Voice deepfakes are coming for your bank balance

[Ariel Davis/The New York Times]

Emily Flitter and Stacy Cowley

September 1, 2023 01.09.2023 • 05:07

This spring, Clive Kabatznik, an investor in Florida, called his local Bank of America representative to discuss a big money transfer he was planning to make. Then he called again.

Except the second phone call wasn’t from Kabatznik. Rather, a software program had artificially generated his voice and tried to trick the banker into moving the money elsewhere.

Kabatznik and his banker were the targets of a cutting-edge scam attempt that has grabbed the attention of cybersecurity experts: the use of artificial intelligence to generate voice deepfakes, or vocal renditions that mimic real people’s voices.

The problem is still new enough that there is no comprehensive accounting of how often it happens. But one expert whose company, Pindrop, monitors the audio traffic for many of the largest US banks said he had seen a jump in its prevalence this year – and in the sophistication of scammers’ voice fraud attempts. Another large voice authentication vendor, Nuance, saw its first successful deepfake attack on a financial services client late last year.

In Kabatznik’s case, the fraud was detectable. But the speed of technological development, the falling costs of generative artificial intelligence programs and the wide availability of recordings of people’s voices on the internet have created the perfect conditions for voice-related AI scams.

Customer data such as bank account details that have been stolen by hackers – and are widely available on underground markets – help scammers pull off these attacks. They become even easier with wealthy clients, whose public appearances, including speeches, are often widely available on the internet. Finding audio samples for everyday customers can also be as easy as conducting an online search – say, on social media apps such as TikTok and Instagram – for the name of someone whose bank account information the scammers already have.

“There’s a lot of audio content out there,” said Vijay Balasubramaniyan, the CEO and a founder of Pindrop, which reviews automatic voice-verification systems for eight of the 10 largest US lenders.

Over the past decade, Pindrop has reviewed recordings of more than 5 billion calls coming into call centers run by the financial companies it serves. The centers handle products such as bank accounts, credit cards and other services offered by big retail banks. All of the call centers receive calls from fraudsters, typically ranging from 1,000 to 10,000 a year. It’s common for 20 calls to come in from fraudsters each week, Balasubramaniyan said.

So far, fake voices created by computer programs account for only “a handful” of these calls, he said – and they’ve begun to happen only within the past year.

Most of the fake voice attacks that Pindrop has seen have come into credit card service call centers, where human representatives deal with customers needing help with their cards.

Balasubramaniyan played a reporter an anonymized recording of one such call that took place in March. Although a very rudimentary example – the voice in this case sounds robotic, more like an e-reader than a person – the call illustrates how scams could occur as AI makes it easier to imitate human voices.

A banker can be heard greeting the customer. Then the voice, similar to an automated one, says, “My card was declined.”

“May I ask whom I have the pleasure of speaking with?” the banker replies.

“My card was declined,” the voice says again.

The banker asks for the customer’s name again. A silence ensues, during which the faint sound of keystrokes can be heard. According to Balasubramaniyan, the number of keystrokes correspond to the number of letters in the customer’s name. The fraudster is typing words into a program that then reads them.

In this instance, the caller’s synthetic speech led the employee to transfer the call to a different department and flag it as potentially fraudulent, Balasubramaniyan said.

Calls like the one he shared, which use type-to-text technology, are some of the easiest attacks to defend against: Call centers can use screening software to pick up technical clues that speech is machine-generated.

“Synthetic speech leaves artifacts behind, and a lot of anti-spoofing algorithms key off those artifacts,” said Peter Soufleris, CEO of IngenID, a voice biometrics technology vendor.

But, as with many security measures, it’s an arms race between attackers and defenders – and one that has recently evolved. A scammer can now simply speak into a microphone or type in a prompt and have that speech very quickly translated into the target’s voice.

Balasubramaniyan noted that one generative AI system, Microsoft’s VALL-E, could create a voice deepfake that said whatever a user wished using just three seconds of sampled audio.

On “60 Minutes” in May, Rachel Tobac, a security consultant, used software to so convincingly clone the voice of Sharyn Alfonsi, one of the program’s correspondents, that she fooled a “60 Minutes” employee into giving her Alfonsi’s passport number.

The attack took only five minutes to put together, said Tobac, CEO of SocialProof Security. The tool she used became available for purchase in January.

While scary deepfake demos are a staple of security conferences, real-life attacks are still extremely rare, said Brett Beranek, general manager of security and biometrics at Nuance, a voice technology vendor that Microsoft acquired in 2021. The only successful breach of a Nuance customer, in October, took the attacker more than a dozen attempts to pull off.

Beranek’s biggest concern is not attacks on call centers or automated systems, like the voice biometrics systems that many banks have deployed. He worries about the scams in which a caller reaches an individual directly.

“I had a conversation just earlier this week with one of our customers,” he said. “They were saying, hey, Brett, it’s great that we have our contact center secured – but what if somebody just calls our CEO directly on their cellphone and pretends to be somebody else?”

That’s what happened in Kabatznik’s case. According to the banker’s description, he appeared to be trying to get her to transfer money to a new location, but the voice was repetitive, talking over her and using garbled phrases. The banker hung up.

“It was like I was talking to her, but it made no sense,” Kabatznik said she had told him. (A Bank of America spokesperson declined to make the banker available for an interview.)

After two more calls like that came through in quick succession, the banker reported the matter to Bank of America’s security team, Kabatznik said. Concerned about the security of Kabatznik’s account, she stopped responding to his calls and emails – even the ones that were coming from the real Kabatznik. It took about 10 days for the two of them to reestablish a connection, when Kabatznik arranged to visit her at her office.

“We regularly train our team to identify and recognize scams and help our clients avoid them,” said William Halldin, a Bank of America spokesperson. He said he could not comment on specific customers or their experiences.

Although the attacks are getting more sophisticated, they stem from a basic cybersecurity threat that has been around for decades: a data breach that reveals the personal information of bank customers. From 2020 to 2022, bits of personal data on more than 300 million people fell into the hands of hackers, leading to $8.8 billion in losses, according to the Federal Trade Commission.

Once they’ve harvested a batch of numbers, hackers sift through the information and match it to real people. Those who steal the information are almost never the same people who end up with it. Instead, the thieves put it up for sale. Specialists can use any one of a handful of easily accessible programs to spoof target customers’ phone numbers – which is what likely happened in Kabatznik’s case.

Recordings of his voice are easy to find. On the internet there are videos of him speaking at a conference and participating in a fundraiser.

“I think it’s pretty scary,” Kabatznik said. “The problem is, I don’t know what you do about it. Do you just go underground and disappear?”

This article originally appeared in The New York Times.

Technology Banking Crime

Subscribe to our Newsletters

Checking email? You’re probably not breathing

Confused, frustrated and stranded at the airport with a service animal

A Trump mug shot for history

13 (great) songs with parenthetical titles

Travel photography: How to make the most of your cellphone camera

A way to feel music through the skin