Synthetic human-like fakes: Difference between revisions

(59 intermediate revisions by the same user not shown)

Line 1:

'''Definitions'''

When the '''[[Glossary#No camera|camera does not exist]]''', but the subject being imaged with a simulation of a (movie) camera deceives the watcher to believe it is some living or dead person it is a '''[[Synthetic human-like fakes#Digital look-alikes|digital look-alike]]'''.

When it cannot be determined by human testing or media forensics whether some fake voice is a synthetic fake of some person's voice, or is it an actual recording made of that person's actual real voice, it is a pre-recorded '''[[Synthetic human-like fakes#Digital sound-alikes|digital sound-alike]]'''.

In 2017-2018 this started to be referred to as [[w:deepfake]], even though altering video footage of humans with a computer with a deceiving effect is actually 20 yrs older than the name "deep fakes" or "deepfakes".<ref name="Bohacek and Farid 2022 protecting against fakes">

{{cite journal

| last1 = Boháček

| first1 = Matyáš

| last2 = Farid

| first2 = Hany

| date = 2022-11-23

| title = Protecting world leaders against deep fakes using facial, gestural, and vocal mannerisms

| url = https://www.pnas.org/doi/10.1073/pnas.2216035119

| journal = [[w:Proceedings of the National Academy of Sciences of the United States of America]]

| volume = 119

| issue = 48

| pages =

| doi = 10.1073/pnas.221603511

| access-date = 2023-01-05

}}

</ref><ref name="Bregler1997">

{{cite journal

| last1 = Bregler

| first1 = Christoph

| last2 = Covell

| first2 = Michele

| last3 = Slaney

| first3 = Malcolm

| date = 1997-08-03

| title = Video Rewrite: Driving Visual Speech with Audio

| url = https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf

| journal = SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques

| volume =

| issue =

| pages = 353-360

| doi = 10.1145/258734.258880

| access-date = 2022-09-09

}}

</ref>

When it cannot be determined by human testing or media forensics whether some fake voice is a synthetic fake of some person's voice, or is it an actual recording made of that person's actual real voice, it is a pre-recorded '''[[Synthetic human-like fakes#Digital sound-alikes|digital sound-alike]]'''. This is now commonly referred to as [[w:audio deepfake]].

'''Real-time digital look-and-sound-alike''' in a video call was used to defraud a substantial amount of money in 2023.<ref name="Reuters real-time digital look-and-sound-alike crime 2023">

{{cite web

| url = https://www.reuters.com/technology/deepfake-scam-china-fans-worries-over-ai-driven-fraud-2023-05-22/

| title = 'Deepfake' scam in China fans worries over AI-driven fraud

| last =

| first =

| date = 2023-05-22

| website = [[w:Reuters.com]]

| publisher = [[w:Reuters]]

| access-date = 2023-06-05

| quote =

}}

</ref>

::[[Synthetic human-like fakes|Read more about '''synthetic human-like fakes''']], see and support '''[[organizations and events against synthetic human-like fakes]]''' and what they are doing, what kinds of '''[[Laws against synthesis and other related crimes]]''' have been formulated, [[Synthetic human-like fakes#Timeline of synthetic human-like fakes|examine the SSFWIKI '''timeline''' of synthetic human-like fakes]] or [[Mediatheque|view the '''Mediatheque''']].

~~<section end=definitions-of-synthetic-human-like-fakes />~~

[[File:Screenshot at 27s of a moving digital-look-alike made to appear Obama-like by Monkeypaw Productions and Buzzfeed 2018.png|thumb|right|480px|link=Mediatheque/2018/Obama's appearance thieved - a public service announcement digital look-alike by Monkeypaw Productions and Buzzfeed|{{#lst:Mediatheque|Obama-like-fake-2018}}]]

Line 46:

Line 101:

[[:File:Deb-2000-reflectance-separation.png|Original picture]] by [[w:Paul Debevec]] et al. - Copyright ACM 2000 https://dl.acm.org/citation.cfm?doid=311779.344855]]

In the cinemas we have seen digital look-alikes for over 15 years. These digital look-alikes have "clothing" (a simulation of clothing is not clothing) or "superhero costumes" and "superbaddie costumes", and they don't need to care about the laws of physics, let alone laws of physiology. It is generally accepted that digital look-alikes made their public debut in the sequels of The Matrix i.e. [[w:The Matrix Reloaded]] and [[w:The Matrix Revolutions]] released in 2003. It can be considered almost certain, that it was not possible to make these before the year 1999, as the final piece of the puzzle to make a (still) digital look-alike that passes human testing, the [[Glossary#Reflectance capture|reflectance capture]] over the human face, was made for the first time in 1999 at the [[w:University of Southern California]] and was presented to the crème de la crème

In the cinemas we have seen digital look-alikes for over 20 years. These digital look-alikes have "clothing" (a simulation of clothing is not clothing) or "superhero costumes" and "superbaddie costumes", and they don't need to care about the laws of physics, let alone laws of physiology. It is generally accepted that digital look-alikes made their public debut in the sequels of The Matrix i.e. [[w:The Matrix Reloaded]] and [[w:The Matrix Revolutions]] released in 2003. It can be considered almost certain, that it was not possible to make these before the year 1999, as the final piece of the puzzle to make a (still) digital look-alike that passes human testing, the [[Glossary#Reflectance capture|reflectance capture]] over the human face, was made for the first time in 1999 at the [[w:University of Southern California]] and was presented to the crème de la crème

of the computer graphics field in their annual gathering SIGGRAPH 2000.<ref name="Deb2000">

{{cite book

Line 69:

Line 124:

=== The problems with digital look-alikes ===

Extremely unfortunately for the humankind, organized criminal leagues, that posses the '''weapons capability''' of making believable looking '''synthetic pornography''', are producing on industrial production pipelines '''synthetic ~~terror porn~~'''<ref group="footnote" name="About the term synthetic ~~terror porn~~">It is terminologically more precise, more inclusive and more useful to talk about 'synthetic ~~terror porn~~', if we want to talk about things with their real names, than 'synthetic rape porn', because also synthesizing recordings of consentual looking sex scenes can be terroristic in intent.</ref> by animating digital look-alikes and distributing it in the murky Internet in exchange for money stacks that are getting thinner and thinner as time goes by.

Extremely unfortunately for the humankind, organized criminal leagues, that posses the '''weapons capability''' of making believable looking '''synthetic pornography''', are producing on industrial production pipelines '''terroristic synthetic pornography'''<ref group="footnote" name="About the term terroristic synthetic pornography">It is terminologically more precise, more inclusive and more useful to talk about 'terroristic synthetic pornography', if we want to talk about things with their real names, than 'synthetic rape porn', because also synthesizing recordings of consentual looking sex scenes can be terroristic in intent.</ref> by animating digital look-alikes and distributing it in the murky Internet in exchange for money stacks that are getting thinner and thinner as time goes by.

These industrially produced pornographic delusions are causing great human suffering, especially in their direct victims, but they are also tearing our communities and societies apart, sowing blind rage, perceptions of deepening chaos, feelings of powerlessness and provoke violence. These kinds of '''hate illustration''' increases and strengthens hate feeling, hate thinking, hate speech and hate crimes and tears our fragile social constructions apart and with time perverts humankind's view of humankind into an almost unrecognizable shape, unless we interfere with resolve.

These industrially produced pornographic delusions are causing great human suffering, especially in their direct victims, but they are also tearing our communities and societies apart, sowing blind rage, perceptions of deepening chaos, feelings of powerlessness and provoke violence.

~~=== List~~ of ~~possible naked digital look-alike attacks ===~~

These kinds of '''hate illustration''' increases and strengthens hate feeling, hate thinking, hate speech and hate crimes and tears our fragile social constructions apart and with time perverts humankind's view of humankind into an almost unrecognizable shape, unless we interfere with resolve.

* The classic "''~~portrayal of as if~~ in ~~involuntary~~ sex''"-~~attack~~. ~~(Digital look~~-~~alike "cries")~~

'''Children-like sexual abuse images'''

* "''~~Sexual preference alteration~~''"-~~attack~~. (~~Digital~~ look-~~alike "smiles")~~

* "''~~Cutting / beating~~''"-~~attack~~ (~~Constructs a deceptive history for genuine scars~~)

Sadly by 2023 there is a market for synthetic human-like sexual abuse material that looks like children. See [https://www.bbc.com/news/uk-65932372 ''''''Illegal trade in AI child sex abuse images exposed'''''' at bbc.com] 2023-06-28 reports [[w:Stable Diffusion]] being abused to produce this kind of images. The [[w:Internet Watch Foundation]] also reports on the alarming existence of production of synthetic human-like sex abuse material portraying minors. See [https://www.iwf.org.uk/news-media/news/prime-minister-must-act-on-threat-of-ai-as-iwf-sounds-alarm-on-first-confirmed-ai-generated-images-of-child-sexual-abuse/ ''''''Prime Minister must act on threat of AI as IWF ‘sounds alarm’ on first confirmed AI-generated images of child sexual abuse'''''' at iwf.org.uk] (2023-08-18)

* "''~~Mutilation~~''~~"-attack (Digital look-alike "dies")~~

* "''~~Unconscious~~ and ~~injected''"~~-~~attack (Digital~~ look-~~alike gets "disease")~~

=== Fixing the problems from digital look-alikes ===

We need to move on 3 fields: [[Laws against synthesis and other related crimes|legal]], technological and cultural.

'''Technological''': Computer vision system like [[FacePinPoint.com]] for seeking unauthorized pornography / nudes used to exist 2017-2021 and could be revived if funding is found. It was a service practically identical with SSFWIKI original concept [[Adequate Porn Watcher AI (concept)]].

'''Legal''': Legislators around the planet have been waking up to this reality that not everything that seems a video of people is a video of people and various laws have been passed to protect humans and humanity from the menaces of synthetic human-like fakes, mostly digital look-alikes so far, but hopefully humans will be protected also fro other aspects of synthetic human-like fakes by laws. See [[Laws against synthesis and other related crimes]]

=== Age analysis and rejuvenating and aging syntheses ===

Line 111:

Line 172:

== Digital sound-alikes ==

=== University of Florida published an antidote to synthetic human-like fake voices in 2022 ===

'''2022''' saw a brilliant '''counter-measure''' presented to peers at the 31st [[w:USENIX]] Security Symposium 10-12 August 2022 by [[w:University of Florida]] <big>'''[[Detecting deep-fake audio through vocal tract reconstruction]]'''</big>.

The university's foundation has applied for a patent and let us hope that they will [[w:copyleft]] the patent as this protective method needs to be rolled out to protect the humanity.

'''Below transcluded [[Detecting deep-fake audio through vocal tract reconstruction|from the article]]'''

{{#lst:Detecting deep-fake audio through vocal tract reconstruction|what-is-it}} {{#lst:Detecting deep-fake audio through vocal tract reconstruction|original-reporting}}

'''This new counter-measure needs to be rolled out to humans to protect humans against the fake human-like voices.'''

{{#lst:Detecting deep-fake audio through vocal tract reconstruction|embed}}

=== On known history of digital sound-alikes ===

[[File:Helsingin-Sanomat-2012-David-Martin-Howard-of-University-of-York-on-apporaching-digital-sound-alikes.jpg|right|thumb|338px|A picture of a cut-away titled "''Voice-terrorist could mimic a leader''" from a 2012 [[w:Helsingin Sanomat]] warning that the sound-like-anyone machines are approaching. Thank you to homie [https://pure.york.ac.uk/portal/en/researchers/david-martin-howard(ecfa9e9e-1290-464f-981a-0c70a534609e).html Prof. David Martin Howard] of the [[w:University of York]], UK and the anonymous editor for the heads-up.]]

Line 135:

Line 210:

{{#ev:youtube|0sR1rU3gLzQ|640px|right|Video [https://www.youtube.com/watch?v=0sR1rU3gLzQ video 'This AI Clones Your Voice After Listening for 5 Seconds' by '2 minute papers' at YouTube] describes the voice thieving machine by Google Research in [[w:NeurIPS|w:NeurIPS]] 2018.}}

In November 2024, Nvidia researchers announced they have made and trained a [https://fugatto.github.io/ Foundational Generative Audio Transformer (Opus 1) at fugatto.github.io] or Fugatto for short.

The researchers state ''Fugatto is a versatile audio synthesis and transformation model capable of following

free-form text instructions with optional audio inputs. ''<ref>https://research.nvidia.com/publication/2024-11_fugatto-1-foundational-generative-audio-transformer-opus-1</ref>

=== Documented crimes with digital sound-alikes ===

Line 218:

Line 298:

==== 2021 digital sound-alike enabled fraud ====

<section begin=2021 digital sound-alike enabled fraud />The 2nd publicly known fraud done with a digital sound-alike<ref group="1st seen in" name="2021 digital sound-alike fraud case">https://www.reddit.com/r/VocalSynthesis/</ref> took place on Friday 2021-01-15. A bank in Hong Kong was manipulated to wire money to numerous bank accounts by using a voice stolen from one of the their client company's directors. They managed to defraud $35 million of the U.A.E. based company's money.<ref name="Forbes reporting on 2021 digital sound-alike fraud">https://www.forbes.com/sites/thomasbrewster/2021/10/14/huge-bank-fraud-uses-deep-fake-voice-tech-to-steal-millions/</ref>. This case came into light when Forbes saw [https://www.documentcloud.org/documents/21085009-hackers-use-deep-voice-tech-in-400k-theft a document] where the U.A.E. financial authorities were seeking administrative assistance from the US authorities towards ~~the end of~~ recovering a small portion of the defrauded money that had been sent to bank accounts in the USA.<ref name="Forbes reporting on 2021 digital sound-alike fraud" />

<section begin=2021 digital sound-alike enabled fraud />The 2nd publicly known fraud done with a digital sound-alike<ref group="1st seen in" name="2021 digital sound-alike fraud case">https://www.reddit.com/r/VocalSynthesis/</ref> took place on Friday 2021-01-15. A bank in Hong Kong was manipulated to wire money to numerous bank accounts by using a voice stolen from one of the their client company's directors. They managed to defraud $35 million of the U.A.E. based company's money.<ref name="Forbes reporting on 2021 digital sound-alike fraud">https://www.forbes.com/sites/thomasbrewster/2021/10/14/huge-bank-fraud-uses-deep-fake-voice-tech-to-steal-millions/</ref>. This case came into light when Forbes saw [https://www.documentcloud.org/documents/21085009-hackers-use-deep-voice-tech-in-400k-theft a document] where the U.A.E. financial authorities were seeking administrative assistance from the US authorities towards recovering a small portion of the defrauded money that had been sent to bank accounts in the USA.<ref name="Forbes reporting on 2021 digital sound-alike fraud" />

'''Reporting on the 2021 digital sound-alike enabled fraud'''

Line 227:

Line 307:

~~=== What should we do about~~ digital sound-alikes~~? ===~~

'''More fraud cases with digital sound-alikes'''

* [https://www.washingtonpost.com/technology/2023/03/05/ai-voice-scam/ '''''They thought loved ones were calling for help. It was an AI scam.''''' at washingtonpost.com], March 2023 reporting

~~Living people can defend<ref group="footnote" name="judiciary maybe not aware">Whether a suspect can defend against faked synthetic speech that sounds like him~~/~~her depends on how up-to-date the judiciary is~~. ~~If no information and instructions about digital sound-alikes have been given to the judiciary, they likely will not believe the defense of denying that the recording is of the suspect's voice~~.</~~ref> themselves against digital sound~~-~~alike by denying the things the digital sound~~-~~alike says if they are presented to the target, but dead people cannot. Digital sound-alikes offer criminals new disinformation attack vectors and wreak havoc on provability.~~

~~For these reasons the bannable~~ '''~~raw materials~~''~~' i~~.e. ~~covert voice models~~ '''~~[[Law proposals to ban covert modeling|should be prohibited by law]]~~''~~' in order to protect humans from abuse by criminal parties~~.

=== Example of a hypothetical 4-victim digital sound-alike attack ===

Line 243:

Line 319:

# Victim #3 - It could also be viewed that victim #3 is our law enforcement systems as they are put to chase after and interrogate the innocent victim #1

# Victim #4 - Our judiciary which prosecutes and possibly convicts the innocent victim #1.

~~Thus it is high time to act and to '''[[Law proposals to ban covert modeling|criminalize the covert modeling of human voice!]]'''~~

=== Examples of speech synthesis software not quite able to fool a human yet ===

Line 276:

Line 350:

[[File:Spectrogram-19thC.png|thumb|right|640px|A [[w:spectrogram]] of a male voice saying 'nineteenth century']]

=== What should we do about digital sound-alikes? ===

Living people can defend<ref group="footnote" name="judiciary maybe not aware">Whether a suspect can defend against faked synthetic speech that sounds like him/her depends on how up-to-date the judiciary is. If no information and instructions about digital sound-alikes have been given to the judiciary, they likely will not believe the defense of denying that the recording is of the suspect's voice.</ref> themselves against digital sound-alike by denying the things the digital sound-alike says if they are presented to the target, but dead people cannot. Digital sound-alikes offer criminals new disinformation attack vectors and wreak havoc on provability.

For these reasons the bannable '''raw materials''' i.e. covert voice models '''[[Law proposals to ban covert modeling|should be prohibited by law]]''' in order to protect humans from abuse by criminal parties.

It is high time to act and to '''[[Law proposals to ban covert modeling|criminalize the covert modeling of human voice!]]'''

== Digital look-and-sound-alikes ==

=== Real-time digital look-and-sound-alike fraud in 2023 ===

'''Real-time digital look-and-sound-alike''' in a video call was used to defraud a substantial amount of money in 2023.<ref name="Reuters real-time digital look-and-sound-alike crime 2023">

{{cite web

| url = https://www.reuters.com/technology/deepfake-scam-china-fans-worries-over-ai-driven-fraud-2023-05-22/

| title = 'Deepfake' scam in China fans worries over AI-driven fraud

| last =

| first =

| date = 2023-05-22

| website = [[w:Reuters.com]]

| publisher = [[w:Reuters]]

| access-date = 2023-06-05

| quote =

}}

</ref>

=== Real-time digital look-and-sound-alike fraud in 2024 ===

'''Reporting'''

* [https://www.ft.com/content/b977e8d4-664c-4ae4-8a8e-eb93bdf785ea '''''Arup lost $25mn in Hong Kong deepfake video conference scam''''' at ft.com] reporting by Cheng Leng and Chan Ho-him 2025-05-17

* [https://edition.cnn.com/2024/02/04/asia/deepfake-cfo-scam-hong-kong-intl-hnk/index.html '''''Finance worker pays out $25 million after video call with deepfake "chief financial officer"''''' at edition.cnn.com], February 2024 reporting by Heather Chen and Kathleen Magramo, CNN

* [https://edition.cnn.com/2024/05/16/tech/arup-deepfake-scam-loss-hong-kong-intl-hnk/index.html '''''British engineering giant Arup revealed as $25 million deepfake scam victim''''' at edition.cnn.com] May 2024 reporting by Kathleen Magramo, CNN

----

== Text syntheses ==

Line 281:

Line 387:

In [[w:natural language processing]] development in [[w:natural-language understanding]] leads to more cunning [[w:natural-language generation]] AI.

'''[[w:Large language model]]s''' ('''LLM''') are very large [[w:language model]]s consisting of a [[w:Artificial neural network|w:neural network]] with many parameters.

[[w:OpenAI]]'s [[w:OpenAI#GPT|w:Generative Pre-trained Transformer]] ('''GPT''') is a left-to-right [[w:transformer (machine learning model)]]-based [[w:Natural-language generation|text generation]] model succeeded by [[w:OpenAI#GPT-2|w:GPT-2]] and [[w:OpenAI#GPT-3|w:GPT-3]]

November 2022 saw the publication of OpenAI's '''[[w:ChatGPT]]''', a conversational artificial intelligence.

'''[[w:Bard (chatbot)]]''' is a conversational [[w:generative artificial intelligence]] [[w:chatbot]] developed by [[w:Google]], based on the [[w:LaMDA]] family of [[w:large language models]]. It was developed as a direct response to the rise of [[w:OpenAI]]'s [[w:ChatGPT]], and was released in March 2023. ([https://en.wikipedia.org/w/index.php?title=Bard_(chatbot)&oldid=1152361586 Wikipedia])

''' Reporting / announcements '''

''' Reporting / announcements ''' (in reverse chronology)

* [https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/ '''''Reinventing search with a new AI-powered Microsoft Bing and Edge, your copilot for the web''''' at blogs.microsoft.com] '''February 2023''' (2023-02-07). The new improved Bing, available only in Microsoft's Edge browser is reportedly based on a language model refined from GPT 3.5.<ref>https://www.theverge.com/2023/2/7/23587454/microsoft-bing-edge-chatgpt-ai</ref>

* [https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text '''New AI classifier for indicating AI-written text''' at openai.com], a '''January 2023''' blog post about OpenAI's AI classifier for detecting AI-written texts.

Line 304:

Line 414:

* [https://analyticssteps.com/blogs/detection-fake-and-false-news-text-analysis-approaches-and-cnn-deep-learning-model '''"Detection of Fake and False News (Text Analysis): Approaches and CNN as Deep Learning Model"''' at analyticsteps.com], a 2019 summmary written by Shubham Panth.

~~'''Against text syntheses'''~~

=== Detectors for synthesized texts ===

Introduction of [[w:ChatGPT]] by OpenAI brought the need for software to detect machine-generated texts.

* https://~~gptzero~~.me/ - ''The ~~World~~'~~s #1 AI Detector with over 1 Million Users~~''

Try AI plagiarism detection for free

* https://gptradar.com/<ref group="1st seen in" name="Wordlift.io 2023">https://wordlift.io/blog/en/best-plagiarism-checkers-for-ai-generated-content/</ref> - ''AI ~~text detector app~~''

* https://www.zerogpt.com/ - ''GPT-4 And ChatGPT detector by ZeroGPT: detect OpenAI text - ZeroGPT the most Advanced and Reliable Chat GPT and GPT-4 detector tool''<ref group="1st seen in" name="Wordlift.io 2023" />

* [https://contentdetector.ai/ '''AI Content Detector''' at contentdetector.ai]- ''AI Content Detector - Detect ChatGPT Plagiarism'' ('''try for free''')

* [https://platform.openai.com/ai-text-classifier '''AI Text Classifier''' at platform.openai.com]- ''The AI Text Classifier is a fine-tuned GPT model that predicts how likely it is that a piece of text was generated by AI from a variety of sources, such as ChatGPT.'' ('''free account required''')

* [https://gptradar.com/ '''GPT Radar''' at gptradar.com] - ''AI text detector app'' ('''try for free''')<ref group="1st seen in" name="Wordlift.io 2023">https://wordlift.io/blog/en/best-plagiarism-checkers-for-ai-generated-content/</ref>

* [https://gptzero.me/ '''GPTZero''' at gptzero.me] - ''The World's #1 AI Detector with over 1 Million Users'' ('''try for free''')

* [https://copyleaks.com/plagiarism-checker '''Plagiarism Checker''' at copyleaks.com]- ''Plagiarism Checker by Copyleaks'' ('''try for free''')<ref group="1st seen in" name="Wordlift.io 2023" />

* https://gowinston.ai/ - ''The most powerful AI content detection solution'' ('''free-tier available''')<ref group="1st seen in" name="Wordlift.io 2023" />

* [https://www.zerogpt.com/ '''ZeroGPT''' at zerogpt.com]<ref group="1st seen in" name="Wordlift.io 2023" /> - ''GPT-4 And ChatGPT detector by ZeroGPT: detect OpenAI text - ZeroGPT the most Advanced and Reliable Chat GPT and GPT-4 detector tool'' ('''try for free''')

For-a-fee AI plagiarism detection tools

* https://originality.ai/ - ''The Most Accurate AI Content Detector and Plagiarism Checker Built for Serious Content Publishers''<ref group="1st seen in" name="Wordlift.io 2023" />

* https://www.turnitin.com/ - ''Empower students to do their best, original work''<ref group="1st seen in" name="Wordlift.io 2023" />

== Handwriting syntheses ==

Line 320:

Line 440:

If the handwriting-like synthesis passes human and media forensics testing, it is a '''digital handwrite-alike'''.

Here we find a '''risk''' similar to that which ~~realized~~ when the '''[[w:speaker recognition]] systems''' turned out to be instrumental in the development of '''[[#Digital sound-alikes|digital sound-alikes]]'''. After the knowledge needed to recognize a speaker was [[w:Transfer learning|w:transferred]] into a generative task in 2018 by Google researchers, we no longer cannot effectively determine for English speakers which recording is human in origin and which is from a machine origin.

Here we find a possible '''risk''' similar to that which became a reality, when the '''[[w:speaker recognition]] systems''' turned out to be instrumental in the development of '''[[#Digital sound-alikes|digital sound-alikes]]'''. After the knowledge needed to recognize a speaker was [[w:Transfer learning|w:transferred]] into a generative task in 2018 by Google researchers, we no longer cannot effectively determine for English speakers which recording is human in origin and which is from a machine origin.

'''Handwriting-like syntheses''':

Line 375:

Line 495:

== 2020's synthetic human-like fakes ==

* '''2024''' | '''text-to-video model''' | '''[[w:Sora (text-to-video model)]]''', a [[w:text-to-video model]] developed by [[w:OpenAI]], that has worrying levels of realism was published in 2024. It was released to subscription paying users of ChatGPT in December 2024.

* '''2023''' | '''Real-time digital look-and-sound-alike crime''' | In April a man in northern China was defrauded of 4.3 million yuan by a criminal employing a digital look-and-sound-alike pretending to be his friend on a video call made with a stolen messaging service account.<ref name="Reuters real-time digital look-and-sound-alike crime 2023"/>

* '''2023''' | '''Election meddling with digital look-alikes''' | The [[w:2023 Turkish presidential election]] saw numerous deepfake controversies.

** "''Ahead of the election in Turkey, President Recep Tayyip Erdogan showed a video linking his main challenger Kemal Kilicdaroglu to the militant Kurdish organization PKK.''" [...] "''Research by DW's fact-checking team in cooperation with DW's Turkish service shows that the video at the campaign rally was '''manipulated''' by '''combining two separate videos''' with totally different backgrounds and content.''" [https://www.dw.com/en/fact-check-turkeys-erdogan-shows-false-kilicdaroglu-video/a-65554034 reports dw.com]

* '''2023''' | March 7 th | '''science / demonstration''' | Microsoft researchers submitted a paper for publication outlining their [https://arxiv.org/abs/2303.03926 '''Cross-lingual neural codec language modeling system''' at arxiv.org] dubbed [https://www.microsoft.com/en-us/research/project/vall-e-x/vall-e-x/ '''VALL-E X''' at microsoft.com], that extends upon VALL-E's capabilities to be cross-lingual and also maintaining the same "''emotional tone''" from sample to fake.

* '''2023''' | January 5th | '''science / demonstration''' | Microsoft researchers announced [https://www.microsoft.com/en-us/research/project/vall-e/ '''''VALL-E''''' - Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (at microsoft.com)] that is able to thieve a voice from only '''3 seconds of sample''' and it is also able to mimic the "''emotional tone''" of the sample the synthesis if produced of.<ref>

{{cite web

| url = https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/

| title = Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio

| last = Edwards

| first = Benj

| date = 2023-01-10

| website = [[w:Arstechnica.com]]

| publisher = Arstechnica

| access-date = 2023-05-05

| quote = For the paper's conclusion, they write: "Since VALL-E could synthesize speech that maintains speaker identity, it may carry potential risks in misuse of the model, such as spoofing voice identification or impersonating a specific speaker. To mitigate such risks, it is possible to build a detection model to discriminate whether an audio clip was synthesized by VALL-E. We will also put Microsoft AI Principles into practice when further developing the models."

}}

</ref>

* '''2023''' | '''Law''' | {{#lst:Law on sexual offences in Finland 2023|what-is-it}}

* '''2023''' | January 1st | '''Law''' | {{#lst:Law on sexual offences in Finland 2023|what-is-it}}

* '''2022''' | '''science''' and '''demonstration''' | [[w:OpenAI]][https://openai.com/ (.com)] published [[w:ChatGPT]], a discutational AI accessible with a free account at [https://chat.openai.com/ chat.openai.com]. Initial version was published on 2022-11-30.

Line 382:

Line 525:

* '''2022''' | '''brief report of counter-measures''' | {{#lst:Protecting world leaders against deep fakes using facial, gestural, and vocal mannerisms|what-is-it}} Publication date 2022-11-23.

* '''2022''' | '''counter-measure''' | {{#lst:~~Who Are You (I Really Wanna Know)?~~ Detecting ~~Audio DeepFakes Through Vocal Tract Reconstruction~~|what-is-it}}

* '''2022''' | '''counter-measure''' | {{#lst:Detecting deep-fake audio through vocal tract reconstruction|what-is-it}}

:{{#lst:~~Who Are You (I Really Wanna Know)?~~ Detecting ~~Audio DeepFakes Through Vocal Tract Reconstruction~~|original-reporting}}. Presented to peers in August 2022 and to the general public in September 2022.

:{{#lst:Detecting deep-fake audio through vocal tract reconstruction|original-reporting}}. Presented to peers in August 2022 and to the general public in September 2022.

* '''2022''' | '''disinformation attack''' | In June 2022 a fake digital look-and-sound-alike in the appearance and voice of [[w:Vitali Klitschko]], mayor of [[w:Kyiv]], held fake video phone calls with several European mayors. The Germans determined that the video phone call was fake by contacting the Ukrainian officials. This attempt at covert disinformation attack was originally reported by [[w:Der Spiegel]].<ref>https://www.theguardian.com/world/2022/jun/25/european-leaders-deepfake-video-calls-mayor-of-kyiv-vitali-klitschko</ref><ref>https://www.dw.com/en/vitali-klitschko-fake-tricks-berlin-mayor-in-video-call/a-62257289</ref>

* '''2022''' | science | [[w:DALL-E]] 2, a successor designed to generate more realistic images at higher resolutions that "can combine concepts, attributes, and styles" was published in April 2022.<ref>{{Cite web |title=DALL·E 2 |url=https://openai.com/dall-e-2/ |access-date=2023-04-22 |website=OpenAI |language=en-US}}</ref> ([https://en.wikipedia.org/w/index.php?title=DALL-E&oldid=1151136107 Wikipedia])

Line 600:

Line 745:

* '''2013''' | demonstration | A '''[https://ict.usc.edu/pubs/Scanning%20and%20Printing%20a%203D%20Portrait%20of%20President%20Barack%20Obama.pdf 'Scanning and Printing a 3D Portrait of President Barack Obama' at ict.usc.edu]'''. A 7D model and a 3D bust was made of President Obama with his consent. Relevancy: '''Relevancy: certain'''

* '''2011''' | '''Law in Finland''' | Distribution and attempt of distribution and also possession of '''synthetic [[w:Child sexual abuse material|CSAM]]''' was '''criminalized''' on Wednesday 2011-06-01, upon the initiative of the [[w:Vanhanen II Cabinet]]. These protections against CSAM were moved into 19 §, 20 § and 21 § of Chapter 20 when the [[Law on sexual offences in Finland 2023]] was improved and gathered into Chapter 20 upon the initiative of the [[w:Marin Cabinet]].

== 2000's synthetic human-like fakes ==

Line 644:

Line 791:

* '''1999''' | '''institute founded''' | The '''[[w:Institute for Creative Technologies]]''' was founded by the [[w:United States Army]] in the [[w:University of Southern California]]. It collaborates with the [[w:United States Army Futures Command]], [[w:United States Army Combat Capabilities Development Command]], [[w:Combat Capabilities Development Command Soldier Center]] and [[w:United States Army Research Laboratory]].<ref name="ICT-about">https://ict.usc.edu/about/</ref>. In 2016 [[w:Hao Li]] was appointed to direct the institute.

* '''1997''' | '''technology / science''' | [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf ''''Video rewrite: Driving visual speech with audio'''' at www2.eecs.berkeley.edu]<ref name="Bregler1997">

* '''1997''' | '''technology / science''' | [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf ''''Video rewrite: Driving visual speech with audio'''' at www2.eecs.berkeley.edu]<ref name="Bregler1997" /><ref group="1st seen in" name="Bohacek-Farid-2022">

~~{{cite journal~~

~~| last1 = Bregler~~

~~| first1 = Christoph~~

~~| last2 = Covell~~

~~| first2 = Michele~~

~~| last3 = Slaney~~

~~| first3 = Malcolm~~

~~| date = 1997-08-03~~

~~| title = Video Rewrite: Driving Visual Speech with Audio~~

~~| url = https:~~/~~/www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf~~

~~| journal = SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques~~

~~| volume =~~

~~| issue =~~

~~| pages = 353-360~~

~~| doi = 10.1145/258734.258880~~

~~| access-date = 2022-09-09~~

}}

~~</ref~~><ref group="1st seen in" name="Bohacek-Farid-2022">

PROTECTING PRESIDENT ZELENSKYY AGAINST DEEP FAKES https://arxiv.org/pdf/2206.12043.pdf

Line 707:

Line 834:

== Contact information of organizations ==

Please contact these organizations and tell them to work harder against the disinformation weapons

Please contact [[Organizations, studies and events against synthetic human-like fakes|these organizations]] and tell them to work harder against the disinformation weapons

~~<references group="contact" />~~

= 1st seen in =

@@ Line 1: / Line 1: @@
 '''Definitions'''
 <section begin=definitions-of-synthetic-human-like-fakes />
 When the '''[[Glossary#No camera|camera does not exist]]''', but the subject being imaged with a simulation of a (movie) camera deceives the watcher to believe it is some living or dead person it is a '''[[Synthetic human-like fakes#Digital look-alikes|digital look-alike]]'''.
-When it cannot be determined by human testing or media forensics whether some fake voice is a synthetic fake of some person's voice, or is it an actual recording made of that person's actual real voice, it is a pre-recorded '''[[Synthetic human-like fakes#Digital sound-alikes|digital sound-alike]]'''.
+In 2017-2018 this started to be referred to as [[w:deepfake]], even though altering video footage of humans with a computer with a deceiving effect is actually 20 yrs older than the name "deep fakes" or "deepfakes".<ref name="Bohacek and Farid 2022 protecting against fakes">
+{{cite journal
+| last1      = Boháček
+| first1     = Matyáš
+| last2      = Farid
+| first2     = Hany
+| date       = 2022-11-23
+| title      = Protecting world leaders against deep fakes using facial, gestural, and vocal mannerisms
+| url        = https://www.pnas.org/doi/10.1073/pnas.2216035119
+| journal    = [[w:Proceedings of the National Academy of Sciences of the United States of America]]
+| volume     = 119
+| issue      = 48
+| pages      =
+| doi        = 10.1073/pnas.221603511
+| access-date = 2023-01-05
+}}
+</ref><ref name="Bregler1997">
+{{cite journal
+| last1      = Bregler
+| first1     = Christoph
+| last2      = Covell
+| first2     = Michele
+| last3      = Slaney
+| first3     = Malcolm
+| date       = 1997-08-03
+| title      = Video Rewrite: Driving Visual Speech with Audio
+| url        = https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf
+| journal    = SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques
+| volume     =
+| issue      =
+| pages      = 353-360
+| doi        = 10.1145/258734.258880
+| access-date = 2022-09-09
+}}
+</ref>
+When it cannot be determined by human testing or media forensics whether some fake voice is a synthetic fake of some person's voice, or is it an actual recording made of that person's actual real voice, it is a pre-recorded '''[[Synthetic human-like fakes#Digital sound-alikes|digital sound-alike]]'''. This is now commonly referred to as [[w:audio deepfake]].
+'''Real-time digital look-and-sound-alike''' in a video call was used to defraud a substantial amount of money in 2023.<ref name="Reuters real-time digital look-and-sound-alike crime  2023">
+{{cite web
+	| url = https://www.reuters.com/technology/deepfake-scam-china-fans-worries-over-ai-driven-fraud-2023-05-22/
+	| title = 'Deepfake' scam in China fans worries over AI-driven fraud
+	| last =
+	| first =
+	| date = 2023-05-22
+	| website = [[w:Reuters.com]]
+	| publisher = [[w:Reuters]]
+	| access-date = 2023-06-05
+	| quote =
+}}
+</ref>
+<section end=definitions-of-synthetic-human-like-fakes />
 ::[[Synthetic human-like fakes|Read more about '''synthetic human-like fakes''']], see and support '''[[organizations and events against synthetic human-like fakes]]''' and what they are doing, what kinds of '''[[Laws against synthesis and other related crimes]]''' have been formulated, [[Synthetic human-like fakes#Timeline of synthetic human-like fakes|examine the SSFWIKI '''timeline''' of synthetic human-like fakes]] or [[Mediatheque|view the '''Mediatheque''']].
-<section end=definitions-of-synthetic-human-like-fakes />
 [[File:Screenshot at 27s of a moving digital-look-alike made to appear Obama-like by Monkeypaw Productions and Buzzfeed 2018.png|thumb|right|480px|link=Mediatheque/2018/Obama's appearance thieved - a public service announcement digital look-alike by Monkeypaw Productions and Buzzfeed|{{#lst:Mediatheque|Obama-like-fake-2018}}]]
@@ Line 46: / Line 101: @@
 <small>[[:File:Deb-2000-reflectance-separation.png|Original picture]]  by [[w:Paul Debevec]] et al. - Copyright ACM 2000 https://dl.acm.org/citation.cfm?doid=311779.344855</small>]]
-In the cinemas we have seen digital look-alikes for over 15 years. These digital look-alikes have "clothing" (a simulation of clothing is not clothing) or "superhero costumes" and "superbaddie costumes", and they don't need to care about the laws of physics, let alone laws of physiology. It is generally accepted that digital look-alikes made their public debut in the sequels of The Matrix i.e. [[w:The Matrix Reloaded]] and [[w:The Matrix Revolutions]] released in 2003. It can be considered almost certain, that it was not possible to make these before the year 1999, as the final piece of the puzzle to make a (still) digital look-alike that passes human testing, the [[Glossary#Reflectance capture|reflectance capture]] over the human face, was made for the first time in 1999 at the [[w:University of Southern California]] and was presented to the crème de la crème
+In the cinemas we have seen digital look-alikes for over 20 years. These digital look-alikes have "clothing" (a simulation of clothing is not clothing) or "superhero costumes" and "superbaddie costumes", and they don't need to care about the laws of physics, let alone laws of physiology. It is generally accepted that digital look-alikes made their public debut in the sequels of The Matrix i.e. [[w:The Matrix Reloaded]] and [[w:The Matrix Revolutions]] released in 2003. It can be considered almost certain, that it was not possible to make these before the year 1999, as the final piece of the puzzle to make a (still) digital look-alike that passes human testing, the [[Glossary#Reflectance capture|reflectance capture]] over the human face, was made for the first time in 1999 at the [[w:University of Southern California]] and was presented to the crème de la crème
 of the computer graphics field in their annual gathering SIGGRAPH 2000.<ref name="Deb2000">
 {{cite book
@@ Line 69: / Line 124: @@
 === The problems with digital look-alikes ===
-Extremely unfortunately for the humankind, organized criminal leagues, that posses the '''weapons capability''' of making believable looking '''synthetic pornography''', are producing on industrial production pipelines '''synthetic terror porn'''<ref group="footnote" name="About the term synthetic terror porn">It is terminologically more precise, more inclusive and more useful to talk about 'synthetic terror porn', if we want to talk about things with their real names, than 'synthetic rape porn', because also synthesizing recordings of consentual looking sex scenes can be terroristic in intent.</ref> by animating digital look-alikes and distributing it in the murky Internet in exchange for money stacks that are getting thinner and thinner as time goes by.
+Extremely unfortunately for the humankind, organized criminal leagues, that posses the '''weapons capability''' of making believable looking '''synthetic pornography''', are producing on industrial production pipelines '''terroristic synthetic pornography'''<ref group="footnote" name="About the term terroristic synthetic pornography">It is terminologically more precise, more inclusive and more useful to talk about 'terroristic synthetic pornography', if we want to talk about things with their real names, than 'synthetic rape porn', because also synthesizing recordings of consentual looking sex scenes can be terroristic in intent.</ref> by animating digital look-alikes and distributing it in the murky Internet in exchange for money stacks that are getting thinner and thinner as time goes by.
-These industrially produced pornographic delusions are causing great human suffering, especially in their direct victims, but they are also tearing our communities and societies apart, sowing blind rage, perceptions of deepening chaos, feelings of powerlessness and provoke violence. These kinds of '''hate illustration''' increases and strengthens hate feeling, hate thinking, hate speech and hate crimes and tears our fragile social constructions apart and with time perverts humankind's view of humankind into an almost unrecognizable shape, unless we interfere with resolve.
+These industrially produced pornographic delusions are causing great human suffering, especially in their direct victims, but they are also tearing our communities and societies apart, sowing blind rage, perceptions of deepening chaos, feelings of powerlessness and provoke violence.
-=== List of possible naked digital look-alike attacks ===
+These kinds of '''hate illustration''' increases and strengthens hate feeling, hate thinking, hate speech and hate crimes and tears our fragile social constructions apart and with time perverts humankind's view of humankind into an almost unrecognizable shape, unless we interfere with resolve.
-* The classic "''portrayal of as if in involuntary sex''"-attack. (Digital look-alike "cries")
+'''Children-like sexual abuse images'''
-* "''Sexual preference alteration''"-attack. (Digital look-alike "smiles")
-* "''Cutting / beating''"-attack  (Constructs a deceptive history for genuine scars)
+Sadly by 2023 there is a market for synthetic human-like sexual abuse material that looks like children. See [https://www.bbc.com/news/uk-65932372 ''''''Illegal trade in AI child sex abuse images exposed'''''' at bbc.com] 2023-06-28 reports [[w:Stable Diffusion]] being abused to produce this kind of images. The [[w:Internet Watch Foundation]] also reports on the alarming existence of production of synthetic human-like sex abuse material portraying minors. See [https://www.iwf.org.uk/news-media/news/prime-minister-must-act-on-threat-of-ai-as-iwf-sounds-alarm-on-first-confirmed-ai-generated-images-of-child-sexual-abuse/ ''''''Prime Minister must act on threat of AI as IWF ‘sounds alarm’ on first confirmed AI-generated images of child sexual abuse'''''' at iwf.org.uk] (2023-08-18)
-* "''Mutilation''"-attack (Digital look-alike "dies")
-* "''Unconscious and injected''"-attack (Digital look-alike gets "disease")
+=== Fixing the problems from digital look-alikes ===
+We need to move on 3 fields: [[Laws against synthesis and other related crimes|legal]], technological and cultural.
+'''Technological''': Computer vision system like [[FacePinPoint.com]] for seeking unauthorized pornography / nudes used to exist 2017-2021 and could be revived if funding is found. It was a service practically identical with SSFWIKI original concept [[Adequate Porn Watcher AI (concept)]].
+'''Legal''': Legislators around the planet have been waking up to this reality that not everything that seems a video of people is a video of people and various laws have been passed to protect humans and humanity from the menaces of synthetic human-like fakes, mostly digital look-alikes so far, but hopefully humans will be protected also fro other aspects of synthetic human-like fakes by laws. See [[Laws against synthesis and other related crimes]]
 === Age analysis and rejuvenating and aging syntheses ===
@@ Line 111: / Line 172: @@
 == Digital sound-alikes ==
+=== University of Florida published an antidote to synthetic human-like fake voices in 2022 ===
+'''2022''' saw a brilliant '''<font color="green">counter-measure</font>''' presented to peers at the 31st [[w:USENIX]] Security Symposium 10-12 August 2022 by [[w:University of Florida]] <u><big>'''[[Detecting deep-fake audio through vocal tract reconstruction]]'''</big></u>.
+The university's foundation has applied for a patent and let us hope that they will [[w:copyleft]] the patent as this protective method needs to be rolled out to protect the humanity.
+'''Below transcluded [[Detecting deep-fake audio through vocal tract reconstruction|from the article]]'''
+{{#lst:Detecting deep-fake audio through vocal tract reconstruction|what-is-it}} {{#lst:Detecting deep-fake audio through vocal tract reconstruction|original-reporting}}
+'''This new counter-measure needs to be rolled out to humans to protect humans against the fake human-like voices.'''
+{{#lst:Detecting deep-fake audio through vocal tract reconstruction|embed}}
+=== On known history of digital sound-alikes ===
 [[File:Helsingin-Sanomat-2012-David-Martin-Howard-of-University-of-York-on-apporaching-digital-sound-alikes.jpg|right|thumb|338px|A picture of a cut-away titled "''Voice-terrorist could mimic a leader''" from a 2012 [[w:Helsingin Sanomat]] warning that the sound-like-anyone machines are approaching. Thank you to homie [https://pure.york.ac.uk/portal/en/researchers/david-martin-howard(ecfa9e9e-1290-464f-981a-0c70a534609e).html Prof. David Martin Howard] of the [[w:University of York]], UK and the anonymous editor for the heads-up.]]
@@ Line 135: / Line 210: @@
 {{#ev:youtube|0sR1rU3gLzQ|640px|right|Video [https://www.youtube.com/watch?v=0sR1rU3gLzQ video 'This AI Clones Your Voice After Listening for 5 Seconds' by '2 minute papers' at YouTube] describes the voice thieving machine by Google Research in [[w:NeurIPS|w:NeurIPS]] 2018.}}
+In November 2024, Nvidia researchers announced they have made and trained a [https://fugatto.github.io/ Foundational Generative Audio Transformer (Opus 1) at fugatto.github.io] or Fugatto for short.
+The researchers state ''Fugatto is a versatile audio synthesis and transformation model capable of following
+free-form text instructions with optional audio inputs. ''<ref>https://research.nvidia.com/publication/2024-11_fugatto-1-foundational-generative-audio-transformer-opus-1</ref>
 === Documented crimes with digital sound-alikes ===
@@ Line 218: / Line 298: @@
 ==== 2021 digital sound-alike enabled fraud ====
-<section begin=2021 digital sound-alike enabled fraud />The 2nd publicly known fraud done with a digital sound-alike<ref group="1st seen in" name="2021 digital sound-alike fraud case">https://www.reddit.com/r/VocalSynthesis/</ref> took place on Friday 2021-01-15. A bank in Hong Kong was manipulated to wire money to numerous bank accounts by using a voice stolen from one of the their client company's directors. They managed to defraud $35 million of the U.A.E. based company's money.<ref name="Forbes reporting on 2021 digital sound-alike fraud">https://www.forbes.com/sites/thomasbrewster/2021/10/14/huge-bank-fraud-uses-deep-fake-voice-tech-to-steal-millions/</ref>. This case came into light when Forbes saw [https://www.documentcloud.org/documents/21085009-hackers-use-deep-voice-tech-in-400k-theft a document] where the U.A.E. financial authorities were seeking administrative assistance from the US authorities towards the end of recovering a small portion of the defrauded money that had been sent to bank accounts in the USA.<ref name="Forbes reporting on 2021 digital sound-alike fraud" />
+<section begin=2021 digital sound-alike enabled fraud />The 2nd publicly known fraud done with a digital sound-alike<ref group="1st seen in" name="2021 digital sound-alike fraud case">https://www.reddit.com/r/VocalSynthesis/</ref> took place on Friday 2021-01-15. A bank in Hong Kong was manipulated to wire money to numerous bank accounts by using a voice stolen from one of the their client company's directors. They managed to defraud $35 million of the U.A.E. based company's money.<ref name="Forbes reporting on 2021 digital sound-alike fraud">https://www.forbes.com/sites/thomasbrewster/2021/10/14/huge-bank-fraud-uses-deep-fake-voice-tech-to-steal-millions/</ref>. This case came into light when Forbes saw [https://www.documentcloud.org/documents/21085009-hackers-use-deep-voice-tech-in-400k-theft a document] where the U.A.E. financial authorities were seeking administrative assistance from the US authorities towards recovering a small portion of the defrauded money that had been sent to bank accounts in the USA.<ref name="Forbes reporting on 2021 digital sound-alike fraud" />
 '''Reporting on the 2021 digital sound-alike enabled fraud'''
@@ Line 227: / Line 307: @@
 <section end=2021 digital sound-alike enabled fraud />
-=== What should we do about digital sound-alikes? ===
+'''More fraud cases with digital sound-alikes'''
+* [https://www.washingtonpost.com/technology/2023/03/05/ai-voice-scam/ '''''They thought loved ones were calling for help. It was an AI scam.''''' at washingtonpost.com], March 2023 reporting
-Living people can defend<ref group="footnote" name="judiciary maybe not aware">Whether a suspect can defend against faked synthetic speech that sounds like him/her depends on how up-to-date the judiciary is. If no information and instructions about digital sound-alikes have been given to the judiciary, they likely will not believe the defense of denying that the recording is of the suspect's voice.</ref> themselves against digital sound-alike by denying the things the digital sound-alike says if they are presented to the target, but dead people cannot. Digital sound-alikes offer criminals new disinformation attack vectors and wreak havoc on provability.
-For these reasons the bannable '''raw materials''' i.e. covert voice models '''[[Law proposals to ban covert modeling|should be prohibited by law]]''' in order to protect humans from abuse by criminal parties.
 === Example of a hypothetical 4-victim digital sound-alike attack ===
@@ Line 243: / Line 319: @@
 # Victim #3 - It could also be viewed that victim #3 is our law enforcement systems as they are put to chase after and interrogate the innocent victim #1
 # Victim #4 - Our judiciary which prosecutes and possibly convicts the innocent victim #1.
-Thus it is high time to act and to '''[[Law proposals to ban covert modeling|criminalize the covert modeling of human voice!]]'''
 === Examples of speech synthesis software not quite able to fool a human yet ===
@@ Line 276: / Line 350: @@
 [[File:Spectrogram-19thC.png|thumb|right|640px|A [[w:spectrogram]] of a male voice saying 'nineteenth century']]
+=== What should we do about digital sound-alikes? ===
+Living people can defend<ref group="footnote" name="judiciary maybe not aware">Whether a suspect can defend against faked synthetic speech that sounds like him/her depends on how up-to-date the judiciary is. If no information and instructions about digital sound-alikes have been given to the judiciary, they likely will not believe the defense of denying that the recording is of the suspect's voice.</ref> themselves against digital sound-alike by denying the things the digital sound-alike says if they are presented to the target, but dead people cannot. Digital sound-alikes offer criminals new disinformation attack vectors and wreak havoc on provability.
+For these reasons the bannable '''raw materials''' i.e. covert voice models '''[[Law proposals to ban covert modeling|should be prohibited by law]]''' in order to protect humans from abuse by criminal parties.
+It is high time to act and to '''[[Law proposals to ban covert modeling|criminalize the covert modeling of human voice!]]'''
+== Digital look-and-sound-alikes ==
+=== Real-time digital look-and-sound-alike fraud in 2023 ===
+'''Real-time digital look-and-sound-alike''' in a video call was used to defraud a substantial amount of money in 2023.<ref name="Reuters real-time digital look-and-sound-alike crime  2023">
+{{cite web
+	| url = https://www.reuters.com/technology/deepfake-scam-china-fans-worries-over-ai-driven-fraud-2023-05-22/
+	| title = 'Deepfake' scam in China fans worries over AI-driven fraud
+	| last =
+	| first =
+	| date = 2023-05-22
+	| website = [[w:Reuters.com]]
+	| publisher = [[w:Reuters]]
+	| access-date = 2023-06-05
+	| quote =
+}}
+</ref>
+=== Real-time digital look-and-sound-alike fraud in 2024 ===
+'''Reporting'''
+* [https://www.ft.com/content/b977e8d4-664c-4ae4-8a8e-eb93bdf785ea '''''Arup lost $25mn in Hong Kong deepfake video conference scam''''' at ft.com] reporting by Cheng Leng and Chan Ho-him 2025-05-17
+* [https://edition.cnn.com/2024/02/04/asia/deepfake-cfo-scam-hong-kong-intl-hnk/index.html '''''Finance worker pays out $25 million after video call with deepfake "chief financial officer"''''' at edition.cnn.com], February 2024 reporting by Heather Chen and Kathleen Magramo, CNN
+* [https://edition.cnn.com/2024/05/16/tech/arup-deepfake-scam-loss-hong-kong-intl-hnk/index.html '''''British engineering giant Arup revealed as $25 million deepfake scam victim''''' at edition.cnn.com] May 2024 reporting by Kathleen Magramo, CNN
+----
 == Text syntheses ==
@@ Line 281: / Line 387: @@
 In [[w:natural language processing]] development in [[w:natural-language understanding]] leads to more cunning [[w:natural-language generation]] AI.
+'''[[w:Large language model]]s''' ('''LLM''') are very large [[w:language model]]s consisting of a [[w:Artificial neural network|w:neural network]] with many parameters.
 [[w:OpenAI]]'s [[w:OpenAI#GPT|w:Generative Pre-trained Transformer]] ('''GPT''') is a left-to-right [[w:transformer (machine learning model)]]-based [[w:Natural-language generation|text generation]] model succeeded by [[w:OpenAI#GPT-2|w:GPT-2]] and [[w:OpenAI#GPT-3|w:GPT-3]]
 November 2022 saw the publication of OpenAI's '''[[w:ChatGPT]]''', a conversational artificial intelligence.
 '''[[w:Bard (chatbot)]]''' is a conversational [[w:generative artificial intelligence]] [[w:chatbot]] developed by [[w:Google]], based on the [[w:LaMDA]] family of [[w:large language models]]. It was developed as a direct response to the rise of [[w:OpenAI]]'s [[w:ChatGPT]], and was released in March 2023. ([https://en.wikipedia.org/w/index.php?title=Bard_(chatbot)&oldid=1152361586 Wikipedia])
-''' Reporting / announcements '''
+''' Reporting / announcements ''' (in reverse chronology)
+* [https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/ '''''Reinventing search with a new AI-powered Microsoft Bing and Edge, your copilot for the web''''' at blogs.microsoft.com] '''February 2023''' (2023-02-07). The new improved Bing, available only in Microsoft's Edge browser is reportedly based on a language model refined from GPT 3.5.<ref>https://www.theverge.com/2023/2/7/23587454/microsoft-bing-edge-chatgpt-ai</ref>
 * [https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text '''New AI classifier for indicating AI-written text''' at openai.com], a '''January 2023''' blog post about OpenAI's AI classifier for detecting AI-written texts.
@@ Line 304: / Line 414: @@
 * [https://analyticssteps.com/blogs/detection-fake-and-false-news-text-analysis-approaches-and-cnn-deep-learning-model '''"Detection of Fake and False News (Text Analysis): Approaches and CNN as Deep Learning Model"''' at analyticsteps.com], a 2019 summmary written by Shubham Panth.
-'''Against text syntheses'''
+=== Detectors for synthesized texts ===
 Introduction of [[w:ChatGPT]] by OpenAI brought the need for software to detect machine-generated texts.
-* https://gptzero.me/ - ''The World's #1 AI Detector with over 1 Million Users''
+Try AI plagiarism detection for free
-* https://gptradar.com/<ref group="1st seen in" name="Wordlift.io 2023">https://wordlift.io/blog/en/best-plagiarism-checkers-for-ai-generated-content/</ref> - ''AI text detector app''
-* https://www.zerogpt.com/ - ''GPT-4 And ChatGPT detector by ZeroGPT: detect OpenAI text - ZeroGPT the most Advanced and Reliable Chat GPT and GPT-4 detector tool''<ref group="1st seen in" name="Wordlift.io 2023" />
+* [https://contentdetector.ai/ '''AI Content Detector''' at contentdetector.ai]- ''AI Content Detector - Detect ChatGPT Plagiarism'' ('''try for free''')
+* [https://platform.openai.com/ai-text-classifier '''AI Text Classifier''' at platform.openai.com]- ''The AI Text Classifier is a fine-tuned GPT model that predicts how likely it is that a piece of text was generated by AI from a variety of sources, such as ChatGPT.'' ('''free account required''')
+* [https://gptradar.com/ '''GPT Radar''' at gptradar.com] - ''AI text detector app'' ('''try for free''')<ref group="1st seen in" name="Wordlift.io 2023">https://wordlift.io/blog/en/best-plagiarism-checkers-for-ai-generated-content/</ref>
+* [https://gptzero.me/ '''GPTZero''' at gptzero.me] - ''The World's #1 AI Detector with over 1 Million Users'' ('''try for free''')
+* [https://copyleaks.com/plagiarism-checker '''Plagiarism Checker''' at copyleaks.com]- ''Plagiarism Checker by Copyleaks'' ('''try for free''')<ref group="1st seen in" name="Wordlift.io 2023" />
+* https://gowinston.ai/ - ''The most powerful AI content detection solution'' ('''free-tier available''')<ref group="1st seen in" name="Wordlift.io 2023" />
+* [https://www.zerogpt.com/ '''ZeroGPT''' at zerogpt.com]<ref group="1st seen in" name="Wordlift.io 2023" /> - ''GPT-4 And ChatGPT detector by ZeroGPT: detect OpenAI text - ZeroGPT the most Advanced and Reliable Chat GPT and GPT-4 detector tool'' ('''try for free''')
+For-a-fee AI plagiarism detection tools
+* https://originality.ai/ - ''The Most Accurate AI Content Detector and Plagiarism Checker Built for Serious Content Publishers''<ref group="1st seen in" name="Wordlift.io 2023" />
+* https://www.turnitin.com/ - ''Empower students to do their best, original work''<ref group="1st seen in" name="Wordlift.io 2023" />
 == Handwriting syntheses ==
@@ Line 320: / Line 440: @@
 If the handwriting-like synthesis passes human and media forensics testing, it is a '''digital handwrite-alike'''.
-Here we find a '''risk''' similar to that which realized when the '''[[w:speaker recognition]] systems''' turned out to be instrumental in the development of '''[[#Digital sound-alikes|digital sound-alikes]]'''. After the knowledge needed to recognize a speaker was [[w:Transfer learning|w:transferred]] into a generative task in 2018 by Google researchers, we no longer cannot effectively determine for English speakers which recording is human in origin and which is from a machine origin.
+Here we find a possible '''risk''' similar to that which became a reality, when the '''[[w:speaker recognition]] systems''' turned out to be instrumental in the development of '''[[#Digital sound-alikes|digital sound-alikes]]'''. After the knowledge needed to recognize a speaker was [[w:Transfer learning|w:transferred]] into a generative task in 2018 by Google researchers, we no longer cannot effectively determine for English speakers which recording is human in origin and which is from a machine origin.
 '''Handwriting-like syntheses''':
@@ Line 375: / Line 495: @@
 == 2020's synthetic human-like fakes ==
+* '''2024''' | '''<font color="red">text-to-video model</font>''' | '''[[w:Sora (text-to-video model)]]''', a [[w:text-to-video model]] developed by [[w:OpenAI]], that has worrying levels of realism was published in 2024. It was released to subscription paying users of ChatGPT in December 2024.
+* '''2023''' | '''<font color="orange">Real-time digital look-and-sound-alike crime</font>''' | In April a man in northern China was defrauded of 4.3 million yuan by a criminal employing a digital look-and-sound-alike pretending to be his friend on a video call made with a stolen messaging service account.<ref name="Reuters real-time digital look-and-sound-alike crime  2023"/>
+* '''2023''' | '''<font color="orange">Election meddling with digital look-alikes</font>''' | The [[w:2023 Turkish presidential election]] saw numerous deepfake controversies.
+** "''Ahead of the election in Turkey, President Recep Tayyip Erdogan showed a video linking his main challenger Kemal Kilicdaroglu to the militant Kurdish organization PKK.''" [...] "''Research by DW's fact-checking team in cooperation with DW's Turkish service shows that the video at the campaign rally was '''manipulated''' by '''combining two separate videos''' with totally different backgrounds and content.''" [https://www.dw.com/en/fact-check-turkeys-erdogan-shows-false-kilicdaroglu-video/a-65554034 reports dw.com]
+* '''2023''' | March 7 th | '''<font color="red">science / demonstration</font>''' | Microsoft researchers submitted a paper for publication outlining their [https://arxiv.org/abs/2303.03926 '''Cross-lingual neural codec language modeling system''' at arxiv.org] dubbed [https://www.microsoft.com/en-us/research/project/vall-e-x/vall-e-x/ '''VALL-E X''' at microsoft.com], that extends upon VALL-E's capabilities to be cross-lingual and also maintaining the same "''emotional tone''" from sample to fake.
+* '''2023''' | January 5th | '''<font color="red">science / demonstration</font>''' | Microsoft researchers announced [https://www.microsoft.com/en-us/research/project/vall-e/ '''''VALL-E''''' - Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (at microsoft.com)] that is able to thieve a voice from only '''3 seconds of sample''' and it is also able to mimic the "''emotional tone''" of the sample the synthesis if produced of.<ref>
+{{cite web
+	| url = https://arstechnica.com/information-technology/2023/01/microsofts-new-ai-can-simulate-anyones-voice-with-3-seconds-of-audio/
+	| title = Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio
+	| last = Edwards
+	| first = Benj
+	| date = 2023-01-10
+	| website = [[w:Arstechnica.com]]
+	| publisher = Arstechnica
+	| access-date = 2023-05-05
+	| quote = For the paper's conclusion, they write: "Since VALL-E could synthesize speech that maintains speaker identity, it may carry potential risks in misuse of the model, such as spoofing voice identification or impersonating a specific speaker. To mitigate such risks, it is possible to build a detection model to discriminate whether an audio clip was synthesized by VALL-E. We will also put Microsoft AI Principles into practice when further developing the models."
+}}
+</ref>
-* '''2023''' | '''<font color="green">Law</font>''' | {{#lst:Law on sexual offences in Finland 2023|what-is-it}}
+* '''2023''' | January 1st | '''<font color="green">Law</font>''' | {{#lst:Law on sexual offences in Finland 2023|what-is-it}}
 * '''2022''' | <font color="orange">'''science'''</font> and <font color="green">'''demonstration'''</font> | [[w:OpenAI]][https://openai.com/ (.com)] published [[w:ChatGPT]], a discutational AI accessible with a free account at [https://chat.openai.com/ chat.openai.com]. Initial version was published on 2022-11-30.
@@ Line 382: / Line 525: @@
 * '''2022''' | '''<font color="green">brief report of counter-measures</font>''' | {{#lst:Protecting world leaders against deep fakes using facial, gestural, and vocal mannerisms|what-is-it}} Publication date 2022-11-23.
-* '''2022''' | '''<font color="green">counter-measure</font>''' | {{#lst:Who Are You (I Really Wanna Know)? Detecting Audio DeepFakes Through Vocal Tract Reconstruction|what-is-it}}
+* '''2022''' | '''<font color="green">counter-measure</font>''' | {{#lst:Detecting deep-fake audio through vocal tract reconstruction|what-is-it}}
-:{{#lst:Who Are You (I Really Wanna Know)? Detecting Audio DeepFakes Through Vocal Tract Reconstruction|original-reporting}}. Presented to peers in August 2022 and to the general public in September 2022.
+:{{#lst:Detecting deep-fake audio through vocal tract reconstruction|original-reporting}}. Presented to peers in August 2022 and to the general public in September 2022.
+* '''2022''' | <font color="orange">'''disinformation attack'''</font> | In June 2022 a fake digital look-and-sound-alike in the appearance and voice of [[w:Vitali Klitschko]], mayor of [[w:Kyiv]], held fake video phone calls with several European mayors. The Germans determined that the video phone call was fake by contacting the Ukrainian officials. This attempt at covert disinformation attack was originally reported by [[w:Der Spiegel]].<ref>https://www.theguardian.com/world/2022/jun/25/european-leaders-deepfake-video-calls-mayor-of-kyiv-vitali-klitschko</ref><ref>https://www.dw.com/en/vitali-klitschko-fake-tricks-berlin-mayor-in-video-call/a-62257289</ref>
 * '''2022''' | science | [[w:DALL-E]] 2, a successor designed to generate more realistic images at higher resolutions that "can combine concepts, attributes, and styles" was published in April 2022.<ref>{{Cite web |title=DALL·E 2 |url=https://openai.com/dall-e-2/ |access-date=2023-04-22 |website=OpenAI |language=en-US}}</ref> ([https://en.wikipedia.org/w/index.php?title=DALL-E&oldid=1151136107 Wikipedia])
@@ Line 600: / Line 745: @@
 * '''2013''' | demonstration | A '''[https://ict.usc.edu/pubs/Scanning%20and%20Printing%20a%203D%20Portrait%20of%20President%20Barack%20Obama.pdf 'Scanning and Printing a 3D Portrait of President Barack Obama' at ict.usc.edu]'''.  A 7D model and a 3D bust was made of President Obama with his consent. Relevancy: <font color="green">'''Relevancy: certain'''</font>
+* '''2011''' | <font color="green">'''Law in Finland'''</font> | Distribution and attempt of distribution and also possession of '''synthetic [[w:Child sexual abuse material|CSAM]]''' was '''criminalized''' on Wednesday 2011-06-01, upon the initiative of the [[w:Vanhanen II Cabinet]]. These protections against CSAM were moved into 19 §, 20 § and 21 § of Chapter 20 when the [[Law on sexual offences in Finland 2023]] was improved and gathered into Chapter 20 upon the initiative of the [[w:Marin Cabinet]].
 == 2000's synthetic human-like fakes ==
@@ Line 644: / Line 791: @@
 * <font color="red">'''1999'''</font> | <font color="red">'''institute founded'''</font> | The '''[[w:Institute for Creative Technologies]]''' was founded by the [[w:United States Army]] in the [[w:University of Southern California]]. It collaborates with the [[w:United States Army Futures Command]], [[w:United States Army Combat Capabilities Development Command]], [[w:Combat Capabilities Development Command Soldier Center]] and [[w:United States Army Research Laboratory]].<ref name="ICT-about">https://ict.usc.edu/about/</ref>. In 2016 [[w:Hao Li]] was appointed to direct the institute.
-* '''1997''' | '''technology / science''' | [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf ''''Video rewrite: Driving visual speech with audio'''' at www2.eecs.berkeley.edu]<ref name="Bregler1997">
+* '''1997''' | '''technology / science''' | [https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf ''''Video rewrite: Driving visual speech with audio'''' at www2.eecs.berkeley.edu]<ref name="Bregler1997" /><ref group="1st seen in" name="Bohacek-Farid-2022">
-{{cite journal
-| last1      = Bregler
-| first1     = Christoph
-| last2      = Covell
-| first2     = Michele
-| last3      = Slaney
-| first3     = Malcolm
-| date       = 1997-08-03
-| title      = Video Rewrite: Driving Visual Speech with Audio
-| url        = https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/human/bregler-sig97.pdf
-| journal    = SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques
-| volume     =
-| issue      =
-| pages      = 353-360
-| doi        = 10.1145/258734.258880
-| access-date = 2022-09-09
-}}
-</ref><ref group="1st seen in" name="Bohacek-Farid-2022">
 PROTECTING PRESIDENT ZELENSKYY AGAINST DEEP FAKES https://arxiv.org/pdf/2206.12043.pdf
@@ Line 707: / Line 834: @@
 == Contact information of organizations ==
-Please contact these organizations and tell them to work harder against the disinformation weapons
+Please contact [[Organizations, studies and events against synthetic human-like fakes|these organizations]] and tell them to work harder against the disinformation weapons
-<references group="contact" />
 = 1st seen in =