Editing Synthetic human-like fakes

When the [[Glossary#No camera|camera does not exist]], but the subject being imaged with a simulation of a (movie) camera deceives the watcher to believe it is some living or dead person it is a '''digital look-alike'''.

When it cannot be determined by human testing whether some fake voice is a synthetic fake of some person's voice, or is it an actual recording made of that person's actual real voice, it is a '''digital sound-alike'''. 


[[File:BlV1999-morphable-model-till-match-low-res-rip.png|thumb|right|460px|Image 2 (low resolution rip) 
<br/><br/>(1) Sculpting a morphable model to one single picture 
<br/><br/>(2) Produces 3D approximation 
<br/><br/>(4) Texture capture 
<br/><br/>(3) The 3D model is rendered back to the image with weight gain 
<br/><br/>(5) With weight loss 
<br/><br/>(6) Looking annoyed 
<br/><br/>(7) Forced to smile 

Image 2 by Blanz and Vettel – Copyright ACM 1999 – http://dl.acm.org/citation.cfm?doid=311535.311556 – Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.]]

== Digital look-alikes ==

{{#ev:vimeo|16292363|640px|right|''[[w:A Computer Animated Hand|A Computer Animated Hand]]'' is a 1972 short film by [[w:Edwin Catmull|Edwin Catmull]] and [[w:Fred Parke|Fred Parke]]. This was the first time that [[w:computer-generated imagery|computer-generated imagery]] was used in film to animate likenesses of moving human appearance.}}

=== Introduction to digital look-alikes ===

[[File:The-diffuse-reflection-deducted-from-the-specular-reflection-Debevec-2000.png|thumb|right|260px|Subtraction of the diffuse reflection from the specular reflection yields the specular component of the model's reflectance.  
<br /><br />
[[:File:Deb-2000-reflectance-separation.png|Original picture]]  by [[w:Paul Debevec|Debevec]] et al. - Copyright ACM 2000 https://dl.acm.org/citation.cfm?doid=311779.344855]]

In the cinemas we have seen digital look-alikes for over 15 years. These digital look-alikes have "clothing" (a simulation of clothing is not clothing) or "superhero costumes" and "superbaddie costumes", and they don't need to care about the laws of physics, let alone laws of physiology. It is generally accepted that digital look-alikes made their public debut in the sequels of The Matrix i.e. [[w:The Matrix Reloaded]] and [[w:The Matrix Revolutions]] released in 2003. It can be considered almost certain, that it was not possible to make these before the year 1999, as the final piece of the puzzle to make a (still) digital look-alike that passes human testing, the [[Glossary#Reflectance capture|reflectance capture]] over the human face, was made for the first time in 1999 at the [[w:University of Southern California]] and was presented to the crème de la crème 
of the computer graphics field in their annual gathering SIGGRAPH 2000.<ref name="Deb2000">
{{cite book
  | last = Debevec
  | first = Paul
  | author-link = Paul Debevec
  | chapter = Acquiring the reflectance field of a human face
  | journal =
  | pages = 145–156
 | publisher = ACM
  | year = 2000
  | chapter-url = http://dl.acm.org/citation.cfm?id=344855 
  | chapter-format =
  | doi = 10.1145/344779.344855
  | accessdate =  2017-05-24| title = Proceedings of the 27th annual conference on Computer graphics and interactive techniques - SIGGRAPH '00
 | isbn = 978-1581132083
 }}</ref>


{{Q|Do you think that was [[w:Hugo Weaving|Hugo Weaving]]'s left cheekbone that [[w:Keanu Reeves|Keanu Reeves]] punched in with his right fist?|Trad|The Matrix Revolutions}}

=== The problems with digital look-alikes ===
[[File:Deb-2000-reflectance-separation.png|thumb|360px|right|Image 1: Separating specular and diffuse reflected light

<br/> <br />

(a) Normal image in dot lighting
<br/><br/> 
(b) Image of the diffuse reflection which is caught by placing a vertical polarizer in front of the light source and a horizontal in the front the camera
<br/><br/> 
(c) Image of the highlight specular reflection which is caught by placing both polarizers vertically
<br/><br/> 
(d) Subtraction of c from b, which yields the specular component
<br/><br/> 
Images are scaled to seem to be the same luminosity.
<br/><br/> 
Original image by Debevec et al. – Copyright ACM 2000 – https://dl.acm.org/citation.cfm?doid=311779.344855 – Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.]]
Extremely unfortunately for the humankind, organized criminal leagues, that posses the '''weapons capability''' of making believable looking '''synthetic pornography''', are producing on industrial production pipelines '''synthetic terror porn'''<ref group="footnote" name="About the term synthetic terror porn">It is terminologically more precise, more inclusive and more useful to talk about 'synthetic terror porn', if we want to talk about things with their real names, than 'synthetic rape porn', because also synthesizing recordings of consentual looking sex scenes can be terroristic in intent.</ref> by animating digital look-alikes and distributing it in the murky Internet in exchange for money stacks that are getting thinner and thinner as time goes by.

These industrially produced pornographic delusions are causing great humane suffering, especially in their direct victims, but they are also tearing our communities and societies apart, sowing blind rage, perceptions of deepening chaos, feelings of powerlessness and provoke violence. This '''hate illustration''' increases and strengthens hate thinking, hate speech, hate crimes and tears our fragile social constructions apart and with time perverts humankind's view of humankind into an almost unrecognizable shape, unless we interfere with resolve.

For these reasons the bannable '''raw materials''' i.e. covert models, needed to produce this disinformation terror on the information-industrial production pipelines, '''[[Law proposals to ban covert modeling|should be prohibited by law]]''' in order to protect humans from arbitrary abuse by criminal parties.

=== List of possible naked digital look-alike attacks ===

* The classic "portrayal of as if in involuntary sex"-attack. (Digital look-alike "cries")
* "Sexual preference alteration"-attack. (Digital look-alike "smiles")

=== How to counter synthetic porn: Adequate Porn Watcher AI (transcluded) ===
{{:Adequate Porn Watcher AI}}
----

== Digital sound-alikes ==

Living people can defend<ref group="footnote" name="judiciary maybe not aware">Whether a suspect can defend against faked synthetic speech that sounds like him/her depends on how up-to-date the judiciary is. If no information and instructions about digital sound-alikes have been given to the judiciary, they likely will not believe the defense of denying that the recording is of the suspect's voice.</ref> themselves against digital sound-alike by denying the things the digital sound-alike says if they are presented to the target, but dead people cannot. Digital sound-alikes offer criminals new disinformation attack vectors and wreak havoc on provability. 

=== Timeline of digital sound-alikes ===

* In '''2016''' [[w:Adobe Inc.]]'s [[w:Adobe Voco|Voco]], an unreleased prototype, was publicly demonstrated in 2016. ([https://www.youtube.com/watch?v=I3l4XLZ59iw&t=5s View and listen to Adobe MAX 2016 presentation of Voco]) 

* In '''2016''' [[w:DeepMind]]'s [[w:WaveNet]] owned by [[w:Google]] also demonstrated ability to steal people's voices

* In '''2018''' [[w:Conference on Neural Information Processing Systems|Conference on Neural Information Processing Systems]] the work [http://papers.nips.cc/paper/7700-transfer-learning-from-speaker-verification-to-multispeaker-text-to-speech-synthesis 'Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis'] ([https://arxiv.org/abs/1806.04558 at arXiv.org]) was presented. The pre-trained model is able to steal voices from a sample of only '''5 seconds''' with almost convincing results
** Listen [https://google.github.io/tacotron/publications/speaker_adaptation/ 'Audio samples from "Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis"']
** View [https://www.youtube.com/watch?v=0sR1rU3gLzQ Video summary of the work at YouTube: 'This AI Clones Your Voice After Listening for 5 Seconds']

* As of '''2019''' Symantec research knows of 3 cases where digital sound-alike technology '''has been used for crimes'''.<ref name="WaPo2019">
https://www.washingtonpost.com/technology/2019/09/04/an-artificial-intelligence-first-voice-mimicking-software-reportedly-used-major-theft/</ref>

----
=== 'Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis' 2018 by Google Research (external transclusion) ===

The Iframe below is transcluded from [https://google.github.io/tacotron/publications/speaker_adaptation/ 'Audio samples from "Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis"' at google.gituhub.io], the audio samples of a sound-like-anyone machine presented as at the 2018 [[w:NeurIPS]] conference by Google researchers.
{{#Widget:Iframe - Audio samples from Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis by Google Research}}

=== Documented digital sound-alike attacks ===
* [https://www.washingtonpost.com/technology/2019/09/04/an-artificial-intelligence-first-voice-mimicking-software-reportedly-used-major-theft/?noredirect=on 'An artificial-intelligence first: Voice-mimicking software reportedly used in a major theft'], a 2019 Washington Post article
----
=== Possible legal response: Outlawing digital sound-alikes (transcluded) ===
Transcluded from [[User:Juho Kunsola/Law proposals#Law proposal to ban covert modeling of human voice|Juho's proposal on banning digital sound-alikes]]
{{#section-h:User:Juho Kunsola/Law proposals|Law proposal to ban covert modeling of human voice}}
----

=== Example of a hypothetical 4-victim digital sound-alike attack ===
A very simple example of a digital sound-alike attack is as follows: 

Someone puts a digital sound-alike to call somebody's voicemail from an unknown number and to speak for example illegal threats. In this example there are at least two victims:

# Victim #1 - The person whose voice has been stolen into a covert model and a digital sound-alike made from it to frame them for crimes
# Victim #2 - The person to whom the illegal threat is presented in a recorded form by a digital sound-alike that deceptively sounds like victim #1
# Victim #3 - It could also be viewed that victim #3 is our law enforcement systems as they are put to chase after and interrogate the innocent victim #1
# Victim #4 - Our judiciary which prosecutes and possibly convicts the innocent victim #1.

Thus it is high time to act and to '''[[Law proposals to ban covert modeling|criminalize the covert modeling of human appearance and voice!]]'''

=== Examples of speech synthesis software not quite able to fool a human yet ===
Some other contenders to create digital sound-alikes are though, as of 2019, their speech synthesis in most use scenarios does not yet fool a human because the results contain tell tale signs that give it away as a speech synthesizer.  

* '''[https://lyrebird.ai/ Lyrebird.ai]''' [https://www.youtube.com/watch?v=xxDBlZu__Xk (listen)] 
* '''[https://candyvoice.com/ CandyVoice.com]''' [https://candyvoice.com/demos/voice-conversion (test with your choice of text)] 
* '''[https://cstr-edinburgh.github.io/merlin/ Merlin]''', a [[w:neural network]] based speech synthesis system by the Centre for Speech Technology Research at the [[w:University of Edinburgh]]

The below video 'This AI Clones Your Voice After Listening for 5 Seconds' by '2 minute papers' describes the voice thieving machine presented by Google Research in [[w:NeurIPS|NeurIPS]] 2018.

{{#ev:youtube|0sR1rU3gLzQ|640px|right|Video 'This AI Clones Your Voice After Listening for 5 Seconds' by '2 minute papers' describes the voice thieving machine by Google Research in [[w:NeurIPS|NeurIPS]] 2018.}}

[[File:Spectrogram-19thC.png|thumb|right|640px|A [[w:spectrogram|spectrogram]] of a male voice saying 'nineteenth century']]


== Media perhaps about synthetic human-like fakes ==
This is a chronological listing of media that are probably to do with [[synthetic human-like fakes]].


=== 6th century BC ===
[[File:Daniel's vision of the four beasts from the sea and the Ancient of Days - Silos Apocalypse (1109), f.240 - BL Add MS 11695.jpg|thumb|right|360px|Image taken from Silos Apocalypse. Originally published/produced in Spain (Silos), 1109.<br/><br/> 
[[Biblical explanation - The books of Daniel and Revelations#Daniel 7|Daniel 7]], Daniel's vision of the three beasts <sup>[[Biblical explanation - The books of Daniel and Revelations#Daniel 7:1-6 - Three beasts|Dan 7:1-6]]</sup> and the fourth beast <sup>[[Biblical explanation - The books of Daniel and Revelations#Daniel 7:7-8 - The fourth beast|Dan 7:7-8]]</sup> from the sea and the [[w:Ancient of Days|Ancient of Days]]<sup>[[Biblical explanation - The books of Daniel and Revelations#The Ancient of Days|Dan 7:9-10]]</sup>]]

* [[w:6th century BC]] | scripture | '''[[w:Daniel (biblical figure)]]''' was in [[w:Babylonian captivity]] when he had his visions where God warned us of synthetic human-like fakes first. 
* His testimony was put into written form in the [[#3rd century BC]].


=== 3rd century BC ===
* [[w:3rd century BC]] | scripture | The '''[[w:Book of Daniel]]''' was put in writing. 
* See [[Biblical explanation - The books of Daniel and Revelations#Daniel 7|Biblical explanation - The books of Daniel and Revelations § Daniel 7]]. '''Caution''' to reader: contains '''explicit''' written information about the beasts.


=== 1st century ===
* '''[[w:1st century]]''' | scripture | '''[[w:Jesus]] teaches''' about things that are yet to come in
*# '''[[w:Matthew 24]]'''
*# '''[[w:The Sheep and the Goats]]''' and 
*# '''[[w:Mark 13]]'''.

*'''1st century''' | scripture | '''[[w:2 Thessalonians 2]]''' is the second chapter of the [[w:Second Epistle to the Thessalonians]]. It is traditionally attributed to [[w:Paul the Apostle]], with [[w:Saint Timothy]] as a co-author.  See [[Biblical explanation - The books of Daniel and Revelations#2 Thessalonians 2|Biblical explanation - The books of Daniel and Revelations § 2 Thessalonians 2]] '''Caution''' to reader: contains '''explicit''' written information about the beasts

*'''1st century''' | scripture | '''[[w:Book of Revelation]]'''. The task of writing down and smuggling out this early warning of what is to come is given by God to his servant John, who was imprisoned on the island of [[w:Patmos]].  See [[Biblical explanation - The books of Daniel and Revelations#Revelation 13|Biblical explanation - The books of Daniel and Revelations § Revelation 13]]. '''Caution''' to reader: contains '''explicit''' written information about the beasts.


=== 1770's ===

[[File:Kempelen Speakingmachine.JPG|right|thumb|300px|A replica of [[w:Wolfgang von Kempelen|Kempelen]]'s [[w:Wolfgang von Kempelen's Speaking Machine|speaking machine]], built 2007–09 at the Department of [[w:Phonetics|Phonetics]], [[w:Saarland University|Saarland University]], [[w:Saarbrücken|Saarbrücken]], Germany. This machine added models of the tongue and lips, enabling it to produce [[w:consonant|consonant]]s as well as [[w:vowel|vowel]]s]]


* '''1779''' | science / discovery | [[w:Christian Gottlieb Kratzenstein]] won the first prize in a competition announced by the [[w:Russian Academy of Sciences]] for '''models''' he built of the '''human [[w:vocal tract]]''' that could produce the five long '''[[w:vowel]]''' sounds.<ref name="Helsinki">
[http://www.acoustics.hut.fi/publications/files/theses/lemmetty_mst/chap2.html History and Development of Speech Synthesis], Helsinki University of Technology, Retrieved on November 4, 2006
</ref> (Based on [[w:Speech synthesis#History]])

* '''1791''' | science | '''[[w:Wolfgang von Kempelen's Speaking Machine]]''' of [[w:Wolfgang von Kempelen]] of [[w:Pressburg]], [[w:Hungary]], described in a 1791 paper was [[w:bellows]]-operated.<ref>''Mechanismus der menschlichen Sprache nebst der Beschreibung seiner sprechenden Maschine'' ("Mechanism of the human speech with description of its speaking machine", J. B. Degen, Wien).</ref> This machine added models of the tongue and lips, enabling it to produce [[w:consonant]]s as well as [[w:vowel]]s. (based on [[w:Speech synthesis#History]])

=== 1970's ===

* '''1971''' | science | '''[https://interstices.info/images-de-synthese-palme-de-la-longevite-pour-lombrage-de-gouraud/ 'Images de synthèse : palme de la longévité pour l’ombrage de Gouraud' (still photos)]'''. [[w:Henri Gouraud (computer scientist)]] made the first [[w:Computer graphics]] [[w:geometry]] [[w:digitization]] and representation of a human face. Modeling was his wife Sylvie Gouraud. The 3D model was a simple [[w:wire-frame model]] and he applied [[w:Gouraud shading]] to produce the '''first known representation''' of '''human-likeness''' on computer. <ref>{{cite web|title=Images de synthèse : palme de la longévité pour l'ombrage de Gouraud|url=http://interstices.info/jcms/c_25256/images-de-synthese-palme-de-la-longevite-pour-lombrage-de-gouraud}}</ref>

* '''1972''' | entertainment | '''[https://vimeo.com/59434349 'A Computer Animated Hand' on Vimeo]'''. [[w:A Computer Animated Hand]] by [[w:Edwin Catmull]] and [[w:Fred Parke]]. Relevancy: This was the '''first time''' that [[w:computer-generated imagery|computer-generated imagery]] was used in film to '''animate''' moving '''human-like appearance'''.

=== 1980's ===
* '''1983''' | music video | '''[https://www.youtube.com/watch?v=O0lIlROWro8 'Musique Non-Stop' by Kraftwerk on Youtube]''' made in 1983, but published only in in '''1986''' by [[w:Kraftwerk]] from album [[w:Electric Café]]. Relevancy: Contains state-of-the-art (for the era) '''[[digital look-alikes]]''' of the band members.

== Footnotes ==
<references group="footnote" /> 

== References ==
<references />