{"id":238,"date":"2019-06-16T08:50:34","date_gmt":"2019-06-16T15:50:34","guid":{"rendered":"http:\/\/codingrestart.com\/home\/?p=238"},"modified":"2019-06-17T15:44:50","modified_gmt":"2019-06-17T22:44:50","slug":"mona-lisas-video","status":"publish","type":"post","link":"https:\/\/codingrestart.com\/home\/mona-lisas-video\/","title":{"rendered":"Mona Lisa&#8217;s video"},"content":{"rendered":"\n<p>For centuries, people have wondered about Mona Lisa&#8217;s smile. Now they can stop wondering and just watch her videos.<\/p>\n\n\n\n<p>A group of AI researchers published a paper titled &#8220;<a href=\"https:\/\/arxiv.org\/pdf\/1905.08233v1.pdf\">Few-Shot Adversarial Learning of Realistic Neural Talking Head Models<\/a>&#8220;, where they describe a new algorithm to generate videos of peoples&#8217; heads (talking heads models). Methods to produce talking heads models using generative adversarial networks (GAN) were already published previously. GANs are essentially two neural networks combined into one system, where one NN is trained to produce samples and the second NN is trained to identify good examples.   <\/p>\n\n\n\n<p>However, the existing methods using GANs required long videos or large sets of photographs of each talking head to train GANs. The existing methods used various warping techniques; for an overview read the introduction in the &#8220;<a href=\"http:\/\/202.38.64.11\/~lfdong\/research\/SAPI50.pdf\">Facial Animation System Based on Image Warping Algorithm<\/a>&#8221; study. <\/p>\n\n\n\n<p>The above paper describes a new way of producing the talking heads using just a few training examples, possibly only a single photograph. Instead of warping, a direct synthesizing method is used. This method is called few-shot learning and relies on pre-trained models that were trained using a large number of videos of various people in different situations. In those models, a critical part of the training relies on the identification of face landmarks, like eyes, nose, mouth, and chin.<\/p>\n\n\n\n<p>The results of the new research are summarized in a 5 minutes <a href=\"https:\/\/www.youtube.com\/watch?time_continue=1&amp;v=p1b5aiTrGzY\">video<\/a> that shows how the properly trained GAN can produce short talking head videos from still images. A talking head created from the Mona Lisa painting was particularly impressive because it was trained on several human models and the differences in three facial expressions are easily recognizable. The process of video synthesis of a certain person based on the face landmarks of a different person is called puppeteering. <\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/thumbs-prod.si-cdn.com\/UKRbRGifr4f0jXWGQ7ySuSnKSa0%3D\/800x600\/filters%3Ano_upscale%28%29\/https%3A\/\/public-media.si-cdn.com\/filer\/a8\/87\/a8871c6e-0b0f-4f62-b86e-8dfd3efee7b2\/mona.gif?w=840&#038;ssl=1\" alt=\"\"\/><figcaption>Mona Lisa trained using three different training videos to give her three distinct personalities. Courtesy of Zakharov at al.<\/figcaption><\/figure>\n\n\n\n<hr class=\"wp-block-separator\"\/>\n\n\n\n<p>Talking head videos could be combined with the latest NLP improvements that I described in an earlier <a href=\"http:\/\/codingrestart.com\/home\/dangers-of-nlp\/\">post<\/a>. This would create highly realistic fake videos and text. If you were concerned about the proliferation of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Deepfake\">deepfakes<\/a> before reading this post, this will only heighten your fears. And how are the authors of the above described few-shot adversarial learning algorithm responding to those concerns? The statement below their YouTube video that I linked above states: &#8220;Shifting a part of human life-like communication to the virtual and augmented worlds will have several positive effects. It will lead to a reduction in long-distance travel and short-distance commute. It will democratize education, and improve the quality of life for people with disabilities.&#8221; Noble enough. But considering that researchers are from Russia and Russia&#8217;s proven track of meddling in recent US\/EU elections, it is not far-fetched to assume that high-quality deepfakes will be common soon. <\/p>\n\n\n\n<p>How soon? Let&#8217;s look at the progress of <a href=\"https:\/\/arxiv.org\/pdf\/1406.2661.pdf\">images<\/a> generated by GANs over the past five years. <\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"324\" data-attachment-id=\"254\" data-permalink=\"https:\/\/codingrestart.com\/home\/mona-lisas-video\/gans\/\" data-orig-file=\"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?fit=800%2C324&amp;ssl=1\" data-orig-size=\"800,324\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Gans\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?fit=800%2C324&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?resize=800%2C324\" alt=\"\" class=\"wp-image-254\" srcset=\"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?w=800&amp;ssl=1 800w, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?resize=300%2C122&amp;ssl=1 300w, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/Gans.png?resize=768%2C311&amp;ssl=1 768w\" sizes=\"auto, (max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 984px) 61vw, (max-width: 1362px) 45vw, 600px\" \/><figcaption>GANs results throughout the years. Please note that none of the above images is a real person. Courtesy of Gidi Shperber.<\/figcaption><\/figure>\n\n\n\n<p> I&#8217;ll let you extrapolate the progress into the future.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For centuries, people have wondered about Mona Lisa&#8217;s smile. Now they can stop wondering and just watch her videos. A group of AI researchers published a paper titled &#8220;Few-Shot Adversarial Learning of Realistic Neural Talking Head Models&#8220;, where they describe a new algorithm to generate videos of peoples&#8217; heads (talking heads models). Methods to produce &hellip; <a href=\"https:\/\/codingrestart.com\/home\/mona-lisas-video\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Mona Lisa&#8217;s video&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[4],"tags":[],"class_list":["post-238","post","type-post","status-publish","format-standard","hentry","category-machine-learning"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paAAlH-3Q","jetpack-related-posts":[{"id":69,"url":"https:\/\/codingrestart.com\/home\/ml-msdn-and-xavier\/","url_meta":{"origin":238,"position":0},"title":"ML: MSDN and Xavier","author":"Viktor Sanek","date":"January 19, 2019","format":false,"excerpt":"Microsoft\u2019s MSDN magazine for its development community has been publishing quite a few introductory articles to Machine Learning (ML) over the past few months. January's issue emphasizes ML with another series of articles, albeit with differing qualities. I liked this quote from the editorial \"Advancing AI\": \u201dML is a huge\u2026","rel":"","context":"In &quot;Machine Learning&quot;","block_context":{"text":"Machine Learning","link":"https:\/\/codingrestart.com\/home\/category\/machine-learning\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":233,"url":"https:\/\/codingrestart.com\/home\/dangers-of-nlp\/","url_meta":{"origin":238,"position":1},"title":"Dangers of NLP","author":"Viktor Sanek","date":"June 9, 2019","format":false,"excerpt":"Natural language processing (NLP) continues its rapid advance, leading some people to fear its latest results. The research organization OpenAI published a blog post titled \"Better Language Models and Their Implications\" summarizing its progress on \"predicting the next word, given all of the previous words within some text\". OpenAI calls\u2026","rel":"","context":"In &quot;Machine Learning&quot;","block_context":{"text":"Machine Learning","link":"https:\/\/codingrestart.com\/home\/category\/machine-learning\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/GPT-2.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/GPT-2.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/GPT-2.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/06\/GPT-2.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":88,"url":"https:\/\/codingrestart.com\/home\/webassembly\/","url_meta":{"origin":238,"position":2},"title":"WebAssembly","author":"Viktor Sanek","date":"February 3, 2019","format":false,"excerpt":"Until recently, JavaScript was the only language supported by all web browsers. But it was difficult to compile it efficiently and the JavaScript applications typically run much slower than native applications. Then, programmers from the four main browser vendors designed a new language, sort of a machine code for the\u2026","rel":"","context":"In &quot;Web&quot;","block_context":{"text":"Web","link":"https:\/\/codingrestart.com\/home\/category\/web\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/01\/WASM.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/01\/WASM.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/01\/WASM.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/codingrestart.com\/wp-content\/uploads\/2019\/01\/WASM.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":51,"url":"https:\/\/codingrestart.com\/home\/really-old-geek\/","url_meta":{"origin":238,"position":3},"title":"Really Old Geek","author":"Viktor Sanek","date":"December 28, 2018","format":false,"excerpt":"The New York Times published an article about Donald Knuth \u201cThe Yoda of Silicon Valley\u201d. The article might idolize his work and impact, but provides a great overview of his work. Mr. Knuth is a living legend in the field of computer science, known mainly for his seminal book on\u2026","rel":"","context":"In &quot;C\/C++&quot;","block_context":{"text":"C\/C++","link":"https:\/\/codingrestart.com\/home\/category\/uncategorized\/cc\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":171,"url":"https:\/\/codingrestart.com\/home\/theory-behind-anki\/","url_meta":{"origin":238,"position":4},"title":"Theory Behind Anki","author":"Viktor Sanek","date":"April 23, 2019","format":false,"excerpt":"This post covers the theory behind Anki and follows the previous post introducing Anki. The last post in this series describes best practices for using Anki. One of the best meta-analyses reviewing best approaches to learning is Improving Students\u2019 Learning With Effective Learning Techniques by Dunlosky et al. If I\u2026","rel":"","context":"In &quot;Anki&quot;","block_context":{"text":"Anki","link":"https:\/\/codingrestart.com\/home\/category\/anki\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":180,"url":"https:\/\/codingrestart.com\/home\/anki-best-practices\/","url_meta":{"origin":238,"position":5},"title":"Anki: Best Practices","author":"Viktor Sanek","date":"April 28, 2019","format":false,"excerpt":"This post concludes the series of posts on Anki and follows the post introducing Anki and another one about the theory behind Anki. I have been using Anki for five years and achieved high consistency in completing daily reviews. Here are my best practices: Be selective about cards you create\u2026","rel":"","context":"In &quot;Anki&quot;","block_context":{"text":"Anki","link":"https:\/\/codingrestart.com\/home\/category\/anki\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/posts\/238","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/comments?post=238"}],"version-history":[{"count":11,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/posts\/238\/revisions"}],"predecessor-version":[{"id":263,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/posts\/238\/revisions\/263"}],"wp:attachment":[{"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/media?parent=238"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/categories?post=238"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/codingrestart.com\/home\/wp-json\/wp\/v2\/tags?post=238"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}