Jason Hise Jordan Medina Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Nicholas Kees Dupuis James Richárd Nagyfi Phil Moyer Shevis Johnson Alec Johnson Lupuleasa Ionuț Clemens Arbesser Bryce Daifuku Allen Faure Simon Strandgaard Jonatan R Michael Greve The Guru Of Vision Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski anul kumar sinha Jérôme Frossard Sean Gibat Volotat andrew Russell Cooper Lawton Gladamas Sylvain Chevalier DGJono robertvanduursen Dmitri Afanasjev Brian Sandberg Marcel Ward Andrew Weir Ben Archer Scott McCarthy Kabs Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Mr Fantastic Wr4thon Archy de Berker Marc Pauly Joshua Pratt Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Truls Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Seth Brothwell Brian Goodrich Clark Mitchell Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson Long Nguyen Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Duncan Orr Andrew Walker Bryan Egan
A Response to Steven Pinker on AIRobert Miles AI Safety2019-03-31 | Steven Pinker wrote an article on AI for Popular Science Magazine, which I have some issues with.
Jason Hise Jordan Medina Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Nicholas Kees Dupuis James Richárd Nagyfi Phil Moyer Shevis Johnson Alec Johnson Lupuleasa Ionuț Clemens Arbesser Bryce Daifuku Allen Faure Simon Strandgaard Jonatan R Michael Greve The Guru Of Vision Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski anul kumar sinha Jérôme Frossard Sean Gibat Volotat andrew Russell Cooper Lawton Gladamas Sylvain Chevalier DGJono robertvanduursen Dmitri Afanasjev Brian Sandberg Marcel Ward Andrew Weir Ben Archer Scott McCarthy Kabs Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Mr Fantastic Wr4thon Archy de Berker Marc Pauly Joshua Pratt Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Truls Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Seth Brothwell Brian Goodrich Clark Mitchell Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson Long Nguyen Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Duncan Orr Andrew Walker Bryan Egan
With enormous thanks to my wonderful patrons: - Tor Barstad - Timothy Lillicrap - Juan Benet - Sarah Howell - Kieryn - Mazianni - Scott Worley - Jason Hise - Clemens Arbesser - Francisco Tolmasky - David Reid - Andrew Blackledge - Cam MacFarlane - Olivier Coutu - CaptObvious - Ze Shen Chin - ikke89 - Isaac - Erik de Bruijn - Jeroen De Dauw - Ludwig Schubert - Eric James - Owen Campbell-Moore - Raf Jakubanis - Esa Koskinen - Nathan Metzger - Jonatan R - Gunnar - Laura Olds - Paul Hobbs - Bastiaan Cnossen - Eric Scammell - Alexare - Reslav Hollós - Jérôme Beaulieu - Nathan Fish - Taras Bobrovytsky - Jeremy - Vaskó Richárd - Andrew Harcourt - Chris Beacham - Zachary Gidwitz - Art Code Outdoors - Abigail Novick - Edmund Fokschaner - DragonSheep - Richard Newcombe - Joshua Michel - Richard - ttw - Sophia Michelle Andren - Alan J. Etchings - James Vera - Stumbleboots - Peter Lillian - Grimrukh - Colin Ricardo - DN - Mr Cats - Robert Paul Schwin - Roland G. McIntosh - Benjamin Mock - Emiliano Hodges - Maxim Kuzmich - Joanny Raby - Tom Miller - Eran Glicksman - CheeseBerry - Hoyskedotte - Alexey Malafeev - Jeff Starr - Justin - Liviu Macovei - Javier Soto - David Christal - Jam - Just Me - Sebastian Zimmer - Matt Thompson - Xan Atkinson - Andy - Albert Higgins - Alexander230 - Clay Upton - Alex Ander - Carolyn - Nathan Rogowski - David Morgan - little Bang - Chad M Jones - Dmitri Afanasjev - Christian Oehne - Marcel Ward - Andrew Weir - Miłosz Wierzbicki - Tendayi Mawushe - Kees - loopuleasa - Marco Tiraboschi - Fraser Cain - Patrick Henderson - Daniel Munter - Ian - James Fowkes - Len - Yuchong Li - Diagon - Puffjanga - Daniel Eickhardt - 14zRobot - Stuart Alldritt - DeepFriedJif - Garrett Maring - Stellated Hexahedron - Jim Renney - Edison Franklin - Piers Calderwood - Matt Brauer - Mihaly Barasz - Rajeen Nabid - Iestyn bleasdale-shepherd - Marek Belski - Luke Peterson - Eric Rogstad - Max Chiswick - slindenau - Nicholas Turner - Jannis Funk - This person's name is too hard to pronounce - Jon Wright - Andrei Trifonov - Bren Ehnebuske - Martin Frassek - Matthew Shinkle - Robby Gottesman - Ohelig - Sarah - Nikola Tasev - Tapio Kortesaari - Soroush Pour - Boris Badinoff - DangerCat - Jack Phelps - Kyle Green - Lexi X - John Slape - Joel Gardner - Christopher Creutzig - Johann Puzik - Pindex - RMR - Andrew Edstrom patreon.com/robertskmilesApply to Study AI Safety Now! #shortsRobert Miles AI Safety2023-04-28 | Apply to SERI MATS at serimats.org by May 7th, and check out http://aisafety.training to stay up to date with events and programs!Why Does AI Lie, and What Can We Do About It?Robert Miles AI Safety2022-12-09 | How do we make sure language models tell the truth?
With thanks to my wonderful Patrons at http://patreon.com/robertskmiles : - Tor Barstad - Kieryn - AxisAngles - Juan Benet - Scott Worley - Chad M Jones - Jason Hise - Shevis Johnson - JJ Hepburn - Pedro A Ortega - Clemens Arbesser - Chris Canal - Jake Ehrlich - Kellen lask - Francisco Tolmasky - Michael Andregg - David Reid - Teague Lasser - Andrew Blackledge - Brad Brookshire - Cam MacFarlane - Olivier Coutu - CaptObvious - Girish Sastry - Ze Shen Chin - Phil Moyer - Erik de Bruijn - Jeroen De Dauw - Ludwig Schubert - Eric James - Atzin Espino-Murnane - Jaeson Booker - Raf Jakubanis - Jonatan R - Ingvi Gautsson - Jake Fish - Tom O'Connor - Laura Olds - Paul Hobbs - Cooper - Eric Scammell - Ben Glanton - Duncan Orr - Nicholas Kees Dupuis - Will Glynn - Tyler Herrmann - Reslav Hollós - Jérôme Beaulieu - Nathan Fish - Peter Hozák - Taras Bobrovytsky - Jeremy - Vaskó Richárd - Report Techies - Andrew Harcourt - Nicholas Guyett - 12tone - Oliver Habryka - Chris Beacham - Zachary Gidwitz - Nikita Kiriy - Art Code Outdoors - Andrew Schreiber - Abigail Novick - Chris Rimmer - Edmund Fokschaner - April Clark - John Aslanides - DragonSheep - Richard Newcombe - Joshua Michel - Quabl - Richard - Neel Nanda - ttw - Sophia Michelle Andren - Trevor Breen - Alan J. Etchings - Jenan Wise - Jonathan Moregård - James Vera - Chris Mathwin - David Shaffer - Jason Gardner - Devin Turner - Andy Southgate - Lorthock The Banisher - Peter Lillian - Jacob Valero - Christopher Nguyen - Kodera Software - Grimrukh - MichaelB - David Morgan - little Bang - Dmitri Afanasjev - Marcel Ward - Andrew Weir - Ammar Mousali - Miłosz Wierzbicki - Tendayi Mawushe - Wr4thon - Martin Ottosen - Alec Johnson - Kees - Darko Sperac - Robert Valdimarsson - Marco Tiraboschi - Michael Kuhinica - Fraser Cain - Patrick Henderson - Daniel Munter - And last but not least - Ian Reyes - James Fowkes - Len - Alan Bandurka - Daniel Kokotajlo - Yuchong Li - Diagon - Andreas Blomqvist - Qwijibo (James) - Zannheim - Daniel Eickhardt - lyon549 - 14zRobot - Ivan - Jason Cherry - Igor (Kerogi) Kostenko - Stuart Alldritt - Alexander Brown - Ted Stokes - DeepFriedJif - Chris Dinant - Johannes Walter - Garrett Maring - Anthony Chiu - Ghaith Tarawneh - Julian Schulz - Stellated Hexahedron - Caleb - Georg Grass - Jim Renney - Edison Franklin - Jacob Van Buren - Piers Calderwood - Matt Brauer - Mihaly Barasz - Mark Woodward - Ranzear - Rajeen Nabid - Iestyn bleasdale-shepherd - MojoExMachina - Marek Belski - Luke Peterson - Eric Rogstad - Caleb Larson - Max Chiswick - Sam Freedo - slindenau - Nicholas Turner - FJannis - Grant Parks - This person's name is too hard to pronounce - Jon Wright - Everardo González Ávalos - Knut - Andrew McKnight - Andrei Trifonov - Tim D - Bren Ehnebuske - Martin Frassek - Valentin Mocanu - Matthew Shinkle - Robby Gottesman - Ohelig - Slobodan Mišković - Sarah - Nikola Tasev - Voltaic - Sam Ringer - Tapio Kortesaari
#shorts #shortFree ML Bootcamp for Alignment #shortsRobert Miles AI Safety2022-05-24 | Apply for the second MLAB (Machine Learning for Alignment Bootcamp)!
#ShortsApply to AI Safety Camp! #shortsRobert Miles AI Safety2021-11-19 | Trying out #shorts Applications are open for next year's AI Safety Camp! http://aisafety.campWe Were Right! Real Inner MisalignmentRobert Miles AI Safety2021-10-10 | Researchers ran real versions of the thought experiments in the 'Mesa-Optimisers' videos! What they found won't shock you (if you've been paying attention)
Previous videos on the subject: The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment: youtu.be/bJLcIBixGj8 Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...: youtu.be/IeWljQw3UgQ
With thanks to my wonderful Patrons at http://patreon.com/robertskmiles : - Gladamas - Timothy Lillicrap - Kieryn - AxisAngles - James - Jake Fish - Scott Worley - James Kirkland - James E. Petts - Chad Jones - Shevis Johnson - JJ Hepboin - Pedro A Ortega - Clemens Arbesser - Said Polat - Chris Canal - Jake Ehrlich - Kellen lask - Francisco Tolmasky - Michael Andregg - David Reid - Peter Rolf - Teague Lasser - Andrew Blackledge - Brad Brookshire - Cam MacFarlane - Craig Mederios - Jon Wright - CaptObvious - Brian Lonergan - Girish Sastry - Jason Hise - Phil Moyer - Erik de Bruijn - Alec Johnson - Ludwig Schubert - Eric James - Matheson Bayley - Qeith Wreid - jugettje dutchking - James Hinchcliffe - Atzin Espino-Murnane - Carsten Milkau - Jacob Van Buren - Jonatan R - Ingvi Gautsson - Michael Greve - Tom O'Connor - Laura Olds - Jon Halliday - Paul Hobbs - Jeroen De Dauw - Cooper Lawton - Tim Neilson - Eric Scammell - Igor Keller - Ben Glanton - Tor Barstad - Duncan Orr - Will Glynn - Tyler Herrmann - Ian Munro - Jérôme Beaulieu - Nathan Fish - Peter Hozák - Taras Bobrovytsky - Jeremy - Vaskó Richárd - Benjamin Watkin - Andrew Harcourt - Luc Ritchie - Nicholas Guyett - 12tone - Oliver Habryka - Chris Beacham - Nikita Kiriy - Andrew Schreiber - Steve Trambert - Braden Tisdale - Abigail Novick - Serge Var - Mink - Chris Rimmer - Edmund Fokschaner - April Clark - J - Nate Gardner - John Aslanides - Mara - ErikBln - DragonSheep - Richard Newcombe - Joshua Michel - P - Alex Doroff - BlankProgram - Richard - David Morgan - Fionn - Dmitri Afanasjev - Marcel Ward - Andrew Weir - Kabs - Ammar Mousali - Miłosz Wierzbicki - Tendayi Mawushe - Wr4thon - Martin Ottosen - Andy K - Kees - Darko Sperac - Robert Valdimarsson - Marco Tiraboschi - Michael Kuhinica - Fraser Cain - Robin Scharf - Klemen Slavic - Patrick Henderson - Hendrik - Daniel Munter - Alex Knauth - Kasper - Ian Reyes - James Fowkes - Tom Sayer - Len - Alan Bandurka - Ben H - Simon Pilkington - Daniel Kokotajlo - Yuchong Li - Diagon - Andreas Blomqvist - Iras - Qwijibo (James) - Zubin Madon - Zannheim - Daniel Eickhardt - lyon549 - 14zRobot - Ivan - Jason Cherry - Igor (Kerogi) Kostenko - ib_ - Thomas Dingemanse - Stuart Alldritt - Alexander Brown - Devon Bernard - Ted Stokes - Jesper Andersson - DeepFriedJif - Chris Dinant - Raphaël Lévy - Johannes Walter - Matt Stanton - Garrett Maring - Anthony Chiu - Ghaith Tarawneh - Julian Schulz - Stellated Hexahedron - Caleb - Clay Upton - Conor Comiconor - Michael Roeschter - Georg Grass - Isak Renström - Matthias Hölzl - Jim Renney - Edison Franklin - Piers Calderwood - Mikhail Tikhomirov - Matt Brauer - Mateusz Krzaczek - Artem Honcharov - Tomasz Gliniecki - Mihaly Barasz - Mark Woodward - Ranzear - Neil Palmere - Rajeen Nabid - Clark Schaefer - Olivier Coutu - Iestyn bleasdale-shepherd - MojoExMachina - Marek Belski - Luke Peterson - Eric Rogstad - Eric Carlson - Caleb Larson - Max Chiswick - Aron - Sam Freedo - slindenau - Johannes Lindmark - Nicholas Turner - Intensifier - Valerio Galieni - FJannis - Grant Parks - Ryan W Ammons - This person's name is too hard to pronounce - contalloomlegs - Everardo González Ávalos - Knut Løklingholm - Andrew McKnight - Andrei Trifonov - Aleks D - Mutual Information - Tim - A Socialist Hobgoblin - Bren Ehnebuske - Martin Frassek - Sven Drebitz - Quabl - Valentin Mocanu - Philip Crawford - Matthew Shinkle - Robby Gottesman - Juanchi
patreon.com/robertskmiles With thanks to my wonderful Patreon supporters: Gladamas Timothy Lillicrap Kieryn AxisAngles James Nestor Politics Scott Worley James Kirkland James E. Petts Chad Jones Shevis Johnson JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Peter Rolf Teague Lasser Andrew Blackledge Frank Marsman Brad Brookshire Cam MacFarlane Craig Mederios Jon Wright CaptObvious Brian Lonergan Jason Hise Phil Moyer Erik de Bruijn Alec Johnson Clemens Arbesser Ludwig Schubert Eric James Matheson Bayley Qeith Wreid jugettje dutchking Owen Campbell-Moore Atzin Espino-Murnane Johnny Vaughan Carsten Milkau Jacob Van Buren Jonatan R Ingvi Gautsson Michael Greve Tom O'Connor Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Cooper Lawton Tim Neilson Eric Scammell Igor Keller Ben Glanton Tor Barstad Duncan Orr Will Glynn Tyler Herrmann Ian Munro Joshua Davis Jérôme Beaulieu Nathan Fish Peter Hozák Taras Bobrovytsky Jeremy Vaskó Richárd Benjamin Watkin Andrew Harcourt Luc Ritchie Nicholas Guyett James Hinchcliffe 12tone Oliver Habryka Chris Beacham Zachary Gidwitz Nikita Kiriy Andrew Schreiber Steve Trambert Braden Tisdale Abigail Novick Serge Var Mink Chris Rimmer Edmund Fokschaner J Nate Gardner John Aslanides Mara ErikBln DragonSheep Richard Newcombe Joshua Michel Alex Altair P David Morgan Fionn Dmitri Afanasjev Marcel Ward Andrew Weir Kabs Ammar Mousali Miłosz Wierzbicki Tendayi Mawushe Jake Fish Wr4thon Martin Ottosen Robert Hildebrandt Andy Kobre Kees Darko Sperac Robert Valdimarsson loopuleasa Marco Tiraboschi Michael Kuhinica Fraser Cain Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Alex Knauth Kasper Ian Reyes James Fowkes Tom Sayer Len Alan Bandurka Ben H Simon Pilkington Daniel Kokotajlo Yuchong Li Diagon Andreas Blomqvist Bertalan Bodor Qwijibo (James) Zubin Madon Zannheim Daniel Eickhardt lyon549 14zRobot Ivan Jason Cherry Igor (Kerogi) Kostenko ib_ Thomas Dingemanse Stuart Alldritt Alexander Brown Devon Bernard Ted Stokes Jesper Andersson DeepFriedJif Chris Dinant Raphaël Lévy Johannes Walter Matt Stanton Garrett Maring Anthony Chiu Ghaith Tarawneh Julian Schulz Stellated Hexahedron Caleb Scott Viteri Clay Upton Conor Comiconor Michael Roeschter Georg Grass Isak Renström Matthias Hölzl Jim Renney Edison Franklin Piers Calderwood Mikhail Tikhomirov Matt Brauer Jaeson Booker Mateusz Krzaczek Artem Honcharov Michael Walters Tomasz Gliniecki Mihaly Barasz Mark Woodward Ranzear Neil Palmere Rajeen Nabid Christian Epple Clark Schaefer Olivier Coutu Iestyn bleasdale-shepherd MojoExMachina Marek Belski Luke Peterson Eric Eldard Eric Rogstad Eric Carlson Caleb Larson Max Chiswick Aron Sam Freedo slindenau A21 Johannes Lindmark Nicholas Turner Intensifier Valerio Galieni FJannis Grant Parks Ryan W Ammons This person's name is too hard to pronounce kp contalloomlegs Everardo González Ávalos Knut Løklingholm Andrew McKnight Andrei Trifonov Aleks D Mutual Information Tim A Socialist Hobgoblin Bren Ehnebuske Martin Frassek Sven Drebitz patreon.com/robertskmilesDeceptive Misaligned Mesa-Optimisers? Its More Likely Than You Think...Robert Miles AI Safety2021-05-23 | The previous video explained why it's *possible* for trained models to end up with the wrong goals, even when we specify the goals perfectly. This video explains why it's *likely*.
Timothy Lillicrap Kieryn James Scott Worley James E. Petts Chad Jones Shevis Johnson JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Peter Rolf Teague Lasser Andrew Blackledge Frank Marsman Brad Brookshire Cam MacFarlane Craig Mederios Jon Wright CaptObvious Jason Hise Phil Moyer Erik de Bruijn Alec Johnson Clemens Arbesser Ludwig Schubert Allen Faure Eric James Matheson Bayley Qeith Wreid jugettje dutchking Owen Campbell-Moore Atzin Espino-Murnane Johnny Vaughan Jacob Van Buren Jonatan R Ingvi Gautsson Michael Greve Tom O'Connor Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Lupuleasa Ionuț Cooper Lawton Tim Neilson Eric Scammell Igor Keller Ben Glanton anul kumar sinha Tor Duncan Orr Will Glynn Tyler Herrmann Ian Munro Joshua Davis Jérôme Beaulieu Nathan Fish Peter Hozák Taras Bobrovytsky Jeremy Vaskó Richárd Benjamin Watkin Andrew Harcourt Luc Ritchie Nicholas Guyett James Hinchcliffe 12tone Oliver Habryka Chris Beacham Zachary Gidwitz Nikita Kiriy Andrew Schreiber Steve Trambert Mario Lois Braden Tisdale Abigail Novick Сергей Уваров Bela R Mink Chris Rimmer Edmund Fokschaner Grant Parks J Nate Gardner John Aslanides Mara ErikBln DragonSheep Richard Newcombe David Morgan Fionn Dmitri Afanasjev Marcel Ward Andrew Weir Kabs Miłosz Wierzbicki Tendayi Mawushe Jake Fish Wr4thon Martin Ottosen Robert Hildebrandt Andy Kobre Kees Darko Sperac Robert Valdimarsson Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Alex Knauth Kasper Ian Reyes James Fowkes Tom Sayer Len Alan Bandurka Ben H Simon Pilkington Daniel Kokotajlo Diagon Andreas Blomqvist Bertalan Bodor Zannheim Daniel Eickhardt lyon549 14zRobot Ivan Jason Cherry Igor (Kerogi) Kostenko ib_ Thomas Dingemanse Stuart Alldritt Alexander Brown Devon Bernard Ted Stokes James Helms Jesper Andersson DeepFriedJif Chris Dinant Raphaël Lévy Johannes Walter Matt Stanton Garrett Maring Anthony Chiu Ghaith Tarawneh Julian Schulz Stellated Hexahedron Caleb Scott Viteri Clay Upton Conor Comiconor Michael Roeschter Georg Grass Isak Matthias Hölzl Jim Renney Edison Franklin Piers Calderwood Mikhail Tikhomirov Richard Otto Matt Brauer Jaeson Booker Mateusz Krzaczek Artem Honcharov Michael Walters Tomasz Gliniecki Mihaly Barasz Mark Woodward Ranzear Neil Palmere Rajeen Nabid Christian Epple Clark Schaefer Olivier Coutu Iestyn bleasdale-shepherd MojoExMachina Marek Belski Luke Peterson Eric Eldard Eric Rogstad Eric Carlson Caleb Larson Max Chiswick Aron David de Kloet Sam Freedo slindenau A21 Johannes Lindmark Nicholas Turner Tero K Valerio Galieni FJannis M I Ryan W Ammons Ludwig Krinner This person's name is too hard to pronounce kp contalloomlegs Everardo González Ávalos Knut Løklingholm Andrew McKnight Andrei Trifonov Aleks D Mutual Information
patreon.com/robertskmilesThe OTHER AI Alignment Problem: Mesa-Optimizers and Inner AlignmentRobert Miles AI Safety2021-02-16 | This "Alignment" thing turns out to be even harder than we thought.
# Other Media The Simpsons Season 5 Episode 19: "Sweet Seymour Skinner's Baadasssss Song" 1970s Psychology study of imprinting in ducks. Behaviorism: http://youtu.be/2xd7o3z957c
With thanks to my excellent Patreon supporters: patreon.com/robertskmiles - Timothy Lillicrap - Gladamas - James - Scott Worley - Chad Jones - Shevis Johnson - JJ Hepboin - Pedro A Ortega - Said Polat - Chris Canal - Jake Ehrlich - Kellen lask - Francisco Tolmasky - Michael Andregg - David Reid - Peter Rolf - Teague Lasser - Andrew Blackledge - Frank Marsman - Brad Brookshire - Cam MacFarlane - Jason Hise - Phil Moyer - Erik de Bruijn - Alec Johnson - Clemens Arbesser - Ludwig Schubert - Allen Faure - Eric James - Matheson Bayley - Qeith Wreid - jugettje dutchking - Owen Campbell-Moore - Atzin Espino-Murnane - Johnny Vaughan - Jacob Van Buren - Jonatan R - Ingvi Gautsson - Michael Greve - Tom O'Connor - Laura Olds - Jon Halliday - Paul Hobbs - Jeroen De Dauw - Lupuleasa Ionuț - Cooper Lawton - Tim Neilson - Eric Scammell - Igor Keller - Ben Glanton - anul kumar sinha - Duncan Orr - Will Glynn - Tyler Herrmann - Tomas Sayder - Ian Munro - Joshua Davis - Jérôme Beaulieu - Nathan Fish - Taras Bobrovytsky - Jeremy - Vaskó Richárd - Benjamin Watkin - Sebastian Birjoveanu - Andrew Harcourt - Luc Ritchie - Nicholas Guyett - James Hinchcliffe - 12tone - Oliver Habryka - Chris Beacham - Zachary Gidwitz - Nikita Kiriy - Parker - Andrew Schreiber - Steve Trambert - Mario Lois - Abigail Novick - Сергей Уваров - Bela R - Mink - Fionn - Dmitri Afanasjev - Marcel Ward - Andrew Weir - Kabs - Miłosz Wierzbicki - Tendayi Mawushe - Jake Fish - Wr4thon - Martin Ottosen - Robert Hildebrandt - Poker Chen - Kees - Darko Sperac - Paul Moffat - Robert Valdimarsson - Marco Tiraboschi - Michael Kuhinica - Fraser Cain - Robin Scharf - Klemen Slavic - Patrick Henderson - Oct todo22 - Melisa Kostrzewski - Hendrik - Daniel Munter - Alex Knauth - Kasper - Ian Reyes - James Fowkes - Tom Sayer - Len - Alan Bandurka - Ben H - Simon Pilkington - Daniel Kokotajlo - Peter Hozák - Diagon - Andreas Blomqvist - Bertalan Bodor - David Morgan - Zannheim - Daniel Eickhardt - lyon549 - Ihor Mukha - 14zRobot - Ivan - Jason Cherry - Igor (Kerogi) Kostenko - ib_ - Thomas Dingemanse - Stuart Alldritt - Alexander Brown - Devon Bernard - Ted Stokes - James Helms - Jesper Andersson - DeepFriedJif - Chris Dinant - Raphaël Lévy - Johannes Walter - Matt Stanton - Garrett Maring - Anthony Chiu - Ghaith Tarawneh - Julian Schulz - Stellated Hexahedron - Caleb - Scott Viteri - Conor Comiconor - Michael Roeschter - Georg Grass - Isak - Matthias Hölzl - Jim Renney - Edison Franklin - Piers Calderwood - Krzysztof Derecki - Mikhail Tikhomirov - Richard Otto - Matt Brauer - Jaeson Booker - Mateusz Krzaczek - Artem Honcharov - Michael Walters - Tomasz Gliniecki - Mihaly Barasz - Mark Woodward - Ranzear - Neil Palmere - Rajeen Nabid - Christian Epple - Clark Schaefer - Olivier Coutu - Iestyn bleasdale-shepherd - MojoExMachina - Marek Belski - Luke Peterson - Eric Eldard - Eric Rogstad - Eric Carlson - Caleb Larson - Braden Tisdale - Max Chiswick - Aron - David de Kloet - Sam Freedo - slindenau - A21 - Rodrigo Couto - Johannes Lindmark - Nicholas Turner - Tero K patreon.com/robertskmilesQuantilizers: AI That Doesnt Try Too HardRobert Miles AI Safety2020-12-13 | How do you get an AI system that does better than a human could, without doing anything a human wouldn't?
Timothy Lillicrap Gladamas James Scott Worley Chad Jones Shevis Johnson JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Peter Rolf Teague Lasser Andrew Blackledge Frank Marsman Brad Brookshire Cam MacFarlane Vivek Nayak Jason Hise Phil Moyer Erik de Bruijn Alec Johnson Clemens Arbesser Ludwig Schubert Allen Faure Eric James Matheson Bayley Qeith Wreid jugettje dutchking Owen Campbell-Moore Atzin Espino-Murnane Johnny Vaughan Jacob Van Buren Jonatan R Ingvi Gautsson Michael Greve Tom O'Connor Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Lupuleasa Ionuț Cooper Lawton Tim Neilson Eric Scammell Igor Keller Ben Glanton anul kumar sinha Duncan Orr Will Glynn Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Nathan Fish Taras Bobrovytsky Jeremy Vaskó Richárd Benjamin Watkin Sebastian Birjoveanu Andrew Harcourt Luc Ritchie Nicholas Guyett James Hinchcliffe 12tone Chris Beacham Zachary Gidwitz Nikita Kiriy Parker Andrew Schreiber Steve Trambert Mario Lois Abigail Novick heino hulsey-vincent Fionn Dmitri Afanasjev Marcel Ward Richárd Nagyfi Andrew Weir Kabs Miłosz Wierzbicki Tendayi Mawushe Jannik Olbrich Jake Fish Wr4thon Martin Ottosen Robert Hildebrandt Andy Kobre Poker Chen Kees Darko Sperac Paul Moffat Robert Valdimarsson Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Alex Knauth Kasper Rob Dawson Ian Reyes James Fowkes Tom Sayer Len Alan Bandurka Ben H Simon Pilkington Daniel Kokotajlo Diagon Andreas Blomqvist Bertalan Bodor David Morgan Zannheim Daniel Eickhardt lyon549 HD Ihor Mukha 14zRobot Ivan Jason Cherry Igor (Kerogi) Kostenko ib_ Thomas Dingemanse Stuart Alldritt Alexander Brown Devon Bernard Ted Stokes James Helms Jesper Andersson Jim T DeepFriedJif Chris Dinant Raphaël Lévy Johannes Walter Matt Stanton Garrett Maring Anthony Chiu Ghaith Tarawneh Julian Schulz Stellated Hexahedron Caleb Scott Viteri Clay Upton Conor Comiconor Michael Roeschter Georg Grass Isak Matthias Hölzl Jim Renney Edison Franklin Piers Calderwood Krzysztof Derecki Mikhail Tikhomirov Richard Otto Matt Brauer Jaeson Booker Mateusz Krzaczek Artem Honcharov Michael Walters Tomasz Gliniecki Mihaly Barasz Mark Woodward Ranzear Neil Palmere Rajeen Nabid Christian Epple Clark Schaefer Olivier Coutu Iestyn bleasdale-shepherd MojoExMachina Marek Belski Eric Eldard Eric Rogstad Eric Carlson Caleb Larson Braden Tisdale Max Chiswick Phillip Brandel
patreon.com/robertskmilesSharing the Benefits of AI: The Windfall ClauseRobert Miles AI Safety2020-07-06 | AI might create enormous amounts of wealth, but how is it going to be distributed?
Gladamas Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Peter Rolf Chad Jones Teague Lasser Andrew Blackledge Frank Marsman Brad Brookshire Cam MacFarlane Jason Hise Erik de Bruijn Alec Johnson Clemens Arbesser Ludwig Schubert Bryce Daifuku Allen Faure Eric James Matheson Bayley Qeith Wreid jugettje dutchking Owen Campbell-Moore Atzin Espino-Murnane Phil Moyer Jacob Van Buren Jonatan R Ingvi Gautsson Michael Greve Julius Brash Tom O'Connor Shevis Johnson Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Lupuleasa Ionuț Tim Neilson Eric Scammell Igor Keller Ben Glanton anul kumar sinha Sean Gibat Duncan Orr Cooper Lawton Will Glynn Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Nathan Fish Taras Bobrovytsky Jeremy Vaskó Richárd Benjamin Watkin Euclidean Plane Andrew Harcourt Luc Ritchie Nicholas Guyett James Hinchcliffe Oliver Habryka Chris Beacham Zachary Gidwitz Nikita Kiriy Andrew Schreiber Dmitri Afanasjev Marcel Ward Andrew Weir Ben Archer Kabs Miłosz Wierzbicki Tendayi Mawushe Jannik Olbrich Jake Fish Jussi Männistö Wr4thon Martin Ottosen Archy de Berker Andy Kobre Poker Chen Kees Paul Moffat Robert Valdimarsson Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Alex Knauth Leo Rob Dawson Bryan Egan Robert Hildebrandt James Fowkes Len Alan Bandurka Ben H Tatiana Ponomareva Michael Bates Simon Pilkington Daniel Kokotajlo Fionn Diagon Andreas Blomqvist Bertalan Bodor David Morgan Ben Schultz Zannheim Daniel Eickhardt lyon549 HD Ihor Mukha 14zRobot Ivan Jason Cherry Igor (Kerogi) Kostenko ib_ Thomas Dingemanse Stuart Alldritt Alexander Brown Devon Bernard Ted Stokes Jesper Andersson Jim T Kasper DeepFriedJif Chris Dinant Raphaël Lévy Marko Topolnik Johannes Walter Matt Stanton Garrett Maring Mo Hossny Anthony Chiu Frank Kurka Ghaith Tarawneh Josh Trevisiol Julian Schulz Stellated Hexahedron Caleb Scott Viteri 12tone Clay Upton Brent ODell Conor Comiconor Michael Roeschter Georg Grass Isak Matthias Hölzl Jim Renney Michael V brown Martin Henriksen Edison Franklin Daniel Steele Piers Calderwood Krzysztof Derecki Mikhail Tikhomirov Richárd Nagyfi Richard Otto Alston Sleet Matt Brauer Jaeson Booker Mateusz Krzaczek Artem Honcharov Evan Ward Michael Walters Tomasz Gliniecki Mihaly Barasz Mark Woodward Ranzear Neil Palmere Rajeen Nabid
patreon.com/robertskmiles10 Reasons to Ignore AI SafetyRobert Miles AI Safety2020-06-04 | Why do some ignore AI Safety? Let's look at 10 reasons people give (adapted from Stuart Russell's list).
With thanks to my excellent Patreon supporters: patreon.com/robertskmiles Gladamas James Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Peter Rolf Chad Jones Frank Kurka Teague Lasser Andrew Blackledge Vignesh Ravichandran Jason Hise Erik de Bruijn Clemens Arbesser Ludwig Schubert Bryce Daifuku Allen Faure Eric James Qeith Wreid jugettje dutchking Owen Campbell-Moore Atzin Espino-Murnane Jacob Van Buren Jonatan R Ingvi Gautsson Michael Greve Julius Brash Tom O'Connor Shevis Johnson Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Lupuleasa Ionuț Tim Neilson Eric Scammell Igor Keller Ben Glanton anul kumar sinha Sean Gibat Duncan Orr Cooper Lawton Will Glynn Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Nathan Fish Taras Bobrovytsky Jeremy Vaskó Richárd Benjamin Watkin Sebastian Birjoveanu Euclidean Plane Andrew Harcourt Luc Ritchie Nicholas Guyett James Hinchcliffe Oliver Habryka Chris Beacham Nikita Kiriy robertvanduursen Dmitri Afanasjev Marcel Ward Andrew Weir Ben Archer Kabs Miłosz Wierzbicki Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Wr4thon Martin Ottosen Archy de Berker Andy Kobre Brian Gillespie Poker Chen Kees Darko Sperac Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Leo Rob Dawson Bryan Egan Robert Hildebrandt James Fowkes Len Alan Bandurka Ben H Tatiana Ponomareva Michael Bates Simon Pilkington Daniel Kokotajlo Fionn Diagon Parker Lund Russell schoen Andreas Blomqvist Bertalan Bodor David Morgan Ben Schultz Zannheim Daniel Eickhardt lyon549 HD Ihor Mukha 14zRobot Ivan Jason Cherry Igor (Kerogi) Kostenko ib_ Thomas Dingemanse Alexander Brown Devon Bernard Ted Stokes Jesper Andersson Jim T Kasper DeepFriedJif Daniel Bartovic Chris Dinant Raphaël Lévy Marko Topolnik Johannes Walter Matt Stanton Garrett Maring Mo Hossny Anthony Chiu Ghaith Tarawneh Josh Trevisiol Julian Schulz Stellated Hexahedron Caleb Scott Viteri 12tone Nathaniel Raddin Clay Upton Brent ODell Conor Comiconor Michael Roeschter Georg Grass Isak Matthias Hölzl Jim Renney Michael V brown Martin Henriksen Edison Franklin Daniel Steele Piers Calderwood Krzysztof Derecki Zachary Gidwitz Mikhail Tikhomirov
patreon.com/robertskmiles9 Examples of Specification GamingRobert Miles AI Safety2020-04-29 | AI systems do what you say, and it's hard to say exactly what you mean. Let's look at a list of real life examples of specification gaming!
Gladamas James Steef Scott Worley Chad Jones Chris Canal David Reid Francisco Tolmasky Frank Kurka Jake Ehrlich JJ Hepboin Kellen lask Michael Andregg Pedro A Ortega Peter Rolf Said Polat Teague Lasser Allen Faure Bryce Daifuku Clemens Arbesser Eric James Erik de Bruijn Jason Hise jugettje dutchking Ludwig Schubert Qeith Wreid Andrew Harcourt anul kumar sinha Ben Glanton Benjamin Watkin Cooper Lawton Duncan Orr Eric Scammell Euclidean Plane Ian Munro Igor Keller Ingvi Gautsson James Hinchcliffe Jeroen De Dauw Jon Halliday Jonatan R Julius Brash Jérôme Beaulieu Laura Olds Luc Ritchie Lupuleasa Ionuț Michael Greve Nathan Fish Nicholas Guyett Paul Hobbs Sean Gibat Sebastian Birjoveanu Shevis Johnson Taras Bobrovytsky Tim Neilson Tom O'Connor Tomas Sayder Tyler Herrmann Vaskó Richárd Will Glynn 12tone 14zRobot Alan Bandurka Alexander Brown Anders Öhrt Andreas Blomqvist Andrew Weir Andy Kobre Anne Kohlbrenner Anthony Chiu Archy de Berker Ben Archer Ben H Ben Schultz Bertalan Bodor Brian Gillespie Bryan Egan Caleb Chris Dinant Daniel Bartovic Daniel Eickhardt Daniel Kokotajlo Daniel Munter Darko Sperac David Morgan DeepFriedJif Devon Bernard Diagon Dmitri Afanasjev Fionn Fraser Cain Garrett Maring Ghaith Tarawneh HD Hendrik ib_ Igor (Kerogi) Kostenko Ihor Mukha Ivan James Fowkes Jannik Olbrich Jason Cherry Jeremy Jesper Andersson Jim T Johannes Walter Josh Trevisiol Julian Schulz Jussi Männistö Kabs Kasper Kasper Schnack Kees Klemen Slavic Leo lyon549 Marc Pauly Marcel Ward Marco Tiraboschi Marko Topolnik Martin Ottosen Matt Stanton Melisa Kostrzewski Michael Bates Michael Kuhinica Miłosz Wierzbicki Mo Hossny Nathaniel Raddin Oct todo22 Owen Campbell-Moore Parker Lund Patrick Henderson Paul Moffat Poker Chen Rob Dawson Robert Hildebrandt robertvanduursen Robin Scharf Russell schoen Scott Viteri Simon Pilkington Stellated Hexahedron Tatiana Ponomareva Ted Stokes Tendayi Mawushe Thomas DingemanseTraining AI Without Writing A Reward Function, with Reward ModellingRobert Miles AI Safety2019-12-13 | How do you get a reinforcement learning agent to do what you want, when you can't actually write a reward function that specifies what that is?
Thanks to my wonderful patrons: patreon.com/robertskmiles James Gladamas Steef Scott Worley Jordan Medina Simon Strandgaard JJ Hepboin Pedro A Ortega Said Polat Chris Canal Jake Ehrlich Kellen lask Francisco Tolmasky Michael Andregg David Reid Robert Daniel Pickard Peter Rolf Chad Jones Richárd Nagyfi Jason Hise Phil Moyer Shevis Johnson Erik de Bruijn Alec Johnson Clemens Arbesser Ludwig Schubert Bryce Daifuku Allen Faure Eric James Qeith Wreid Jonatan R Ingvi Gautsson Michael Greve Julius Brash Tom O'Connor Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Lupuleasa Ionuț Tim Neilson Eric Scammell Igor Keller Ben Glanton anul kumar sinha Sean Gibat Cooper Lawton Will Glynn Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Nathan Fish Taras Bobrovytsky Anne Buit Vaskó Richárd Sebastian Birjoveanu Euclidean Plane Andrew Harcourt DGJono robertvanduursen Dmitri Afanasjev Marcel Ward Andrew Weir Ben Archer Kabs Miłosz Wierzbicki Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Wr4thon Martin Ottosen Archy de Berker Marc Pauly Andy Kobre Brian Gillespie Poker Chen Kees Darko Sperac Truls Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Seth Brothwell Kasper Schnack Klemen Slavic Patrick Henderson Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Duncan Orr Bryan Egan Robert Hildebrandt James Fowkes Alan Bandurka Ben H Tatiana Ponomareva Michael Bates Simon Pilkington Dion Gerald Bridger Petr Smital Daniel Kokotajlo Fionn Yuchong Li Diagon Parker Lund Paul Emmerich Russell schoen Andreas Blomqvist Bertalan Bodor David Morgan Jeremy Ben Schultz Zannheim Daniel Eickhardt lyon549 HD Ihor Mukha 14zRobot Ivan Arne Strasser Jason Cherry Igor (Kerogi) Kostenko Isaac Boates Thomas Dingemanse Davy Ker Alexander Brown Devon Bernard Ted Stokes James Helms Matheson Bayley patreon.com/robertskmilesAI That Doesnt Try Too Hard - Maximizers and SatisficersRobert Miles AI Safety2019-08-23 | Powerful AI systems can be dangerous in part because they pursue their goals as strongly as they can. Perhaps it would be safer to have systems that don't aim for perfection, and stop at 'good enough'. How could we build something like that?
Scott Worley Jordan Medina Simon Strandgaard JJ Hepboin Lupuleasa Ionuț Pedro A Ortega Said Polat Chris Canal Nicholas Kees Dupuis Jake Ehrlich Mark Hechim Kellen lask Francisco Tolmasky Michael Andregg Alexandru Dobre David Reid Robert Daniel Pickard Peter Rolf Chad Jones Truthdoc James Richárd Nagyfi Jason Hise Phil Moyer Shevis Johnson Alec Johnson Clemens Arbesser Ludwig Schubert Bryce Daifuku Allen Faure Eric James Jonatan R Ingvi Gautsson Michael Greve Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski anul kumar sinha Jérôme Frossard Sean Gibat Cooper Lawton Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Taras Bobrovytsky Anne Buit Tom Murphy Vaskó Richárd Sebastian Birjoveanu Gladamas Sylvain Chevalier DGJono Dmitri Afanasjev Brian Sandberg Marcel Ward Andrew Weir Ben Archer Scott McCarthy Kabs Miłosz Wierzbicki Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Mr Fantastic Wr4thon Martin Ottosen Archy de Berker Marc Pauly Joshua Pratt Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Truls Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Seth Brothwell Clark Mitchell Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson Long Nguyen Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Volotat Duncan Orr Marin Aldimirov Bryan Egan James Fowkes Frame Problems Alan Bandurka Benjamin Hull Tatiana Ponomareva Aleksi Maunu Michael Bates Simon Pilkington Dion Gerald Bridger Steven Cope Marcos Alfredo Núñez Petr Smital Daniel Kokotajlo Fionn Yuchong Li Nathan Fish Diagon Parker Lund Russell schoen Andreas Blomqvist Bertalan Bodor David Morgan Ben Schultz Zannheim Daniel Eickhardt lyon549 HD
patreon.com/robertskmilesIs AI Safety a Pascals Mugging?Robert Miles AI Safety2019-05-16 | An event that's very unlikely is still worth thinking about, if the consequences are big enough. What's the limit though?
Do we have to devote all of our resources to any outcome that might give infinite payoffs, even if it seems basically impossible? Does the case for AI Safety rely on this kind of Pascal's Wager argument? Watch this video to find out that the answer to these questions is 'No'.
Correction: At 6:34 the embedded video says 3^^^3 has 3.6 trillion digits, but that's actually only the size of 3^^4. 3^^^3 is enormously larger.
Jason Hise Jordan Medina Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Nicholas Kees Dupuis Jake Ehrlich Mark Hechim Kellen lask Francisco Tolmasky Michael Andregg James Richárd Nagyfi Phil Moyer Shevis Johnson Alec Johnson Lupuleasa Ionuț Clemens Arbesser Bryce Daifuku Allen Faure Simon Strandgaard Jonatan R Michael Greve Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski anul kumar sinha Jérôme Frossard Sean Gibat A.Russell Cooper Lawton Tyler Herrmann Tomas Sayder Ian Munro Jérôme Beaulieu Gladamas Sylvain Chevalier DGJono robertvanduursen Dmitri Afanasjev Brian Sandberg Marcel Ward Andrew Weir Ben Archer Scott McCarthy Kabs Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Mr Fantastic Wr4thon Archy de Berker Marc Pauly Joshua Pratt Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Truls Paul Moffat Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Seth Brothwell Brian Goodrich Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson Long Nguyen Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Volotat Duncan Orr Bryan Egan James Fowkes Frame Problems Alan Bandurka Benjamin Hull Dave Tapley Tatiana Ponomareva Aleksi Maunu Michael Bates Simon Pilkington Dion Gerald Bridger Steven Cope Petr Smital Daniel Kokotajlo Joshua Davis Fionn Tyler LaBean Roger Yuchong Li Nathan Fish Diagon Giancarlo Pace
patreon.com/robertskmilesHow to Keep Improving When Youre Better Than Any Teacher - Iterated Distillation and AmplificationRobert Miles AI Safety2019-03-11 | [2nd upload] AI systems can be trained using demonstrations from experts, but how do you train them to out-perform those experts? Can this still be done even without clear win/loss criteria? And how do you do it safely?
With thanks to my wonderful Patrons: ( patreon.com/robertskmiles ) Steef Jason Strack Jordan Medina Jason Hise Scott Worley JJ Hepboin Pedro A Ortega Said Polat Chris Canal Nicholas Kees Dupuis James Richárd Nagyfi Phil Moyer Alec Johnson Clemens Arbesser Bryce Daifuku Simon Strandgaard Jonatan R Michael Greve The Guru Of Vision Volodymyr David Tjäder Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski anul kumar sinha Jérôme Frossard Sean Gibat Sun Sun andrew Russell Cooper Lawton Gladamas Sylvain Chevalier DGJono robertvanduursen Dmitri Afanasjev Brian Sandberg Einar Ueland Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Kabs Kabs Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Bjorn Nyblad Jussi Männistö Mr Fantastic Wr4thon Archy de Berker Marc Pauly Joshua Pratt Shevis Johnson Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Truls Paul Moffat Jelle Langen Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Shawn Hartsock Seth Brothwell Brian Goodrich Clark Mitchell Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson Long Nguyen Oct todo22 Melisa Kostrzewski Hendrik Daniel Munter Graham Henry Duncan OrrWhy Not Just: Think of AGI Like a Corporation?Robert Miles AI Safety2018-12-23 | Corporations are kind of like AIs, if you squint. How hard do you have to squint though, and is it worth it? In this video we ask: Are corporations artificial general superintelligences?
Media Sources: "SpaceX - How Not to Land an Orbital Rocket Booster" (youtu.be/bvim4rsNHkQ) Undertale - Turbosnail Clerks (1994) Zootopia (2016) AlphaGo (2017) Ready Player One (2018)
Jordan Medina Jason Hise Pablo Eder Scott Worley JJ Hepboin Pedro A Ortega James McCuen Richárd Nagyfi Phil Moyer Alec Johnson Bobby Cold Clemens Arbesser Simon Strandgaard Jonatan R Michael Greve The Guru Of Vision David Tjäder Julius Brash Tom O'Connor Erik de Bruijn Robin Green Laura Olds Jon Halliday Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell Igor Keller Ben Glanton Robert Sokolowski Jérôme Frossard Sean Gibat Sylvain Chevalier DGJono robertvanduursen Scott Stevens Dmitri Afanasjev Brian Sandberg Marcel Ward Andrew Weir Ben Archer Scott McCarthy Kabs Kabs Kabs Tendayi Mawushe Jannik Olbrich Anne Kohlbrenner Jussi Männistö Mr Fantastic Wr4thon Dave Tapley Archy de Berker Kevin Marc Pauly Joshua Pratt Gunnar Guðvarðarson Shevis Johnson Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Truls Paul Moffat Anders Öhrt Lupuleasa Ionuț Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Shawn Hartsock Seth Brothwell Brian Goodrich Michael S McReynolds Clark Mitchell Kasper Schnack Michael Hunter Klemen Slavic Patrick Henderson
patreon.com/robertskmilesSafe Exploration: Concrete Problems in AI Safety Part 6Robert Miles AI Safety2018-09-21 | To learn, you need to try new things, but that can be risky. How do we make AI systems that can explore safely?
AI Safety Gridworlds: youtu.be/CGTkoUidQ8I Why Would AI Want to do Bad Things? Instrumental Convergence: youtu.be/ZeecOKBus3Q Scalable Supervision: Concrete Problems in AI Safety Part 5: youtu.be/nr1lHuFeq5w The Evolved Radio and its Implications for Modelling the Evolution of Novel Sensors: https://people.duke.edu/~ng46/topics/evolved-radio.pdf
Jason Hise Steef Jason Strack Stefan Skiles Jordan Medina Scott Worley JJ Hepboin Alex Flint Pedro A Ortega James McCuen Richárd Nagyfi Alec Johnson Clemens Arbesser Simon Strandgaard Jonatan R Michael Greve The Guru Of Vision Alexander Hartvig Nielsen Volodymyr David Tjäder Julius Brash Tom O'Connor Ville Ahlgren Erik de Bruijn Robin Green Maksym Taran Laura Olds Jon Halliday Bobby Cold Paul Hobbs Jeroen De Dauw Tim Neilson Eric Scammell christopher dasenbrock Igor Keller Ben Glanton Robert Sokolowski Vlad D Jérôme Frossard Lupuleasa Ionuț Sylvain Chevalier DGJono robertvanduursen Scott Stevens Dmitri Afanasjev Brian Sandberg Einar Ueland Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Phil Moyer Tendayi Mawushe Anne Kohlbrenner Bjorn Nyblad Jussi Männistö Mr Fantastic Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker Pablo Eder Kevin Marc Pauly Joshua Pratt Gunnar Guðvarðarson Shevis Johnson Andy Kobre Manuel Weichselbaum Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Paul Moffat Jelle Langen Lars Scholz Anders Öhrt Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Gladamas Shawn Hartsock Seth Brothwell Brian Goodrich Michael S McReynolds
Media Sources: "DashCam Russia - Crazy Drivers and Car Crashes 2018" (youtu.be/h50TQ3i9k5I) Optimist Prime "Hapless Boston Dynamics robot in shelf-stacking fail" (youtu.be/JzlsvFN_5HI) "The Simpsons - Bart Gets Famous" (c) Fox 1994 "Donald Duck - Cured Duck" (c) Disney 1945 "Vase Breaking Slow Motion" (youtu.be/IJNPc_anP7U) "Fastest quadcopter i've ever flown + Most Destructive Crash" (youtu.be/OKT4cx7UKsk) "An athlete uses physics to shatter world records - Asaf Bar-Yosef" (youtu.be/RaGUW1d0w8g) "Uber self-driving car crash in Tempe, Arizona" (youtu.be/XtTB8hTgHbM) "Quadcopter Fx Simulator" (youtu.be/-6si8WkRtaY) "Fallout - New Vegas by progamingwithed in 24:00 - AGDQ 2017 - Part 59" (youtu.be/nuzDif16_nc) "Far Cry 5 out of 5 Physics Simulation" (youtu.be/4My0Bt30pX0)Friend or Foe? AI Safety Gridworlds extra bitRobert Miles AI Safety2018-06-24 | The last video about the AI Safety Gridworlds paper. How does an agent detect and adapt to friendly and adversarial intentions in the environment?
Jason Hise Steef Cooper Lawton Jason Strack Chad Jones Stefan Skiles Jordan Medina Manuel Weichselbaum Scott Worley JJ Hepboin Alex Flint Justin Courtright Pedro A Ortega James McCuen Richárd Nagyfi Ville Ahlgren Alec Johnson Clement Chiris Simon Strandgaard Joshua Richardson Jonatan R Michael Greve The Guru Of Vision Alexander Hartvig Nielsen Volodymyr David Tjäder Julius Brash Tom O'Connor Gunnar Guðvarðarson Shevis Johnson Erik de Bruijn Robin Green Alexei Vasilkov Maksym Taran Laura Olds Jon Halliday Robert Werner Paul Hobbs Jeroen De Dauw Enrico Ros Tim Neilson Eric Scammell christopher dasenbrock Igor Keller Morten Jelle Ben Glanton Robert Sokolowski Vlad D William Hendley DGJono robertvanduursen Scott Stevens Emilio Alvarez Dmitri Afanasjev Brian Sandberg Einar Ueland Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Phil Tendayi Mawushe Anne Kohlbrenner Jake Fish Bjorn Nyblad Jussi Männistö Mr Fantastic Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker Kevin Marc Pauly Joshua Pratt Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Paul Moffat Jelle Langen Lars Scholz Anders Öhrt Lupuleasa Ionuț Marco Tiraboschi Michael Kuhinica Fraser Cain Robin Scharf Oren Milman John Rees Shawn Hartsock Seth Brothwell
patreon.com/robertskmilesAI Safety GridworldsRobert Miles AI Safety2018-05-25 | Got an AI safety idea? Now you can test it out! A recent paper from DeepMind sets out some environments for evaluating the safety of AI systems, and the code is on GitHub.
- Jason Hise - Steef - Cooper Lawton - Jason Strack - Chad Jones - Stefan Skiles - Jordan Medina - Manuel Weichselbaum - Scott Worley - JJ Hepboin - Alex Flint - Justin Courtright - James McCuen - Richárd Nagyfi - Ville Ahlgren - Alec Johnson - Simon Strandgaard - Joshua Richardson - Jonatan R - Michael Greve - The Guru Of Vision - Fabrizio Pisani - Alexander Hartvig Nielsen - Volodymyr - David Tjäder - Paul Mason - Ben Scanlon - Julius Brash - Mike Bird - Tom O'Connor - Gunnar Guðvarðarson - Shevis Johnson - Erik de Bruijn - Robin Green - Alexei Vasilkov - Maksym Taran - Laura Olds - Jon Halliday - Robert Werner - Paul Hobbs - Jeroen De Dauw - Enrico Ros - Tim Neilson - Eric Scammell - christopher dasenbrock - Igor Keller - William Hendley - DGJono - robertvanduursen - Scott Stevens - Michael Ore - Dmitri Afanasjev - Brian Sandberg - Einar Ueland - Marcel Ward - Andrew Weir - Taylor Smith - Ben Archer - Scott McCarthy - Kabs Kabs - Phil - Tendayi Mawushe - Gabriel Behm - Anne Kohlbrenner - Jake Fish - Bjorn Nyblad - Jussi Männistö - Mr Fantastic - Matanya Loewenthal - Wr4thon - Dave Tapley - Archy de Berker - Kevin - Marc Pauly - Joshua Pratt - Andy Kobre - Brian Gillespie - Martin Wind - Peggy Youell - Poker Chen - pmilian - Kees - Darko Sperac - Paul Moffat - Jelle Langen - Lars Scholz - Anders Öhrt - Lupuleasa Ionuț - Marco Tiraboschi - Peter Kjeld Andersen - Michael Kuhinica - Fraser Cain - Robin Scharf - Oren MilmanExperts Predictions about the Future of AIRobert Miles AI Safety2018-03-31 | When will AI systems surpass human performance? I don't know, do you? No you don't. Let's see what 352 top AI researchers think.
[CORRECTION: I mistakenly stated that the survey was before AlphaGo beat Lee Sedol. The 12 year prediction was for AI to outperform humans *after having only played as many games as a human plays in their lifetime*]
Jason Hise Steef Jason Strack Chad Jones Stefan Skiles Jordan Medina Manuel Weichselbaum 1RV34 Scott Worley JJ Hepboin Alex Flint James McCuen Richárd Nagyfi Ville Ahlgren Alec Johnson Simon Strandgaard Joshua Richardson Jonatan R Michael Greve The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Tom O'Connor Gunnar Guðvarðarson Shevis Johnson Erik de Bruijn Robin Green Alexei Vasilkov Maksym Taran Laura Olds Jon Halliday Robert Werner Paul Hobbs Jeroen De Dauw Konsta William Hendley DGJono robertvanduursen Scott Stevens Michael Ore Dmitri Afanasjev Brian Sandberg Einar Ueland Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Kabs Phil Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Bjorn Nyblad Jussi Männistö Mr Fantastic Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker Kevin Vincent Sanders Marc Pauly Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Paul Moffat Noel Kocheril Jelle Langen Lars ScholzWhy Would AI Want to do Bad Things? Instrumental ConvergenceRobert Miles AI Safety2018-03-24 | How can we predict that AGI with unknown goals would behave badly by default?
Jason Hise Steef Jason Strack Chad Jones Stefan Skiles Jordan Medina Manuel Weichselbaum 1RV34 Scott Worley JJ Hepboin Alex Flint James McCuen Richárd Nagyfi Ville Ahlgren Alec Johnson Simon Strandgaard Joshua Richardson Jonatan R Michael Greve The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Tom O'Connor Gunnar Guðvarðarson Shevis Johnson Erik de Bruijn Robin Green Alexei Vasilkov Maksym Taran Laura Olds Jon Halliday Robert Werner Paul Hobbs Jeroen De Dauw Konsta William Hendley DGJono robertvanduursen Scott Stevens Michael Ore Dmitri Afanasjev Brian Sandberg Einar Ueland Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Kabs Phil Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Bjorn Nyblad Jussi Männistö Mr Fantastic Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker Kevin Vincent Sanders Marc Pauly Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Darko Sperac Paul Moffat Noel Kocheril Jelle Langen Lars ScholzSuperintelligence Mod for Civilization VRobert Miles AI Safety2018-02-13 | Let's play this new mod for Civ 5 that makes AGI an available technology! Can we guide humanity to a utopian AI future, or will we destroy ourselves?
Jason Hise Steef Jason Strack Chad Jones Stefan Skiles Jordan Medina Manuel Weichselbaum 1RV34 Scott Worley JJ Hepboin James McCuen Richárd Nagyfi Ville Ahlgren Alec Johnson Trevor Alexander Nestor Clement Chiris Simon Strandgaard Joshua Richardson Jonatan R Michael Greve The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Tom O'Connor Gunnar Guðvarðarson Shevis Johnson Erik de Bruijn Robin Green Alexei Vasilkov Maksym Taran Laura Olds Jon Halliday Robert Werner Paul Hobbs Jeroen De Dauw Roman Nekhoroshev Konsta William Hendley DGJono robertvanduursen Scott Stevens Emilio Alvarez Michael Ore Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez Marcel Ward Andrew Weir Taylor Smith Ben Archer Scott McCarthy Kabs Kabs Phil Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Bjorn Nyblad Stefan Laurie Jussi Männistö Cameron Kinsel Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker Kevin Vincent Sanders Marc Pauly Andy Kobre Brian Gillespie Martin Wind Peggy Youell Poker Chen Kees Boon Darko Sperac Paul Moffat Noel Kocheril Jelle Langen
patreon.com/robertskmilesIntelligence and Stupidity: The Orthogonality ThesisRobert Miles AI Safety2018-01-11 | Can highly intelligent agents have stupid goals? A look at The Orthogonality Thesis and the nature of stupidity.
patreon.com/robertskmiles With thanks to my wonderful Patreon supporters: - Steef - Sara Tjäder - Jason Strack - Chad Jones - Stefan Skiles - Ziyang Liu - Jordan Medina - Jason Hise - Manuel Weichselbaum - 1RV34 - James McCuen - Richárd Nagyfi - Ammar Mousali - Scott Zockoll - Ville Ahlgren - Alec Johnson - Simon Strandgaard - Joshua Richardson - Jonatan R - Michael Greve - robertvanduursen - The Guru Of Vision - Fabrizio Pisani - Alexander Hartvig Nielsen - Volodymyr - David Tjäder - Paul Mason - Ben Scanlon - Julius Brash - Mike Bird - Tom O'Connor - Gunnar Guðvarðarson - Shevis Johnson - Erik de Bruijn - Robin Green - Alexei Vasilkov - Maksym Taran - Laura Olds - Jon Halliday - Robert Werner - Roman Nekhoroshev - Konsta - William Hendley - DGJono - Matthias Meger - Scott Stevens - Emilio Alvarez - Michael Ore - Dmitri Afanasjev - Brian Sandberg - Einar Ueland - Lo Rez - Marcel Ward - Andrew Weir - Taylor Smith - Ben Archer - Scott McCarthy - Kabs Kabs - Phil - Tendayi Mawushe - Gabriel Behm - Anne Kohlbrenner - Jake Fish - Bjorn Nyblad - Stefan Laurie - Jussi Männistö - Cameron Kinsel - Matanya Loewenthal - Wr4thon - Dave Tapley - Archy de Berker - Kevin - Vincent Sanders - Marc Pauly - Andy Kobre - Brian Gillespie - Martin Wind - Peggy Youell - Poker Chen patreon.com/robertskmilesScalable Supervision: Concrete Problems in AI Safety Part 5Robert Miles AI Safety2017-11-29 | Why can't we just have humans overseeing our AI systems?
patreon.com/robertskmiles With thanks to my wonderful Patreon supporters: - Steef - Sara Tjäder - Jason Strack - Chad Jones - Stefan Skiles - Ziyang Liu - Jordan Medina - Jason Hise - Heavy Empty - Manuel Weichselbaum - James McCuen - Richárd Nagyfi - Ammar Mousali - Scott Zockoll - Charles Miller - Joshua Richardson - Jonatan R - Michael Greve - robertvanduursen - The Guru Of Vision - Fabrizio Pisani - Alexander Hartvig Nielsen - Volodymyr - David Tjäder - Paul Mason - Ben Scanlon - Julius Brash - Mike Bird - Taylor Winning - Ville Ahlgren - Johannes David - Andrew Pearce - Gunnar Guðvarðarson - Shevis Johnson - Erik de Bruijn - Robin Green - Alexei Vasilkov - Roman Nekhoroshev - Peggy Youell - Konsta - William Hendley - Almighty Dodd - DGJono - Matthias Meger - Scott Stevens - Emilio Alvarez - Michael Ore - Robert Bridges - Dmitri Afanasjev - Brian Sandberg - Einar Ueland - Lo Rez - Stephen Paul - Marcel Ward - Andrew Weir - Pontus Carlsson - Taylor Smith - Ben Archer - Ivan Pochesnev - Scott McCarthy - Kabs Kabs Kabs - Phil - Christopher Askin - Tendayi Mawushe - Gabriel Behm - Anne Kohlbrenner - Jake Fish - David Rasmussen - Bjorn Nyblad - Stefan Laurie - Tom O'Connor - pmilian - Jussi Männistö - Cameron Kinsel - Matanya Loewenthal - Wr4thon - Dave Tapley - Archy de Berker - Kevin - Vincent Sanders - Marc Pauly - Andy Kobre - Brian Gillespie patreon.com/robertskmilesAI Safety at EAGlobal2017 ConferenceRobert Miles AI Safety2017-11-16 | I attended a charity conference to learn about AI Safety!
Correction: Alan Dafoe is funded by a grant from the Open Philanthropy Project, but does not work for them.
With thanks to my Patrons! (patreon.com/robertskmiles) Steef Sara Tjäder Jason Strack Chad Jones Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise Heavy Empty James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Charles Miller Joshua Richardson Jonatan R Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Ville Ahlgren Johannes David Andrew Pearce Gunnar Guðvarðarson Shevis Johnson Erik de Bruijn Robin Green Roman Nekhoroshev Peggy Youell Konsta William Hendley Adam Dodd DGJono Matthias Meger Scott Stevens Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Kabs Kabs Phil Christopher Askin Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish David Rasmussen Filip Bjorn Nyblad Stefan Laurie Tom O'Connor pmilian Jussi Männistö Cameron Kinsel Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker
patreon.com/robertskmilesAI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1Robert Miles AI Safety2017-10-29 | Some beautiful new GAN results have been published, so let's have a quick look at the pretty pictures. More AI Safety coming soon of course.
Steef Sara Tjäder Jason Strack Chad Jones Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise David Rasmussen Heavy Empty James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Charles Miller Joshua Richardson Jonatan R Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Ville Ahlgren Roman Nekhoroshev Peggy Youell Konsta William Hendley Almighty Dodd DGJono Matthias Meger Scott Stevens Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Kabs Kabs Kabs Phil Christopher Askin Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Filip Bjorn Nyblad Stefan Laurie Tom O'Connor pmilian Jussi Männistö Cameron Kinsel Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker
patreon.com/robertskmilesWhat can AGI do? I/O and SpeedRobert Miles AI Safety2017-10-17 | Suppose we make an algorithm that implements general intelligence as well as the brain. What could that system do? It might have better input and output than a human, and probably could be run faster...
Steef Sara Tjäder Jason Strack Chad Jones Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise David Rasmussen Heavy Empty James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Charles Miller Joshua Richardson Jonatan R Øystein Flygt Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Ville Ahlgren
Roman Nekhoroshev Peggy Youell Konstantin Shabashov William Hendley Adam Dodd DGJono Matthias Meger Scott Stevens Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Phil Christopher Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Jennifer Autumn Latham Filip Bjorn Nyblad Stefan Laurie Tom O'Connor pmilian Jussi Männistö Cameron Kinsel Matanya Loewenthal Wr4thon Dave Tapley Archy de Berker
patreon.com/robertskmilesWhat Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4Robert Miles AI Safety2017-09-24 | Three different approaches that might help to prevent reward hacking.
Steef Sara Tjäder Jason Strack Chad Jones Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise David Rasmussen Heavy Empty James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Charles Miller Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani A Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Roman Nekhoroshev Peggy Youell Konstantin Shabashov Dodd Almighty DGJono Matthias Meger Scott Stevens Emilio Alvarez Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Kabs Kabs Phil Philip Alexander Christopher Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Jennifer Autumn Latham Filip Bjorn Nyblad Stefan Laurie Tom O'Connor Krethys PiotrekM Jussi Männistö Matanya Loewenthal Wr4thonReward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5Robert Miles AI Safety2017-08-29 | Goodhart's Law, Partially Observed Goals, and Wireheading: some more reasons for AI systems to find ways to 'cheat' and get more reward than we intended.
Steef Sara Tjäder Jason Strack Chad Jones Ichiro Dohi Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise David Rasmussen James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Charles Miller Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Roman Nekhoroshev Peggy Youell Konstantin Shabashov Almighty Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Kabs Phil Philip Alexander Christopher Tendayi Mawushe Gabriel Behm Anne KohlbrennerThe other Killer Robot Arms Race Elon Musk should worry aboutRobert Miles AI Safety2017-08-22 | Elon Musk is in the news, talking to the UN about autonomous weapons. This seems like a good time to explain one area where we don't quite agree about AI Safety.
Steef Sara Tjäder Jason Strack Chad Jones Ichiro Dohi Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina Kyle Scott Jason Hise David Rasmussen James McCuen Richárd Nagyfi Ammar Mousali Scott Zockoll Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Taylor Winning Peggy Youell Konstantin Shabashov Almighty Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabs Kabs Phil Philip Alexander Christopher Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Jennifer Autumn Latham Filip Bjorn Nyblad Stefan Laurie Tom O'Connor KrethysReward Hacking: Concrete Problems in AI Safety Part 3Robert Miles AI Safety2017-08-12 | Sometimes AI can find ways to 'cheat' and get more reward than we intended by doing something unexpected.
Jordan Medina FHI's own Kyle Scott Jason Hise David Rasmussen James McCuen Richárd Nagyfi Ammar Mousali Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr David Tjäder Paul Mason Ben Scanlon Julius Brash Mike Bird Peggy Youell Konstantin Shabashov Almighty Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne Stephen Paul Marcel Ward Andrew Weir Pontus Carlsson Taylor Smith Ben Archer Ivan Pochesnev Scott McCarthy Kabilan Kabilan Kabilan Kabilan Phil Philip Alexander Christopher Tendayi Mawushe Gabriel Behm Anne Kohlbrenner Jake Fish Jennifer Autumn LathamWhy Not Just: Raise AI Like Kids?Robert Miles AI Safety2017-07-22 | Newly made Artificial General Intelligences are basically like children, right? So we already know we can teach them how to behave, right? Wrong.
Thanks to my amazing Patreon Supporters: Sara Tjäder Jason Strack Chad Jones Ichiro Dohi Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina James McCuen Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr Peggy Youell Konstantin Shabashov Almighty Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne patreon.com/robertskmilesEmpowerment: Concrete Problems in AI Safety part 2Robert Miles AI Safety2017-07-09 | Maybe AI systems would be safer if they avoid gaining too much control over their environment? How might that work?
Thanks to my amazing Patreon Supporters: Sara Tjäder Jason Strack Chad Jones Ichiro Dohi Stefan Skiles Katie Byrne Ziyang Liu Jordan Medina James McCuen Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr Peggy Youell Konstantin Shabashov Almighty Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez C3POehne patreon.com/robertskmilesAvoiding Positive Side Effects: Concrete Problems in AI Safety part 1.5Robert Miles AI Safety2017-06-25 | This is a follow-up to this earlier video: youtu.be/lqJUIqZNzP8 There's another problem with minimising side effects...
Thanks to my amazing Patreon Supporters: Chad Jones Ichiro Dohi Stefan Skiles Katie Byrne Ziyang Liu James McCuen Joshua Richardson Fabian Consiglio Jonatan R Øystein Flygt Björn Mosten Michael Greve robertvanduursen The Guru Of Vision Fabrizio Pisani Alexander Hartvig Nielsen Volodymyr Peggy Youell Konstantin Shabashov The Dodd DGJono Matthias Meger Scott Stevens Emilio Alvarez Benjamin Aaron Degenhart Michael Ore Robert Bridges Dmitri Afanasjev Brian Sandberg Einar Ueland Lo Rez patreon.com/robertskmilesAvoiding Negative Side Effects: Concrete Problems in AI Safety part 1Robert Miles AI Safety2017-06-18 | We can expect AI systems to accidentally create serious negative side effects - how can we avoid that? The first of several videos about the paper "Concrete Problems in AI Safety".
Thanks again to all my wonderful Patreon Supporters: - Chad Jones - Ichiro Dohi - Stefan Skiles - Katie Byrne - Ziyang Liu - Joshua Richardson - Fabian Consiglio - Jonatan R - Øystein Flygt - Björn Mosten - Michael Greve - robertvanduursen - The Guru Of Vision - Fabrizio Pisani - Alexander Hartvig Nielsen - Volodymyr - Peggy Youell - Konstantin Shabashov - Adam Dodd - DGJono - Matthias Meger - Scott Stevens - Emilio Alvarez - Benjamin Aaron Degenhart - Michael Ore - Robert Bridges - Dmitri Afanasjev patreon.com/robertskmilesRobert Miles Live StreamRobert Miles AI Safety2017-06-17 | ...Are AI Risks like Nuclear Risks?Robert Miles AI Safety2017-06-10 | Concerns about AI cover a really wide range of possible problems. Can we make progress on several of these problems at once?
With thanks to my Patreon supporters: - Ichiro Dohi - Stefan Skiles - Chad Jones - Joshua Richardson - Fabian Consiglio - Jonatan R - Øystein Flygt - Björn Mosten - Michael Greve - robertvanduursen - The Guru Of Vision - Fabrizio Pisani - Alexander Hartvig Nielsen - Peggy Youell - Konstantin Shabashov - The Dodd - DGJono - Matthias Meger - Scott Stevens - Emilio Alvarez patreon.com/robertskmilesRespectabilityRobert Miles AI Safety2017-05-27 | It can be hard to get people to take AI Safety concerns seriously, but it's a lot easier now than it used to be.
With thanks to my wonderful Patreon Supporters: - Ichiro Dohi - Stefan Skiles - Chad Jones - Joshua Richardson - Fabian Consiglio - Jonatan R - Øystein Flygt - Björn Mosten - Michael Greve - robertvanduursen - The Guru Of Vision - Fabrizio Pisani - Peggy Youell - Konstantin Shabashov - Adam Dodd - DGJono - Matthias Meger http://patreon.com/robertskmilesPredicting AI: RIP Prof. Hubert DreyfusRobert Miles AI Safety2017-05-18 | It's hard to predict what AI will be like in the future. Many tried in the past, and all failed to some extent. In this video we look at Professor Hubert Dreyfus, and one of his reasons for thinking AI couldn't be done.
Some of Dreyfus' work: "What Computers Can't Do": archive.org/details/whatcomputerscan017504mbp "Alchemy and Artificial Intelligence": https://courses.csail.mit.edu/6.803/pdf/dreyfussummaryandconclusion.pdf
Here's that paper criticising him: "The Artificial Intelligence of Hubert L. Dreyfus: A Budget of Fallacies": https://dspace.mit.edu/handle/1721.1/6084
With thanks to my excellent Patreon Supporters: - Ichiro Dohi - Chad Jones - Joshua Richardson - Fabian Consiglio - Jonatan R - Øystein Flygt - Björn Mosten - Peggy Youell - Konstantin Shabashov - Almighty Dodd - DGJono
patreon.com/robertskmilesWhats the Use of Utility Functions?Robert Miles AI Safety2017-04-27 | A lot of our problems with AI seem to relate to its utility function. Why do we need one of those, anyway?
Footage from The Simpsons, Copyright FOX, used under Fair Use
With thanks to everyone who gave their suggestions and feedback about drafts of this video And thanks to my Patreon Supporters: Chad Jones Peggy Youell Sylvain Chevalier
Thanks to Computerphile for permission to use the clips. Just about every time I say "I made" in this video, I mean "Computerphile made, using footage of me". youtube.com/user/Computerphile
Support me on Patreon, if you like! http://patreon.com/robertskmiles I just set it up, cause people asked for it, and I'm not sure how that all works, but let's figure it out together :)
Let me know what you want to see on this channel!Status ReportRobert Miles AI Safety2017-03-18 | Videos on the way as soon as I can get a computer than can edit videosChannel IntroductionRobert Miles AI Safety2017-02-28 | Welcome to the channel! I'm new to making YouTube videos myself, so sorry for the bad editing.