Transcript
i0UyKsAEaNI • How to Build AGI? (Ilya Sutskever) | AI Podcast Clips
/home/itcorpmy/itcorp.my.id/harry/yt_channel/out/lexfridman/.shards/text-0001.zst#text/0382_i0UyKsAEaNI.txt
Kind: captions Language: en what do you think it takes to let's talk about AGI a little bit what do you think it takes to build a system of human level intelligence we talked about reasoning we talked about long term memory but in general what does it take you think well I can't be sure but I think the deep learning plus may be another small idea do you think self play will be involved sort of like you've spoken about the powerful mechanism of self play where systems learn by sort of exploring the world in a competitive setting against other entities that are similarly skilled as them and so incrementally improving this way do you think self play will be a component of building an AGI system yeah so what I would say to build AGI I think is going to be deep learning plus some ideas and I think self play will be one of those ideas I think that that is a very self play has this amazing property that it can surprise us in truly novel ways for example like we I mean pretty much every self plays system both our daughter bought I don't know if opening I had a release about multi-agent where you had the two little agents were playing hide and seek and of course also alpha zero they were all produced surprising behaviors they all produce behaviors that we didn't expect they are creative solutions to problems and that seems like an important part of AGI that our systems don't exhibit routinely right now and so that's why I like this area I like this direction because of its ability to surprises surprises and an age age a system would surprise is fun yes but and to be precise not just not just a random surprise but to find a surprising solution to a problem it's also useful right now a lot of the self play mechanisms have been used in the game context or at least in simulation context how much how much do you have far along the path to eg I do you think we'll be done in simulation how much faith promise do you have in simulation versus having to have a system that operates in the real world with whether it's the real world of digital real world data or real world like actual physical world of the robotics I don't think it's an either/or I think simulation is a tool and it helps you has certain its strengths and certain weaknesses and we should use it yeah but okay I understand that that's uh that's true but one of the criticisms of self play one of the criticisms are reinforcement learning is one of the the its current power its current results while amazing have been demonstrated in a simulated environments or very constrained physical environments do you think it's possible to escape them escape the simulated environments and be able to learn in non simulated environments or do you think it's possible to also just similarly in the photorealistic and physics realistic way the real world in a way that we can solve real problems with self play in simulation so I think that transfer from simulation to the real world is definitely possible and as what has been exhibited many times in by many different groups it's been especially successful in vision also open AI in the summer has demonstrated a robot hand which was trained entirely in simulation in a certain way that allowed for seem to real transfer to occur this is Safin for the Rubik's Cube yes right and I don't know where that was trained in simulation was trained in simulation entirely really so what it wasn't in the physical that the hand wasn't trained no 100% of the training was done in simulation and the policy that was learned in simulation was trained to be very adaptive so adaptive that when you transfer it it could very quickly adapt to the physical to the physical world so the kind of perturbations with the giraffe or whatever the heck it was those weren't were those part of the simulation well the simulation was generally so the simulation was trained to be robust to many different things but not the kind of perturbations we've had in the video so it's never been trained with the glove it's never been trained really there's a stuffed giraffe so in theory these are novel perturbation correct it's not a theory in practice and pray that those are novel perturbation well that's okay doesn't matter that's a clean small-scale but clean example the transfer from the simulated world to the physical world yeah and I will also say that I expect the transfer capabilities of deep learning to increase in general and the better the transfer capabilities are the more useful simulation will become because today and you could take you could experience something in simulation and then learn a moral of the story which you could then carry with you to the real world right as humans do all the time and the play computer games so let me ask sort of an embodied question staying an AGI for a sec and do you think a GI system we need to have a body we need to have some of those human elements of self awareness consciousness sort of fear of mortality sort of self-preservation in the physical space which comes with having a body I think having a body will be useful I don't think it's necessary but I think it's very useful to have a body for sure because you can learn a whole new you can learn things which cannot be learned without a body but at the same time I think that you can call if you don't have a body you could compensate for it and still succeed you think so yes well there is evidence for this for example there are many people who were born deaf and blind and they were able to compensate for the lack of modalities I'm thinking about Helen Keller specifically so even if you're not able to physically interact with the world and if you're not able to I mean I actually was getting it maybe let me ask on the more particularly I'm not sure if it's connected to having a body or not but the idea of consciousness and a more constrained version of that is self-awareness do you think an EGR system should have consciousness it's what we can't define Kyle whatever the heck you think consciousness is yeah hard question to answer given how hard is to find do you think it's useful to think about I mean it's it's definitely interesting it's fascinating I think it's definitely possible that our systems will be conscious do you think that's an emergent thing that just comes from do you think consciousness could emerge from the representation that's stored within your networks so like that it naturally just emerges when you become more and more you able to represent more and more of the world well let's say I'd make the following argument which is humans are conscious and if you believe that artificial neural Nets are sufficiently similar to the brain then there should at least exist artificial neural Nets you should be conscious to you're leaning on that existence proof pretty heavily okay but this is that that's that's the best answer I can give no I I know I know I know there's still an open question if there's not some magic in the brain that were not I mean I don't mean a non materialistic magic but that that the brain might be a lot more complicated and interesting that would give it credit for if that's the case then it should show up and at some point at some point if you'll find out if you can't continue to make progress it I think it I think it's unlikely so we talk about consciousness but let me talk about another poorly defined concept of intelligence again we've talked about reasoning I've talked about memory what do you think is a good test of intelligence for you are you impressed by the test that Alan Turing formulated with the imitation game of that with natural language is there something in your mind that you will be deeply impressed by if a system was able to do I mean lots of things there's certain through certain frontier there is a certain frontier of capabilities today yeah and there exist things outside of the frontier and I would be impressed by any such thing for example I would be impressed by deep learning system which solves a very pedestrian you know pedestrian tasks like machine translation or computer vision tasks or something which never makes mistake a human wouldn't make under any circumstances I think that is something which have not yet been demonstrated and I would find it very impressive yes so right now they make mistakes in different they might be more accurate than human beings but they still they make a different set of mistakes so my my I would guess a lot of the skepticism that some people have about deep learning is when they look at their mistakes and they say well those mistakes they make no sense like if you understood the concept you wouldn't make that mistake us and I think that changing that would be we would would do that would that would inspire me that would be yes this is this this is this is progress yeah that's a really nice way to put it but I also just don't like that human instinct to criticize a model is not intelligent that's the same instinct as we do when we criticize any group of creatures as the other because it's very possible that GPT two is much smarter than human beings and many things definitely true it has a little more breadth of knowledge yes breadth of knowledge and even and even perhaps depth on certain topics it's kind of hard to judge what depth means but there's definitely a sense in which humans don't make mistakes that these models do yes the same is applied to autonomous vehicles the same is probably going to continue being applied to a lot of artificial intelligence systems we find this is the annoying this is the process of in the 21st century the process of analyzing the progress of AI is the search for one case where the system fails in a big way where humans would not and then many people writing articles about it and then broadly as a the public generally gets convinced that the system is not intelligent and we like pacify ourselves but I think it's not intelligent because of this one anecdotal case and seems to continue happening yeah I mean there is truth to that though there is people although I'm sure that plenty of people are also extremely impressed by the system that exists today but I think this connects to the earlier point we discussed that it's just confusing to judge progress in AI yeah and you know you have a new robot demonstrating something how impressed should you be and I think that people will start to be impressed once AI starts to really move the needle on the GDP so you're one of the people that might be able to create an AGR system here not you but you and open the AI if if you do create an AI system and you get the sponge sort of the evening with it him/her what would you talk about do you think the very first time first time well the first time I'll just oh just ask all kinds of questions and try to make it to get it to make a mistake and ever be amazed that it doesn't make mistakes and just keep keep asking broad okay what kind of questions do you think would they be factual or would they be personal emotional psychological what do you think all of the above would you ask for advice definitely I mean why would I limit myself talking to a system of this now again let me emphasize the fact that you truly are one of the people that might be in the room where this happens so let me ask a sort of a profound question about I just talked to Stalin his story I've been talking to a lot of people who are studying power Abraham Lincoln said nearly all men can stand adversity but if you want to test a man's character give him power I would say the power of the 21st century maybe a 22nd but hopefully the 21st would be the creation of an AGI system and the people who have control direct possession and control the AGI system so what do you think after spending that evening having a discussion with the AGI system what do you think you would do well the ideal world would like to imagine is one where humanity I like the board the board members of a company where they GI is the CEO so it would be I would like the picture which I would imagine is you have some kind of different entities different countries of cities and the people that live there vote for what the AGI that represents them should do and then AJ that represents them goes and does it I think a picture like that I find very appealing and you could have multiple ad you have an AJ for a city for a country and it would be it would be trying to in effect take the democratic process to the next level and the board can always fire the CEO essentially press the reset button say we randomized the parameters in well let me sort of that's actually okay that that's a beautiful vision I think as long as it's possible to press the reset button do you think will always be possible to press the reset button so things that it's definite definitely really possible to build so you're talking so the question that I I really understand from you is will the real humans or we humans people have control over the AI systems at the build yes and my answer is it's definitely possible to build AI systems which will want to be controlled by their humans Wow that's part of their so it's not that just they can't help but be controlled but that's that's the they exist the one of the objectives of their existence is to be controlled in the same way that human parents generally want to help their children they want their children to succeed it's not a burden for them they are excited to help the children to feed them and to dress them and to take care of them and I believe with hyster conviction that the same will be possible for an AGI it will be possible to program an AGI to design it in such a way that it will have a similar deep drive that it will be delighted to fulfill and the drive will be to help humans flourish but let me take a step back to that moment where you create the AGI system I think this is a really crucial moment and between that moment and the the Democratic board members with the AGI at the head there has to be a relinquish enough power so it's George Washington despite all the bad things he did one of the big things he did is you relinquish power he first of all didn't want to be President and even when he became president he gave he didn't keep just serving as most dictators do for indefinitely do you see yourself being able to relinquish control over an AGI system given how much power you can have over the world at First Financial just make a lot of money right and then control by having possession and say GI system I find it trivial to do that I'd finally trivial to little really really this kind of pot I mean you know there's a kind of scenario you are describing sounds terrifying to me that's all I would absolutely not want to be in that position do you think you represent the majority or the minority of people in the I community well I mean it's an open question an important one our most real good is another way to ask it so I don't know if most people are good but I think that when it really counts people can be better than we think that's beautifully put yeah are there specific mechanism you can think of aligning AIG values to human values is that do you think about these problems of continued alignment as we develop the AI systems yeah definitely in some sense the kind of question which you are asking is so if I were to translate the question to today's terms yes it would be a question about how to get an RL agent that's optimizing a value function which itself is learned and if you look at humans humans are like that because the reward function the value function of humans is not external it is internal right and there are definite ideas of how to train a value function basically an objective you know an as objective as possible perception system that will be trained separately to recognize to internalize human judgments on different situations and then that component wouldn't be integrated as the value as the base value function for some more more capable RL system you could imagine a process like this I'm not saying this is the process I'm saying this is an example of the kind of thing you could do you