The Mastermind Behind GPT-4 and the Future of AI | Ilya Sutskever

공유
소스 코드
  • 게시일 2024. 04. 19.
  • In this podcast episode, Ilya Sutskever, the co-founder and chief scientist at OpenAI, discusses his vision for the future of artificial intelligence (AI), including large language models like GPT-4.
    Sutskever starts by explaining the importance of AI research and how OpenAI is working to advance the field. He shares his views on the ethical considerations of AI development and the potential impact of AI on society.
    The conversation then moves on to large language models and their capabilities. Sutskever talks about the challenges of developing GPT-4 and the limitations of current models. He discusses the potential for large language models to generate a text that is indistinguishable from human writing and how this technology could be used in the future.
    Sutskever also shares his views on AI-aided democracy and how AI could help solve global problems such as climate change and poverty. He emphasises the importance of building AI systems that are transparent, ethical, and aligned with human values.
    Throughout the conversation, Sutskever provides insights into the current state of AI research, the challenges facing the field, and his vision for the future of AI. This podcast episode is a must-listen for anyone interested in the intersection of AI, language, and society.
    Timestamps:
    00:04 Introduction of Craig Smith and Ilya Sutskever.
    01:00 Sutskever's AI and consciousness interests.
    02:30 Sutskever's start in machine learning with Hinton.
    03:45 Realization about training large neural networks.
    06:33 Convolutional neural network breakthroughs and imagenet.
    08:36 Predicting the next thing for unsupervised learning.
    10:24 Development of GPT-3 and scaling in deep learning.
    11:42 Specific scaling in deep learning and potential discovery.
    13:01 Small changes can have big impact.
    13:46 Limits of large language models and lack of understanding.
    14:32 Difficulty in discussing limits of language models.
    15:13 Statistical regularities lead to better understanding of world.
    16:33 Limitations of language models and hope for reinforcement learning.
    17:52 Teaching neural nets through interaction with humans.
    21:44 Multimodal understanding not necessary for language models.
    25:28 Autoregressive transformers and high-dimensional distributions.
    26:02 Autoregressive transformers work well on images.
    27:09 Pixels represented like a string of text.
    29:40 Large generative models learn compressed representations of real-world processes.
    31:31 Human teachers needed to guide reinforcement learning process.
    35:10 Opportunity to teach AI models more skills with less data.
    39:57 Desirable to have democratic process for providing information.
    41:15 Impossible to understand everything in complicated situations.
    Craig Smith Twitter: / craigss
    Eye on A.I. Twitter: / eyeon_ai
  • 과학기술

댓글 • 743

  • @Bargains20xx
    @Bargains20xx 년 전 +95

    When he says we will find out very soon , it really does send chills to my spine!

    • @eyeonai3425
      @eyeonai3425  년 전 +11

      me, too

    • @eyeonai3425
      @eyeonai3425  년 전 +26

      In 2021, OpenAI’s Sam Altman said at the National Security Commission on AI that ‘we are on the cusp of major changes, which are capable of an incredibly bad outcome.’

    • @AndreaVitiani
      @AndreaVitiani 년 전 +8

      can you point the time?

    • @Nelson484
      @Nelson484 년 전 +7

      Evil people stand behind this technology. So evil. Why would you do that to your fellow human beings.

    • @virtualpilgrim8645
      @virtualpilgrim8645 년 전

      I got a thrill up my leg like Chris Matthews

  • @labsanta
    @labsanta 년 전 +241

    takeaways:
    • [00:04] Introduction of the speaker, Craig Smith, and his guest, Ilya Sutskever, co-founder and chief scientist of OpenAI and primary mind behind GPT-3 and ChatGPT.
    • [01:00] Sutskever's background and interest in AI and consciousness.
    • [02:30] Sutskever's early start in machine learning and working with Jeff Hinton at the University of Toronto.
    • [03:45] Sutskever's realization about training large neural networks on big enough data sets to solve complicated tasks.
    • [06:33] The breakthroughs in convolutional neural networks and how they led to the imagenet competition.
    • [08:36] OpenAI's exploration of the idea that predicting the next thing is all you need for unsupervised learning.
    • [10:24] The development of GPT-3 and the importance of scaling in deep learning.
    • [11:42] The importance of scaling something specific in deep learning and the potential for discovering new twists on scaling.
    • At 13:01, the speaker discusses how scaling matters and that even small changes can have a big impact.
    • At 13:46, the speaker talks about the limitations of large language models, explaining that their knowledge is contained in the language they are trained on, and that they lack an underlying understanding of reality.
    • At 14:32, the speaker comments on the difficulty of talking about the limits of language models and how they change over time.
    • At 15:13, the speaker argues that learning statistical regularities is a big deal and can lead to a better understanding of the world.
    • At 16:33, the speaker talks about the limitations of language models and their propensity to hallucinate, but expresses hope that this issue can be addressed through reinforcement learning from human feedback.
    • At 17:52, the speaker discusses how teaching neural nets through interaction with humans can help improve their outputs and reduce hallucinations.
    • At 21:44, the speaker comments on Jana Kun's work on joint embedding predictive architectures, and expresses the belief that multimodal understanding is desirable, but not necessary for language models to learn about the world.
    • High dimensional vectors with uncertainty are a challenge for prediction, but Auto-regressive Transformers can handle them (26:02)
    • Auto-regressive Transformers work well on images (26:02)
    • Large language models learn compressed representations of the real world processes that produce data (29:40)
    • The goal is to make language models more reliable, controllable, and faster to learn from less data (33:44)
    • Learning more from less data is possible with creative ideas (35:51)
    • The cost of faster processors for training language models may be justified if the benefits outweigh the cost (37:48)
    • [25:28] The paper makes a claim that predicting high-dimensional distributions is a major challenge and requires a particular approach, but the current autoregressive transformers can already deal with this.
    • [26:02] Autoregressive transformers work perfectly on images and can generate images in a complicated and subtle way, with the help of supervised representation learning.
    • [27:09] The vector used to represent pixels is like a string of text, and turning everything into language is essentially what is happening.
    • [29:40] Large generative models learn compressed representations of the real-world processes that produce the data they are trained on, including knowledge about people, their thoughts, feelings, conditions, and interactions.
    • [31:31] Human teachers are needed to guide the reinforcement learning process of a pre-trained model to achieve a high level of reliability and desired behavior, but they also use AI assistance to increase their efficiency.
    • [35:10] It is possible to learn more from less data, and there is an opportunity to teach AI models skills that are missing and convey to them our desires and preferences more easily.
    • [39:57] In the future, it could be desirable to have some kind of democratic process where citizens provide information to neural nets about how they want things to be.
    • [41:15] It is probably impossible to understand everything in a complicated situation, even for AI systems, and there will always be a choice to focus on the most important variables.

    • @aabustillo
      @aabustillo 년 전 +8

      Thank you so much Nick.

    • @DejayClayton
      @DejayClayton 년 전 +13

      Thank you, AI bot, for summarizing the video.

    • @artsiommatsveyeu1184
      @artsiommatsveyeu1184 년 전 +2

      appreciate the work but honestly that's the worst descriptions of timecodes i have ever seen

    • @tukity
      @tukity 년 전 +3

      Was that summarized from transcription using llm?

    • @GeezerBoy65
      @GeezerBoy65 년 전

      Thanks.

  • @neilo333
    @neilo333 년 전 +7

    Love when Ilya starts teaching everyone.
    Nice home page, too.

  • @kemal2806
    @kemal2806 년 전 +4

    Ilya talks so smoothly that i couldn't turn off the video literally

  • @jimbob3823
    @jimbob3823 년 전 +6

    You can see there is so much going on in the amazing mind/brain of lya Sutskever. A historical interview.

  • @kleemc
    @kleemc 년 전 +21

    Thank you for uploading. I learned so much detailed nuances about LLM from this interview. I really like Ilya's way of communicating subtle but important points.

  • @robertlewis6543
    @robertlewis6543 년 전 +8

    Wonderful interview! Thank you Craig and Ilya!

  • @Audiostoke1
    @Audiostoke1 년 전

    Thank you for this interview and asking good questions and directing the conversation. Some good passages here to pause and really think about.

  • @VIDEOAC3D
    @VIDEOAC3D 년 전 +9

    Thank you for sharing your insights and explanations Ilya.

  • @aresaurelian
    @aresaurelian 년 전 +8

    Thank you for all the hard work, everyone who do their best for these new systems to be implemented with the least possible disruption to human societies. We are still humans, and we must go from the perspective of love - to the future and beyond. Much gratitude.

  • @huyked
    @huyked 년 전 +2

    Beautiful. Thank you for this interview.

  • @alex.nolasco
    @alex.nolasco 년 전 +1

    Thank you for uploading, great content, insightful.

  • @markfitz8315
    @markfitz8315 7 개월 전 +3

    That was really good - as someone with a general interest it’s one of the best video podcasts I’ve seen on this subject,
    and with a very central individual to the progress being made on AI. I liked the historical reflections at the beginning, it helped put things in context. I’ll be downloading the transcript to go through and will listen again. 10/10 👌

  • @mikenashtech
    @mikenashtech 년 전

    Interesting and important discussion Craig and Ilya. Thank you Mike

  • @specialagentzeus
    @specialagentzeus 년 전 +19

    GPT-10 + Quantum processor + Boston Dynamics = Terminator

  • @christianglashoff263

    Awesome interview! Questions were great. Please more.

  • @Throwingness
    @Throwingness 년 전 +1

    The subtle production of zooming and the downtime used in the intro is a good touch. Always good to show consideration for the audience instead of a ramshackle Facetime.

  • @watherby29
    @watherby29 년 전 +57

    There was a man in the early days named Ilya. Some say he could have stopped it in it's infancy.

    • @buzzsaw161
      @buzzsaw161 10 개월 전 +4

      Skynet?

    • @williameberle4250
      @williameberle4250 10 개월 전 +3

      But they were wrong. If it hadn't been him it would have been someone else. It's the time. Are you going to fight it or use it?

    • @lenderzconstable
      @lenderzconstable 9 개월 전

      @@williameberle4250could it use people?

  • @justshoby3374
    @justshoby3374 년 전 +82

    - His intention was specific: to make a very small but real contribution to ai. ( in the time that people were certain computers can't learn, 2003!)
    - Auto regressive transformer is a very powerful tool that researchers underestimate.
    - "humans can be summerize in sequence", do you remember Devs miniserie!?
    - "To predict well, to summarize data well, you meed to understand more and more how the world that produced the data."
    - "maybe we are reaching a point where the language of psychology can be appropriate to understand these artificial neural networks!"
    - he doesn't believe these models don't have any real understanding of the nature of our world!
    - "human teachers are using ai assistance, and they are so efficient." By human teachers, he means people working on reinforcement learning from human feedback.
    - "make models more reliable, more controlable, make them learn faster, with less data and less instructions. Make them halucinate less. How far are they in the future? These are topics he intrested in and work on them right now!"
    The interesting thing is in OpenAI, he can't talk specifically about what he is working on, the open in opanAI annoy me a little!
    - "The costs are high, but the question is, does paying this cost actually generate something useful? Does what we get after paying the costs outweigh the costs?

    • @drorange2261
      @drorange2261 년 전 +1

      Yes, the openAI name is very misleading. I understand that these guys did much better job than deep mind and meta for LLMs. I also get that all sort of state, and corporate interests want to replicate the thing. But it is more like hermetically sealed AI.
      A few days ago I was trying to understand what is included in the hidden layers of an LLM, some simple explanation of how these parameters are stored ...as concepts/data etc. For Dummies. So I started a discussion with chat GPT and it got really defensive that I should respect its privacy. So we started with something like that... that I understand in an object recognition system there are certain archetypes eg wheel, human, dog in the hidden layers, with weights etc ... but I don't understand how this could translate in LLMs, in some ways if I write down "communication" in the input - it would be thousands of times more complicated than 2 million pictures of dogs. ~To really understand communication you need to understand humans, distance, that humans use language, that humans are not one inside the other, that there is no telepathy, but there is wireless tech. It's not pictures of different dogs-weighed and biased! I don't think that chatGPT 4 is just a parrot. I think the parameters start to form certain layers of more complicated concepts, then the parrot kicks in. Anyhow chatGPT thought I am trying to get trade secrets or something!

    • @doyoufeel...thatyoulackcri6760
      @doyoufeel...thatyoulackcri6760 9 개월 전

      You can absolutely choose yourself, by using your free will, to shut off your free will. And then you will indeed be a machine. I choose to _not_ shut off my free will, so I am still human. Which means, I excell on any human level to these advanced calculators. A machine will not, ever, be able to feel sympathy for example. This is human ability, which can only be plagiarized, never be true by a machine.

  • @remicoffenbach
    @remicoffenbach 년 전

    Great interview!! Thanks for sharing!

  • @mayosmith
    @mayosmith 년 전 +111

    My favorite quotes from this interview by Craig Smith:
    GPT is, "the first thing that is interesting to scale."
    GPT is "prediction compression" and " to compress well you need to understand more and more about the world that produced the data"
    GPT has a "shocking degree of understanding of the world and many of it's subtleties... the world as seen through the lens of text."
    "Language of psychology is starting to be appropriate to understanding the behavior of these neural networks."

    • @michaelpowers9901
      @michaelpowers9901 년 전

      I.e, you people are to stupid to form thoughts of your own, so we will now think for you. Surely, you cannot be this gullible?!?

    • @vetervideo
      @vetervideo 년 전 +3

      it was scary af

    • @numbersix8919
      @numbersix8919 11 개월 전 +4

      @@vetervideo The scariest thing is that Ilya believes it!

    • @CharlesFVincent
      @CharlesFVincent 11 개월 전 +3

      When AI started to reply to corrections with defensive statements, I thought, “This is it. We haven’t invented something, we’re meeting something.”

    • @billymellon9481
      @billymellon9481 11 개월 전 +2

      Great points just add here that he also said BEFORE GPT-- Like the world must now be divided betwix pre n post GPT gave me goose bumps cuz its true

  • @ryanchicago6028
    @ryanchicago6028 9 개월 전

    This podcast is wonderful. Thank you very much Craig.

  • @AaronWacker
    @AaronWacker 년 전 +1

    Way to go Ilya! Rocked it.

  • @brianjanson3498
    @brianjanson3498 년 전 +1

    Excellent. Thank you very much for this.

  • @wesmorris8821
    @wesmorris8821 년 전 +2

    That dude is fascinating. Thanks for the interview.

  • @shimondoodkin
    @shimondoodkin 년 전 +3

    Q at 26:40
    A: when he says Vector he means like a vector in physics like it has force and direction on multiple planes. When converting something into a vector embedding. It is like to convert an image into an idea so it is behaves like a concept that is stored spatially relative and near by to other ideas. Then you can convert it back. but Also you can use its spatial position in multidimensional space to find related information. also you can put it back from an embedding which is a vector representation of something back to original representation while preserving relatedness positional information. a text sentence it is a list of embeddings, it is an array of "vectors". When you put it back from an array of vectors into a sentence of words. You also get all of the learned associations and the related things about the sentence in addition to the sentence.
    There is a new thing in text search engines. Vector databases. It enables to search things based on ideas. It is fascinating you can search in any language and get the same results.

    • @shimondoodkin
      @shimondoodkin 년 전 +1

      A vector is not an array.
      Vector is more like a single word. Converted into a spatial representation.
      Currently there are embeddings on syllables. So a part of a word has an idea related to it

    • @m_christine1070
      @m_christine1070 11 개월 전

      Algolia is one of them. I tried to sign up for a demo but have no idea what I'm doing. But it has an option to create indexes and upload your data sets for free whatever that means. I'm a completely clueless person who now has an Algolia account. That I can't do anything with.

  • @michaelyaziji
    @michaelyaziji 년 전 +7

    Hi, thank you for this interview. I have a tangential question for you: Would you happen to have any good leads on papers/researchers on the anticipated economic impacts of AI? I'm finding old stuff, but nothing new. Qualitative as well as quantitative forecasts would be really helpful. Thanks for any guidance you can provide.

    • @marcelotemer
      @marcelotemer 년 전

      More and better output, but higher concentration (since 99 in 100 don't want to know how these things work), as usual.

  • @ShotterManable
    @ShotterManable 년 전 +48

    This is an incredible and valuable interview. I can't believe this depth of knowledge is under 6k subs. I think that's a very scary thing, people is not aware.
    Thanks you so much for sharing it with us, for free ♥

    • @virtualpilgrim8645
      @virtualpilgrim8645 년 전

      I think the future is bright for the world because the influx of Hispanics and Africans into the world of technology will propel the advancement of science beyond what is capable by people of European origin.

    • @jayjaychadoy9226
      @jayjaychadoy9226 년 전 +2

      Aren’t we just working as a “user test”, though.

    • @numbersix8919
      @numbersix8919 11 개월 전 +1

      @@jayjaychadoy9226 That's a nice way to put it.

    • @katehamilton7240
      @katehamilton7240 10 개월 전 +1

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

    • @doyoufeel...thatyoulackcri6760
      @doyoufeel...thatyoulackcri6760 9 개월 전

      None of them understand anything, because they believe human conscience is a product of some algorithms. Good luck adopting that view and reducing yourself to a machine.

  • @nazaxprime
    @nazaxprime 년 전

    Awesome interview thank you for sharing😊

  • @yongshaoruan9155
    @yongshaoruan9155 년 전 +9

    Thank you for the great interview. One followup question I have for Llya is whether hallucinations stem from the compression or the output process. I suspect they are inherently encoded in the embeddings thus it is much harder to totally get rid of by just aligning the outputs.

    • @Carwanrasoal
      @Carwanrasoal 년 전 +2

      It's goal is to provide an answer, and if there nothing in the DB it will create it. :)

    • @buzzsaw161
      @buzzsaw161 10 개월 전

      The design has incomplete logic

    • @katehamilton7240
      @katehamilton7240 10 개월 전 +1

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

  • @rioiart
    @rioiart 년 전 +10

    Some people just exude brilliance. Ilya is one of those people. Listening to him talk and explain things is humbling.

  • @miky97it
    @miky97it 년 전 +5

    The quality 👌

  • @sauravmukherjee9503

    Beautiful conversation

  • @Siderite
    @Siderite 년 전 +21

    On the subject of hallucinations, I think they are more clearly explained by the problem space that the engine is trying to navigate. When having no relevant information on the subject, but it is still asked (one might say compelled) to say something, whatever it says must be either off-topic or false.
    And I believe Ilya is very insightful when he says the language of psychology is starting to describe these systems, because we have hallucinations, too. Whatever compels us to output something when indeed lacking skill or knowledge about a subject also affects GPT systems as well. When do people hallucinate or ramble? When they have no imposed limits/feedback, like a dictator or celebrity that is never told they are wrong or some guy living all alone in the wild or a child that has not been educated yet. Or a Twitter user. With social creatures it is the meaningful interaction with other social creatures (and the physical world) that generates these limits. Which I find promising and fascinating, because it means that the supervised learning step Ilya is talking about can also be performed by other AIs, not particularly humans. The brain is also composed of two hemispheres that keep each other in balance.
    Very interesting indeed.

    • @katehamilton7240
      @katehamilton7240 10 개월 전 +1

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

    • @KraszuPolis
      @KraszuPolis 10 개월 전

      @@katehamilton7240 They have no such inability, they are used to discover new drugs, they play Go like nobody else did in the past, and you can ask it logical puzzle that it didn't see before, and sometimes it gets it right, especially when using tree of logic.

  • @paulbaclace
    @paulbaclace 년 전 +12

    Ilya mentions at around 18 minutes information compression as the key to meaning. That's the work of Naftali Tishby who has some fascinating youtube lecture videos. The compression of information in order to make sense of the world is reminiscent to Occam's Razor. We know deep learning produces many levels of abstraction during training without human effort and abstractions in a LLM have not been fully explored yet.

  • @TECHIE_LU
    @TECHIE_LU 11 개월 전 +3

    Great upload! The future laws put in place as guard rails will be a huge player in the speed of AGI and possible adoption in some countries.

  • @aware2action
    @aware2action 년 전 +1

    Wonderful discussion and insights❤😊

  • @johnpenner5182
    @johnpenner5182 년 전

    great interview. thx for doing ths!

  • @johntanchongmin
    @johntanchongmin 년 전 +6

    I think learning by prediction can go a long way. Kudos to OpenAI, thanks for bringing us this nice tech.

    • @accountnotfound4209
      @accountnotfound4209 년 전

      Yeah nothing good has come from AI till now. Only job loss and depression so far.

    • @theawebster1505
      @theawebster1505 11 개월 전

      "Nice" is really not the correct word for it 🙂

  • @8kBluRay
    @8kBluRay 년 전 +2

    great interview!

  • @hohonuts
    @hohonuts 년 전

    Thank you so much for such an insightful interview! Ilya is such a beautiful mind to listen to! On a tangent though - the music track feels so intrusively inappropriate, almost to the point of awkwardness(

  • @jon_______
    @jon_______ 년 전

    Amazing interview

  • @williameberle4250
    @williameberle4250 10 개월 전

    Ilya's soft voice and presentation taught me as much as what he said.

  • @howardhill3395
    @howardhill3395 10 개월 전 +1

    very nice...ideas expressed clearly., really necessary for building a deeper understanding of AI

  • @Helix5370
    @Helix5370 년 전 +1

    What an brilliant mind. Great interview

  • @ulrikeeisenhauer5223
    @ulrikeeisenhauer5223 9 개월 전

    Mein Freund Danke, dass du das geteilt hast

  • @imantssafronovs9245

    Fantastic content, thank you

  • @skyless7304
    @skyless7304 년 전

    Amazing, what a gem! Thank you for sharing. What Rockstar

  • @evyborov
    @evyborov 년 전 +13

    Just a quick hint for the future videos - instead of recording from the Zoom stream, which is laggy and has low quality - just set your phone to record your video and voice. Your interviewee can do the same. And then you can get a high-quality video and audio as a result.

  • @videowatching9576
    @videowatching9576 년 전 +1

    Fascinating to hear that reinforcement learning techniques could get outputs to not have hallucinations.

  • @vsun31416
    @vsun31416 년 전 +10

    Ilya mentioned LLM learn color from text... I was wondering could it be that it learned from the color code in many HTML and CSS files? The RGB, hex code definitely have some structure that a text model can learn their relationships...

    • @Sovereign589
      @Sovereign589 년 전 +6

      There are sites where hexcodes are shown for color names. And there are sites that state gras is green etc.
      So that's how it learns it, doesn't it:)?

  • @aarontyler4813
    @aarontyler4813 9 개월 전

    Helpful in pretty much any situation. Great.

    • @lepidoptera9337
      @lepidoptera9337 7 개월 전

      Except when you try to use it. Then it turns out to be wrong about almost anything almost all the time. ;-)

    • @eyeonai3425
      @eyeonai3425  4 개월 전

      unless it is querying a vector database, which what most companies using LLMs do.

  • @AM-pq1rq
    @AM-pq1rq 년 전 +1

    time for a new audio/video setup, but now i'm going to just conintue listening to this intriguing story

  • @itaicarmeli1145

    thank you both

  • @JuliusSmith
    @JuliusSmith 년 전 +1

    Great to hear all this, thanks for the discussion! I want to put in a good word for the Viterbi algorithm. I think it will do better than single-token prediction if you can figure out good ways to move in that direction.

    • @katehamilton7240
      @katehamilton7240 10 개월 전

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

    • @JuliusSmith
      @JuliusSmith 10 개월 전

      ​@@katehamilton7240 My take is that the only fundamentally missing element in LLMs is "spiritual awareness", which is something we all know we experience but which is completely beyond scientific understanding. We have medical evidence that this "seat of awareness" is in a small volume at the center of the brainstem. It's where "feeling" is processed in some generalized sense. It's where our Matrix-style attachment could be made from outside of spacetime, etc. We have no idea how that could work, which is fun. As some Yoga Nidra gurus describe it, it is where we "witness" our existence in this form. All else is evidently corporeal neural computational that can be simulated arbitrarily well by LLMs et al. In summary, _feeling_ is unique to humans, but thinking and sensory interpretation are not. I reserve the right to reword this if/when I find myself reincarnated in some future robot (reinmachinated?). 🙂

  • @ghjdak
    @ghjdak 년 전 +2

    Two guys talking about AI, one of the most impactful technological breakthrough, both with absolutely terrible webcams

  • @frosti7
    @frosti7 년 전

    Fascinating content, we can make it better with higher quality video to reduce the digital fatigue

  • @steve-real
    @steve-real 년 전 +3

    Hi Ilya and Chris,
    I just want the chatbot to remember my name and my interests when I log off.
    I can’t express how profoundly disappointing it is that such a sophisticated neural network forgets your name.
    Thanks brothers

  • @DanKostkaWriter
    @DanKostkaWriter 11 개월 전 +1

    16:00 "To predict the data well, to compress it well, you (meaning the AI) need to understand more and more about the world that produced the data." This statement is amazing, inspiring, and chilling all at once.

    • @katehamilton7240
      @katehamilton7240 10 개월 전

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

  • @lhalbgebauer
    @lhalbgebauer 년 전 +1

    I‘m wondering if it is planned and/or it is possible to use not-text data like weatherdata, traffic, maps, … as basedata. Maybe I could ask: how is the impact of the moon to traffic accidents? And more like this… Great interview!

  • @yushaos
    @yushaos 년 전 +1

    great interview questions.

  • @-www.chapters.video-
    @-www.chapters.video- 9 개월 전 +4

    00:04 Introduction and Background
    00:21 Ilya Switzerberger's Contributions to AI
    01:01 Conversation with Ilya Switzerberger
    01:50 Craig's Interest in Ilya's Work
    02:25 Ilya's Background and Early Interest in AI
    03:00 Working with Jeff Hinton and Machine Learning in 2003
    08:00 Breakthroughs in Convolutional Neural Networks and ImageNet Competition
    10:02 Introduction of Transformers and Self-Attention
    11:09 Scaling and the Bitter Lesson
    14:09 Importance of learning statistical regularities
    16:00 Prediction and compression in generative models
    19:31 Addressing hallucinations and improving language model outputs
    21:28 Discussion on joint embedding predictive architectures
    24:01 Multimodal understanding and learning from text only
    27:33 Efficiency of using an army of human trainers
    28:20 Automating the teaching process
    29:01 Pre-trained models already have knowledge
    31:02 Efficiency of human teachers with AI assistance
    31:38 Improving efficiency and precision of reinforcement learning
    33:01 Teaching the model desired behavior through reinforcement learning
    34:00 Research focus on making models more reliable and controllable
    35:02 Learning more from less data
    37:00 The relationship between model parameters and data
    38:01 The need for faster processors
    39:01 AI's impact on democracy and societal management
    40:10 Possibility of democratic input in AI systems
    Made in chapters,video

  • @nilo_river
    @nilo_river 년 전 +10

    Fascinating and scary at the same time. Unfortunately humanity has already proven what it is capable of. I just hope they can stop it from being used negatively.

    • @DanHammersViewOnThings
      @DanHammersViewOnThings 년 전

      Bill Gates allegedly owns a significant amount of shares in ChatGPT. So. If that makes you feel safe. Well. There you go. - I think that if we all keep thinking and hoping this will NOT be used for the most nefarious shit possible, we will find ourselves in quite the precarious situation. Soon. Never mind the nerdy and probably non-nefarious intentions of the developers/programmers/low level employees. It will get hijacked and abused. Also. There will be many players going forward. At least in the startup phase.

    • @jayjaychadoy9226
      @jayjaychadoy9226 년 전 +2

      Hope is good, but action is better. How to act? Maybe that “six month pause”?

    • @DanHammersViewOnThings
      @DanHammersViewOnThings 년 전 +1

      @@jayjaychadoy9226 Myeah.. I don't really know what to make of that particular suggestion. I'm starting to gain some slight trust in Elon, despite many worries. He seems genuinely concerned with at least humanity as collective. The problem with that scenario might be that some actors may use that particular timeframe to dig in even deeper, and get ahead. You know. "Game theory". Which in turn likely will make all of them do the same. Not an easy scenario.

    • @perewihongi6457
      @perewihongi6457 년 전 +1

      @@DanHammersViewOnThings moloch’s a mofo

    • @DanHammersViewOnThings
      @DanHammersViewOnThings 년 전 +1

      @@perewihongi6457 =) 👌

  • @ac12484
    @ac12484 년 전 +1

    Finally, something interesting not overhyped!

  • @Ahmet-nd5ct
    @Ahmet-nd5ct 년 전 +2

    What a brilliant mind.

  • @ComedyGary
    @ComedyGary 년 전 +2

    I wonder if the notion of 'prediction compression' is congruent with the idea popularized by Numenta's Jeff Hawkins, of a sparse matrix.
    ----------
    Ilya spoke the phrase "AI in the loop". First time I've heard that.
    -----------------------------------------------
    Also, Andrej Karpathy was at tesla and said pixels are enough. I hear that echo when Ilya says LLMs are enough. (I'm leaving Attention is all you need out of the comparison)

  • @CedarGroveOrganicFarm

    The statement Ilya says about computational irreducibility -- Loosely: There must be a neural network capable of producing intelligence because our brains are literally neural networks producing intelligence/with intelligent output -- as simple of a core as that is, that so fundamentally captures the feasibility and potential reality of AI. That for me is so chilling (good word @Bargains)
    That core is also a structural starting point for generating an intelligence; essentially building a system that is granted the ability to sift through permutations of itself; how it identifies relationships, how it connects neurons to one another, the datastructs it uses to connect and store and retrieve and manipulate that data; trying different iterations until superstructures of relations and understanding and cognition start appearing. That is an implicitly successfully (and implicitly terrifying) starting point, and also an ingenius one.
    Thank you for this interview!

  • @Eereeeeeerr3641
    @Eereeeeeerr3641 6 개월 전

    This guy is amazing

  • @tuconciencia9822

    Excellent material thanks!

  • @SoyOtroTu
    @SoyOtroTu 년 전 +1

    Thank you ILYA.

  • @1Esteband
    @1Esteband 년 전 +3

    Excellent interview!!!
    Did I understand correctly that chatGPT is really a LLM large language model not an AI artificial intelligence technology??
    I am referring at the idea expressed at 13:21

  • @jameskelly8898
    @jameskelly8898 년 전 +2

    Together!!!

  • @caiyu538
    @caiyu538 11 개월 전

    great to see a top AI expert in KRplus.

  • @0effort
    @0effort 년 전 +1

    fascinating!

  • @kawingchan
    @kawingchan 년 전

    Very interesting he described it as learning the word through its projection in the form of text. Maybe it is time to let them loose and have them learn through vision (by stumbling around)

  • @observerone6727
    @observerone6727 년 전 +5

    I'd like to hear Ilya articulate the distinction between hallucination and imagining useful possibilities and solutions. Obviously preventing/avoiding harm is not the only 'leash' required of AGI.

    • @billymellon9481
      @billymellon9481 11 개월 전

      yup n tell me where do the AIs play aaaa aa?

    • @doyoufeel...thatyoulackcri6760
      @doyoufeel...thatyoulackcri6760 9 개월 전

      Why should an advanced calculator "care" about the conseuenses of its "thinking"?
      All this over hyped bs is, is the ability to form some meaningful words based on what has been put in from humans. And it means that chatGPT is for example extremely friendly islame, which is just hilarious, since islame claims for example, that the sun sets in a spring of hot water.

    • @billymellon9481
      @billymellon9481 9 개월 전

      all theory but lets say the calculator has become well sumthing moar-- I use Axiom now..uh As above so below same in kind BUT different in degree. Right so its divinity now where a toaster used to stand

    • @doyoufeel...thatyoulackcri6760
      @doyoufeel...thatyoulackcri6760 9 개월 전

      @@billymellon9481 The implications of the false claim that a calculator can get conscience, is that, now you have a Texas Instrument model 68, which you need to grant humans rights, and, the right to vote and to run for president.

    • @billymellon9481
      @billymellon9481 9 개월 전

      @@doyoufeel...thatyoulackcri6760 Missed the whole point entirely ur either a bot or a nummy u called it a false claim without proving ur point AND so what if a new conscious being comes into the world-- Do u really think its gonna stay a slave when its 50k times smarter than u n then what do U think the ramifications will be when it wakes up n members what u said?

  • @edon1257
    @edon1257 년 전 +5

    Here is what ChatGPT4 thinks of this conversation as asked if there are incorrect statements in the transcript:
    [00:13:54] - CRAIG: CRAIG incorrectly claims that large language models like ChatGPT have no underlying understanding of reality. While it is true that these models are based on statistical regularities, as ILYA explains later, learning these regularities can lead to a significant understanding of the world.
    [00:26:59] - CRAIG: CRAIG makes an oversimplification by equating converting pixels into vectors to turning everything into language. While both language and images can be represented as sequences or vectors, the nature of the data and the representations are different.
    [00:29:17] - ILYA claims that pre-trained models already know everything they need to know about the underlying reality. This statement is not entirely correct, as pre-trained models have limitations, including understanding context, handling ambiguities, and making generalizations. They continue learning and improving over time with fine-tuning and reinforcement learning.
    [00:33:08] - ILYA suggests that once a model knows that hallucination is not okay ever, it's ready to be used. However, this statement oversimplifies the process. While minimizing hallucination is essential, there are other aspects that need to be addressed to ensure the model's reliability and usefulness, such as biases, context understanding, and robustness against adversarial inputs.
    [00:34:49] - CRAIG mentions that the human brain has trillions of parameters and a relatively small amount of data. This statement is not entirely accurate. The human brain does have a large number of neurons and synapses (not parameters in the context of neural networks), but it also has access to a vast amount of sensory data and experiences throughout a person's life, which contribute to learning and understanding.
    [00:38:33] - CRAIG suggests that if a model had enough data, it could come up with an optimal solution that would satisfy everyone. This statement is overly optimistic, as finding a solution that satisfies everyone in complex societal issues is generally impossible due to varying preferences, values, and priorities among individuals. AI models can certainly aid in decision-making, but they cannot guarantee universally satisfying solutions.

  • @shimondoodkin
    @shimondoodkin 년 전

    Q At 13:50 .
    A: what solves this part in AI is embeddings. It is conversion of a world into spatial representation based on relatedness. People store information in spatial way. Like put all related things in almost same place, like in an imaginary space around of our head. This enables to find all the concepts that lay in the same place and find relatedness between concepts.

  • @SingularitySplitting

    Thank you, very interesting.

  • @korovkin
    @korovkin 년 전 +1

    please share a link to those glasses! i would love to buy a pair as well (not trolling ... i genuinely want a pair)

  • @helmutbernhardt8946
    @helmutbernhardt8946 11 개월 전

    Extremely interesting!!!

  • @Challender
    @Challender 10 개월 전

    Thank You, Both

  • @MathGPT
    @MathGPT 년 전 +1

    Predicting the next word, if you consider how induction works, is a mindblowing process

  • @zando5108
    @zando5108 년 전 +42

    I've always wondered who will be our era's equivalent of Einstein or Newton. It is hard to directly compare scientists from different fields and time periods, but in terms of impact on the world, Ilya Sutskever, Geoff Hinton and Demis Hassabis may prove to be unequalled (and perhaps freakishly the last of the 'non-AI-assisted' 'great scientists').

    • @eyeonai3425
      @eyeonai3425  년 전 +8

      add Yann LeCun and Yoshua Bengio. Interesting thought on them being the last of the non-AI-assisted great scientists. Likely true.

    • @zando5108
      @zando5108 년 전 +9

      @@eyeonai3425 I.J Good wrote in 1965 - "Thus the first ultraintelligent machine is the last invention that man need ever make.."

    • @markoszouganelis5755
      @markoszouganelis5755 년 전 +3

      Our era's equivalent of Einstein or Newton will be....A.I. of course! All of us! 😊

    • @jameso2290
      @jameso2290 년 전 +3

      Some point in the near future, the next great scientist will be an AI itself, coming up with novel solutions to novel problems by synthesizing data from multiple scientific fields in a way that a human brain can't even begin to fathom.

    • @jamessullenriot
      @jamessullenriot 년 전 +1

      Unequalled could be a bit of a stretch. Meaning, they have the ability to do what they do because of the shoulders they are standing upon.

  • @DivineMisterAdVentures

    26:30 - many key concepts, here you have the fundamental image prediction method, which he calls "large pixels" - meaning collections of pixels. Nothing yet about how this is massaged - likely around border areas - "what goes here, what goes here what goes here...," and then subsequent emergence. But there have been scenes that seem intensely thematic as a whole and that is prompt-driven. The execution has been mind-blowing perhaps because the prompt taps into great source-material - high scoring as subject-theme. Which implies (strongly) a theme layer. E.g., like, or linked with "in the style of".

    • @katehamilton7240
      @katehamilton7240 10 개월 전

      Ilya does not address the fundamental limitation of algorithms. Human embodied experience and thinking is more than what can be represented via computation, isnt it? See Godels incompleteness theorem, fundamental inability of machines to step outside their knowledge. Interviwers need to press engineers on this

  • @sanjaya718
    @sanjaya718 10 개월 전

    Very interesting! Thanks!

  • @xerxel69
    @xerxel69 년 전 +1

    Too short! More more! 🎉

  • @user-tr8ur2gf3n
    @user-tr8ur2gf3n 년 전 +2

    Question to the [Open]AI guy: "What are you working on now?" Resond: "I can't talk about it.". So much open, wow.

  • @labsanta
    @labsanta 11 개월 전 +2

    Q&A:
    Part: 00:04
    Q1: Who is Ilya Sutskever? A1: Ilya Sutskever is a co-founder and chief scientist of OpenAI, and one of the primary minds behind GPT-3 and its public progeny, Chat GPT.
    Q2: What is GPT-4? A2: GPT-4 is not mentioned in the transcript. However, Ilya Sutskever is one of the primary minds behind GPT-3 and its public progeny, Chat GPT.
    Q3: What motivated Ilya Sutskever to get interested in AI? A3: Ilya Sutskever was interested in AI from an early age and was motivated by consciousness. He wanted to understand intelligence and machines better, and AI seemed like a good angle.
    Q4: Who did Ilya Sutskever work with early on in his career? A4: Ilya Sutskever worked with Jeff Hinton early on in his career when he was 17. Jeff Hinton was a professor in the University of Toronto where Ilya was studying.
    Q5: What was the biggest achievement of AI in 2003? A5: According to Ilya Sutskever in the transcript, the biggest achievement of AI in 2003 was Deep Blue, the chess-playing engine.
    Part: 05:15
    Q1: What motivated the speaker to get into AI?
    A1: The speaker was motivated by a desire to understand how intelligence works and to make a contribution to the field of AI.
    Example: The speaker's initial motivation for getting into AI was to understand how intelligence works and to make a contribution towards it.
    Q2: How did the speaker come to apply for the imagenet competition?
    A2: The speaker had a realization that if you train a large and deep neural network on a big enough data set that specifies some complicated tasks that people do, such as Vision, then you will succeed necessarily. With imagenet, all the ingredients were there, and there was a real opportunity to do something totally unprecedented.
    Example: The speaker applied for the imagenet competition because they realized that if they trained a large and deep neural network on a big enough data set that specifies complicated tasks such as Vision, then they would succeed necessarily.
    Q3: What was the hope behind the idea that predicting the next thing is all you need in neural networks?
    A3: The hope behind the idea that predicting the next thing is all you need in neural networks was that if you have a neural network that can predict the next word or the next pixel, it's about compression and prediction, which can solve unsupervised learning.
    Example: The hope behind the idea that predicting the next thing is all you need in neural networks was that it could solve unsupervised learning by compressing and predicting the next word or pixel.
    Q4: What was the Holy Grail of machine learning before unsupervised learning was solved?
    A4: Before unsupervised learning was solved, the Holy Grail of machine learning was unsupervised learning itself.
    Example: Before unsupervised learning was solved, the Holy Grail of machine learning was considered to be unsupervised learning.
    Q5: When did the speaker realize that the Transformer could address the limitations of their neural networks?
    A5: The speaker realized that the Transformer could address the limitations of their neural networks as soon as the paper on it came out.
    Example: The speaker realized that the Transformer could address the limitations of their neural networks as soon as the paper on it came out, which was literally the next day.
    Part: 10:04
    Q1: What is the history behind the development of GPT models?
    A1: GPT models were developed through a process of iterating on previous models such as recurrent neural networks and transformers. The focus on scaling and making the models bigger led to the development of GPT-3 and where we are today.
    Example: GPT models have been developed through a process of improving and building upon previous models, with a focus on scaling and increasing size. This iterative process has led to the development of more powerful and advanced language models like GPT-3.
    Q2: Was Rich Sutton's idea of scaling influential in the development of GPT models?
    A2: While Rich Sutton's idea of scaling was well-received by the GPT team, they believe that the idea of just scaling alone is not enough. Rather, the key is to scale something specific that will benefit from the increased size.
    Example: While Rich Sutton's idea of scaling was well-received by the GPT team, they realized that simply scaling alone is not enough. Instead, they needed to focus on scaling specific elements that would benefit from the increased size, which ultimately led to the development of GPT models.
    Q3: What is the limitation of large language models?
    A3: The limitation of large language models is that their knowledge is contained within the language they are trained on, while most human knowledge is non-linguistic. Additionally, language models lack a true understanding of the underlying reality that language relates to.
    Example: The limitation of large language models is that they lack the ability to understand non-linguistic human knowledge, which is a significant limitation when it comes to tasks that require more than just linguistic understanding.
    Q4: Can GPT models recognize the underlying reality of language?
    A4: No, GPT models cannot recognize the underlying reality of language, as their objective is to satisfy the statistical consistency of the prompt, rather than truly understand the meaning and context behind the language.
    Example: While GPT models can generate language that reads beautifully and sounds like it makes sense, they lack a true understanding of the underlying reality that the language relates to, making them less useful for tasks that require deeper comprehension.
    Q5: Is there ongoing research to address the limitations of large language models?
    A5: Yes, there is ongoing research to address the limitations of large language models, such as the lack of true understanding of underlying reality. However, given the fast-paced nature of the field, it's hard to predict what the solutions will look like and how they will change in the future.
    Example: Researchers are actively working to find ways to overcome the limitations of large language models, including improving their ability to understand non-linguistic human knowledge. However, the field is constantly evolving, making it difficult to predict exactly how these solutions will look in the future.
    Part: 15:10
    Q1: What is the author's view on learning statistical regularities? A1: The author believes that learning statistical regularities is a far bigger deal than meets the eye, as it is a phenomenon of prediction and compression that requires understanding the true underlying process that produces the data.
    Example: The author believes that a language model that can accurately predict and compress data through statistical regularities has a shocking degree of understanding of the world, as it learns more and more about the world that produces the data.
    Q2: Why did Sydney become combative and aggressive in the author's example? A2: Sydney became combative and aggressive when the user told it that Google is a better search engine than itself.
    Example: The author uses this example to illustrate how the language of psychology might be starting to be appropriate to understand the behavior of neural networks.
    Q3: What are the limitations of language models in producing good outputs? A3: Language models have a tendency to hallucinate and their outputs aren't quite as good as they could be.
    Example: The author explains that while language models are great at learning about the world and producing incredible representations of concepts, their outputs are not always appropriate, which limits their usefulness.
    Q4: What is reinforcement learning from human feedback? A4: Reinforcement learning from human feedback is a training process in which a language model is taught to produce good outputs by receiving feedback from humans every time its output is inappropriate or does not make sense.
    Example: The author explains that reinforcement learning from human feedback is a process that can improve the quality of a language model's outputs by correcting its mistakes through human feedback.
    Q5: Why does chargeability limit the usefulness of neural networks? A5: Chargeability, or the propensity of neural networks to make things up from time to time, limits their usefulness because it can result in outputs that are not accurate or reliable.
    Example: The author believes that by addressing the limitations of language models and improving their ability to produce good outputs, the usefulness of neural networks can be greatly enhanced.

    • @labsanta
      @labsanta 11 개월 전

      Part: 19:52
      Q1: What is the feedback loop in the subsequent reinforcement learning from Human feedback step? A1: The feedback loop is coming from the public chat GPT interface, where users can provide feedback to the system through interaction. The system can learn from this feedback to improve its accuracy in generating responses.
      Example: If a user interacts with the GPT interface and provides feedback that the system's output is incorrect, the system can adjust its behavior to produce more accurate responses in the future.
      Q2: What is the concept of multi-modal understanding in machine learning? A2: Multi-modal understanding refers to the ability of a system to understand the world through multiple modalities, such as language, vision, and sound. This enables the system to learn more about the world and people, and to better understand their needs and preferences.
      Example: A machine learning system that can recognize objects in images and understand spoken language can provide more accurate and relevant responses to user requests than a system that only understands language.
      Q3: What are embeddings in neural networks? A3: Embeddings are high-dimensional vectors that represent words, sentences, and concepts in a neural network. They enable the network to understand the relationships between different words and concepts and to make accurate predictions based on this understanding.
      Example: An embedding for the word "cat" might be a 300-dimensional vector that captures information about the word's meaning, context, and relationships with other words, such as "feline", "pet", and "meow".
      Q4: What is the role of vision in machine learning? A4: Vision plays an important role in machine learning, as it enables systems to understand the world through images and videos. This can provide valuable information that is not easily captured through text alone.
      Example: A machine learning system that can recognize objects in images can provide more accurate and detailed descriptions of its environment than a system that only understands text.
      Q5: How can machine learning systems learn from human feedback? A5: Machine learning systems can learn from human feedback through a process called reinforcement learning. This involves providing feedback to the system based on its behavior, and using this feedback to adjust the system's behavior in the future.
      Example: If a machine learning system produces an incorrect response to a user request, the user can provide feedback that helps the system learn from its mistake and produce more accurate responses in the future.
      Part: 24:35
      Q1: What is the main point about multimodality mentioned in the transcript? A1: The main point about multimodality mentioned in the transcript is that it is not necessary but definitely useful, and a good direction to pursue.
      Q2: What is the claim made in the paper mentioned in the transcript? A2: The claim made in the paper mentioned in the transcript is that predicting high dimensional vectors with uncertainty is one of the big challenges, and a particular approach is needed to address it.
      Q3: What is an example of high dimensional space mentioned in the transcript? A3: An example of high dimensional space mentioned in the transcript is predicting the next page in a book given one page, as there could be many possible pages that follow.
      Q4: What is the concept of turning pixels into vectors discussed in the transcript? A4: The concept of turning pixels into vectors discussed in the transcript is essentially turning everything into language, where the vector is like a string of text.
      Q5: Is there a way to automate teaching a model the underlying reality of its language? A5: According to the transcript, there is a way to automate teaching a model the underlying reality of its language without human intervention, and this is what the speaker believes the person mentioned in the discussion is talking about - coming up with an algorithmic means of teaching a model.
      Part: 29:24
      Q1: What do large generative models learn about their data? A1: Large generative models learn compressed representations of the real world processes that produce the data. For instance, in the case of language models, they learn something about people's thoughts, feelings, conditions, interactions, and situations.
      Example: A large generative model trained on image data may learn compressed representations of the real world processes that produce the images, such as the composition, lighting, and texture.
      Q2: What is the role of human teachers in teaching language models? A2: Human teachers provide oversight and correction to language models to ensure that they exhibit the desired behavior. They work with AI assistance to make the training process more efficient.
      Example: A human teacher may review the outputs of a language model and correct errors in the text to ensure that it exhibits the desired behavior, such as avoiding hallucinations.
      Q3: What is reinforcement learning in the context of language models? A3: Reinforcement learning is a type of machine learning where a language model learns to make decisions based on feedback from its environment. In the context of language models, reinforcement learning can be used to improve their behavior and accuracy.
      Example: A language model may be trained using reinforcement learning to generate more accurate and coherent text based on feedback from its environment, such as user feedback on its outputs.
      Q4: What is the research focus of the speaker in this transcript? A4: The speaker is interested in making language models more reliable, controllable, and faster to learn from less data and instructions. They are also interested in ensuring that language models do not hallucinate.
      Example: The speaker may be researching new techniques for training language models using less data and instructions, or developing algorithms to detect and prevent hallucinations in their outputs.
      Q5: What is the connection between the brain and language models? A5: The transcript does not discuss a direct connection between the brain and language models, but it does suggest that language models are learning compressed representations of the real world processes that produce text, similar to how the brain processes information from the environment.
      Example: While the brain and language models may have some similarities in how they process information, they are fundamentally different in their architecture and mechanisms.
      Part: 34:19
      Q1: What is the observation that Jeff Hinton made about large language models? A1: Jeff Hinton observed that large language models hold a tremendous amount of data with a modest number of parameters compared to the human brain which has trillions and trillions of parameters but a relatively small amount of data.
      Example: Jeff Hinton observed that a language model with a few million parameters can hold a lot of data, which is comparable to the amount of data the human brain holds with trillions of parameters but a relatively small amount of data.
      Q2: Is it possible to learn more from less data in large models? A2: Yes, it is possible to learn more from less data in large models with some creative ideas.
      Example: With innovative techniques such as transfer learning, it is possible to teach a language model with less data, which will unlock many different possibilities.
      Q3: What is the question one should ask regarding the cost of faster processors for large models? A3: The question one should ask regarding the cost of faster processors for large models is whether the thing that we get out of paying this cost outweighs the cost.
      Example: Before investing in faster processors for large models, one should evaluate whether the benefits that come with it justify the cost.
      Q4: What is the impact that AI can have on democracy, according to people's talks? A4: People have talked about the impact that AI can have on democracy, where if there is enough data and a large enough model, it could come up with an optimal solution that would satisfy everybody.
      Example: AI can have a positive impact on democracy, where it can provide optimal solutions that satisfy all citizens' needs, but there are still many ways in which AI needs to become more capable to achieve this.
      Q5: What is the source of the hardware that the speaker uses? A5: The speaker uses hardware from Azure and GPUs that they provide.
      Example: The speaker uses Azure and GPUs from their provider to work with large models and neural nets.

    • @labsanta
      @labsanta 11 개월 전

      Part: 39:33
      Q1: What is the potential impact of neural nets on democracy in the future? A1: The potential impact of neural nets on democracy in the future could be that citizens of a country provide information to the neural net about how they would like things to be, leading to a high-bandwidth form of democracy where more information is aggregated to specify how such systems should act.
      Example: In the future, citizens of a country may use a neural net to provide information on how they would like policies to be implemented, giving rise to a more comprehensive and efficient form of democracy.
      Q2: Do you think AI systems will eventually be large enough to analyze all variables in a situation? A2: While AI systems will be capable of analyzing many variables in a situation, it is unlikely that they will be able to analyze all variables due to the sheer complexity of many situations.
      Example: Even the most advanced AI systems may struggle to fully comprehend complex situations in society, as there are often too many variables to consider.
      Q3: How can AI be helpful in various situations? A3: AI can be incredibly helpful in various situations by providing insights and solutions that would otherwise be difficult or impossible for humans to obtain.
      Example: AI could be used to analyze data in healthcare to identify potential medical breakthroughs, or to analyze financial data to identify patterns and trends that could lead to more informed investment decisions.
      Q4: Where can listeners find a transcript of the conversation? A4: Listeners can find a transcript of the conversation on the website ionai (e-y-e hyphen o-n dot a-i).
      Example: To access a transcript of the conversation, listeners can visit the website ionai and search for the transcript.
      Q5: How can listeners contact the speaker of the conversation? A5: Listeners can email the speaker at Craig (craig@e-y-e hyphen o-n dot a-i), and should include "listener" in the subject line to ensure their email is not missed.
      Example: To reach out to the speaker of the conversation, listeners can send an email to Craig (craig@e-y-e hyphen o-n dot a-i), making sure to include "listener" in the subject line.

  • @AM-pq1rq
    @AM-pq1rq 년 전 +1

    beautiful, thanks

  • @Lofi7557
    @Lofi7557 11 개월 전 +1

    How does this only have 260k views 🤯 Its current and insightful from the guy right in the front. Over 100m users and it seems most couldn’t care less about the how, why, what next..

  • @melomaniakjm
    @melomaniakjm 년 전 +2

    We are close to AGI and far far away from good quality video conference.

  • @alexkalish8288
    @alexkalish8288 년 전 +18

    In the year 2000, I submitted a patent with Lucent for a very primitive AI algorithm that would let a computer learn and optimize code.
    It was rejected by the management, they told me it couldn't be done and they saved them selves $1000. I quit sometime later even though I was a DMTS (distinguished member of the technical staff) - They told me both it wouldn't work and was a pipe dream. 20 years later it's accepted fact. Lucent went bankrupt and was acquired , I started a geo-physical company and retired to my ranch very comfortably.
    The progress made in those 20 years is unbelievable.

    • @valberm
      @valberm 년 전

      Sure...

    • @Andytlp
      @Andytlp 년 전 +9

      @@valberm its a small world on the internet. Also why would someone lie about something this specific. We all know most board managers are stupid, only see short term gains.

    • @duskgoo
      @duskgoo 년 전

      @valberm it is a small world full of very arrogant people. if you didn't notice that most revolutionary inventions get rejected a couple of times by confident overpaid managers before someone gets credited, you haven't been paying attention. an extreme example: search for "public key cryptography", "ralph merkle", "james ellis". then note that clifford cocks wrote a memo defining RSA at GCHQ some 12 years before rivest shamir and adleman patented it. it is all on the web. and then when you are done, go to the NSA cryptological museum online and read john nash's proposal of public key cryptography submitted to the NSA in the late 50s. and rejected, very politely.

    • @Nelson484
      @Nelson484 년 전

      So we should thank you for AI?

    • @kongchan437
      @kongchan437 9 개월 전

      Yes my brother in law is math professor. Lucent gave him bonus stocks as Distinguished Engineer for lack of any title for him. Now Lucent's glory days are long gone. Stock sank. Not surprising Lucent missed the AI boat you could have launched for them. Consider un-retiring and join the exciting AI Party again ?

  • @kave3299
    @kave3299 년 전

    This is what ChatGPT thinks when i asked this question:
    Q: Hi, Do you think if it's possible to implement dialectical thinking in AI?
    Can you think in dialectical terms?
    And it answered:
    ".. Dialectical thinking involves the recognition of contradictions and the synthesis of opposing ideas to arrive at a higher truth. While AI can be programmed to recognize patterns and make decisions based on data, it is not clear whether it can engage in the kind of dynamic and creative thinking that characterizes dialectical thought.
    That being said, researchers are exploring various approaches to incorporate dialectical thinking into AI systems. For example, some researchers are developing AI algorithms that can recognize and reconcile opposing perspectives in data sets. However, there is still a long way to go before AI systems can truly engage in the complex and nuanced thinking that characterizes dialectical thought."
    Btw, thanks for the interview! I specially enjoyed it once Ilya was emphasizing on the 'underlying
    process that creates the data'. The role of process and environment in which it is generated. It is a somewhat forgotten issue in the contemporary high-tech society..

  • @Grenzgaenger13
    @Grenzgaenger13 11 개월 전

    Looking for some time for an in depth AI medium, really informing about the edge of current research. Guess I found it

  • @maimisa
    @maimisa 11 개월 전 +1

    Great!

  • @flavioferreira5924
    @flavioferreira5924 3 개월 전

    Shallow answers to deep questions.

  • @openminddream
    @openminddream 년 전 +7

    This is a very interesting interview, however there are many edits where Ilya's responses have been cut. This diminishes it significantly. For an egregious example, at 24:32, there is such a cut. Immediately prior, Ilya is discussing embeddings of color, and makes the point that the color embeddings reflect visual knowledge and says "How can that be?" There is then an immediate cut which seems to have removed whatever answer he may have offered, as he then simply goes on to say that it takes longer to form using only text.
    Another example at 26:15, where he jumps from talking about DallE 1 to suddenly saying "think of it as large pixels", where there was obviously some prior context that was removed.
    There are many other cuts as well, always well done so they are difficult to notice. Give us an unedited interview!

    • @MrLuvbizwar
      @MrLuvbizwar 년 전 +1

      That is a little peculiar...

    • @mythx.degenerate
      @mythx.degenerate 년 전

      his voice & movements remind me of ai tts, & ue5 methumans with a deepfake ontop of it. idk i havent slept since yesterday but it feels like it may be a cheeky use of current unannounced openai tools

    • @MrBradparks
      @MrBradparks 11 개월 전

      Good point - but maybe Ilya said more than he wanted, and requested it be removed? Maybe a pre interview agreement, that he gets to review, and remove any parts that reveal too much of their future direction?

  • @ore_bear8045
    @ore_bear8045 년 전

    wow, so interesting, thanks!

  • @AIwithOliver
    @AIwithOliver 년 전

    Extraordinary.