Diffusion models explained in 4-difficulty levels

Diffusion models explained in 4-difficulty levels

AssemblyAI

2 года назад

117,681 Просмотров

In this video, we will take a close look at diffusion models. Diffusion models are being used in many domains but they are most famous for image generation. You might have seen diffusion models at work through Dall-e 2 and Imagen.

Let's look into how diffusion models learn and manage to create high-resolution, realistic images.

Check out the blog post for a more detailed look at diffusion models. https://www.assemblyai.com/blog/diffusion-models-for-machine-learning-introduction/

Get your Free Token for AssemblyAI Speech-To-Text API 👇https://www.assemblyai.com/?utm_source=youtube&utm_medium=referral&utm_campaign=yt_mis_30
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬

🖥️ Website: https://www.assemblyai.com
🐦 Twitter: https://twitter.com/AssemblyAI
🦾 Discord: https://discord.gg/Cd8MyVJAXd
▶️ Subscribe: https://www.youtube.com/c/AssemblyAI?sub_confirmation=1
🔥 We're hiring! Check our open roles: https://www.assemblyai.com/careers

▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬

#MachineLearning #DeepLearning
Ссылки и html тэги не поддерживаются


Комментарии:

@saraebrahimi3795
@saraebrahimi3795 - 18.12.2023 08:43

that was aweeeeessssommmmmmeeeee

Ответить
@alaad1009
@alaad1009 - 15.12.2023 14:10

Thank You !

Ответить
@inetmiguel
@inetmiguel - 12.12.2023 18:41

Nice explanation! I feel like the video title is misleading, it is just one explanation going deeper and not complete without the deeper levels of knowledge and differs a lot from other videos that start from zero the explanation at different levels. This is more like 4 shades of Diffussion :D Thanks for sharing!

Ответить
@randomaccessofshortvideos6214
@randomaccessofshortvideos6214 - 09.12.2023 10:38

❤🎉 amazing lecture

Ответить
@sotasearcher
@sotasearcher - 22.11.2023 19:27

Such a great video to dive in! I'm live streaming learning about Diffusion, right now!

Ответить
@sinsernadeesoyo
@sinsernadeesoyo - 20.11.2023 07:27

This video was awesome! Well done :) and thank you

Ответить
@terjeoseberg990
@terjeoseberg990 - 13.11.2023 22:36

Wow! There’s another video if yours below this one, and your hair is so different that I didn’t recognize that it’s you.

Ответить
@harshadmane8785
@harshadmane8785 - 10.11.2023 14:42

Great

Ответить
@John-eq8cu
@John-eq8cu - 09.11.2023 08:21

I want to understand diffusion models so I can understand how it's possible for artificial intelligence to produce an image. Your explanation helps. A bit.

Ответить
@zenchiassassin283
@zenchiassassin283 - 24.10.2023 18:23

Any level >= 5 ?

Ответить
@AIMLDLNLP-TECH
@AIMLDLNLP-TECH - 21.10.2023 09:38

Appreciate your explanation skill.
Q. What is diffusion model
Ans. Let's say you tell your best friend, Sarah, about this amazing new flavor. Sarah gets excited and tells her friend, Tom. Then Tom tells his cousin, Emily. Emily, in turn, tells her family, and the news keeps spreading from person to person, creating a chain reaction. This process of your ice cream flavor information spreading from one person to another is like how a drop of ink spreads in water. At first, it's just a small spot, but then it spreads out and covers more and more area as time goes on.

In the diffusion model, experts study how things, whether it's information, ideas, or products like your ice cream flavor, spread through a community of people. They try to understand how fast it spreads, how many people it reaches, and what factors influence its spread. By understanding these patterns, they can learn a lot about how people share and adopt new things!

Ответить
@malikfahadsarwar2281
@malikfahadsarwar2281 - 23.09.2023 10:48

It would be good if you also explain the reverse process in detail as you explained the forward process

Ответить
@alirezaakhavi9943
@alirezaakhavi9943 - 04.08.2023 08:18

really amazing video thank you very much! subbed! :)

Ответить
@yaruuvva
@yaruuvva - 12.07.2023 22:53

I need level 5/6/7 of explanations

Ответить
@cipherxen2
@cipherxen2 - 03.07.2023 22:15

She might not have technical background. No technical person will mispronounce variance as variation.

Ответить
@xgalarion8659
@xgalarion8659 - 22.06.2023 04:51

Good explanation but i do hate when papers add needless maths and physics which are tangential at best when they should be describing their model in a simple way.

Ответить
@yousufmamsa
@yousufmamsa - 18.06.2023 04:33

Great explanation of diffusion models. Thank you.

Ответить
@Democracy_Manifest
@Democracy_Manifest - 09.06.2023 16:05

This is an excellent video. Love the format. Well done, more please!

Ответить
@spadaacca
@spadaacca - 31.05.2023 21:30

bewbs

Ответить
@thobeycampion5387
@thobeycampion5387 - 31.05.2023 00:53

wow someone finally pulled this off

Ответить
@truejim
@truejim - 30.05.2023 05:27

In the level 1 explanation, what’s the point of introducing the phrase “thermodynamic equilibrium”? Most lay people understand what it means when we say food coloring diffuses into clear water. Reminding the viewer why that happens from a physics standpoint makes the level 1 explanation less clear, not more clear.

Ответить
@kaushiks7303
@kaushiks7303 - 26.05.2023 11:13

Thank you so much for the elegant explanation.

Ответить
@kaiboshvanhortonsnort359
@kaiboshvanhortonsnort359 - 01.05.2023 23:14

I dunno about all that, I just type in 'boobs' and the thing delivers. Whatever math those silicon wafers decide to subject themselves to, that's on them.

Ответить
@ahmedsinger9435
@ahmedsinger9435 - 15.04.2023 08:26

Tysm <3

Ответить
@Gurugurustan
@Gurugurustan - 14.04.2023 16:51

Can someone explain why do we need to know the initial position of the ink in water if we already knew where the ink was first introduced?

Ответить
@potrishead
@potrishead - 05.04.2023 07:44

Sorry, but this video is very frustrating. Nothing was explained in terms of either the technique for reversing or how it relates to new image creation when prompting, which is obviously what we are mostly interested on.

Ответить
@paramino
@paramino - 27.03.2023 22:21

This is very good intro for quick understanding of the concept 👍

Ответить
@ivuvu4065
@ivuvu4065 - 23.03.2023 11:28

6 minutes explaining nothing and at the end.. blablabla super fast about convolution... and nothing clear :/

Ответить
@al-aminibrahim1394
@al-aminibrahim1394 - 10.03.2023 18:03

thanks for this

Ответить
@jonathaningram8157
@jonathaningram8157 - 27.02.2023 05:41

there is the whole language part missing.

Ответить
@targetdexter
@targetdexter - 18.02.2023 15:38

Great explanation!

Ответить
@BogdanEchoMilosevic
@BogdanEchoMilosevic - 15.02.2023 21:18

Having just watched 5 videos on this, umm, "topic?" I feel as if I have been in a coma for 25 years. I am looking for the simplest possible explanation on how this whole AI thing works, yet there don't seem to be any videos that can explain that without using already established terminology that, to me, is completely foreign. Your video is obviously well made, and you are good at explaining this, especially with the example of a drop of paint in water, but I am obviously so far from even beginning anything beyond. Apart from understanding "noise", I have no clue as to what "diffusion", or "model" or anything means. I could always watch videos on any topic, i.e. quantum physics, rocket science, robotics, or anything, and get the basic idea, but this time I feel like I'm years behind... If you could make a video explaining this as if you would explain it to someone in kindergarten, I would definitely come back and watch

Ответить
@kartikpodugu
@kartikpodugu - 03.02.2023 17:16

Learnt a lot of new things from this video.

Why it is called as UNet
Why it is called as diffusion model.
What diffusion model does and how it does.
Thanks

Ответить
@MistereXMachina
@MistereXMachina - 26.01.2023 05:11

Can we take a moment to appreciate how silly it is to say, "we're gonna explain this in 4 levels - 1 being the easiest, 4 being the hardest"

and immediately starting level 1 with: "diffusion models were inspired by non-equilibrium thermodynamics from physics and as you can understand from the name this field deals with system d that are not in thermodynamic equilibrium"

next time ask ChatGPT to write it for you lmao, imagine going up to a five year old and being like,
"Hey kid, you're familiar with thermodynamic equilibrium right? Well the area of machine learning concerned with image generation using diffusion models takes that principle, but is inspired by its inverse."

Ответить
@retroathlete5814
@retroathlete5814 - 13.01.2023 18:11

Fine, you add noise to an image and then restore it. VERY simple concepts (even if very hard in practice). But the magic of DALL-E, Midjourney & Stable Diffusion is the creation of NEW images. This is the third video I'm watching that explains the same trivial diffusion concept. Guess I'll have to ask ChatGPT instead.

Ответить
@akrammekbal8936
@akrammekbal8936 - 05.01.2023 13:35

diffusion model can add noise to image1 and then in the revers process it make a different image (not the same) ?????? plz rpns?

Ответить
@pi5549
@pi5549 - 03.01.2023 10:23

'3/4/5-levels' looks like a very powerful way of explaining concepts. I'd like to see the higher levels be longer, and really drill down into the heart of the matter. So that the final level is communicating at an expert level. +1 / subbed.

Ответить
@chaneydw
@chaneydw - 29.12.2022 05:36

A very confusing, yet somehow great explanation of diffusion models. Thank you!

Ответить
@francescorizzo5027
@francescorizzo5027 - 21.12.2022 12:11

what role do images in the training set play? are diffusion models violating copyright or not?

Ответить
@paulatreides1354
@paulatreides1354 - 13.12.2022 18:27

meanwhile some artist have their art been trained as a SD models....having their art stolen in a way , people are loosing jobs , replaced by ai already ..i 'm waiting for this tech to be used for fake news , propaganda , terrorism , .;maybe some laws will appear instead of screwing up artists

Ответить
@mib32
@mib32 - 08.12.2022 20:26

Beautiful woman, can you share you instagram? :D)

Ответить
@abraruralam3534
@abraruralam3534 - 18.11.2022 09:24

This feels like it should not be possible...then again, its not too different from us humans imagining faces in the clouds. Computers just take this hallucination to the next level.

Ответить
@adammason1587
@adammason1587 - 18.11.2022 06:20

@AssemblyAI

Why 255 (Probability Density Graph), does it have to do with binary? Network Engineer here, and I am trying to draw correlations between IP address ranges being 255 and subnet ranges being 255 and the graph you displayed. They all have binary masks in common hence why I am asking.

Ответить
@0xeb-
@0xeb- - 16.11.2022 07:52

Good job.

Ответить
@BartoszBielecki
@BartoszBielecki - 09.11.2022 14:16

Regarding level 3. Is every single pixel diffused at each step, or there is a subset that is randomly chosen? Is the sampling separate for every pixel or we take one value and then multiply it by each pixel? Subsequent diffusions work on the already diffused value, I guess (we don't try to remember what was the mean of the original pixel, but just use the new one)?

Ответить
@S.Mullen
@S.Mullen - 07.11.2022 08:59

The explanation of adding noise was well done, but the reverse process--by far the strangest process--was not really explained at all. You introduced, but did not explain, some learning process. This unexplained process "somehow" gets back the image. Every "explanation" of SD always skips over this step! Why? (Also skipped, how the text prompt is "combined" with the image. Folks mumble about CL??, but never clearly explain it.) You are a very very good presenter. Please take 15 minutes to "explain" SD.

Ответить
@jhanolaer8286
@jhanolaer8286 - 30.10.2022 03:06

Beautiful❤

Ответить
@user-wr4yl7tx3w
@user-wr4yl7tx3w - 22.10.2022 02:29

This was so helpful. Love this format of starting easier and add layers of explanations.

Ответить
@Grifter
@Grifter - 14.10.2022 15:02

Fascinating stuff, Great explanation.

Ответить
@andikafaishal2230
@andikafaishal2230 - 13.10.2022 17:05

my brain cannot handle this

Ответить