Is CODE LLAMA Really Better Than GPT4 For Coding?!

Matthew Berman

9 месяцев назад

110,683 Просмотров

Скачать видео

Комментарии:

Matthew Berman - 30.08.2023 17:39

What tests should I add to future coding tests for LLMs?

Ответить

waqar_asgar__r - 13.10.2023 11:45

With this man every coding assistant model is the best coding assistant model 😂😂

Ответить

A-Z of Eveything AI (Azai) - 03.10.2023 17:21

Thanks Great Video! I found LLama to be great to code with and I am integrating Llama2 into our own Multi Application Platform.

Ответить

Максим Икрянов - 02.10.2023 09:35

CRAZY!!!

Ответить

(don't) blame people, blame the government - 24.09.2023 22:52

How well it compares to other languages than Python?

Ответить

bert imus - 21.09.2023 17:44

Yes, Please show us how to locally install it! They charge through the nose soon.

Ответить

avinash eediga - 20.09.2023 10:12

Yes please please make a video regarding setup

Ответить

Alex Leiva - 17.09.2023 13:39

Great video, how does it compare with WizardML?

Ответить

Steve Heggen - Aquarelle - 14.09.2023 09:28

Hi Matthew, amazing video! Thanks!
Could you tell me what is your Graphic card ?

Ответить

Ken - 13.09.2023 19:39

Hey Matthew - would be great for you to do a deep dive in Text Generation UI and how to use the whole thing.. Also, cover GGUF and GPTQ (other formats too) would be helpful...

Ответить

Shukur Abdul - 09.09.2023 21:24

can you test on falcon LLM and is it better than LLAMA or chatgpt 4?

Ответить

Jimmy Joel - 08.09.2023 18:31

would be interesting to ask CodeLlama to generate Game Theory simulations. Just to see how much of Math or other non-developer domains it can bring as code.
I've done it with GPT-4 and is really cool how much Game Theory you can learn just by running python examples.

Ответить

Kuro Misu - 07.09.2023 20:11

which model would you suggest for three.js or babylon.js?

Ответить

Lloyd Keays - 07.09.2023 19:18

I'm struggling to figure out the workflow for iterative conversations with codeLLAMA. The examples are all single prompt-response pairs. I want guidance on prolonged, iterative back-and-forth dialogues where I can ask, re-ask, and ask further over many iterations.

A tutorial showing how to incrementally build something complex through 200+ iterative prompt-response exchanges would be extremely helpful. Rather than one-off prompts, walk through prompting conversationally over hours to build up a website piece by piece. I want to 'chew the bone' iteratively with codeLLAMA like this.

Ответить

Kevin Kelly - 07.09.2023 11:05

Will the 34B run on a 4090?

Ответить

Andreas - 06.09.2023 17:42

I was able to coax chat gpt into writing a working snake game. I used iterative prompting. At one point I ran the program, receiving an error, I pasted that error and chatgpt resolved it correctly. Ultimately it correctly implemented snake with one random
fruit.

Ответить

ade spade - 06.09.2023 05:09

install it now and it'll be out of date in a few months, with some other LLM beating it, good vid but I'm sticking with chatgpt for now.

Ответить

MegaPixel - 06.09.2023 05:07

Would have been good to see how it does on other languages such as html, css, scss, js, ts, php, node etc

Ответить

UnderMind - 05.09.2023 20:07

Man, you turned my world around
Thanks for your content!

Ответить

Nikola Jankovic - 05.09.2023 18:46

What is with these shocked faces on thumbnails?

Ответить

Stumbli ANiM - 05.09.2023 10:47

You are biased Man. You see the result in first. Okay, As you are trying very hard to convince yourself that the LAMA is best but its not. You are just testing when the GPT fail 🤣. Man I build softwares with gpt and already selling.

Ответить

Michael - 04.09.2023 03:26

That was impressive. I like to ask, "build a calculator that adds, subtracts, divides and multiplies any two integers. Write the code in html, css, and JavaScript"

Ответить

Steven Elliott - 04.09.2023 01:39

Nice video. For some reason the snake game I got was not as good as the one you got. What I got was shorter, and had at least one syntax error. It's strange because, as far as I can tell, I did everything the same way, same prompt, same settings, etc. Anyone else have trouble?

Ответить

Blender Wiki - 03.09.2023 06:01

An excellent video, but IMMO it doesn't reflect the reality for most people. When you're coding, you usually don't spend an extensive amount of time crafting a 'perfect' prompt; otherwise, you might as well write the code yourself. Typically, prompts are more 'casual' and resemble brainstorming phrases.

One significant issue I've observed with current LLM models (particularly with GPT rather than Llama) is their tendency to prioritize your prompt over providing the 'correct' answer. If your prompt is misleading or partially incorrect, the model often generates a response that's also flawed, akin to a 'political correctness mistake.' Rarely do the models suggest, 'I think you're mistaken; here's a better way to do it.'
Conducting a more scientific test on this aspect could be quite interesting.

Ответить

Max - 03.09.2023 05:47

Yes please, a Full tutorial on how to get it installed on a gaming laptop would be epic! Thank you!

Ответить

savire.ergheiz - 03.09.2023 03:13

Wow now lets get those AAA game companies bankrupt 😂

Ответить

john johnson - 03.09.2023 01:52

Nobody is going to mention how he used parameters in LLAMA but didn't even optimize chatgpt 4 with plugins.

Ответить

john johnson - 03.09.2023 01:45

This is laughable. All independent sources have EVERYTHING behind chatgpt4 and LLAMA behind chatgpt3.5. nice try though!

Ответить

NaN - 03.09.2023 00:50

LLaMA is trash.

Ответить

Stumbli ANiM - 02.09.2023 23:57

Not even close.

Ответить

Guillaume Vermeille Sanchez Metal Vocal Covers - 02.09.2023 20:01

This just sounds like an overfitting tests. Those challenges are very likely in the training set of those models.

Ответить

dgunia - 02.09.2023 19:55

Hi! Did you see that in the example where ChatGPT "failed", an undefined situation was checked? The function all_equal should return if all items in the list are equal. But then it checked it with an empty list, "all_equal([])" and wanted it to return "True". However, the question did not define what should happen when the function is used with an empty list. Why should it return "True"? Are all items equal if there are no items in the list? I.e. are all items in an empty list equal? 😉

Ответить

Maximilian - 02.09.2023 12:37

How about giving it much harder problems to solve? These are cookie cutter problems.
Ask it to generate test data for a radar software then tell it to apply a kalman filter to make the radar predicts better.

Ответить

Dtory - 02.09.2023 00:50

This is why I subscribed to this channel. Connecting the viewer to the actual project

Ответить

yw1971 - 02.09.2023 00:23

Yup, there it is
Yup, there it is

Ответить

JeKK - 01.09.2023 23:31

Chat gpt 3.5 turbo?

Ответить

Jorge Martínez - 01.09.2023 22:59

Amazing content, thanks a bunch.

Ответить

Colfax Schuyler - 01.09.2023 20:09

"Better than GPT 4!!"

Meh.

I've got a calculator in my cell phone. It's better than GPT 4.

So what?

Ответить

Team Up With AI - 01.09.2023 18:54

If you install this Llama model, it will be free, but what's machine that will run it? You need 32GB RAM - does the quantization work here to help you run this model on 16 GB?

Ответить

AI Hypnosis - The Truth of AI - 01.09.2023 14:10

Thing is .. the corups re large.. all these examples are part of the training intentionally..the thing is not to ask for examples and snake games.. ask for real code, coz i dont think u make snake games as real use cases

Ответить

David Cabanis - 01.09.2023 13:01

+1 on the code Llana installation video.

Ответить

Senhua Wu - 01.09.2023 01:45

what specs do you need to run the 34B parameter version?

Ответить

testales - 01.09.2023 00:34

For some reason I don't get the code you got. I've used all the same settings, prompts and even reinstalled Oobabooga from scratch. i've also tried the 32g version which is supposed to be more accurate. I've got a few versions running too though, none of them working as supposed. I was also impressed by the communication while debugging. The AI suggested for example to add some print instructions to get more information and then tried making fixes with my feedback based on this.

Ответить