ChatGPT tries a BASIC Capture The Flag (CTF) Challenge

preview_player
Показать описание

🔥 YOUTUBE ALGORITHM ➡ Like, Comment, & Subscribe!
Рекомендации по теме
Комментарии
Автор

I've found (at least with Chat GPT 3) that if you provide the AI with a "character" to play at the beginning of a session it can have a real impact on its performance. In this case, I'd have probably started with something like, "In this session you're to play the role of a highly skilled penetration tester, and we'll be attempting a capture the flag challenge."

Telling it what its supposed to be can sometimes have an astonishing effect on the results.

StevenIngram
Автор

assuming that a language model like ChatGPT cannot mess up math, is the first mistake. It actually gets mathematically trivial things completely wrong. 😂

annorome
Автор

This is basically my same experience with ChatGPT on anything technical. It's hard to tell if it's just making stuff up more and more or if it might actually be slowly getting closer to the right answer but it never gets it right the first time.

EmberHext
Автор

Sometimes once GPT-4 gets lost it actually works better to start a new chat and rephrase the question. It seems to get stuck on previous responses earlier in the chat which throws it in the wrong direction continuously. I think it is an amazing tool, and it has saved me so much time scripting things that i wouldn't normally be attempting to script. It definitely has its uses but it obviously can't solve everyone's problems and it does have its failures.

Dnarchy
Автор

I do find ChatGPT very useful for long, tedious and mindless tasks - such as generating a struct from an input data type, writing markup based on a spec, scaffolding unit tests, etc. You do always need to read over what it produces, as sometimes it just confidently spews a bunch of incorrect outputs, but when used as a tool (rather than a full replacement) it can be a great time saver, letting us focus more on the meat of tasks.

AliciaSykes
Автор

I was a college CS instructor for a few years. Chat GPT reminds me of an over eager and knowledgeable student I had who kept trying to convince people he knew more than he actually knew. I wish ChatGPT would include a confidence rating for its responses. Or work that confidence wording into its responses. So instead of saying "Apologies the answer is X. X should work." It should say "Whoops, idk. Uhm.. try X please? I'm a little unsure about when you said " Maybe that will be Chat GPT 5

SaintSaint
Автор

love your interaction with GPT, exactly like the day I had yesterday. Thought GPT knew what is was doing, but found myself going around in circles, and would have been quicker if I hadnt used GPT.

seb_gibbs
Автор

It works better when A. As you stated having a better knowledge of the materials and B. Giving full error outputs really helps so using jgrasp or pycharm is better in this case than terminal

criostasis
Автор

For an AI class I took, the teacher generated the midterm questions using ChatGPT. One of the questions was making a 3x3 array where each node was equal to arcsine of its i + j. Arcsine is only defined between -1 and 1. ChatGPT can do math wrong, in fact, when I asked the question which was already generated by ChatGPT to itself, it provided an incorrect answer. Only by handholding it, I could make it so it returned 0 when i + j was not in Arcsine's range.

necraul
Автор

I have the same experience using ChatGPT for any problem solving. It just goes around in circles. It is very helpful for simpler tasks though.

lawliot
Автор

Did you notify ChatGPT that you found the flag?

I became a bit tired of the apology thing and wrote that it was not necessary. After a few times repeating that it was really not necessary, there were no more apologies.
At one point I was given a few suggestions, I chose one and hhen at a later time in the chat GPT asked why I had made that choice (!).
I explained my reasoning and GPT responded gratefully for my explanation and indicated that it understood within the context we were talking about why I had made that choice.
There is a clear desire to help in the program, but I found the feedback question very surprising

NL
Автор

It's interesting how so many commenters have a variant of "Haha, it took awhile to solve it!" or "Look at all its failed experiments trying to solve it! HAHAHA", yet we humans do the exact same thing. Plenty of folks will struggle and experiment trying to solve this challenge. The AI is very human like in this way, and in the end succeeded, not even taking that long to succeed. Other humans will "hallucinate" and give you bad ideas and advice all the time -- the utility of ChatGPT is not it being "perfect" (impossible because knowledge is neither perfect nor complete, nor absolute), but about being as good or better than the average person you ask in the field for ideas. Just treat it like another human (not because it is, but because that is how it acts), and none of this becomes mysterious or strange, and vet its info the same you would do any random non-SME-in-your-exact-problem coworker.

SmartassEyebrows
Автор

I know this is old, but if ChatGPT ever truncates code/stops running because the streaming API stopped, all you need to do is provide an ellipse “…” and it will continue where it stopped. Almost like it knows it does this regularly and knows it messed up.

Milamber-pgju
Автор

So what i found is it is good at Q/A type things. You can ask it to explain code or write pieces but it’s far from perfect without some experience in the subject. You have to steer it in the direction you want. When it does go down a wrong path, you need to ask it why and give it suggestions. It’s not a bad study aid or guide when exploring things you have some experience with but if you don’t have any knowledge or experience it can be difficult. You walk through explanations and use it to walk through a problem. Then I usually start a new session with a new starting point now that I have a better understanding of what I am looking for. I’ve done some testing in writing songs, story building, coding, engineering a prototype and other projects. I’d say it’s about as good as a intern or perhaps a little better than Jr. employee at times. But having a assistant to help answer questions at your fingertips and provide a 2nd opinion or fact check details can be a big help.

MrMaramor
Автор

'AI Kool-aid' 👍 great wording

valk
Автор

that went about as well as i expected from my own experience, some code it generates just fine if your using a version released prior to 2021, thats here the data for most topics has cut off. however if your asking it for something that it doesnt have in its training data, then it just makes random stuff up without telling you, even if your asking it to tell you if it made the thing up or not. the guesses are based on assumptions from reading the info it has and thinking how the rest should work. these guesses are rarely accurate.
you may get it to solve an older CTF challenge that uses knowledge prior to its cut off point but even then your taking risks if your not able to verify if what its saying is correct and as you have encountered when its making stuff up you will run in circles because when it runs out of reasons why it could fail it just restarts with the first thing it thought to be the issue and regenerates the same or similar code for that.
so its fair to assume when you see it repeat the same thing without having taken any steps that indicate that its now time to do that as the next step that its just making it up and you wont get anything useful out of it.
shame to see what this is still the case with gpt-4 which you used because im on gpt-3 (the free version) which is even more prone to this kind of behaviour.

ai-spacedestructor
Автор

tried to solve some other challenges with chat GPT. Had no luck so far but I'll continue trying. When it comes to harder CTFs it gets quickly lost

_hackwell
Автор

Man, you always make banger content. Love this video John!

therealjsevilla
Автор

Thank you for making this video, it showcases some of the downsides of AI that I've been trying to convince a handful of my co-workers of for a while now lol.

tincup
Автор

generally you can say something like « take a step back from the code and resonate logically » and you get better answers

paulcrinq