filmov
tv
PhD Thesis in 1 Day (300$): Open-Source AI
Показать описание
I show you HOW-TO generate a new scientific publication or your PhD Thesis in 1 day with open-source AI code. Utilizing GPT-4o for complexity tasks and idea generation, Claude 3.5 Sonnet for Software Engineering and AIDER as my parallel interface to generate a scientific report /PhD thesis with new results from autonomously run computer simulations by AI.
This synthetic science report includes our newly generated images and multiple graphs, bar charts, trend charts plus a complete LATEX file for a state of the art science publication (eg in computer science).
Of course another dedicated AI machine evaluated this synthetic research report according to official rules you can specify and then re-evaluate the scientific quality of this synthetic science report as an "independent" AI evaluator. Smile.
Ethical note: I do not recommend to generate a synthetic science report (or a theoretical PhD thesis) by AI (even with open source Ai code) but enjoy the PhD learning phase and do all the research yourself. Your time at the university is precious and you have the opportunity to learn how to learn. OR, alternatively, if you teach at the university, allow synthetic science reports for students - but increase the level of complexity to an extreme amount, comparable to a department output.
For industrial AI applications the computer simulation time-frame shortens significantly, more testing and more simulation for less costs, but you'll need the scientists for designing and trouble shooting any AI multi-agent complexity.
The central theme is the feasibility of writing a PhD thesis within a single day using AI. The video introduces tools like the new SWE benchmark, developed by OpenAI, to evaluate software engineering performance, highlighting GPT-4 Omni's impressive achievement of resolving 38% of the difficult "SWE-Bench Verified" subset. This benchmark not only measures the success rate of coding agents but also demonstrates the growing competence of AI in automating complex tasks. It is interesting to see how these benchmarks and AI tools can accelerate the research process (based on complex computer simulations - PyTorch, TensorFlow or JAX), making it possible to generate significant scientific contributions in a very short time.
We then explore the practical application of these AI tools in real-world research, focusing on a specific workflow that leverages AI for generating novel research ideas, coding implementations, and drafting scientific papers. I show you how a tool called AIDER can be used to automate the coding process by pairing with LLMs like GPT-4 Omni. The new system can iteratively refine code, simulate experiments, and produce visualizations using tools like Matplotlib. This automation extends to the entire research process, including literature review and novelty checking against current publications, all managed through a structured pipeline that ensures the quality and novelty of the research output.
00:00 SWE Bench Verified at 20% performance
05:18 AIDER for programming with any LLM
09:13 Harvard Business Rev: Accenture CTO AI Scientist
11:25 Sakana AI Scientist pre-print of 185 pages
26:19 Hyperparameters of AI Scientist
27:44 AI generates new Science & research ideas
34:36 How much does this new synthetic AI publication cost?
36:57 GitHub Open Source Code
37:36 AI designs new experiments for GROKKING LLMs
42:43 AI process notes (GitHub)
43:50 Create our own template for synthetic science
49:52 Synthetical AI generated science publications
50:55 Limitations and safety rules
All right with the authors of:
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Open-source code:
The LATEX template for a synthetic science paper on GROKKING:
The EXPERIMENT.py file for running AIDER on it:
3 synthetic science reports /publications on the topic of AI research:
#airesearch
#coding
#science
#phd
#newtechnology
This synthetic science report includes our newly generated images and multiple graphs, bar charts, trend charts plus a complete LATEX file for a state of the art science publication (eg in computer science).
Of course another dedicated AI machine evaluated this synthetic research report according to official rules you can specify and then re-evaluate the scientific quality of this synthetic science report as an "independent" AI evaluator. Smile.
Ethical note: I do not recommend to generate a synthetic science report (or a theoretical PhD thesis) by AI (even with open source Ai code) but enjoy the PhD learning phase and do all the research yourself. Your time at the university is precious and you have the opportunity to learn how to learn. OR, alternatively, if you teach at the university, allow synthetic science reports for students - but increase the level of complexity to an extreme amount, comparable to a department output.
For industrial AI applications the computer simulation time-frame shortens significantly, more testing and more simulation for less costs, but you'll need the scientists for designing and trouble shooting any AI multi-agent complexity.
The central theme is the feasibility of writing a PhD thesis within a single day using AI. The video introduces tools like the new SWE benchmark, developed by OpenAI, to evaluate software engineering performance, highlighting GPT-4 Omni's impressive achievement of resolving 38% of the difficult "SWE-Bench Verified" subset. This benchmark not only measures the success rate of coding agents but also demonstrates the growing competence of AI in automating complex tasks. It is interesting to see how these benchmarks and AI tools can accelerate the research process (based on complex computer simulations - PyTorch, TensorFlow or JAX), making it possible to generate significant scientific contributions in a very short time.
We then explore the practical application of these AI tools in real-world research, focusing on a specific workflow that leverages AI for generating novel research ideas, coding implementations, and drafting scientific papers. I show you how a tool called AIDER can be used to automate the coding process by pairing with LLMs like GPT-4 Omni. The new system can iteratively refine code, simulate experiments, and produce visualizations using tools like Matplotlib. This automation extends to the entire research process, including literature review and novelty checking against current publications, all managed through a structured pipeline that ensures the quality and novelty of the research output.
00:00 SWE Bench Verified at 20% performance
05:18 AIDER for programming with any LLM
09:13 Harvard Business Rev: Accenture CTO AI Scientist
11:25 Sakana AI Scientist pre-print of 185 pages
26:19 Hyperparameters of AI Scientist
27:44 AI generates new Science & research ideas
34:36 How much does this new synthetic AI publication cost?
36:57 GitHub Open Source Code
37:36 AI designs new experiments for GROKKING LLMs
42:43 AI process notes (GitHub)
43:50 Create our own template for synthetic science
49:52 Synthetical AI generated science publications
50:55 Limitations and safety rules
All right with the authors of:
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Open-source code:
The LATEX template for a synthetic science paper on GROKKING:
The EXPERIMENT.py file for running AIDER on it:
3 synthetic science reports /publications on the topic of AI research:
#airesearch
#coding
#science
#phd
#newtechnology
Комментарии