Bioinformatics Project from Scratch - Drug Discovery Part 1 (Data Collection and Pre-Processing)

preview_player
Показать описание
Do you want to collect your very own novel and original dataset in biology that you can use in your Data Science Project? In this video, I will show you how to download and pre-process biological activity data from the ChEMBL database that you can use to perform Computational Drug Discovery. The dataset is comprised of compounds (molecules) that have been biologically tested for their activity towards target organism/protein of interest. This video represents Part 1 in a multi-part video series on Bioinformatics Project.

⭕ Playlist:
Check out our other videos in the following playlists.

⭕ Subscribe:
If you're new here, it would mean the world to me if you would consider subscribing to this channel.

⭕ Recommended Tools:
Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite and I love it!

⭕ Recommended Books:

⭕ Stock photos, graphics and videos used on this channel:

⭕ Follow us:

⭕ Disclaimer:
Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel's contents.

#dataprofessor #bioinformatics #drugdiscovery #drugdesign #chembl #cheminformatics #bioinformaticsproject #bioinformaticproject #drug #drugs #molecule #molecules #machinelearning #lecture #dataprofessor #bigdata #QSAR #QSPR #machinelearning #datascienceproject #randomforest #decisiontree #svm #neuralnet #neuralnetwork #supportvectormachine #python #learnpython #pythonprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #tutorial #dataanalytics #dataanalysis #machinelearningmodel
Рекомендации по теме
Комментарии
Автор

Thanks to the discussion with Shweta in this comment section. Back in the days, 7 years ago, we manually compiled the bioactivity data of more than 2000 compounds from hundreds of research articles. The whole process took 6 months, then we spent a few more months manually curating the data, and double checking again and again for consistency. Fast forward to today, we can do the same thing in less than 10 minutes as shown in this video. I am thankful for the generosity of data providers for making these APIs as well as the various libraries such as pandas (imagine handling hundreds of Excel files and manually curating those) and scikit-learn (imagine optimizing learning parameters manually on 50 computers and via a GUI interface of data mining software such as Weka). Coding is indeed a real superpower. If you are thinking of whether to learn coding or not, my recommendation is yes! It will be one of the best decision for your career and hobby 😃

DataProfessor
Автор

Can't wait for part 2! I know your subscribers have been asking for this series!

KenJee_ds
Автор

Dear Data Professor,
I can not even express how grateful I am for your content and dedication to your subscribers!
I come from a biological background and I am new to the bioinformatic world.
You give me motivation and great advice to continue studying in this incredible field.

Great work,
Greetings from Brazil!

soukisama
Автор

As a Bioinformatics MSc student I found this so interesting

khaifea
Автор

This video is a treasure that I have found. Probably the first video ever on Chembl data collection. I wish this video was out when my paper was under review last year. Luckily, I could solve the reviewer query on Chembl.

shwetaredkar
Автор

Wow!!! Now I can learn DS and Bioinformatics from a Thai Professional. Thanks Prof for helping get out off my SandBox prison mindset.😅😅😅

danieltoo
Автор

I want to show my sincere appreciation for how you made data science so simple for me and interesting

ElijahErureh
Автор

What an awesome explanation
I finally maneged to find channel to walk me through step by step.
I sincerely thank you

Yoursleepassistant
Автор

sir....thank you so much for simply sharing your knowledge.

manabendraborah
Автор

This is incredible. Thank you so much for sharing your knowledge and experience with us!

saraalm
Автор

Professor, this is an indispensable resource! You are the best!

gauravbhattacharjee
Автор

I watch this on May 2021, and sooo excited with this amazing project video, what a great content, thankss Prof!

FarisIzzaturRahman
Автор

Perfect lesson, thank you Professor!
Greetings from Italy!

CostanzoPadovano
Автор

I would really appreciate it if you explained your lines of code and how they work. Otherwise, I am not learning why I'm typing what I'm typing, or why it's necessary, etc.

I just watched another series by a different guy who thoroughly explained every line of code he wrote in R and I walked away with a much better understanding of why each line of code is being typed or why certain arguments were used.

delilahjones
Автор

I am new to your profile. I was intrested to see about drug discovery and especially l am rescently got interested in bioinformatics. Sadly l think l chose the wrong course as I am biomedical science student in my second years and now l can see that l should have gone bioinformatics way! It was very straight forward lesson, proffessor! Thank you so much

avoniadevile
Автор

This was a useful, informational video! It was straightforward and very interesting to learn about. You've intrigued me to pursue a bioinformatics program and I plan on starting my own project using the same tools demonstrated, Thanks Data Professor!

raedhanoon
Автор

Yess! Finally! The niche teach professor!

mr.harambae
Автор

You are so amazing data professor, thank for always sharing your expertise and knowledge

alvinmodales
Автор

This is one only resource condensed, thanks!

onkarkumbhar
Автор

ตามมาจากช่อง Data rockie นะครับ data science for bioinformatics มากเลยครับ จะรอซื้อครับ 👍

flowstateofmind