Node.js Web Crawler from Scratch | Full Tutorial

preview_player
Показать описание


Рекомендации по теме
Комментарии
Автор

Just awesome😁 you not only helped creating a crawler but also taught how to use testcases and code. Thank you soo much 🥳🥳

CodewithAbhi
Автор

that was pretty great, helped me get more familiar with the URL class and string manipulation and parsing in general, also finally got me learning testing which I had been ignoring for quite a while now.

leonss
Автор

This was great, i learnt more than just crawling the internet... Am experimenting with TDD with Jest. thanks a banch.

arnoldasiimwe
Автор

Ok. It works. It's actually amazing seeing it work (since, from my experience, most code tutorials in you tube, at some point, don't work).
I learnt some Node.js, (mostly Express) to make REST Apps (CRUD). But that was it. A server, some routes, some controllers; Sequelize to post stuff into a Postgres Data Base, and that's it.
This is another level. I just was able to follow the tutorial, but I would be lying if say I understood everything you did. Yes, you import some modules, you install some packages from npm, you tested some functions... yep. And it works, and I don't know how.
How can I learn what you do? I know you are a Backend Developer, but, (at least with Node.js), how did you learnt all that? It's awesome, it really is.

matiassomoza
Автор

Nice video, clear sound, more information and very helpful
Thank you so much for this working hard
We need more of these projects of nodejs

DC from Sudan

mspurim
Автор

Subscribed ! You're other videos seem interesting, I'm checking em out soon. Nice content 👍

hsider
Автор

I ran into an issue concerning Jest. Everything passes no issue with just 2 pages concerning sorting for the report. I get an error with any more than 2 pages, stating the output didn't match the expected. I feel this has something to do with the a, b hits function, but cannot for the life of me figure out what. The project works flawlessly in production, it only fails when trying to test with more than 2 pages with Jest. Any ideas on this?

(Edit!)
I just figured it out, for some reason it required me to put the pages in the expected variable in the exact opposite order the input variable pages were ordered and the test passed. A bug from a recent update perhaps? Either way thank you for the knowlege!

bryanarycode
Автор

Hello, Do you know any way to create a web crawling for multiple websites with different structures?

vinhngotrung
Автор

Brilliant video and your teaching style is very clear! Is this code available in a GitHub repo or Gist somewhere that I can use for reference at all? Thank you

michaelpumo
Автор

bro can this crawl all websites like complex ones like amazon or other ecommerce website pls reply bro ??

amt.rambo
Автор

Thankyou very much.😊
Watched the complete video.
Please post videos like this more🤗

miyamotomusashi
Автор

@bootdotdev, for the function getURLfromHTMl test, you are checking both the arrays are equal or not using toEqual< I'm unable to do that donno why, and the other thing is isn't giving the output array, I debugged in my case and found it to be .forEach(linkElement=>{
} this thing, bt still I tried resolving the test error multiple times using toMatch or new Set(actual), but nothing provide me with a solution....i hope there will be a reply

karthiksharma
Автор

ok so i created a spider, and i'm crawling this website, my spider goes page by page, but it's very slow, half a second for each page, why is that ?

exe.mdnght
Автор

this can be turned into a weapon hahahaha

exe.mdnght
Автор

In sortPages you are creating aHits and bHits but actually dont use them :P .. great tutorial thank you.

thomasbabinsky
Автор

bro hate working with or || operator 😆

DeveloperMan_
Автор

Did he really change a totally clear name of "input" "output" "expected" into "AcTuAl"? I'm pretty sure I have never named anything in my life "Actual" i did do "currentString" etc.

randomdamian