Python Tutorial - Data extraction from raw text

preview_player
Показать описание

This tutorial focuses on very basic yet powerful operations in Python, to extract meaningful information from junk data.

The overall video is covers these 4 points.
1. Basic string operations for data extraction
2. How to open a text file
3. How to read rows line by line
4. Data extraction from junk

Feel free to write to me with suggestions and feedback. Stay connected!
Рекомендации по теме
Комментарии
Автор

Here, Wagner, from Brazil. Thank you very much!!!

wagnerbarbosa
Автор

Thank you for this video - I have been looking for a simple explanation about working with raw txt files and this is EXACTLY what I needed. Thanks again

SilveringOfRose
Автор

Thank you for your video, but can you please tell me how you can extract data like this from every second or third line ?

iKSDE
Автор

Thank you, how to extract specific IP host 10.147. From text file dynamically

yebetesebchewata
Автор

Python 2.7? Why sir why?
Given the multitude of string manipulation functions from 3.6 and even more from 3.9 onward this video is obsolete. You should consider making an update.

SensiStarToaster
Автор

the type of document am working with is utf-8 and unfortunately it prints the whole data even after am telling it to print specific line, works good in usual encoding though idk why

studiospan
Автор

hi sir can please explain how to extract particular name in a paragraph with out mention the name generally i want that name like the paragraph contains company same like there 3 paragraphs each paragraph as different company names how to extract those with same logic at a time

manikrishnajenula
Автор

Sir,
I have a text file which contains data as key value pair like every line a python dictionary. how can I make a data frame using this txt file ?

My text file have content like this-
{"Id":123, "Name":"xyz"}
{"Id":124, "Name":"abc"}
{"Id":125, "Name":"kaggle"}

My desired output-
Id Name
123 xyz
124 abc
125 Kaggle

Please help me how can I achieve this output from above mentioned text file.

VipinKumar-ugei
Автор

Thank you
This is what I was searching for 2 hours lol

cydev
Автор

Thanks in advance


if data is unorganized then you want to extract data like below and also extract Received from, Time, Subject


so i want extract "murder ([unix socket])
" before 'by' and after 'Received: from' in first two lines so can you please explain how?






"Received: from murder ([unix socket])
by mail.umich.edu (Cyrus v2.2.12) with LMTPA;
Sat, 05 Jan 2008 09:14:16 -0500
Received: from holes.mr.itd.umich.edu (holes.mr.itd.umich.edu [141.211.14.79])
by flawless.mail.umich.edu () with ESMTP id m05EEFR1013674;
Sat, 5 Jan 2008 09:14:15 -0500
Received: FROM paploo.uhi.ac.uk (app1.prod.collab.uhi.ac.uk [194.35.219.184])
BY holes.mr.itd.umich.edu ID 477F90B0.2DB2F.12494 ;
5 Jan 2008 09:14:10 -0500
Received: from paploo.uhi.ac.uk (localhost [127.0.0.1])
by paploo.uhi.ac.uk (Postfix) with ESMTP id 5F919BC2F2;
Sat, 5 Jan 2008 14:10:05 +0000 (GMT)
Mime-Version: 1.0"
Subject: [sakai] svn commit: r39771 - in bundle
text/plain; charset=UTF-8
X-Content-Type-Message-Body: text/plain; charset=UTF-8
Content-Type: text/plain; charset=UTF-8
X-DSPAM-Result: Innocent



aniruddhmishra
Автор

How can I List Outer Most field from that text documents???

UniverseGames
Автор

is it possible to do text processing for multiple columns in the dataset ?

bhanupriyatham
Автор

hi, I want to print an entire line if any string in the line matches with the user input. How to do this?

jagan
Автор

What python editor used in this video and how did you run the program?

BraveBoyZs
Автор

Thanks for was very useful
I tried this code for one of my project
Outfile=open("results. txt", "W")
With open('output.txt') as fo:
For rec in fo:
Outfile.write(rec.split(' ')[1])

My result.txt is printing continuously without new need to print in each line in new
Please help me....
Thanks in advance

swapanalibhaskar
Автор

Video is good but properly visible so if u do recording by zooming the screen

creation
Автор

what does fo mean here? I know it is alias but I am not getting, why should we use alias here?

martonine
Автор

how to get input from 1000 files which are inside a folder..?

genghisda
Автор

Hi, I am trying to extract email ids from xml or text file using python version 3.8.3 in mac. Could you please help me.

anandraj
Автор

after making this changes in file how we can save it in a new text file

nourarifi