filmov
tv
Building an Igbo Dictionary API from Converting and Parsing a PDF File

Показать описание
I wanted to see if I could build an Igbo dictionary API that serves Igbo terms and definitions. Since there wasn't anything like this out on the Internet, I had to work a little harder to grab the data I needed to power the API.
In this video, I'll show you how I converted a PDF into an HTML that contains words, word classes, definitions, phrases, and examples from the Igbo language. Then I'll show how I identified the pattern in the original PDF dictionary table and parsed that data to build a JSON object full of data.
Timestamps:
0:00 Introduction
0:09 What I'm Working On
0:28 Why Work on an Igbo Dictionary API?
1:27 Step 1: Find an Igbo-English Dictionary PDF
2:19 Step 2: Parsing the HTML File
2:50 Step 3: Identify Table Structure
5:01 Issues to Address in the Dictionary JSON files
5:11 Issue: Uncleaned terms were add to the dictionary
5:32 Issue: Similarily Spelled Terms were not batched
5:58 More Dictionary Files
== Let's Connect ==
== Who I Am ==
I'm Ijemma, a frontend software engineer currently working in New York City. I make videos on all things JavaScript and, more broadly, frontend web development.
#javascript #webdevelopment #softwareengineer
In this video, I'll show you how I converted a PDF into an HTML that contains words, word classes, definitions, phrases, and examples from the Igbo language. Then I'll show how I identified the pattern in the original PDF dictionary table and parsed that data to build a JSON object full of data.
Timestamps:
0:00 Introduction
0:09 What I'm Working On
0:28 Why Work on an Igbo Dictionary API?
1:27 Step 1: Find an Igbo-English Dictionary PDF
2:19 Step 2: Parsing the HTML File
2:50 Step 3: Identify Table Structure
5:01 Issues to Address in the Dictionary JSON files
5:11 Issue: Uncleaned terms were add to the dictionary
5:32 Issue: Similarily Spelled Terms were not batched
5:58 More Dictionary Files
== Let's Connect ==
== Who I Am ==
I'm Ijemma, a frontend software engineer currently working in New York City. I make videos on all things JavaScript and, more broadly, frontend web development.
#javascript #webdevelopment #softwareengineer
Комментарии