Question Answering on Tabular Data with HuggingFace Transformers Pipeline & TAPAS

Показать описание

In this video, I'll show you how you can use HuggingFace's Transformers pipeline : table-question-answering. You can use this for answering questions related to a table.

The TAPAS model was proposed in TAPAS: Weakly Supervised Table Parsing via Pre-training by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Martin Eisenschlos. It’s a BERT-based model specifically designed (and pre-trained) for answering questions about tabular data. Compared to BERT, TAPAS uses relative position embeddings and has 7 token types that encode tabular structure. TAPAS is pre-trained on the masked language modeling (MLM) objective on a large dataset comprising millions of tables from English Wikipedia and corresponding texts. For question answering, TAPAS has 2 heads on top: a cell selection head and an aggregation head, for (optionally) performing aggregations (such as counting or summing) among selected cells. TAPAS has been fine-tuned on several datasets: SQA (Sequential Question Answering by Microsoft), WTQ (Wiki Table Questions by Stanford University) and WikiSQL (by Salesforce). It achieves state-of-the-art on both SQA and WTQ, while having comparable performance to SOTA on WikiSQL, with a much simpler architecture.

Join this channel to get access to perks:

If you do have any questions with what we covered in this video then feel free to ask in the comment section below & I'll do my best to answer those.

If you enjoy these tutorials & would like to support them then the easiest way is to simply like the video & give it a thumbs up & also it's a huge help to share these videos with anyone who you think would find them useful.

Please consider clicking the SUBSCRIBE button to be notified for future videos & thank you all for watching.

You can find me on:

#huggingface #NLP

Рекомендации по теме

Комментарии

I'm working on a natural language parser for database queries as part of a placement project. Decided to uses tapas from huggingface and what are the odds that the day I'm about to start working you upload this amazing video that makes my life so much easier, keep up the great work!

randomdudewithnovidz

Hi Bhavesh, I have a use case where I need yo do a reverse of this. Like "Update score 189 for Virat Kohali. Change his team to Australia." Which model to use here? Instead of score one can use runs also.
Please help.

vbarai

are there any limitation for the size of CSV file? say i have a CSV of 3gb?

SmartAzan

Does it work on any dtabase query? using a chat interface and generic database. This one is CSV based not a full fledged table.

ditchtech

is there any model which supports for both german and english with Table quesiton answering?

chandrantwins

can we make it as an input other than a query so a user can ask a specific question he wants?

buu

How to work with table size>512 tokens?

kunalkasodekar

hi this is awesome
how do i fine tune it to perform other tasks

al-aminibrahim

Do we need to pass the table always for every query we ask, how it scales for larger tables did you give a try ?

RavikiranBhonagiri

This works for a very small data set merley 30 rows. If i have data set which has say 1000 rows it gives the error Out of Range.

adityakaran

Thanks for the video....from transformers import pipeline doesnt work when i idea whats the issue?

David-rmwn

Can anyone tell me how do i get the total sum of Runs?
this is the op i am getting when i run the query
"what is the sum of Runs?"
SUM > 18426, 14234, 13704, 13430, 12650, 11867, 11739, 11579, 11363, 10889

mohandas

Brilliant
Please make a video on hypothesis testing, chi squared test, p value, t-test
Would like to hear it from you

charmilam

Amazing video as usual. My question might sound silly but can you please let me know why you have used 'q' while installing transformers and 'f' while installing torch-scatter ?

rohitjagdale

How much rows it can handle at a time?
With time complexity

krisskad

Sql is far from gone. It will be there for atleast next decade and is the most important skill set to become a data scientist. More important than nlp or deep learning

sahil

Why did it show the answer for Virat Kohli's highest score as "AVERAGE > 183" and not "183" ?

akshaysarbhukan

Take AI otherwise It will take your job

shivamkumar-qpjm

Question Answering on Tabular Data with HuggingFace Transformers Pipeline & TAPAS

Question Answering on Tabular Data with HuggingFace Transformers Pipeline & TAPAS

Asking Finance Data Tables Questions with GPT | Tabular Question & Answer NLP

Table Question-Answering with TAPAS in Python

Speech/Audio Based Question Answering on Tabular Data using Python, HuggingFace, Gradio

LLMs for Advanced Question-Answering over Tabular/CSV/SQL Data (Building Advanced RAG, Part 2)

AgentChain: Question-Answering on Tabular Data, Then Making a Phone Call About the Answers

ChatDATA | Chat With Any Tabular DATA | pandas-ai | Pandas

Question Answering using Transformers Hugging Face Library || BERT QA Python Demo

Introducing Crawlee for Python: Build reliable web scrapers. Fast.

ChatGPT answering questions about data in DoTenX tables

Table-GPT by Microsoft: Empower LLMs To Understand Tables

Table Query with Hugging Face ML

Extracting tabular data for Question Answering from documents

Create table question answering with Gen AI LLMs @HuggingFace #llm #generativeai #machinelearning

Table Question Answering with Hugging Face Pipelines

Multi-Vector Retriever for RAG on Tables + Texts Using LANGCHAIN & UNSTRUCTURED

Custom Training Question Answer Model Using Transformer BERT

Google's TAPAS, a BERT-Based Model for Querying Tables Using Natural Language

🤗 Tasks: Question Answering

Chat with SQL and Tabular Databases using LLM Agents (DON'T USE RAG!)

LlamaIndex Webinar: Advanced Tabular Data Understanding with LLMs

Benchmarking Question/Answering Over CSV Data

Grammarly Meetup: Memory Networks for Question Answering on Tabular Data

Learn How To Query Pdf using Langchain Open AI in 5 min