Tablesaw: The EASIEST way to process tables in Java (Data Frames / Pandas like)

preview_player
Показать описание
The world runs on CSV files so it's always great to have multiple methods for processing them in your toolbelt. In this video, I'm going to talk about a library called tablesaw that provides an ergonomic interface for processing tabular datasets. It's so ergonomic that it's quite usable in interactive mode -- which is huge for a language that is as verbose as Java. With tablesaw you can perform most of the operations that you would typically use an SQL database for - sort, aggregate, query/filter, group by - but on an in memory dataset and without the need for a separate query language.

While not necessary, if you enjoyed this video, buying me a coffee is greatly appreciated!

00:00 Intro
01:21 Reading a CSV File
01:29 Pretty Printing
01:56 Inspecting the Table's structure
02:22 Removing Columns
02:42 Saving Tables to a CSV File
03:39 Mapping and Selection Methods
04:06 Example: Mapping Methods
04:51 Example: Selection Methods
05:50 Converting a String Column to a Long Column
08:25 Extracting a YearMonth column from the Date Column
09:10 countBy
09:38 isGreaterThan
10:25 Using and() to combine selection methods
10:54 summarize(), AggregateFunctions and (group) by() example

Tablesaw Home Page and GIT Repo:

Jupyter kernel I used:

Icon Credits:
Рекомендации по теме
Комментарии
Автор

Hi! I discovered that library some few months back.As you said is a pandas-like library for Java. I wanna see more videos about Tablesaw! Your explanation and demo are great! I didn't know to set up jupiter and Java. Thanks!!!

leonardonam
Автор

Great video. Gonna have to give tables away a try soon,

TheNeoanomally
Автор

Hi, how are you running Java in JupyterLab. How did you install the Java Kernel in Jupyterlab environment.

picobarco
Автор

nice, do you make videos on multithreading ?

abhinavbatta
visit shbcf.ru