ACL 2021 invited talk: Learning-to-learn through Model-based Optimization, by Prof Eric Xing

Показать описание

Title:

Learning-to-learn through Model-based Optimization: HPO, NAS, and Distributed Systems

Abstract:

In recent years we have seen rapid progress in developing modern NLP applications, by either building omni-purpose systems via training massive language models such as GPT-3 on big data, or building industrial solutions for specific real-world use cases via composition from pre-made modules. In both cases, a bottleneck developers often face is the effort required to determine the best way to train the model: such as how to tune the optimal configuration of hyper-parameters of the model(s), big or small, single or multiple; how to choose the best structure of a single large network or a pipeline of multiple model modules; or even how to dynamically pick the best learning rate and gradient-update transmission/synchronization scheme to achieve best “Goodput” of training on a cluster. This is a special area in meta-learning that concerns the question of “learning to learn”. However, many existing methods remain rather primitive, including random search, simple line or grid (or hyper-grid) search, and genetic algorithms, which suffer many limitations such as optimality, efficiency, scalability, adaptability, and ability to leverage domain knowledge.

In this talk, we present a learning-to-learn methodology based on model-based optimization (MBO), which leverages machine learning models which take actions to gather information and provide recommendations to efficiently improve performance. This exhibits several advantages over existing alternatives: 1) provides adaptive/elastic algorithms that improve performance online; 2) we can incorporate domain knowledge into these models for improved recommendations; 3) can easily facilitate more-data-efficient automatic learning-to-learn, or Auto-ML. We show applications of Auto-ML via MBO in three main tasks: hyper-parameter tuning, neural architecture search, and Goodput optimization in distributed systems. We argue that such applications can improve productivity and performance of NLP systems across the board.

SailingLab CMU

Рекомендации по теме

ACL 2021 invited talk: Learning-to-learn through Model-based Optimization, by Prof Eric Xing

ACL 2021 invited talk: Learning-to-learn through Model-based Optimization, by Prof Eric Xing

How can you teach a computer coreference faster? [ACL 2022 Research Talk]

UnNatural Language Inference (ACL 2021 Outstanding Paper Talk)

CVPR 2021 Invited Talk: Simultaneous Translation: Breakthrough and Recent Progress (Liang Huang)

[ACL 2021] On the Distribution, Sparsity, and Inference-time Quantization of Attention

EMNLP 2021 Contributed Talk for Novel Ideas in Learning-to-Learn through Interaction(NILLI) Workshop

Danqi Chen - Invited talk at DialDoc Workshop@ACL2021

Reliability Testing for Natural Language Processing Systems [ACL 2021]

CLARIN2020 - Invited Talk - Antske Fokkens - Day 1 - 5.10.2020

ICML 2021 invited talk: A Data-Centric View for Composable Natural Language Processing by Eric Xing

IJCLR 2021 Keynote Talk by Zhi-Hua Zhou: 'From Pure Learning to Learning & Reasoning'

Our paper at CVPR 2020 - MUL Workshop and ACL 2020 - ALVR Workshop

How to Win LMs and Influence Predictions (Sameer Singh, UCI), Repl4NLP 2021 Invited Talk

2021 Certification Challenge - Developer Study Group #1

TSD 2022 conference - Anna Rogers, invited talk

Diversity in NLP by Mona Diab- ACL 2019

*SEM 2021 Invited talk: Diyi Yang -- Seven social factors in NLP

NW-NLP 2018: Ben Taskar Invited Talk; Learning and Reasoning about the World using Language

[ACL Mentorship] An HCI Perspective for NLP

LHMP 2021, Part 7: Invited talk by Jonathan P. How (MIT)

The Web Conference 2021 keynote speakers: YEJIN CHOI and ESTEVAM HRUSCHKA

AKBC 2019 Invited Talk: Yejin Choi

ICLP 2021 - Invited Talk - William Cohen

[CVPR-2021] 1st Workshop on Language for 3D Scenes