tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset

Uploaded By: Myvideo

Published on

21 Jan 2022

1 view

0

0 votes

0

About Share Download Add to

tinyML Talks The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset Mark Mazumder , PhD Student Harvard University This talk will present the Multilingual Spoken Words Corpus (MSWC), a speech dataset of over 340,000 spoken words in 50 languages, with over 23 million audio examples. MSWC has many use cases, ranging from voice-enabled consumer devices to call center automation. The dataset is CC-BY licensed and free for academic research and commercial use. We will introduce applications of MSWC for few-shot keyword spotting and spoken term search tasks in low-resource languages, and share a brief tutorial on getting started with the dataset. We will also discuss how we automated the construction of our dataset and our self-supervised approach for detecting outlier samples.

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/M0Rvd3hySEhLQkVQbVNsMTZLckw2THQ0OTlxUzZpWDdGS1FQY0xlL0lWYz0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

How TinyML Gives us Spider-Man Powers | Emelie Eldracher | TEDxMIT

3 years ago

00:10:23

How TinyML Gives us Spider-Man Powers | Emelie Eldracher | TEDxMIT

18 84%

#100 Embedded Machine Learning on Edge Devices(with Daniel Situnayake)

3 years ago

00:51:33

#100 Embedded Machine Learning on Edge Devices(with Daniel Situnayake)

2 47%

tinyML Talks: Advanced Anomaly Detection Made Easy

3 years ago

00:59:44

tinyML Talks: Advanced Anomaly Detection Made Easy

13 90%

tinyML Talks: Energy-Efficiency and Security for TinyML and EdgeAI: A Cross-Layer Approach

4 years ago

01:01:09

tinyML Talks: Energy-Efficiency and Security for TinyML and EdgeAI: A Cross-Layer Approach

3 24%

tinyML Talks Pakistan: FFConv: An FPGA-based Accelerator for Fast Convolution Layers in...

4 years ago

01:04:11

tinyML Talks Pakistan: FFConv: An FPGA-based Accelerator for Fast Convolution Layers in...

4 19%

tinyML Talks: Oculi is putting the human eye in A.I.

4 years ago

00:53:51

tinyML Talks: Oculi is putting the human eye in A.I.

2 55%

tinyML Asia 2021 Justin Kao: A lightweight face detection method working with Himax Ultra-Low...

4 years ago

00:28:21

tinyML Asia 2021 Justin Kao: A lightweight face detection method working with Himax Ultra-Low...

2 65%

tinyML Asia 2021 Haochen Xie: An approach to dynamically integrate heterogenous AI components...

4 years ago

00:23:01

tinyML Asia 2021 Haochen Xie: An approach to dynamically integrate heterogenous AI components...

1 26%

tinyML Asia 2021 Joshua Chang: Sensor Fusion using Machine Learning: Smart Forehead Temperature...

4 years ago

00:29:39

tinyML Asia 2021 Joshua Chang: Sensor Fusion using Machine Learning: Smart Forehead Temperature...

2 12%

tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset

4 years ago

01:01:24

tinyML Talks: The Multilingual Spoken Words Corpus, a Massive Keyword Spotting Dataset

1 74%

tinyML Talks Toronto Part 1: Evolutionary Needs of TinyML

4 years ago

00:34:03

tinyML Talks Toronto Part 1: Evolutionary Needs of TinyML

9 14%

tinyML Talks Toronto Part 2: tinyMLedu: widening access to tinyML education and resources

4 years ago

00:17:35

tinyML Talks Toronto Part 2: tinyMLedu: widening access to tinyML education and resources

0 7%

tinyML Talks Toronto Part 3: tinyML4STEM: using tinyML for Neuroscience in K12

4 years ago

00:27:20

tinyML Talks Toronto Part 3: tinyML4STEM: using tinyML for Neuroscience in K12

54 44%

tinyML Talks India: Single Lead ECG Classification On Wearable and Implantable Devices

4 years ago

01:06:51

tinyML Talks India: Single Lead ECG Classification On Wearable and Implantable Devices

0 64%

tinyML Asia 2021 Yihong Wu: Lightweight visual localization with deep learning

4 years ago

00:26:53

tinyML Asia 2021 Yihong Wu: Lightweight visual localization with deep learning

5 17%

tinyML Talks: CFU Playground: Customize Your ML Processor for Your Specific TinyML Model

4 years ago

00:56:12

tinyML Talks: CFU Playground: Customize Your ML Processor for Your Specific TinyML Model

6 53%

tinyML Asia 2021 Chanwoo Kim: A review of on-device fully neural end-to-end speech recognition...

4 years ago

00:49:23

tinyML Asia 2021 Chanwoo Kim: A review of on-device fully neural end-to-end speech recognition...

5 8%

tinyML Talks: The Value of Edge AI for Industrial Applications: onsemi and SensiML IIoT Solutions

4 years ago

01:10:29

tinyML Talks: The Value of Edge AI for Industrial Applications: onsemi and SensiML IIoT Solutions

3 53%

Pete Warden Practical Applications of TinyML

4 years ago

00:53:29

Pete Warden Practical Applications of TinyML

1 81%

tinyML Talks: AutoML + TinyML with Edge Impulse's EON Tuner

4 years ago

01:00:43

tinyML Talks: AutoML + TinyML with Edge Impulse's EON Tuner

2 49%

tinyML Talks Morocco: Enabling Ultra-low Power Always-On Computer Vision at Qualcomm

4 years ago

00:56:06

tinyML Talks Morocco: Enabling Ultra-low Power Always-On Computer Vision at Qualcomm

10 13%

tinyML Talks: Verification of ML-based AI systems and its applicability in Edge ML

4 years ago

01:01:21

tinyML Talks: Verification of ML-based AI systems and its applicability in Edge ML

3 43%

tinyML Talks: A Practical Guide to Neural Network Quantization

4 years ago

01:01:20

tinyML Talks: A Practical Guide to Neural Network Quantization

14 20%

EMEA 2021 tiny Talks: Building Heterogeneous TinyML Pipelines

4 years ago

00:16:49

EMEA 2021 tiny Talks: Building Heterogeneous TinyML Pipelines

4 76%

0 Comments

Guest