From the course: Large Language Models: Text Classification for NLP using BERT
Unlock the full course today
Join today to access over 24,800 courses taught by industry experts.
The Datasets library - Python Tutorial
From the course: Large Language Models: Text Classification for NLP using BERT
The Datasets library
- As we're going to be training a model in the next few videos, we want to change the run time to use a hardware accelerator, like a GPU. So head over to run time, change run time type, and select GPU from the hardware accelerator and save. As I'm going to be using the same notebook over several videos, I won't be running all the cells at the start but I'm going to go through them individually. So I can run the cells by clicking on the play icon on the left, or I can go ahead and just hit Shift and Enter. So we've just gone and installed two libraries created by the Hugging Face team. That's the transformers and the dataset libraries. Now, datasets is a library for easily accessing, well, datasets. So, these could be NLP tasks or computer vision or audio. We can go ahead and list some of the datasets currently available. There are instructions on the Hugging Face website about how you can contribute a dataset to this…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.