From the course: Advanced RAG Applications with Vector Databases
Unlock the full course today
Join today to access over 24,700 courses taught by industry experts.
Introduction to embeddings
From the course: Advanced RAG Applications with Vector Databases
Introduction to embeddings
- [Instructor] Now that we've wrapped up chunking, let's cover embeddings. Before vector embeddings, we didn't have a way to compare unstructured data. With embedding models, we do. Embedding models are machine learning models, almost always deep neural networks, that turn your text, images, videos, audio, whatever kind of data you have into vectors or vector embeddings. Vectors are the tools we use to quantitatively compare unstructured data. Remember that it's important to use the correct embedding models to embed whatever data you have. In most contexts, that refers to embedding models trained on your data type. For example, using ResNet50 for image embeddings, using Sentence Transformers for your text, or using Whisper for your audio. In this context, we are primarily concerned with embedding text. The rise in popularity of large language models late in 2022 and all of 2023 showed us that text is one of the most important mediums for AI to work with. As such, there are now…
Contents
-
-
-
(Locked)
Introduction to preprocessing for RAG4m 57s
-
Chunking considerations5m 12s
-
(Locked)
Chunking examples4m 32s
-
(Locked)
Introduction to embeddings9m 50s
-
(Locked)
Embedding examples2m 57s
-
(Locked)
Metadata3m 12s
-
(Locked)
Demo: Chunking2m 32s
-
(Locked)
Demo: Metadata1m 23s
-
(Locked)
Demo: Embed and store2m
-
(Locked)
Demo: Querying1m 8s
-
(Locked)
Demo: Adding the LLM2m 1s
-
(Locked)
Challenge: Cite your document sources47s
-
(Locked)
Solution: Cite your document sources59s
-
(Locked)
Challenge: Change the chunk size44s
-
(Locked)
Solution: Change the chunk size55s
-
(Locked)
-
-
-