One of the hardest things about building AI systems is that the things humans care about (words, sentences, images, ideas, etc) aren’t naturally something a computer can do math on. A computer doesn’t inherently know that “happy” and “joyful” are similar, or that a photo of a dog and the word “dog” are related. It just sees raw data.
Embeddings are the solution to that problem.