Writing

Recent Advancements in AI

A 2019 snapshot of major AI breakthroughs across text, image, audio, and video generation, with practical startup use cases.

Snapshot
2019
Scope
Generative AI
Areas
Text/Image/Audio/Video
Source
Medium
Generated media examples from 2019 AI advancements.
This Waifu Does Not Exist

If you are looking to build an AI startup or want to see where AI currently stands, this article might be for you.

AI needs no introduction. It is in the news all the time. In 2019, many of us were waiting for a headline like “DeepMind or OpenAI finally developed AGI.” Some people believed AGI was around the corner, and others believed it would take much longer. Who knows.

While waiting for AGI, I wanted to share a list of recent breakthroughs that were already transforming industries, especially around content generation.

The AGI Question

In 2019, a lot of people were waiting for a headline like “DeepMind or OpenAI finally developed AGI.” While that did not arrive, the field was already moving quickly across commercially useful content-generation systems.

This list is a historical snapshot, not a current survey, but it captures where practical AI felt headed in 2019.

Text Generation

I do not want to talk about text generation without mentioning OpenAI’s GPT-2. It could generate convincingly realistic text, and despite the hype and ethical debates, it opened up many commercial possibilities.

Image Generation

Images make up a large part of the internet. Designers, artists, game developers, and creators all use images to communicate and earn a livelihood, so image generation has obvious commercial value.

Pix2Pix: From Sketch to Reality

Pix2Pix sketch-to-image example
Pix2Pix transforming a sketch-like input into a generated image.

GauGAN: Turning Sketches into Landscapes

GauGAN turning simple sketches into landscape images
NVIDIA GauGAN turned rough scene sketches into photorealistic landscapes.

Audio Generation

Audio is another kind of content we consume every day, whether it is speech, podcasts, or music.

MuseNet Demo

OpenAI’s MuseNet showed how AI could compose multi-instrument music under stylistic constraints.

Video and Animation Generation

Video is the most consumed content format online, and AI was already moving into video editing, animation, and generation workflows.

That Is It?

No. This barely scratches the surface. I did not even touch game playing, virtual YouTubers, virtual models, self-driving systems, and many other areas.

The 2019 AI Revolution in Summary

From text generation that could write stories, to image synthesis creating faces that did not exist, to music composition and video style transfer, 2019 already looked like the start of a content creation shift across gaming, entertainment, fashion, photography, and media.

Thanks for reading. If you have suggestions or would add something to the list, let me know.