NLP

Has the AI-Era come to video games already?

With the big noises made by ChatGPT, many different industries have noticed the value of LLM technologies. Unsurprisingly, the video game industry is one of them. In this blog, I introduce several cool demos/WIPs that I’ve recently found, and share my opinions on why they might have profound influences on the future of video game industry.

Shaojie Jiang

Aug 13, 2023 8 min read Deep Learning, NLP, LLM, video games

Has the AI-Era come to video games already?

One source of LLM hallucination is exposure bias

With the release of closed-source ChatGPT, GPT-4, and open-source LLaMa models, the LLM development has seen tremendous improvements in recent months. While we are hyped with the fact that these LLMs are capable of many tasks, we have also noticed again and again that these LLMs hallucinate content.

Shaojie Jiang

Aug 9, 2023 3 min read paper reading notes, Deep Learning, NLP, LLM, hallucination, information retrieval

One source of LLM hallucination is exposure bias

Transformer Align Model

Jointly Learning to Align and Translate with Transformer Models

Shaojie Jiang

May 16, 2020 2 min read paper reading notes, Deep Learning, NLP

Compressive Transformers

Built on top of Transformer-XL, Compressive Transformer1 condenses old memories (hidden states) and stores them in the compressed memory buffer, before completely discarding them. This model is suitable for long-range sequence learning but may cause too much computational burden for tasks that only have short sequences.

Shaojie Jiang

May 12, 2020 3 min read paper reading notes, Deep Learning, NLP

Compressive Transformers

What's New in XLNet?

In this post, I will try to understand what makes XLNet better than BERT.

Shaojie Jiang

Last updated on Jul 3, 2019 4 min read NLP, Deep Learning

What's New in XLNet?