Shaojie Jiang ☕️

Shaojie Jiang

Machine Learning Scientist

Huawei R&D Amsterdam

I am a Senior Machine Learning Scientist at Huawei R&D Center Amsterdam. I work on training Huawei’s ChatGPT models, and am responsible for the Reinforcement Learning algorithm development and model training. More broadly, my work involves chatbots, information retrieval and conversational QA systems. In parallel, I am completing my PhD at the University of Amsterdam, under the supervision of Prof. Maarten de Rijke. In the past, I did two internships at Replika.ai and Amazon.com, respectively.

Download my CV in PDF

Skills

Python

90%

pytorch
PyTorch

90%

Git

90%

Bash

70%

Photography

60%

Rust

50%

Unity

10%

Experience

 
 
 
 
 
Senior Machine Learning Scientist
July 2022 – Present Amsterdam, the Netherlands

Responsibilities include:

  • Reward model and PPO algorithm development and model training (core of the ChatGPT model)
  • Designer and main contributor of Chair, a research toolkit in Python (70% of commits), which is capable of training SFT, RM, and PPO models through one line of command
  • Lead of ChatEval challenge project at DSTC 11
  • Contact person of the Conversational AI project
  • Contact person of DReaMS Lab work package 3
 
 
 
 
 
Research Intern
February 2021 – July 2021 Amsterdam, the Netherlands
This was a remote internship due to COVID-19. During this period, I worked on improving the engagingness of the Replika chatbot. It was a great experience to get my hands on a chatbot system that have real users (as opposed to a common lab environment of my daily research).
 
 
 
 
 
Applied Science Intern
Amazon
July 2020 – September 2020 Amsterdam, the Netherlands
This was a remote internship as my travel to the originally agreed location Berlin was interrupted by COVID-19 pandemic. During this period, I worked on a review summarisation task with the Subjective NLP team (based in Barcelona and Berlin).
 
 
 
 
 
Poster presentation at TheWebConf
May 2019 – May 2019 San Francisco, California
  • Presented my poster for our paper accepted at TheWebConf ‘19
  • Thanks to all the coauthors
  • Superb coastal views
  • The Golden Gate Bridge is glorious
 
 
 
 
 
Poster presentation at SCAI Workshop
October 2018 – November 2018 Brussels, Belgium
  • Presented my poster for our paper accepted at SCAI Workshop
  • Awarded student travel grant (€400)
  • A nice city with Dutch-style and French-style (and more) architectures merged together
 
 
 
 
 
NLP Summit
Google
June 2019 – June 2019 Zurich, Switzerland
  • Thanks to Google for paying everything
  • Great opportunity to communicate with peers and Googlers
  • Presented my WWW ‘19 poster
  • Nice crystal clear water everywhere!
 
 
 
 
 
PhD student
University of Amsterdam
October 2017 – Present Amsterdam, The Netherlands
 
 
 
 
 
Master of Engineering
Northwest A&F University
September 2014 – June 2017 Yangling, China
  • Thesis title: Research on Feature Representation and Optimization Methods in Structured Object Tracking
  • Supervisor: Jifeng Ning
 
 
 
 
 
Bachelor of Engineering
Northwest A&F University
September 2010 – June 2014 Yangling, China
  • Thesis title: Implementation of Single Image Haze Removal Using Dark Channel Prior
  • Advisors: Dr. Yaojun Geng and Prof. Jifeng Ning

Recent Posts

Recent Publications

Quickly discover relevant content by filtering publications.
(2023). Weakly Supervised Turn-level Engagingness Evaluator for Dialogues. Proceedings of the 2023 Conference on Human Information Interaction and Retrieval.

PDF Cite Code

(2022). A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration. arXiv preprint arXiv:2205.02517.

PDF Cite Code

(2020). TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation. arXiv preprint arXiv:2003.11963.

PDF Cite Code

(2019). Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss. The World Wide Web Conference.

PDF Cite Code Poster

(2018). Why are Sequence-to-Sequence Models So Dull? Understanding the Low-Diversity Problem of Chatbots. Proceedings of the 2018 EMNLP Workshop SCAI: The 2nd International Workshop on Search-Oriented Conversational AI.

PDF Cite Poster

Me on Twitter

Log in to view

Upcoming Events

If you’re interested in AI, Web 3, and Game Development, join me in these events around Amsterdam! Click the + button in the bottom right to subscribe to this calendar.

Contact

Drop an email below: