About

I’m Jonathan Lim Siu Chi (@jonathanlimsc), sc are the romanized initials of my Chinese first name, 修齐.

I am passionate about AI and building scalable AI-driven products that impact real-world users.

On the more research side of things, I am broadly interested in improving the generalizability of ML models, looking at various AI sub-fields such as reinforcement learning, neuro-AI and multi-modal models.

In my 4.5 years of work experience, I have built:

  • Convolutional Neural Network models for tumor detection and disease classification in MRI brain scans
  • Real-time OCR pipeline based on Attention-OCR to extract text information from user identity cards, productionized with docker-compose. For monitoring model prediction distribution, I deployed the Elasticsearch-Logstash-Kibana (ELK) stack.
  • Sentiment classification of customer service chat-logs, using RoBERTa trained from scratch on in-house datasets.
  • Real-time Machine translation between >15 language pairs including English, Chinese, Thai, Vietnamese, Bahasa Indonesia, using FairSeq, deployed on GCP GPU instances, serving >200QPS per second, managed using Kubernetes.
  • Recommendation system based on Graph Neural Network (LightGCN) to recommend brands to customers based on domain browsing behaviour extracted from telco logs.