Hi, I'm Valerie!

Final year NUS Data Science & Analytics student

I'm currently exploring the fields of data science and analytics, with hands-on experience in projects involving Retrieval-Augmented Generation (RAG), knowledge graphs, OCR, and prompt engineering.

Outside of tech, you'll probably find me playing table tennis or figuring out piano chords by ear.

If you're curious about my work or just want to connect, feel free to reach out!

Valerie Oh

Experience

Technical Experience

Data Science Intern

SynapxeJan 2025 - Present

Redesigned a clinical search engine using LLM-based entity extraction, Neo4j knowledge graph, and Azure OCR to improve ICD code retrieval accuracy.

Research Assistant

NUS Faculty of ScienceDec 2024 - Jan 2025

Developed a React-based revision platform using LLMs to generate linear algebra questions, enhancing learning for 600+ students through interactive design and data-driven insights.

Data Analyst Intern

ASTARMay 2024 - Dec 2024

Built a multimodal NLP pipeline using LLaMA, HuggingFace, and prompt engineering to extract sustainability insights, speeding up article processing by 1.5× and supporting data-driven evaluations with Tableau.

Game Data Compliance Intern

TencentJan 2024 - Apr 2024

Ensured quality and compliance of cookie implementations, conducted research on gaming industry products in international markets, and assisted with project management.

Leadership Experience

Welfare Director (JCRC)

NUS Raffles HallAY24/25

Led 8 welfare-focused committees to support residents' well-being through internal initiatives and external volunteer programs for special needs, elderly, children, and migrant workers.

Production Manager

Raffles Hall Musical ProductionAY23/24

Led the administration, publicity, and marketing efforts for an annual musical production involving 200 people, managing communications and logistics for opening, closing, and show day.

Table Tennis Captain

NUS Raffles HallAY23/24

Led a team of 20 as Table Tennis Captain to achieve 2nd place in Inter-Hall Games, while organizing regular trainings and team bonding sessions.

President, Volunteer Corps

NUS Raffles HallJan 2023 - May 2024

Led the Special Needs volunteer program, planning and executing 7 adhoc events and weekly volunteering sessions with organizations like Mindsville and Genesis School.

Featured Projects

Knowledge Graph Query Engine for SNOMED to ICD Code Mapping
Knowledge Graph Query Engine for SNOMED to ICD Code Mapping
Neo4j
GPT-4
RAG
Knowledge Graph
Python
React
A clinical search engine combining Retrieval-Augmented Generation (RAG) with Neo4j knowledge graphs to improve medical code mapping accuracy. Implemented entity extraction and semantic search to achieve a 5% boost in retrieval precision.
Drug Information OCR Pipeline
Drug Information OCR Pipeline
Azure
OCR
Document Intelligence
Python
SQL
An intelligent OCR pipeline that extracts structured drug information from scanned documents using Azure Document Intelligence, enabling efficient search and analysis of medication data.
AI-Powered Linear Algebra Platform
AI-Powered Linear Algebra Platform
React
OpenAI
LLM
Figma
Python
Analytics
An interactive web platform that revolutionized linear algebra revision for 600+ students through personalized LLM-powered question generation and comprehensive analytics.
Sustainability Journal Analysis Platform
Sustainability Journal Analysis Platform
LLaMA
HuggingFace
Python
Tableau
NLP
A comprehensive data analysis platform that processes sustainability journals using LLaMA and HuggingFace transformers, providing actionable insights through Tableau visualizations.
ED Triage Prediction App – DataRobot Deployment
ED Triage Prediction App – DataRobot Deployment
Docker
Python
Streamlit
Java
MLOps
Deployed a hybrid ED triage prediction system by containerizing a multi-language pipeline (Python + Java) and integrating it with DataRobot's platform, complete with a Streamlit frontend for real-time predictions.

Skills & Technologies

🐍

Python

📊

R

💾

SQL

📝

TypeScript

🧠

GPT-4

🤖

LLMs

🧩

RAG

🤗

HuggingFace

💬

Prompt Engineering

🔄

Neo4j

📊

Knowledge Graphs

📈

Tableau

📊

Data Analytics

🐼

Pandas

🔢

NumPy

☁️

Azure

📝

OCR

🐳

Docker

📦

Git

⚛️

React

Next.js

🎨

Figma

💨

TailwindCSS

Let's Connect!

I'm always excited to discuss data science, analytics, or just have a friendly chat about tech and innovation.