Liangjun (Lance) Song - Senior ML Engineer CV

Tip: use Print - Save as PDF with Paper A4, Margins None, Scale 100%, and Background graphics ON.

CONTACT

TEL0432 203 278

LOCMelbourne, VIC, Australia

Status:Australian Permanent Resident

PROFILE

Senior ML Engineer with a PhD specialising in search and recommendation systems (RMIT). 7+ years designing, building, and operating production-grade ML systems — search ranking, recommendation, content understanding, and LLM/agentic applications — across the full model lifecycle: feature pipelines, offline evaluation, A/B testing, deployment, and monitoring. Ships LLM systems with governance built in: guardrails, PII filtering, human-in-the-loop oversight, and cost observability. Research published in the VLDB Journal, DASFAA, and ADC.

EDUCATION

PhD — Search & Recommendation Systems

RMIT University

2015 – 2020

B.Sc. in Computer Science

Harbin Institute of Technology

2009 – 2013

SKILLS

Production ML Systems:Search, recommendation, content classification, feature pipelines, offline evaluation, model lifecycle management, monitoring & observability

Retrieval & LLM Products:LangGraph, LangChain, RAG architecture, Azure OpenAI, vector search, guardrails

Infrastructure:Docker, GitHub Actions, dev containers, readiness checks, cloud deployment, production debugging

Languages:Python (Expert), SQL, TypeScript, C++

Backend Services:FastAPI, Pydantic, PostgreSQL, AsyncIO, REST APIs

ML Frameworks:PyTorch, TensorFlow, Keras

Cloud & MLOps:Azure, GCP Vertex AI, Vercel, Supabase, CI/CD

Data Science Libraries:NumPy, Pandas, SciPy, Matplotlib

WORK EXPERIENCE

WiseTech Global

Apr 2025 – Jun 2026

AI Engineer - AI/ML Group

Architected and optimized end-to-end LLM systems for two production support products — CWBot, a support chatbot, and TriageAgent, an automated ticket-triage agent — combining LangGraph orchestration, FastAPI services, Azure OpenAI, enterprise APIs, and durable PostgreSQL-backed state.
Designed RAG workflows with dynamic question rewriting, custom retriever factories, Azure AI Search integration, and heterogeneous internal data sources to improve how users find and act on operational knowledge.
Implemented guardrails for PII filtering, response steering, and human-in-the-loop review, plus checkpointing and recovery patterns for reliable multi-turn product experiences.
Led container-first CI/CD modernization for AI services, establishing Docker dev containers, GitHub Actions pipelines, readiness checks, and environment parity across local and deployed workflows.
Instrumented the production layer around the model — request tracing, structured logging, and token/cost accounting — to keep behavior and spend observable after deployment.
Maintained test coverage across backend, frontend, and system test sets, incorporating linting, type checks, quality gates, and release-readiness checks.

SGLang Framework

Feb 2025 – Present

Open Source Contributor

Contributing core optimizations to the SGLang framework (LMSYS / UC Berkeley) to accelerate LLM inference pipelines for state-of-the-art models including DeepSeek R1, Llama 3, and Qwen.
Working on backend runtime, distributed serving, and frontend prompt-language features to improve throughput, contextual routing, and controllability.
Leveraging AI coding agents (Claude Code) to navigate the large codebase, draft PRs, and review diffs, demonstrating an effective human-in-the-loop open-source contribution workflow.

Australia IT Group · AI Engineer Bootcamp

Feb 2026 – May 2026

Guest Instructor & Curriculum Designer — RAG, Multi-Agent Systems, Fine-tuning

Delivered 12+ sessions on production RAG, embeddings & vector search, multi-agent systems (LangGraph, AutoGen, CrewAI), QLoRA fine-tuning, RAGAS evaluation, and applied AI (UI, PDF parsing, Text-to-SQL); all notebooks and slides committed to the cohort repo.
Designed Dispatch.AI, a 6-week hands-on course project in which 13 students built a production AI booking assistant incrementally — Pydantic state + Redis persistence, LangGraph agents, MCP tool server, multi-agent routing, NeMo Guardrails, RAGAS evaluation, and Docker + Render capstone; delivered the reference implementation, scaffold, and CI/CD grading pipeline.

Redbubble

Mar 2023 – Jan 2025

Data Scientist - Search & Recommendation Team

Led search and recommendation enhancements with Marqo vector search and GCP Vertex AI MLOps pipelines, improving add-to-cart rate by 0.5% and CTR by 10%.
Owned production ML workflows end to end, from analysis and offline evaluation through experiment design, deployment coordination, and post-launch metric review.
Designed ML/data infrastructure processing 100M+ user events for feature extraction, search relevance analysis, and downstream serving workflows.
Drove GA4 analytics migration to ensure data reliability across internal dashboard-driven A/B testing frameworks.
Optimized search relevance for long-tail queries, balancing retrieval quality, user behavior signals, and measurable business impact.

Redbubble

Jan 2021 – Mar 2023

Data Scientist - Content & Discovery

Shipped production content-classification and moderation systems using language-image models, reducing intellectual-property moderation risk and improving operational throughput.
Built image duplicate-detection and data-quality/anomaly-detection pipelines, and ran content tagging, taxonomy and SEO experiments to improve product discovery.

SELECTED ML / AI SYSTEMS

Termly (GitHub)

2026

Creator & Lead Developer — forward-deployed onsite with healthcare customers

Built a document AI system for Australian medical contract automation: scanned PDF extraction, OCR correction, structured clause/entity extraction, validation, and risk analysis.
Implemented FastAPI/Pydantic services integrating Claude Sonnet, Azure Document Intelligence, Tesseract OCR, Docker, and healthcare-specific validation logic.

CallForMe (GitHub collaboration)

2026

Co-Creator & Lead Developer

Collaborating with a partner developer on an AI phone assistant MVP — call records, transcripts and summaries, AI risk analysis, and interactive hold-for-me call flows — built on Next.js/TypeScript with WebSocket real-time services.

Ultra OT (GitHub)

2026

Creator & Lead Developer — forward-deployed onsite with an occupational therapy practice

Built the practice's internal operations system onsite, iterating directly with the team: case flow, Jira-style tickets, calendar scheduling, geospatial clustering of clients by driving distance (Google Routes API) for visit-route planning, intake imports, and time/finance visibility (Next.js, TypeScript, Supabase).

Agentic Workbench (Remote Agent Workbench + AgentForge + BranchFlow · GitHub)

2026

Creator & Lead Developer

Built a multi-agent coding-agent collaboration system: Remote Agent Workbench (browser orchestration for Claude Code and Codex with live PTY terminal streaming, task persistence, and allowed-directory safety boundaries) and AgentForge (control plane running coding agents in parallel across isolated git worktrees, with PM2 management and auto-commit scheduling).
Built BranchFlow, a React/TypeScript nonlinear writing and prompt-orchestration workbench with branching story state, React Flow mind maps, and Obsidian Canvas export.

Voice AI Companions (RageZone · Elder Companion · GitHub)

2026

Creator & Prototype Builder

Built two voice-first AI companions: Elder Companion, an Expo voice app with ElevenLabs speech, a family dashboard, and a daily summary workflow; and RageZone, a WeChat Mini Program real-time communication assistant with Deepgram speech recognition, plus a corpus and evaluation scripts for conversation quality.

RESEARCH EXPERIENCE

Microsoft Research Asia

Jul 2012 – Jun 2013

Research Intern - Web Search and Data Mining Group

Developed Autosub, a collaborative project for generating video subtitles using advanced speech-to-text APIs.
Contributed to web page mining projects using data mining and time-series analysis techniques.

PUBLICATIONS

Towards Efficient Personalized Ranking

PhD Thesis, RMIT University

2020

Incremental Preference Adjustment: a Graph Theoretical Approach

The VLDB Journal (Core Rank A*)

2020

Continuous Summarization over Microblog Threads

DASFAA (Best Student Paper Award Runner-Up)

2017