Ali Ansari

Software Engineer & ML Researcher · MSc Advanced Computing Science @ University of East Anglia

Building at the intersection of production systems engineering and applied AI research. 3+ years delivering high-availability infrastructure in FinTech, now combining that with active ML research, from CUDA kernel benchmarking to rail intelligence systems powered by LLMs and real-time data pipelines.

Technical Expertise

🤖

Machine Learning & AI

  • LLMs, NLP & Prompt Engineering
  • CUDA / GPU Programming
  • Scikit-learn & Random Forest
  • pgvector & RAG Pipelines
  • AI Evaluation Frameworks
  • Claude Code (Certified)
🏗️

Infrastructure & Cloud

  • Kubernetes (EKS, bare-metal, Cilium)
  • Docker & Helm
  • AWS (EC2, EKS, S3, IAM)
  • VMware vSphere (vSAN, vMotion)
  • Oracle Linux
  • Linux (12+ years daily use)
⚙️

Automation & DevOps

  • Terraform & Ansible (IaC)
  • ArgoCD & GitLab CI/CD
  • Python, TypeScript & Bash
  • pytest & Playwright
  • C++ & OpenMP
  • CIS Security Hardening
📊

Data & Databases

  • Oracle Database (admin & tuning)
  • PostgreSQL (production scale)
  • Kafka & Debezium (CDC)
  • Prometheus & Grafana
  • ELK Stack
  • Redis

Projects

RailSense

Rail + AI

Agentic NLP and ML platform for the rail domain. Routes queries across three integrated capabilities: live ticket search via the National Rail OJP SOAP API, a Random Forest delay prediction engine, and an LLM-powered contingency advisor for station staff backed by a pgvector RAG layer. Fully containerised with Docker Compose including voice interface support.

Python LLM + RAG Random Forest pgvector Streamlit Docker
View on GitHub ↗

LLM Evaluation Framework

NLP Research

LLM-as-a-Judge evaluation system built with Dr. F. F. Liza at UEA. Automated Python pipelines ingest and process large volumes of unstructured text, then use a generative model to systematically score other LLMs for reasoning capability, logic consistency, and semantic accuracy, providing a rigorous framework for measuring AI output quality.

Python LLMs NLP Evaluation Data Pipelines

AI vs Human: CUDA Kernels

MSc Dissertation

MSc dissertation supervised by Prof. Stephen Laycock, benchmarking state-of-the-art generative AI models against expert human engineers on low-level, hardware-aware CUDA kernel development. Designing rigorous experimental frameworks to evaluate execution efficiency, memory throughput, and code correctness across a range of GPU computational workloads.

CUDA C++ LLMs Benchmarking GPU Programming

English PDF Analyzer

Application

PDF analysis tool with CEFR level classification, vocabulary highlighting, and glossary generation for language learners. Fully deployed and publicly accessible.

Python NLP Docker Nginx
View on GitHub ↗

Get In Touch

Finishing my MSc at UEA and actively looking for Software Engineering, ML Engineering, or Systems roles in London from September 2026. Always happy to talk about interesting technical problems.