cv Download CV

Basics

Name Andrei Kanavalau
Title PhD Candidate, Electrical Engineering
Expected graduation: June 2026
Email askanavalau@gmail.com
Url https://kanavalau.com
Summary PhD candidate in Electrical Engineering at Stanford working across LLMs/Transformers, constrained optimization, and control.

Work

  • Graduate Researcher (PhD)
    2020.09 - Present
    Stanford University (Lall Group)
    Research at the intersection of machine learning, optimization, and control.
    • Developed TaperNorm, a gated normalization layer that tapers to a foldable affine map, enabling removal of per-token normalization inside Transformer blocks and faster inference after weight folding (up to 1.22× throughput on an H100 microbenchmark).
    • Learned explicit MPC policies with hard closed-loop safety constraints by training neural network controllers via an augmented Lagrangian / primal-dual loop; achieved constant-time policy evaluation (~0.035 ms worst case) with zero safety violations in simulation test sets.
    • Developed and analyzed a primal-dual Adam-style method for equality-constrained optimization used for constrained neural network training, clarifying how step sizes interact with constraint geometry.
  • Member of Technical Staff Intern (Research)
    2025.06 - 2025.09
    Inflection AI
    Owned an end-to-end research project testing whether a tokenizer-free Byte-Latent Transformer (BLT) can be adapted to new languages by retraining only the ~4% "interface" modules while keeping the latent transformer fixed.
    • Fine-tuned BLT by training a new entropy model and updating the local encoder/decoder while freezing the latent transformer (and hash n-gram embeddings), training ~4% of parameters.
    • Built training and evaluation pipeline (FineWeb-2) and benchmarked on Belebele; analyzed entropy-threshold patching as an explicit compute/quality knob.
    • Implemented character-preserving patch boundaries (UTF-8 safe) and studied patch-count efficiency vs. accuracy.
  • Computational Lithography Intern
    2024.06 - 2024.09
    TSMC
    Injected domain expertise into deep learning models (CNNs) via invariance constraints with exact-fit requirements on critical samples.
    • Developed a training method to enforce expert invariances by requiring exact fit on a designated subset of rare-but-critical samples while maintaining performance on the full distribution.
    • Built custom training and evaluation tooling to measure constraint satisfaction and robustness across regimes.
  • Advanced Development Intern
    2023.06 - 2023.09
    KLA Corporation (FaST Division)
    Inverse problem tooling and algorithm optimization for semiconductor metrology.
    • Developed a MATLAB GUI prototype for the Axion tool to run a new inverse algorithm and integrate it into an existing workflow.
    • Improved algorithm accuracy by 2× and reduced runtime by 3× by optimizing signal processing and regression components.
  • Systems Analytics Engineer Intern
    2022.06 - 2022.09
    Applied Materials (AIx Team)
    Fault detection and analytics for semiconductor manufacturing using process data.
    • Analyzed process traces to translate qualitative failure modes into concrete definitions and measurable features.
    • Engineered fault detection algorithms for an advanced data analytics product and contributed to pipeline development.

Education

Teaching

  • Teaching Assistant — EE263/CME263: Matrix Methods
    2025.09 - 2025.12
    Stanford University
    TA for a core linear algebra course focused on SVD, least squares, and matrix methods used throughout ML/control.
    • Held regular office hours and helped students build intuition for SVD, conditioning, and least squares through worked examples.
    • Delivered a number of lectures to ~80 students, including prepared slides and in-class problem solving.
    • Designed and ran midterm and final exams (writing problems, coordinating logistics, and grading).

Undergraduate Research

  • Summer Undergraduate Research Fellow
    2018.06 - 2018.09
    California Institute of Technology
    Numerical simulations studying interactions between acoustics and laminar flames.
  • Undergraduate Researcher
    2017.06 - 2017.09
    University of Cambridge
    Identification of metal-organic frameworks with desired structures in the Cambridge Structural Database.

Skills

LLMs & Deep Learning
LLMs (transformers, custom architectures and optimization, training dynamics)
PyTorch + HuggingFace Transformers (experiment design, custom training loops, ablations)
Math & Modeling Foundations
Numerical linear algebra (SVD, eigenvalues, conditioning)
Probabilistic modeling (graphical models, exact and approximate inference)
Signal and information theory (Fourier/FFT, filtering, sampling, entropy, mutual information)
Optimization, Control & Safety
Constrained optimization (duality/KKT, LP/QP/SDP), augmented Lagrangian and primal-dual methods
Optimal control (dynamic programming, trajectory optimization, MPC, implicit vs explicit policies)
Safety/stability (reachability/HJI, control-invariant sets, closed-loop constraint enforcement)
Software & Compute
Python (PyTorch/NumPy/SciPy), CUDA
HPC (Slurm), Git
MATLAB/Simulink, ROS

Publications

Awards

  • T.R.C. Fox Prize
    2019
    University of Cambridge
    Ranked #1/60 in 4th year (Chemical Engineering Tripos).
  • Foundation Scholarship & Foundress Prize
    2017,2018,2019
    Pembroke College, University of Cambridge
    Awarded annually for First Class Honours (Years 2–4).
  • BP Prize
    2017
    University of Cambridge
    Top 5 ranking in 2nd year examinations.
  • College Scholarship
    2016
    Pembroke College, University of Cambridge
    First Class Honours in 1st year examinations.
  • Peter May Prize
    2018
    Pembroke College, University of Cambridge
    Excellence in both Tripos examinations and University-level sport.
  • Best Academic Poster & Presentation
    2019
    University of Cambridge
    Best poster and presentation at the 4th-year student conference.