Skip to content
View TheLustriVA's full-sized avatar
🖥️
Coordinating Development Priorities
🖥️
Coordinating Development Priorities

Highlights

  • Pro

Block or report TheLustriVA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
TheLustriVA/README.md

Kieran Bicheno

AI Strategy & Technology Leadership | Available for Strategic Roles

Kieran Bicheno Banner

Technical Proficiency

Kieran Bicheno combines deep expertise in AI/ML development, mass media, and data-driven infrastructure with a background in digital news leadership and economics. He has more than ten years' experience designing, deploying, and scaling workflow systems and data pipelines, with applied experience in AI and MLOps going back to 2018.

Key Technical Competencies

  • Machine Learning & AI:
    • MLOps architecture, automated model training pipelines, and continuous-integration workflows
    • Research contributions in dataset documentation (co-author of “The Pile Datasheet”)[^1]
    • Implementation of local LLM deployments and agentic coding systems (e.g., Llama.cpp, Mistral)
  • Infrastructure & DevOps:
    • Design and management of self-hosted GPU clusters (NVIDIA RTX-class hardware) on Ubuntu/Linux
    • Containerization with Docker and orchestration using docker-compose and Kubernetes
    • Secrets-management (Infisical), PostgreSQL configuration, and systemd service automation
  • Programming & Automation:
    • Advanced Python development (data pipelines, audio-processing, Google Apps Script)
    • Shell scripting and CLI tooling for ffmpeg, yt-dl integration, and workflow automation (n8n, PM2/Bun)
  • Data Engineering:
    • Large-scale economic and cosmological data ingestion, statistical analysis, and time-series forecasting
    • API design for Stable Diffusion and cost-push inflation models

Career Highlights

  • Led AI augmentation strategy and digital transformation in high-stress news environments at News Corp Australia, pioneering a Google Apps Script tool that cut a critical workflow from 2 hours to 12 minutes.
  • Co-authored the influential “Datasheet for The Pile” paper on arXiv, establishing metadata standards for large language-model datasets.[^1]
  • Architected and optimized self-hosted inference environments for LLMs, overcoming PyTorch compatibility issues with RTX 5090 GPUs.
  • Directed GenFactory.io’s industrial-scale Stable Diffusion API platform, integrating data-driven image generation at production scale.

Open-Source Contributions

GitHub profile showcases a broad range of repositories spanning AI/ML prototypes, tooling for self-hosting, data-analysis libraries, and multimedia processing utilities. Highlights include:

  • MLOps pipeline templates for training and deploying transformer models
  • Kubernetes manifests and Helm charts for GPU-accelerated inference
  • Audio-visual production scripts and video-processing workflows

Selected Publications & Projects

  • “Datasheet for The Pile” – co-author, arXiv:2201.07311
  • Independent MLOps research on autonomous systems control and workforce augmentation (2020–2021)
  • Cosmological data analysis using radio-telescope datasets (2020–2021)

Connect

Pinned Loading

  1. Croissant-TOML Croissant-TOML Public

    Crossiant tools and specifications for a more accessible TOML format

    Python

  2. efdata efdata Public

    Australian economic data integration platform - automated RBA/ABS data collection with circular flow validation

    Python

  3. ComfyUI-Image-Size-Tools ComfyUI-Image-Size-Tools Public

    A ComfyUI node for setting image sizes based on the model being used.

    Python 5 1

  4. portainer-templates-Nov-2022-collection portainer-templates-Nov-2022-collection Public

    A collection of 488 templates for Portainer v2.0 sourced from various repositories

    Python 118 18