Skip to content
View wzdnzd's full-sized avatar

Block or report wzdnzd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wzdnzd/README.md

πŸ™‹ Hey! πŸ‘‹ Nice to See You

πŸ‘¨β€πŸ’» About Me

Backend engineer and AI infrastructure enthusiast focused on building highly scalable and intelligent systems. I specialize in Cloud Native Architectures, ML Engineering, and MLOps to drive robust and efficient AI-powered applications.

πŸ”§ Skills

  • Programming Languages: Python, Go, Java and more.

  • Cloud-Native & Distributed Systems: Kubernetes(CRI & CNI & CSI & Scheduler & Operator), Service Mesh(Istio & Linkerd), Container(CGroups & Namespaces & UnionFS), Ray, Spark, Distributed System Design, etc.

  • AI/ML Engineering & Platforms: RecSys, RAG, Text2SQL, NLP, MLOps, PyTorch, DeepSpeed, Triton.

🎯 Expertise

  • End-to-End ML Platform Engineering: Architecting and building production-grade, Kubernetes-native ML platforms that integrate cloud-native infrastructure (service mesh, observability, security) with complete ML lifecycle automation from distributed data processing and elastic training to model versioning and intelligent deployment strategies.

  • AI Application Development: Building and optimizing high-performance applications for search, retrieval, and recommendation, with proven implementations of RAG, Text-to-SQL, and hybrid search solutions.

🌱 Exploring

  • LLM Inference & Performance Tuning: Deep diving into runtime optimization techniques (PagedAttention, FlashAttention, 3D Parallelism, etc.) and advanced attention mechanisms to maximize throughput and reduce latency.

  • Advanced Retrieval & Recommendation: Advancing the application of RAG, generative recommendation systems, and Text2SQL to solve real-world problems.

  • AI Agent Architectures: Designing and developing autonomous agents and multi-agent systems for complex, multi-step task automation.

wzdnzd's Github Stats wzdnzd's Top Languages

Pinned Loading

  1. harvester harvester Public

    Intelligent data acquisition framework for GitHub and web sources

    Python 488 96

  2. batches batches Public

    Command Line Tools for Windows

    Batchfile 36 5

  3. bigdata-notes bigdata-notes Public

    BigData Learning Notes

    Java 51 42

  4. aggregator aggregator Public

    One-stop Proxies Crawling and Aggregation Platform

    Python 5.6k 5.1k

  5. snippets snippets Public

    some miscellaneous code

    Python 2 1

  6. leetcode leetcode Public

    LeetCode Solutions

    Java 7 2