Skip to content
View arijitroy003's full-sized avatar

Block or report arijitroy003

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arijitroy003/README.md

Hey there! I'm Arijit Kumar Roy πŸ‘‹

Typing SVG

Profile Views GitHub Followers GitHub Stars

πŸš€ About Me

class ArijitRoy:
    def __init__(self):
        self.name = "Arijit Kumar Roy"
        self.role = "Senior Software Engineer"
        self.company = "Red Hat"
        self.location = "Bangalore, India"
        self.experience = "7+ years"
        self.bio = "That Data guy who's good with Numbers"
        
    def current_focus(self):
        return [
            "Building self-service Data Platform Frameworks",
            "AI/ML Engineering & LLM Applications", 
            "Open Source Contributions",
            "Kubernetes & Cloud-Native Solutions"
        ]
    
    def specializes_in(self):
        return {
            "data_engineering": ["Snowflake", "dbt", "Spark", "Databricks"],
            "ai_ml": ["LangChain", "OpenAI", "Claude", "Embeddings"],
            "cloud": ["AWS", "Azure", "Kubernetes", "GitOps"], 
            "backend": ["Python", "Golang", "APIs", "Microservices"],
            "databases": ["SQL", "NoSQL", "Delta Lake", "Vector DBs"]
        }

🎯 Current Focus Areas

πŸ€– AI Engineering

  • Building MCP servers for AI tools
  • LLM application development
  • Agentic framework design
  • AI observability & monitoring

πŸ—οΈ Platform Engineering

  • Self-service data platforms
  • GitOps & Infrastructure as Code
  • Kubernetes operators
  • Cloud-native architectures

🌍 Open Source

  • Contributing to data tools
  • MCP protocol implementations
  • AI/ML frameworks
  • Developer productivity tools

πŸ› οΈ Tech Arsenal

Core Technologies

Python Databricks Snowflake AWS Azure GCP

Kubernetes Docker

AI/ML & Data

LangChain OpenAI Apache Spark dbt

Programming & Frameworks

Go JavaScript TypeScript Node.js React Angular Django React Native MongoDB DynamoDB Lambda


πŸ’Ό Professional Journey

πŸ”΄ Red Hat (Current)

Senior Software Engineer | Data & AI Platform
Apr 2024 - Present

  • πŸ—οΈ Building self-service Data Platform Framework (future open-source)
  • πŸ€– Metadata Completeness for Agentic frameworks & MCP servers
  • πŸ”§ Tech Stack: GitOps, Snowflake, dbt-Core, AWS, Golang, K8s

🐝 BEEM

Senior Data Engineer | FinTech
Nov 2023 - Mar 2024

  • πŸ’° AI-First Fintech for US working class (50M+ users)
  • 🧠 LLM-powered Personal Finance Management tools
  • πŸ’Έ Secured $16k funding from Databricks with $24k commitments
  • πŸ”§ Tech Stack: AWS, Python, Databricks, Mixpanel, Metabase, LLMs

🏒 Tata Digital

Senior Software Engineer | E-commerce
Aug 2021 - Oct 2023

  • πŸ›’ Conversational AI & Big Data for 120M+ users
  • πŸ€– Built Generative AI Product Search & Recommendation Engine
  • ⚑ Optimized operations: 75% latency reduction, 80% cost savings
  • πŸ”§ Tech Stack: Azure OpenAI, Mistral, LangChain, Embeddings, Vector DBs (Chroma/Milvus), Python, Spark, Databricks, Delta Lake, ADLS, Azure DevOps, KQL/SQL

πŸ§ͺ Gnosis Lab

Founding Engineer | AI Automation
Jun 2019 - May 2021

  • πŸ€– Built AI bot for social media marketing automation
  • πŸ—οΈ MEAN Stack developer with 50+ APIs
  • πŸ“š Learning Management System Platform development
  • πŸ”§ Tech Stack: Python, AWS, Serverless, Web Scraping, TypeScript, Angular, Lambda, DynamoDB, MongoDB, Docker

πŸ“Š GitHub Analytics

🌐 Public Contributions

Public Contribution Graph

🏒 Work & Private Contributions

Red Hat GitLab Contributions Graph

πŸ“Š Weekly Development Breakdown

Python       12 hrs 45 mins  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   48.5%
Go           5 hrs 23 mins   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   20.4%
SQL          3 hrs 12 mins   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   12.2%
YAML         2 hrs 8 mins    β–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   8.1%
JavaScript   1 hr 45 mins    β–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   6.7%
Other        1 hr 5 mins     β–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘   4.1%

🌐 Let's Connect!

LinkedIn GitHub Email


πŸ’­ Random Dev Quote

Quote


Pinned Loading

  1. arijitroy003.github.io arijitroy003.github.io Public

    Personal portfolio website showcasing my work in Data Engineering, AI/ML, and Software Development

    JavaScript

  2. datadiff datadiff Public

    High-performance CLI tool for semantic diffing of tabular data (CSV, Excel, Parquet, JSON) with Git integration

    Rust

  3. linkedin-mcp-server linkedin-mcp-server Public

    LinkedIn automation MCP server wrapping the unofficial linkedin-api

    Python

  4. snap-a-miro snap-a-miro Public

    Convert whiteboard photos into interactive Miro boards using AI vision analysis

    JavaScript

  5. duckdb duckdb Public

    Forked from duckdb/duckdb

    DuckDB is an analytical in-process SQL database management system

    C++

  6. genai-toolbox genai-toolbox Public

    Forked from googleapis/mcp-toolbox

    MCP Toolbox for Databases is an open source MCP server for databases.

    Go