Skip to content
View Siddhesh2377's full-sized avatar
🪨
Eating Stones
🪨
Eating Stones

Block or report Siddhesh2377

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Siddhesh2377/README.md

Siddhesh Sonar


Tech Stack

Languages: Kotlin · Java · C++ (JNI/NDK) · Python · Rust (exploring)

Android: Jetpack Compose · AOSP · NDK · AIDL · Accessibility Services · Android KeyStore

AI/ML:

  • On-device inference: llama.cpp · ONNX Runtime · TensorFlow Lite · Whisper STT · Sherpa-ONNX
  • Model formats: GGUF · ONNX · LoRA adapters · OTA weight delivery
  • Techniques: GBNF · Multi-modal · Tool Calling · LoRA fine-tuning · Adapter switching · Model merging (TIES, DARE, SLERP)
  • Training: SFT with LoRA · On-device PEFT (experimental) · Federated learning (exploring)

GPU / NPU Systems:

  • Vulkan compute pipelines · ggml-vulkan submission model · timeline semaphores · GPU watchdog behavior
  • UMA (Unified Memory Architecture) on mobile SoCs · Heterogeneous compute (CPU + GPU + NPU scheduling)
  • GGML kernel paths: MMLA · I8MM · instruction-level runtime specialization
  • Adreno GPU scheduling · low-priority compute queues · DEVICE LOST debugging
  • Workload shape analysis: CLIP vs UNet dispatch behavior · conditional ramp tuning for mobile
  • Smart ML-op offloading across CPU / GPU / NPU

Security & Cryptography: AES-256 · RSA · Hardware-backed Android KeyStore · Secure IPC · Encrypted pipelines

Architecture: Plugin SDK design · Modular runtime systems · MVVM · Clean Architecture · Dependency Injection

DevOps: GitHub Actions (CI/CD) · Gradle KTS · JUnit · Espresso · Crashlytics · Linux


What I Build

Privacy-first, offline AI systems that run entirely on-device — no cloud, no compromise.

Not just apps. Ecosystems: plugin runtimes, inference cores, OTA adapter delivery, agent workflows, and the low-level GPU plumbing that makes it all actually run on mobile hardware.


Connect

📧 siddheshsonar2377@gmail.com 💼 LinkedIn

Open to: Startup roles · On-device AI research · SDK/tooling engineering · Cofounder conversations


Building AI that doesn't need the cloud to be intelligent.

Pinned Loading

  1. ToolNeuron ToolNeuron Public

    Complete offline AI ecosystem for Android: Chat (GGUF/LLMs), Images (Stable Diffusion 1.5), Voice (TTS/STT), and Knowledge (RAG Data-Packs), zero subscriptions, no data harvesting. Open-source priv…

    Kotlin 222 18

  2. Ai-Systems-New Ai-Systems-New Public

    C++ 1

  3. structured-prompt-builder structured-prompt-builder Public

    A lightweight, browser‑first tool for designing well‑structured AI prompts with a clean UI, live previews, a local Prompt Library, and optional Gemini‑powered prompt optimization.

    HTML 310 35

  4. Web-Plugin Web-Plugin Public

    Kotlin 1

  5. HRM HRM Public

    Enhanced Hierarchical Reasoning Model (HRM) An AI system extending HRM with scalable memory, JSON metadata, and tag-based retrieval for layered, context-aware reasoning and persistent knowledge int…

    Kotlin 10 1

  6. Canves Canves Public

    Advance UI designing Canves For Android

    Java 3 1