Skip to content
View nikshepsvn's full-sized avatar
🪄
creating
🪄
creating

Block or report nikshepsvn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nikshepsvn/README.md

Hi there, I'm Nik 👋

About Me

My passion is to accelerate human progress with the ultimate goal of creating a better world where everyone can thrive and express their creativity. Currently, I'm focused on building an innovative experience that I believe will unlock and democratize human creative potential.

I have been closely following the AI/LLM/Diffusion space since it's inception (RealmPlay, my first "company" was built on a fine-tuned, block-merged version of Llama1, productionized by building an API on top of exllama, a framework meant for fast inference using consumer GPUs. I used vast.ai, renting multiple 3090's to serve production traffic, using a Digital Ocean Droplet and nginx as a multi-region load balancer -- this was well before projects like vLLM and other serving frameworks existed. I scaled the context from 4K to 16K with SuperHOT the day kaiokendev had the breakthrough of discovering RoPE scaling, which is now commonly used to extend context).

Funfact: I actually started building RealmPlay before Llama1 and was about to call it quits since all the models prior to it (GPT-J, Pythia, GPT-NeoX-20B etc.) lacked coherency -- just about as I was about to call it quits, the OSS LLM lords at Meta saved the day with the Llama release!

Some other projects I've explored in the space include:

  1. SoulBazaar - an LLM fine-tune community and marketplace powered by LoRAX
  2. ThumbGen - a YouTube Thumbnail Design Copilot using tool-calling & diffusion models
  3. Misc hacking (blog writer agent, video understanding via frame-extraction etc.)

While my professional background is largely in the Data / Infrastructure / FullStack space, my personal experiences, along with my consistent interest (following /r/LocalLLama, /r/StableDiffusion and X religiously) have led to me building an extremely strong foundation in AI engineering -- with a deep understanding of multimodal models (ssm + transformers + diffusion), fine-tuning, syntethic data generation, vector databases / RAG, prompt engineering, agentic frameworks, structured output, generative UI and other subject matter.

I love staying up to date with the cutting edge developments in the space and would love to chat about related topics -- feel free to reach out for a chat at either [email protected] or [email protected]

Visitors

Pinned Loading

  1. ScatterFly ScatterFly Public

    An attempt to improve user privacy by intelligent data obfuscation.

    Python 49 6

  2. instacart-delivery-availability-check instacart-delivery-availability-check Public

    Tiny python script that check's instacart's delivery availability and notifies you if a slot opens up.

    Python 21 10

  3. ShatterDrive ShatterDrive Public

    An open source implementation of an Interplanetary File System.

    Python 3

  4. Giftory Giftory Public

    Convert any piece of text into an entertaining GIF movie!

    JavaScript 3 1

  5. GDAX-Price-Fetch GDAX-Price-Fetch Public

    API that let's you immediately get the price that you can buy/sell large amounts of crypto for.

    JavaScript

  6. Pyclic Pyclic Public

    Cyclic JSON chain checker for Python

    Python