VibeBuilders.ai Logo
VibeBuilders.ai

Loop

Explore resources related to loop to help implement AI solutions for your business.

Online Reputation AI - Startup got stuck
reddit
LLM Vibe Score0
Human Vibe Score0.6
kyr0x0This week

Online Reputation AI - Startup got stuck

Hi, I‘m one of 3 co-founders of a startup that built an AI-driven SaaS and App product this year. We‘re coming from an SaaS background, two of us senior developers (in the 3% of highest earning freelancers in Germany) and expert in our fields. The third is a seasoned sales strategist. We have a minor 4th co-founder (legal advisor). The company is self-funded, no investors. Our tech is owned by us, built by us and the product was already operational after a few months. We basically solve three data science/NLP issues in a generalized way: understand customer feedback to improve your business. Analyzes online review with context and explains it with a drill down, aggregation, charts (AI insights, timeframe reports); evidence driven, agentic LLM and ETL processes drive this. respond to customer feedback, half-automated, human in the loop, but AI supported. In the tone of your brand, any language. And context-aware, with your customer support signature etc. competitor analysis. Because we do 1 for you, we can do 1. for all of your competitors and compare the results, yielding insights like „oh, this happens to everyone in November to December, so I should focus on something else“ — etc. Now, after a huge sales effort we got only one paying customer. This customer is petty happy with the product. They tell us that they use our product daily, it‘s better than all the other solutions out there (better than TrustYou, etc.) However, after cold calling/emailing hundreds of leads, we almost always hear that „what we have is good enough“. Or that they don‘t have budget. I‘m the introverted tech part of the startup. I‘m good with algorithms. Give me any tech issue and I will solve it for you quickly and efficiently. I make stuff work. But with my startups I never had commercial luck. People always tell me about my stellar potential, because I can build things almost nobody else can. I come from a poor families background, worked my way up the very hard way. I just love tech and programming. I wrote a book for O’Reilly once. I‘m not doing bad economically, but I‘m probably not the best sales person. After founding a few startups with amazing tech, people using the products and loving them, but no commercial success, I truly question myself and if I‘m just unlucky with the fact that I‘m located in Europe, targeting the wrong industries, or are just unlucky somehow? I won‘t blame my co-founders here. They definitely did the best they could. I‘m just a bit resignated. I recently thought about valuing my own lifetime more and only building software for myself anymore. Basically not focusing on what problems other people face and trying to solve them, but solely focusing on what I enjoy doing most — e.g. coding algorithms for a music visualizer. Because in the end, my time is my most valuable resource. If I waste any second on something that isn‘t contributing to „my life“ and how I define success, then it would be a rather stupid deed? I don‘t want to derail too much here. I‘m confused and seeking for advice. Burn me if you like, but please be aware that you are talking to a broadly educated nerd.

After building an AI Co-founder to solve my startup struggles, I realized we might be onto something bigger. What problems would you want YOUR AI Co-founder to solve?
reddit
LLM Vibe Score0
Human Vibe Score0
Consistent_Yak6765This week

After building an AI Co-founder to solve my startup struggles, I realized we might be onto something bigger. What problems would you want YOUR AI Co-founder to solve?

A few days ago, I shared my entrepreneurial journey and the endless loop of startup struggles I was facing. The response from the community was overwhelming, and it validated something I had stumbled upon while trying to solve my own problems. In just a matter of days, we've built out the core modules I initially used for myself, deep market research capabilities, automated outreach systems, and competitor analysis. It's surreal to see something born out of personal frustration turning into a tool that others might actually find valuable. But here's where it gets interesting (and where I need your help). While we're actively onboarding users for our alpha test, I can't shake the feeling that we're just scratching the surface. We've built what helped me, but what would help YOU? When you're lying awake at 3 AM, stressed about your startup, what tasks do you wish you could delegate to an AI co-founder who actually understands context and can take meaningful action? Of course, it's not a replacement for an actual AI cofounder, but using our prior entrepreneurial experience and conversations with other folks, we understand that OUTREACH and SALES might actually be a big problem statement we can go deeper on as it naturally helps with the following: Idea Validation - Testing your assumptions with real customers before building Pricing strategy - Understanding what the market is willing to pay Product strategy - Getting feedback on features and roadmap Actually revenue - Converting conversations into real paying customers I'm not asking you to imagine some sci-fi scenario, we've already built modules that can: Generate comprehensive 20+ page market analysis reports with actionable insights Handle customer outreach Monitor competitors and target accounts, tracking changes in their strategy Take supervised actions based on the insights gathered (Manual effort is required currently) But what else should it do? What would make you trust an AI co-founder with parts of your business? Or do you think this whole concept is fundamentally flawed? I'm committed to building this the right way, not just another AI tool or an LLM Wrapper, but an agentic system that can understand your unique challenges and work towards overcoming them. Whether you think this is revolutionary or ridiculous, I want to hear your honest thoughts. But more importantly, I want to hear your unfiltered feedback in the comments. What would make this truly valuable for YOU? Edit 1: The AI cofounder will take no equity in your startup.

After building an AI Co-founder to solve my startup struggles, I realized we might be onto something bigger. What problems would you want YOUR AI Co-founder to solve?
reddit
LLM Vibe Score0
Human Vibe Score0
Consistent_Yak6765This week

After building an AI Co-founder to solve my startup struggles, I realized we might be onto something bigger. What problems would you want YOUR AI Co-founder to solve?

A few days ago, I shared my entrepreneurial journey and the endless loop of startup struggles I was facing. The response from the community was overwhelming, and it validated something I had stumbled upon while trying to solve my own problems. In just a matter of days, we've built out the core modules I initially used for myself, deep market research capabilities, automated outreach systems, and competitor analysis. It's surreal to see something born out of personal frustration turning into a tool that others might actually find valuable. But here's where it gets interesting (and where I need your help). While we're actively onboarding users for our alpha test, I can't shake the feeling that we're just scratching the surface. We've built what helped me, but what would help YOU? When you're lying awake at 3 AM, stressed about your startup, what tasks do you wish you could delegate to an AI co-founder who actually understands context and can take meaningful action? Of course, it's not a replacement for an actual AI cofounder, but using our prior entrepreneurial experience and conversations with other folks, we understand that OUTREACH and SALES might actually be a big problem statement we can go deeper on as it naturally helps with the following: Idea Validation - Testing your assumptions with real customers before building Pricing strategy - Understanding what the market is willing to pay Product strategy - Getting feedback on features and roadmap Actually revenue - Converting conversations into real paying customers I'm not asking you to imagine some sci-fi scenario, we've already built modules that can: Generate comprehensive 20+ page market analysis reports with actionable insights Handle customer outreach Monitor competitors and target accounts, tracking changes in their strategy Take supervised actions based on the insights gathered (Manual effort is required currently) But what else should it do? What would make you trust an AI co-founder with parts of your business? Or do you think this whole concept is fundamentally flawed? I'm committed to building this the right way, not just another AI tool or an LLM Wrapper, but an agentic system that can understand your unique challenges and work towards overcoming them. Whether you think this is revolutionary or ridiculous, I want to hear your honest thoughts. But more importantly, I want to hear your unfiltered feedback in the comments. What would make this truly valuable for YOU? Edit 1: The AI cofounder will take no equity in your startup.

The Birth of My First (and Hilariously Flawed) Voice Agent: A Tale of No-Code Chaos
reddit
LLM Vibe Score0
Human Vibe Score0.778
No-Understanding5609This week

The Birth of My First (and Hilariously Flawed) Voice Agent: A Tale of No-Code Chaos

Okay Reddit, buckle up. I'm about to tell you the saga of how I birthed my very first voice agent, a chaotic and frankly, slightly embarrassing journey involving Retell.ai, Make.com, and Zapier. Looking back, it's equal parts hilarious and traumatizing. The Naive Dream: Back then (it feels like ages ago!), I was convinced I could easily whip up a voice agent that would take restaurant orders over the phone. Elegant, efficient, and completely automated! I envisioned a world where my clients' restaurant never missed a beat, all thanks to my coding prowess... or rather, my no-code prowess. How wrong I was. The Gauntlet Begins: Retell.ai's Murky Depths Retell.ai was the starting point, the "voice" of my operation. Getting the phone number hooked up felt like a small victory, quickly overshadowed by the realization that their documentation was... well, let's just say it wasn't written for complete novices. I spent what felt like an eternity staring at API keys, convinced I'd entered them correctly, only to be greeted by cryptic error messages. The sheer frustration I felt wrestling with that initial setup is something I'll never forget. Make.com: From Pretty Picture to Painful Puzzle Then came Make.com, the orchestra conductor of my workflow. It looked so beautiful, so user-friendly! Drag and drop, visual modules... what could go wrong? Oh, so much could go wrong. Trying to decipher the JSON data stream from Retell was like trying to understand a foreign language I only knew a few words of. Mapping that data to a Google Sheet? A complete and utter disaster. I remember spending hours just trying to get the correct fields to populate, each failed attempt fueling my growing despair. Zapier: Briefly Considered, Quickly Dismissed I flirted with the idea of using Zapier instead, seduced by its simplicity. But its limitations became glaringly obvious when I tried to build the complex, multi-step process I needed. Make.com was the only real option, which meant diving headfirst into a whole new world of modules, triggers, and data transformations. The Infernal Testing Loop: The absolute WORST part of the entire process was the testing. Picture this: Calling the agent, rambling through a mock order, waiting for the workflow to execute, only to discover (yet another) error. Then, tweaking the scenario, pushing "save," and repeating the entire agonizing process. Each test call felt like a mini-marathon, a grueling race against time and my own dwindling patience. The AI's... Quirks: And then there was the AI itself. It was... let's just say it had a personality of its own. Sometimes, it perfectly understood my order. Other times, it decided I wanted to order 500 pizzas with extra anchovies. Debugging the AI's interpretation felt like negotiating with a stubborn toddler. Lessons Hard-Learned (And Forever Etched in My Memory): Start absurdly small: I tried to build a fully functional system right away. A HUGE mistake. If I could go back, I would have focused on just extracting one piece of information (like, say, just the quantity) and gotten that rock solid before adding anything else. JSON is your friend (or should be): Back then, JSON felt like alien code. Now, I have a slightly better grasp on it. Trust me, learn JSON. It will save you so much pain. Test like your sanity depends on it: Because it does. After every. Single. Change. Test the entire flow. It's tedious, but it's the only way to catch errors before they snowball into a catastrophe. Don't suffer in silence: I tried to be a lone wolf, figuring everything out myself. Big mistake. Retell.ai's forums and Make.com's documentation are goldmines. Use them! Embrace the struggle: This is the most important lesson. Building a voice agent, especially your first one, is hard. It's frustrating. It will test your limits. But don't give up. The feeling of finally making it work (even partially) is worth it. The Bot That (Barely) Lived: In the end, I did create a voice agent that could take orders and log them into a spreadsheet. It wasn't pretty. It was buggy. It occasionally ordered things that didn't make any sense. But it was mine. And it was the first step on a long and winding road. Looking back, I laugh (and cringe) at my naivety. But I also appreciate the lessons I learned and the sheer grit it took to bring my little AI Frankenstein to life. Anyone else have a similar "first bot" story? Let's hear them! Misery (and laughter) loves company. #RetellAI #Makecom #Zapier #FirstBot #NoCodeFail #VoiceAgentStruggles #StoryTime

𝐁𝐮𝐢𝐥𝐝 𝐋𝐋𝐌𝐬 𝐟𝐫𝐨𝐦 𝐬𝐜𝐫𝐚𝐭𝐜𝐡
reddit
LLM Vibe Score0
Human Vibe Score1
Ambitious-Fix-3376This week

𝐁𝐮𝐢𝐥𝐝 𝐋𝐋𝐌𝐬 𝐟𝐫𝐨𝐦 𝐬𝐜𝐫𝐚𝐭𝐜𝐡

“ChatGPT” is everywhere—it’s a tool we use daily to boost productivity, streamline tasks, and spark creativity. But have you ever wondered how it knows so much and performs across such diverse fields? Like many, I've been curious about how it really works and if I could create a similar tool to fit specific needs. 🤔 To dive deeper, I found a fantastic resource: “Build a Large Language Model (From Scratch)” by Sebastian Raschka, which is explained with an insightful YouTube series “Building LLM from Scratch” by Dr. Raj Dandekar (MIT PhD). This combination offers a structured, approachable way to understand the mechanics behind LLMs—and even to try building one ourselves! https://preview.redd.it/35sdlxdb2m0e1.jpg?width=1037&format=pjpg&auto=webp&s=dd228136fbf7cbdeeae253118ee7a46b04948c24 While AI and generative language models architecture shown in the figure can seem difficult to understand, I believe that by taking it step-by-step, it’s achievable—even for those without a tech background. 🚀 Learning one concept at a time can open the doors to this transformative field, and we at Vizuara.ai are excited to take you through the journey where each step is explained in detail for creating an LLM. For anyone interested, I highly recommend going through the following videos:  Lecture 1: Building LLMs from scratch: Series introduction https://youtu.be/Xpr8D6LeAtw?si=vPCmTzfUY4oMCuVl  Lecture 2: Large Language Models (LLM) Basics https://youtu.be/3dWzNZXA8DY?si=FdsoxgSRn9PmXTTz  Lecture 3: Pretraining LLMs vs Finetuning LLMs https://youtu.be/-bsa3fCNGg4?si=j49O1OX2MT2k68pl  Lecture 4: What are transformers? https://youtu.be/NLn4eetGmf8?si=GVBrKVjGa5Y7ivVY  Lecture 5: How does GPT-3 really work? https://youtu.be/xbaYCf2FHSY?si=owbZqQTJQYm5VzDx  Lecture 6: Stages of building an LLM from Scratch https://youtu.be/z9fgKz1Drlc?si=dzAqz-iLKaxUH-lZ  Lecture 7: Code an LLM Tokenizer from Scratch in Python https://youtu.be/rsy5Ragmso8?si=MJr-miJKm7AHwhu9  Lecture 8: The GPT Tokenizer: Byte Pair Encoding https://youtu.be/fKd8s29e-l4?si=aZzzV4qT\nbQ1lzk  Lecture 9: Creating Input-Target data pairs using Python DataLoader https://youtu.be/iQZFH8dr2yI?si=lH6sdboTXzOzZXP9  Lecture 10: What are token embeddings? https://youtu.be/ghCSGRgVB\o?si=PM2FLDl91ENNPJbd  Lecture 11: The importance of Positional Embeddings https://youtu.be/ufrPLpKnapU?si=cstZgif13kyYo0Rc  Lecture 12: The entire Data Preprocessing Pipeline of Large Language Models (LLMs) https://youtu.be/mk-6cFebjis?si=G4Wqn64OszI9ID0b  Lecture 13: Introduction to the Attention Mechanism in Large Language Models (LLMs) https://youtu.be/XN7sevVxyUM?si=aJy7Nplz69jAzDnC  Lecture 14: Simplified Attention Mechanism - Coded from scratch in Python | No trainable weights https://youtu.be/eSRhpYLerw4?si=1eiOOXa3V5LY-H8c  Lecture 15: Coding the self attention mechanism with key, query and value matrices https://youtu.be/UjdRN80c6p8?si=LlJkFvrC4i3J0ERj  Lecture 16: Causal Self Attention Mechanism | Coded from scratch in Python https://youtu.be/h94TQOK7NRA?si=14DzdgSx9XkAJ9Pp  Lecture 17: Multi Head Attention Part 1 - Basics and Python code https://youtu.be/cPaBCoNdCtE?si=eF3GW7lTqGPdsS6y  Lecture 18: Multi Head Attention Part 2 - Entire mathematics explained https://youtu.be/K5u9eEaoxFg?si=JkUATWM9Ah4IBRy2  Lecture 19: Birds Eye View of the LLM Architecture https://youtu.be/4i23dYoXp-A?si=GjoIoJWlMloLDedg  Lecture 20: Layer Normalization in the LLM Architecture https://youtu.be/G3W-LT79LSI?si=ezsIvNcW4dTVa29i  Lecture 21: GELU Activation Function in the LLM Architecture https://youtu.be/d\PiwZe8UF4?si=IOMD06wo1MzElY9J  Lecture 22: Shortcut connections in the LLM Architecture https://youtu.be/2r0QahNdwMw?si=i4KX0nmBTDiPmNcJ  Lecture 23: Coding the entire LLM Transformer Block https://youtu.be/dvH6lFGhFrs?si=e90uX0TfyVRasvel  Lecture 24: Coding the 124 million parameter GPT-2 model https://youtu.be/G3-JgHckzjw?si=peLE6thVj6bds4M0  Lecture 25: Coding GPT-2 to predict the next token https://youtu.be/F1Sm7z2R96w?si=TAN33aOXAeXJm5Ro  Lecture 26: Measuring the LLM loss function https://youtu.be/7TKCrt--bWI?si=rvjeapyoD6c-SQm3  Lecture 27: Evaluating LLM performance on real dataset | Hands on project | Book data https://youtu.be/zuj\NJNouAA?si=Y\vuf-KzY3Dt1d1r  Lecture 28: Coding the entire LLM Pre-training Loop https://youtu.be/Zxf-34voZss?si=AxYVGwQwBubZ3-Y9  Lecture 29: Temperature Scaling in Large Language Models (LLMs) https://youtu.be/oG1FPVnY0pI?si=S4N0wSoy4KYV5hbv  Lecture 30: Top-k sampling in Large Language Models https://youtu.be/EhU32O7DkA4?si=GKHqUCPqG-XvCMFG

Just completed a new type of language learning website - read popular stories scaled to different reading levels
reddit
LLM Vibe Score0
Human Vibe Score1
creedaaronThis week

Just completed a new type of language learning website - read popular stories scaled to different reading levels

As a language learner and software developer, I bootstrapped my project superlang.com over the past year working on the side. There is a mobile friendly web app now, and iOS/Android apps coming in a few months. A year ago I discovered the concept of "comprehensible input" as a way to help me learn German. Even if it's not a silver bullet, it sounded pretty great. Rather than drilling vocab or looking at grammar charts, I could "just read" and acquire the language. I picked up some fairy tales in German, and stories like Alice in Wonderland. Unfortunately, I couldn't really read them. I had to stop every sentence to look up words and try and decipher sentence constructions. Then I turned to some purpose built simple stories for German beginners. But there was a different problem... these were not really stories with any real plot. I could only read so many "Hans goes to the market" type stories before losing interest. My idea was to try to get the best of both worlds somehow. What if I could take a real story, say Alice in Wonderland (or even War and Peace), and dial the difficulty down to my level without losing the plotline. That way, beginners can start right away with something basically comprehensible. Then, you could also re-read the same story at increasing difficulty levels as you gain confidence. As a cherry on top, more illustrations would help with comprehension so each page could have a picture. Is it revolutionary? Maybe, maybe not. I am building off a well established idea of "graded readers" which are simplified stories meant for learning languages. And there are somewhat similar ideas out there now that AI is good at simplifying text, but none that really take this idea where it needs to be with many preloaded stories, multiple difficulty levels, high quality human verified text, and all the bells and whistles. I spent a year building Superlang and it is ready to put out there. Some quick notes: There are 3 languages so far, intended for native English speakers: German, French, and Spanish There are 3 difficulty levels you can set on each story: beginner (roughly A1-A2), intermediate (roughly A2-B1), and advanced (the same level as the original story, but typically B2+) There is premium version as producing the content was somewhat expensive. You can still do a lot of reading on the free version. I have done no marketing yet, except for this post :) The implementation is a combination of AI, and human proofreading and reviewing. In particular, the simplification of stories is very heavily AI driven. The illustrations for each page are AI as well. For translation, as many of you may be aware new LLM models are typically better than Google translate, but still far from perfect. I am very much a proponent of keeping real people in the loop, and so I have real people proofread the translations. That's why there are only about 700 pages of content so far and not tens of thousands. Let me know what you think, and if you find it helpful! Alice in Wonderland - beginner level German Romeo and Juliet - beginner level Spanish

My experience trying to scrape google maps with no code
reddit
LLM Vibe Score0
Human Vibe Score1
youngkilogThis week

My experience trying to scrape google maps with no code

A few months back I was working on a project to help founders that sell to SMBs get better quality leads (Current solutions like Zoominfo and Apollo don’t do very well for the SMB market). Of course, I wanted to do this as quickly as possible with as little code as possible.  We found that people were manually going through Google Maps to find SMBs. They would use the search and manually type in the businesses they were looking for. For example, they would type “restaurants” and manually call/email them. What we decided to do was gather the Google Maps data autonomously and surface that to our customers so they could take all of it. The problem was that we would need a bunch of data from Google Maps to pull it off. We would need to grab all the SMBs across the United States which is a huge undertaking.  Initially, I tried no-code AI web scraping solutions and they worked horribly. For some reason, I couldn’t even get them to scroll down on the page. I was also able to reverse engineer their open-source code and discover that they were taking the entire web page and passing it into GPT to extract data. That just burned my Openai bill.  I then tried the semi-code approach (sorry no-code subreddit) where I would use something like Apify or Google Places API to scrape the businesses. This worked better but still, there was an issue of price at the scale we wanted. Eventually, we ended up writing our scraper for the task.  This experience was so horrible I ended up creating potarix.com. Firstly, we provide scraping as a service in conjunction with AI. We all know AI is shit and keeping this human in the loop allows the AI to do 90% of the work and then for us to tweak the script to 100% completion. Also since we use AI to create the scraper instead of using AI to scrape, we can run it for large scale tasks at a low cost.

[P] [R] sANNd: A New Neural Network Framework Using Trainable Iterators
reddit
LLM Vibe Score0
Human Vibe Score1
JackRipperVAThis week

[P] [R] sANNd: A New Neural Network Framework Using Trainable Iterators

sANNd sANNd is a lightweight, modular neural network library designed as a sandbox for experimenting with new ideas in artificial intelligence. The Mould Class: A Pythonic Building Block The Mould class is a core component of sANNd. It provides a Pythonic way to apply functions to data that’s bundled inside objects: Encapsulated Variables: Each Mould object holds a set of variables (for example, weights or parameters) inside it. This means related data is kept together in one place (the object), making the code organized and intuitive. Static Functions: A Mould class defines its operation as a static method – essentially a function that isn’t tied to a specific instance. This static function takes in inputs (and possibly other Mould objects’ variables) and produces an output. In simple terms, the Mould’s static method describes how to transform input data using the Mould’s internal variables. Pythonic Usage: Using static methods in this way is a clean, Pythonic design. You call the Mould’s function through the class, but it applies to the data in the object. This approach lets you clearly separate what the operation is (the logic in the static function) from which data it uses (the variables inside the Mould instance). Example: Imagine a Mould class called LinearMould that has a static function to compute a linear transformation (like y = W*x + b). An instance of LinearMould would hold specific W and b values, and you’d use the static method to apply that linear formula to an input. This gives you the convenience of object-oriented design (encapsulating W and b) with the clarity of a standalone function defining the math. Chaining Moulds for Complex Computations Moulds become even more powerful when you chain them together. You can connect multiple Moulds so that the output of one becomes the input of the next: Sequential Operations: Just like stacking layers in a neural network, you can place Moulds in sequence. For example, you might take the output from LinearMouldA and feed it into LinearMouldB. In code, this might look as simple as using the output of one call as the argument to the next. The design of sANNd makes this straightforward – the static function of each Mould knows how to handle the data coming in. Building Pipelines: By chaining Moulds, you create a pipeline of transformations. Each Mould handles one step of computation, and together they produce a final result. This could represent a multi-layer neural network, a data processing pipeline, or any custom sequence of operations you need. There’s no strict limit to how you can chain them; you have the freedom to combine Moulds in any order that makes sense for your experiment. Clarity and Modularity: Because each Mould is a self-contained piece (with its variables and function), chaining them doesn’t turn your code into a black box. You can inspect or modify any part of the chain easily. This modular design means you can insert, remove, or replace Moulds to see how it affects the overall computation, which is great for experimentation. Implicit Backward Path (Automatic Backpropagation) One major benefit of using chained Moulds is that they implicitly define the backward path for training with gradient descent (backpropagation): Automatic Gradient Flow: When you connect Moulds in a sequence for a forward pass (input → Mould A → Mould B → output), you’ve essentially defined a computation graph. sANNd uses this graph to handle the reverse computation automatically. In other words, if you calculate an error or loss based on the final output, sANNd can propagate that error backwards through each Mould in the chain. No Manual Backprop: You do not need to manually code how gradients flow through each Mould. The way you set up the Moulds’ static functions already determines how outputs depend on inputs and internal variables. sANNd leverages that to perform backpropagation. This is similar in spirit to how libraries like PyTorch/TF do “autograd,” but here it’s a natural result of the Mould chain architecture. Gradient Descent Ready: Because the backward path is established by the forward connections, you can apply gradient descent optimizations out of the box. For instance, you can adjust the weights inside each Mould based on the computed gradients to minimize your loss. The design ensures that each Mould’s contribution to the final error is tracked, so all parts of your model learn appropriately during training. In short, defining your model with Moulds means you get training capability for free. You focus on describing the forward computations, and sANNd handles the math behind learning from errors. Comparing sANNd to Traditional Frameworks sANNd’s approach is quite different from traditional Python-based neural network frameworks. Here’s how it stacks up against frameworks like TensorFlow, PyTorch, or Keras in terms of approach, flexibility, and intended use: Design Approach: Traditional frameworks use predefined layer classes and often build a computation graph behind the scenes. For example, Keras might have a Dense layer class, and TensorFlow might construct a static graph (in TF1) or use eager execution (in TF2). sANNd takes a simpler approach – it uses plain Python classes and static functions (Moulds) to define computations. There’s no need to learn a new graph syntax or decorators; if you know Python functions and classes, you can read and write sANNd models. This makes the internal workings more transparent and easier to follow. Flexibility: While frameworks like PyTorch and TensorFlow are very powerful, they can introduce a lot of boilerplate and assume you’re building typical architectures. sANNd is extremely modular and flexible. You aren’t limited to the layers someone else defined – you can create any operation you want as a Mould. Want to experiment with a novel activation function or a custom recurrent connection? Just define it in a Mould. There’s less magic and abstraction obscuring your code, so unconventional model structures are easier to implement. (Of course, major frameworks can also be extended, but sANNd makes this feel more natural by staying within standard Python paradigms.) Intended Use: sANNd is intended for experimentation and research. It’s like a toolkit for tinkering. You get fine-grained control over every part of the network, which is ideal for trying out bold new ideas that don’t fit the mold of common deep learning models. In contrast, TensorFlow/PyTorch shine in production environments and large-scale training – they are optimized (GPU support, highly efficient tensor operations) and come with many utilities for things like data loading, distributed training, etc. sANNd doesn’t aim to replace them for those heavy-lifting tasks. Instead, it’s meant for when you need a lighter, more interpretable setup to prototype concepts. You might use sANNd to prove out a concept or test a hypothesis in AI research, and later switch to a bigger framework if you need to scale it up. Simplicity vs. Complexity: By design, sANNd keeps things simple. The trade-off is that it might not have the raw performance optimizations of the large frameworks. However, this simplicity is a feature – it means the code is easier to understand and modify. For many research scenarios, being able to quickly tweak an idea is more important than squeezing out maximum speed. Traditional frameworks, with their complexity, can sometimes be harder to adapt for radically different ideas (you might find yourself fighting the framework). With sANNd, the framework gets out of your way as much as possible. Modular and Experimental by Nature One of the driving philosophies of sANNd is to be modular and experimental, to further ML research: Modularity: sANNd is built from small, composable pieces. The Mould class is one such piece, and you can imagine building additional components in a similar spirit. This modular design means you can re-use components, mix and match them, or replace one implementation with another without affecting the rest of your system. It’s like having a box of building blocks for neural networks – you can assemble them in standard ways or in completely novel configurations. Experimentation Friendly: Because it avoids heavy abstraction, sANNd lets you directly see and control what’s happening at each step. This is great for research, where you might need to observe intermediate results, inject custom behavior, or adjust the learning process on the fly. sANNd’s straightforward structure (Python objects and functions) makes such interventions possible. You’re not constrained to a fixed training loop or forced to use certain layer types. True Intelligence Research: Achieving “True Intelligence” (often related to artificial general intelligence or other forms of broader AI) may require going beyond the usual neural network designs. sANNd aims to be a playground for these ideas. Its flexibility allows researchers to integrate unconventional elements — be it new memory structures, dynamic connection patterns, or hybrid models that combine symbolic and neural approaches. You can use sANNd to prototype these offbeat ideas quickly. In essence, it’s easier to test “what if we try this?” scenarios with sANNd than with more rigid frameworks. In summary, sANNd’s unique Mould class and design philosophy offer a fresh take on building neural networks. It emphasizes clarity, composability, and flexibility, allowing you to focus on creativity and understanding. Whether you’re stacking simple Moulds into a deep model, or inventing a completely new form of network, sANNd provides a friendly foundation. It’s not here to dethrone TensorFlow or PyTorch in industry applications – instead, it’s here to give researchers and enthusiasts a more malleable tool for exploring the frontiers of AI. Enjoy using sANNd as your neural network sandbox, and happy experimenting!

[N] Yoshua Bengio's latest letter addressing arguments against taking AI safety seriously
reddit
LLM Vibe Score0
Human Vibe Score1
qtangsThis week

[N] Yoshua Bengio's latest letter addressing arguments against taking AI safety seriously

https://yoshuabengio.org/2024/07/09/reasoning-through-arguments-against-taking-ai-safety-seriously/ Summary by GPT-4o: "Reasoning through arguments against taking AI safety seriously" by Yoshua Bengio: Summary Introduction Bengio reflects on his year of advocating for AI safety, learning through debates, and synthesizing global expert views in the International Scientific Report on AI safety. He revisits arguments against AI safety concerns and shares his evolved perspective on the potential catastrophic risks of AGI and ASI. Headings and Summary The Importance of AI Safety Despite differing views, there is a consensus on the need to address risks associated with AGI and ASI. The main concern is the unknown moral and behavioral control over such entities. Arguments Dismissing AGI/ASI Risks Skeptics argue AGI/ASI is either impossible or too far in the future to worry about now. Bengio refutes this, stating we cannot be certain about the timeline and need to prepare regulatory frameworks proactively. For those who think AGI and ASI are impossible or far in the future He challenges the idea that current AI capabilities are far from human-level intelligence, citing historical underestimations of AI advancements. The trend of AI capabilities suggests we might reach AGI/ASI sooner than expected. For those who think AGI is possible but only in many decades Regulatory and safety measures need time to develop, necessitating action now despite uncertainties about AGI’s timeline. For those who think that we may reach AGI but not ASI Bengio argues that even AGI presents significant risks and could quickly lead to ASI, making it crucial to address these dangers. For those who think that AGI and ASI will be kind to us He counters the optimism that AGI/ASI will align with human goals, emphasizing the need for robust control mechanisms to prevent AI from pursuing harmful objectives. For those who think that corporations will only design well-behaving AIs and existing laws are sufficient Profit motives often conflict with safety, and existing laws may not adequately address AI-specific risks and loopholes. For those who think that we should accelerate AI capabilities research and not delay benefits of AGI Bengio warns against prioritizing short-term benefits over long-term risks, advocating for a balanced approach that includes safety research. For those concerned that talking about catastrophic risks will hurt efforts to mitigate short-term human-rights issues with AI Addressing both short-term and long-term AI risks can be complementary, and ignoring catastrophic risks would be irresponsible given their potential impact. For those concerned with the US-China cold war AI development should consider global risks and seek collaborative safety research to prevent catastrophic mistakes that transcend national borders. For those who think that international treaties will not work While challenging, international treaties on AI safety are essential and feasible, especially with mechanisms like hardware-enabled governance. For those who think the genie is out of the bottle and we should just let go and avoid regulation Despite AI's unstoppable progress, regulation and safety measures are still critical to steer AI development towards positive outcomes. For those who think that open-source AGI code and weights are the solution Open-sourcing AI has benefits but also significant risks, requiring careful consideration and governance to prevent misuse and loss of control. For those who think worrying about AGI is falling for Pascal’s wager Bengio argues that AI risks are substantial and non-negligible, warranting serious attention and proactive mitigation efforts. Conclusion Bengio emphasizes the need for a collective, cautious approach to AI development, balancing the pursuit of benefits with rigorous safety measures to prevent catastrophic outcomes.

[D] AI Agents: too early, too expensive, too unreliable
reddit
LLM Vibe Score0
Human Vibe Score1
madredditscientistThis week

[D] AI Agents: too early, too expensive, too unreliable

Reference: Full blog post There has been a lot of hype about the promise of autonomous agent-based LLM workflows. By now, all major LLMs are capable of interacting with external tools and functions, letting the LLM perform sequences of tasks automatically. But reality is proving more challenging than anticipated. The WebArena leaderboard, which benchmarks LLMs agents against real-world tasks, shows that even the best-performing models have a success rate of only 35.8%. Challenges in Practice After seeing many attempts to AI agents, I believe it's too early, too expensive, too slow, too unreliable. It feels like many AI agent startups are waiting for a model breakthrough that will start the race to productize agents. Reliability: As we all know, LLMs are prone to hallucinations and inconsistencies. Chaining multiple AI steps compounds these issues, especially for tasks requiring exact outputs. Performance and costs: GPT-4o, Gemini-1.5, and Claude Opus are working quite well with tool usage/function calling, but they are still slow and expensive, particularly if you need to do loops and automatic retries. Legal concerns: Companies may be held liable for the mistakes of their agents. A recent example is Air Canada being ordered to pay a customer who was misled by the airline's chatbot. User trust: The "black box" nature of AI agents and stories like the above makes it hard for users to understand and trust their outputs. Gaining user trust for sensitive tasks involving payments or personal information will be hard (paying bills, shopping, etc.). Real-World Attempts Several startups are tackling the AI agent space, but most are still experimental or invite-only: adept.ai - $350M funding, but access is still very limited MultiOn - funding unknown, their API-first approach seems promising HypeWrite - $2.8M funding, started with an AI writing assistant and expanded into the agent space minion.ai - created some initial buzz but has gone quiet now, waitlist only Only MultiOn seems to be pursuing the "give it instructions and watch it go" approach, which is more in line with the promise of AI agents. All others are going down the record-and-replay RPA route, which may be necessary for reliability at this stage. Large players are also bringing AI capabilities to desktops and browsers, and it looks like we'll get native AI integrations on a system level: OpenAI announced their Mac desktop app that can interact with the OS screen. At Google I/O, Google demonstrated Gemini automatically processing a shopping return. Microsoft announced Copilot Studio, which will let developers build AI agent bots. Screenshot Screenshot These tech demos are impressive, but we'll see how well these agent capabilities will work when released publicly and tested against real-world scenarios instead of hand-picked demo cases. The Path Forward AI agents overhyped and it's too early. However, the underlying models continue to advance quickly, and we can expect to see more successful real-world applications. Instead of trying to have one large general purpose agent that is hard to control and test, we can use many smaller agents that basically just pick the right strategy for a specific sub-task in our workflows. These "agents" can be thought of as medium-sized LLM prompts with a) context and b) a set of functions available to call. The most promising path forward likely looks like this: Narrowly scoped, well testable automations that use AI as an augmentation tool rather than pursuing full autonomy Human-in-the-loop approaches that keep humans involved for oversight and handling edge cases Setting realistic expectations about current capabilities and limitations By combining tightly constrained agents, good evaluation data, human-in-the-loop oversight, and traditional engineering methods, we can achieve reliably good results for automating medium-complex tasks. Will AI agents automate tedious repetitive work, such as web scraping, form filling, and data entry? Yes, absolutely. Will AI agents autonomously book your vacation without your intervention? Unlikely, at least in the near future.

[P] An elegant and strong PyTorch Trainer
reddit
LLM Vibe Score0
Human Vibe Score1
serend1p1ty-leeThis week

[P] An elegant and strong PyTorch Trainer

For lightweight use, pytorch-lightning is too heavy, and its source code will be very difficult for beginners to read, at least for me. As we know, for a deep learning engineer, a powerful trainer is a sharp weapon. When reproducing the SOTA papers, you don't have to write a lot of template code every time and can pay more attention to the model implementation itself. I opened source some works (AAAI 21 SeqNet, ICCV 21 MAED, etc) and earned more than 500 stars. After referring to some popular projects (detectron2, pytorch-image-models, and mmcv), based on my personal development experience, I developed a SIMPLE enough, GENERIC enough, and STRONG enough PyTorch Trainer: core-pytorch-utils, also named CPU. CPU covers most details in the process of training a deep neural network, including: Auto logging to console and tensorboard. Auto checkpointing. Argument parser which can load a YAML configuration file. Make ALL PyTorch LR scheduler supporting warmup. Support distributed training. Support Automatically Mixed Precision (AMP) training. I try to keep the project code as simple and readable as possible. So the code comments are very detailed and everyone can understand them. What's more, a good document is also available: CPU document For deep learning green hands, you can learn how to: write a standard and clean training loop. use AMP to speed up your training. save checkpoint, and resume from it. perform more smooth, and readable logging. use the popular visualization library: tensorboard. For old hands, we can talk about whether the structure of CPU is elegant and reasonable. I have thought a lot about this framework, combining the advantages of several popular frameworks and discarding their shortcomings. Welcome to use it!

[D] Here are 17 ways of making PyTorch training faster – what did I miss?
reddit
LLM Vibe Score0
Human Vibe Score1
lorenzkuhnThis week

[D] Here are 17 ways of making PyTorch training faster – what did I miss?

I've been collecting methods to accelerate training in PyTorch – here's what I've found so far. What did I miss? What did I get wrong? The methods – roughly sorted from largest to smallest expected speed-up – are: Consider using a different learning rate schedule. Use multiple workers and pinned memory in DataLoader. Max out the batch size. Use Automatic Mixed Precision (AMP). Consider using a different optimizer. Turn on cudNN benchmarking. Beware of frequently transferring data between CPUs and GPUs. Use gradient/activation checkpointing. Use gradient accumulation. Use DistributedDataParallel for multi-GPU training. Set gradients to None rather than 0. Use .as\_tensor rather than .tensor() Turn off debugging APIs if not needed. Use gradient clipping. Turn off bias before BatchNorm. Turn off gradient computation during validation. Use input and batch normalization. Consider using another learning rate schedule The learning rate (schedule) you choose has a large impact on the speed of convergence as well as the generalization performance of your model. Cyclical Learning Rates and the 1Cycle learning rate schedule are both methods introduced by Leslie N. Smith (here and here), and then popularised by fast.ai's Jeremy Howard and Sylvain Gugger (here and here). Essentially, the 1Cycle learning rate schedule looks something like this: ​ https://preview.redd.it/sc37u5knmxa61.png?width=476&format=png&auto=webp&s=09b309b4dbd67eedb4ab5f86e03e0e83d7b072d1 Sylvain writes: \[1cycle consists of\]  two steps of equal lengths, one going from a lower learning rate to a higher one than go back to the minimum. The maximum should be the value picked with the Learning Rate Finder, and the lower one can be ten times lower. Then, the length of this cycle should be slightly less than the total number of epochs, and, in the last part of training, we should allow the learning rate to decrease more than the minimum, by several orders of magnitude. In the best case this schedule achieves a massive speed-up – what Smith calls Superconvergence – as compared to conventional learning rate schedules. Using the 1Cycle policy he needs \~10x fewer training iterations of a ResNet-56 on ImageNet to match the performance of the original paper, for instance). The schedule seems to perform robustly well across common architectures and optimizers. PyTorch implements both of these methods torch.optim.lrscheduler.CyclicLR and torch.optim.lrscheduler.OneCycleLR, see the documentation. One drawback of these schedulers is that they introduce a number of additional hyperparameters. This post and this repo, offer a nice overview and implementation of how good hyper-parameters can be found including the Learning Rate Finder mentioned above. Why does this work? It doesn't seem entirely clear but one possible explanation might be that regularly increasing the learning rate helps to traverse saddle points in the loss landscape more quickly. Use multiple workers and pinned memory in DataLoader When using torch.utils.data.DataLoader, set numworkers > 0, rather than the default value of 0, and pinmemory=True, rather than the default value of False. Details of this are explained here. Szymon Micacz achieves a 2x speed-up for a single training epoch by using four workers and pinned memory. A rule of thumb that people are using to choose the number of workers is to set it to four times the number of available GPUs with both a larger and smaller number of workers leading to a slow down. Note that increasing num\_workerswill increase your CPU memory consumption. Max out the batch size This is a somewhat contentious point. Generally, however, it seems like using the largest batch size your GPU memory permits will accelerate your training (see NVIDIA's Szymon Migacz, for instance). Note that you will also have to adjust other hyperparameters, such as the learning rate, if you modify the batch size. A rule of thumb here is to double the learning rate as you double the batch size. OpenAI has a nice empirical paper on the number of convergence steps needed for different batch sizes. Daniel Huynh runs some experiments with different batch sizes (also using the 1Cycle policy discussed above) where he achieves a 4x speed-up by going from batch size 64 to 512. One of the downsides of using large batch sizes, however, is that they might lead to solutions that generalize worse than those trained with smaller batches. Use Automatic Mixed Precision (AMP) The release of PyTorch 1.6 included a native implementation of Automatic Mixed Precision training to PyTorch. The main idea here is that certain operations can be run faster and without a loss of accuracy at semi-precision (FP16) rather than in the single-precision (FP32) used elsewhere. AMP, then, automatically decide which operation should be executed in which format. This allows both for faster training and a smaller memory footprint. In the best case, the usage of AMP would look something like this: import torch Creates once at the beginning of training scaler = torch.cuda.amp.GradScaler() for data, label in data_iter: optimizer.zero_grad() Casts operations to mixed precision with torch.cuda.amp.autocast(): loss = model(data) Scales the loss, and calls backward() to create scaled gradients scaler.scale(loss).backward() Unscales gradients and calls or skips optimizer.step() scaler.step(optimizer) Updates the scale for next iteration scaler.update() Benchmarking a number of common language and vision models on NVIDIA V100 GPUs, Huang and colleagues find that using AMP over regular FP32 training yields roughly 2x – but upto 5.5x – training speed-ups. Currently, only CUDA ops can be autocast in this way. See the documentation here for more details on this and other limitations. u/SVPERBlA points out that you can squeeze out some additional performance (\~ 20%) from AMP on NVIDIA Tensor Core GPUs if you convert your tensors to the Channels Last memory format. Refer to this section in the NVIDIA docs for an explanation of the speedup and more about NCHW versus NHWC tensor formats. Consider using another optimizer AdamW is Adam with weight decay (rather than L2-regularization) which was popularized by fast.ai and is now available natively in PyTorch as torch.optim.AdamW. AdamW seems to consistently outperform Adam in terms of both the error achieved and the training time. See this excellent blog post on why using weight decay instead of L2-regularization makes a difference for Adam. Both Adam and AdamW work well with the 1Cycle policy described above. There are also a few not-yet-native optimizers that have received a lot of attention recently, most notably LARS (pip installable implementation) and LAMB. NVIDA's APEX implements fused versions of a number of common optimizers such as Adam. This implementation avoid a number of passes to and from GPU memory as compared to the PyTorch implementation of Adam, yielding speed-ups in the range of 5%. Turn on cudNN benchmarking If your model architecture remains fixed and your input size stays constant, setting torch.backends.cudnn.benchmark = True might be beneficial (docs). This enables the cudNN autotuner which will benchmark a number of different ways of computing convolutions in cudNN and then use the fastest method from then on. For a rough reference on the type of speed-up you can expect from this, Szymon Migacz achieves a speed-up of 70% on a forward pass for a convolution and a 27% speed-up for a forward + backward pass of the same convolution. One caveat here is that this autotuning might become very slow if you max out the batch size as mentioned above. Beware of frequently transferring data between CPUs and GPUs Beware of frequently transferring tensors from a GPU to a CPU using tensor.cpu() and vice versa using tensor.cuda() as these are relatively expensive. The same applies for .item() and .numpy() – use .detach() instead. If you are creating a new tensor, you can also directly assign it to your GPU using the keyword argument device=torch.device('cuda:0'). If you do need to transfer data, using .to(non_blocking=True), might be useful as long as you don't have any synchronization points after the transfer. If you really have to, you might want to give Santosh Gupta's SpeedTorch a try, although it doesn't seem entirely clear when this actually does/doesn't provide speed-ups. Use gradient/activation checkpointing Quoting directly from the documentation: Checkpointing works by trading compute for memory. Rather than storing all intermediate activations of the entire computation graph for computing backward, the checkpointed part does not save intermediate activations, and instead recomputes them in backward pass. It can be applied on any part of a model. Specifically, in the forward pass, function will run in torch.no\grad() manner, i.e., not storing the intermediate activations. Instead, the forward pass saves the inputs tuple and the functionparameter. In the backwards pass, the saved inputs and function is retrieved, and the forward pass is computed on function again, now tracking the intermediate activations, and then the gradients are calculated using these activation values. So while this will might slightly increase your run time for a given batch size, you'll significantly reduce your memory footprint. This in turn will allow you to further increase the batch size you're using allowing for better GPU utilization. While checkpointing is implemented natively as torch.utils.checkpoint(docs), it does seem to take some thought and effort to implement properly. Priya Goyal has a good tutorial demonstrating some of the key aspects of checkpointing. Use gradient accumulation Another approach to increasing the batch size is to accumulate gradients across multiple .backward() passes before calling optimizer.step(). Following a post by Hugging Face's Thomas Wolf, gradient accumulation can be implemented as follows: model.zero_grad() Reset gradients tensors for i, (inputs, labels) in enumerate(training_set): predictions = model(inputs) Forward pass loss = loss_function(predictions, labels) Compute loss function loss = loss / accumulation_steps Normalize our loss (if averaged) loss.backward() Backward pass if (i+1) % accumulation_steps == 0: Wait for several backward steps optimizer.step() Now we can do an optimizer step model.zero_grad() Reset gradients tensors if (i+1) % evaluation_steps == 0: Evaluate the model when we... evaluate_model() ...have no gradients accumulate This method was developed mainly to circumvent GPU memory limitations and I'm not entirely clear on the trade-off between having additional .backward() loops. This discussion on the fastai forum seems to suggest that it can in fact accelerate training, so it's probably worth a try. Use Distributed Data Parallel for multi-GPU training Methods to accelerate distributed training probably warrant their own post but one simple one is to use torch.nn.DistributedDataParallel rather than torch.nn.DataParallel. By doing so, each GPU will be driven by a dedicated CPU core avoiding the GIL issues of DataParallel. In general, I can strongly recommend reading the documentation on distributed training. Set gradients to None rather than 0 Use .zerograd(settonone=True) rather than .zerograd(). Doing so will let the memory allocator handle the gradients rather than actively setting them to 0. This will lead to yield a modest speed-up as they say in the documentation, so don't expect any miracles. Watch out, doing this is not side-effect free! Check the docs for the details on this. Use .as_tensor() rather than .tensor() torch.tensor() always copies data. If you have a numpy array that you want to convert, use torch.astensor() or torch.fromnumpy() to avoid copying the data. Turn on debugging tools only when actually needed PyTorch offers a number of useful debugging tools like the autograd.profiler, autograd.grad\check, and autograd.anomaly\detection. Make sure to use them to better understand when needed but to also turn them off when you don't need them as they will slow down your training. Use gradient clipping Originally used to avoid exploding gradients in RNNs, there is both some empirical evidence as well as some theoretical support that clipping gradients (roughly speaking: gradient = min(gradient, threshold)) accelerates convergence. Hugging Face's Transformer implementation is a really clean example of how to use gradient clipping as well as some of the other methods such as AMP mentioned in this post. In PyTorch this can be done using torch.nn.utils.clipgradnorm(documentation). It's not entirely clear to me which models benefit how much from gradient clipping but it seems to be robustly useful for RNNs, Transformer-based and ResNets architectures and a range of different optimizers. Turn off bias before BatchNorm This is a very simple one: turn off the bias of layers before BatchNormalization layers. For a 2-D convolutional layer, this can be done by setting the bias keyword to False: torch.nn.Conv2d(..., bias=False, ...).  (Here's a reminder why this makes sense.) You will save some parameters, I would however expect the speed-up of this to be relatively small as compared to some of the other methods mentioned here. Turn off gradient computation during validation This one is straightforward: set torch.no_grad() during validation. Use input and batch normalization You're probably already doing this but you might want to double-check: Are you normalizing your input? Are you using batch-normalization? And here's a reminder of why you probably should. Bonus tip from the comments: Use JIT to fuse point-wise operations. If you have adjacent point-wise operations you can use PyTorch JIT to combine them into one FusionGroup which can then be launched on a single kernel rather than multiple kernels as would have been done per default. You'll also save some memory reads and writes. Szymon Migacz shows how you can use the @torch.jit.script decorator to fuse the operations in a GELU, for instance: @torch.jit.script def fused_gelu(x): return x 0.5 (1.0 + torch.erf(x / 1.41421)) In this case, fusing the operations leads to a 5x speed-up for the execution of fused_gelu as compared to the unfused version. See also this post for an example of how Torchscript can be used to accelerate an RNN. Hat tip to u/Patient_Atmosphere45 for the suggestion. Sources and additional resources Many of the tips listed above come from Szymon Migacz' talk and post in the PyTorch docs. PyTorch Lightning's William Falcon has two interesting posts with tips to speed-up training. PyTorch Lightning does already take care of some of the points above per-default. Thomas Wolf at Hugging Face has a number of interesting articles on accelerating deep learning – with a particular focus on language models. The same goes for Sylvain Gugger and Jeremy Howard: they have many interesting posts in particular on learning rates and AdamW. Thanks to Ben Hahn, Kevin Klein and Robin Vaaler for their feedback on a draft of this post! I've also put all of the above into this blog post.

[D] The current and future state of AI/ML is shockingly demoralizing with little hope of redemption
reddit
LLM Vibe Score0
Human Vibe Score1
Flaky_Suit_8665This week

[D] The current and future state of AI/ML is shockingly demoralizing with little hope of redemption

I recently encountered the PaLM (Scaling Language Modeling with Pathways) paper from Google Research and it opened up a can of worms of ideas I’ve felt I’ve intuitively had for a while, but have been unable to express – and I know I can’t be the only one. Sometimes I wonder what the original pioneers of AI – Turing, Neumann, McCarthy, etc. – would think if they could see the state of AI that we’ve gotten ourselves into. 67 authors, 83 pages, 540B parameters in a model, the internals of which no one can say they comprehend with a straight face, 6144 TPUs in a commercial lab that no one has access to, on a rig that no one can afford, trained on a volume of data that a human couldn’t process in a lifetime, 1 page on ethics with the same ideas that have been rehashed over and over elsewhere with no attempt at a solution – bias, racism, malicious use, etc. – for purposes that who asked for? When I started my career as an AI/ML research engineer 2016, I was most interested in two types of tasks – 1.) those that most humans could do but that would universally be considered tedious and non-scalable. I’m talking image classification, sentiment analysis, even document summarization, etc. 2.) tasks that humans lack the capacity to perform as well as computers for various reasons – forecasting, risk analysis, game playing, and so forth. I still love my career, and I try to only work on projects in these areas, but it’s getting harder and harder. This is because, somewhere along the way, it became popular and unquestionably acceptable to push AI into domains that were originally uniquely human, those areas that sit at the top of Maslows’s hierarchy of needs in terms of self-actualization – art, music, writing, singing, programming, and so forth. These areas of endeavor have negative logarithmic ability curves – the vast majority of people cannot do them well at all, about 10% can do them decently, and 1% or less can do them extraordinarily. The little discussed problem with AI-generation is that, without extreme deterrence, we will sacrifice human achievement at the top percentile in the name of lowering the bar for a larger volume of people, until the AI ability range is the norm. This is because relative to humans, AI is cheap, fast, and infinite, to the extent that investments in human achievement will be watered down at the societal, educational, and individual level with each passing year. And unlike AI gameplay which superseded humans decades ago, we won’t be able to just disqualify the machines and continue to play as if they didn’t exist. Almost everywhere I go, even this forum, I encounter almost universal deference given to current SOTA AI generation systems like GPT-3, CODEX, DALL-E, etc., with almost no one extending their implications to its logical conclusion, which is long-term convergence to the mean, to mediocrity, in the fields they claim to address or even enhance. If you’re an artist or writer and you’re using DALL-E or GPT-3 to “enhance” your work, or if you’re a programmer saying, “GitHub Co-Pilot makes me a better programmer?”, then how could you possibly know? You’ve disrupted and bypassed your own creative process, which is thoughts -> (optionally words) -> actions -> feedback -> repeat, and instead seeded your canvas with ideas from a machine, the provenance of which you can’t understand, nor can the machine reliably explain. And the more you do this, the more you make your creative processes dependent on said machine, until you must question whether or not you could work at the same level without it. When I was a college student, I often dabbled with weed, LSD, and mushrooms, and for a while, I thought the ideas I was having while under the influence were revolutionary and groundbreaking – that is until took it upon myself to actually start writing down those ideas and then reviewing them while sober, when I realized they weren’t that special at all. What I eventually determined is that, under the influence, it was impossible for me to accurately evaluate the drug-induced ideas I was having because the influencing agent the generates the ideas themselves was disrupting the same frame of reference that is responsible evaluating said ideas. This is the same principle of – if you took a pill and it made you stupider, would even know it? I believe that, especially over the long-term timeframe that crosses generations, there’s significant risk that current AI-generation developments produces a similar effect on humanity, and we mostly won’t even realize it has happened, much like a frog in boiling water. If you have children like I do, how can you be aware of the the current SOTA in these areas, project that 20 to 30 years, and then and tell them with a straight face that it is worth them pursuing their talent in art, writing, or music? How can you be honest and still say that widespread implementation of auto-correction hasn’t made you and others worse and worse at spelling over the years (a task that even I believe most would agree is tedious and worth automating). Furthermore, I’ve yet to set anyone discuss the train – generate – train - generate feedback loop that long-term application of AI-generation systems imply. The first generations of these models were trained on wide swaths of web data generated by humans, but if these systems are permitted to continually spit out content without restriction or verification, especially to the extent that it reduces or eliminates development and investment in human talent over the long term, then what happens to the 4th or 5th generation of models? Eventually we encounter this situation where the AI is being trained almost exclusively on AI-generated content, and therefore with each generation, it settles more and more into the mean and mediocrity with no way out using current methods. By the time that happens, what will we have lost in terms of the creative capacity of people, and will we be able to get it back? By relentlessly pursuing this direction so enthusiastically, I’m convinced that we as AI/ML developers, companies, and nations are past the point of no return, and it mostly comes down the investments in time and money that we’ve made, as well as a prisoner’s dilemma with our competitors. As a society though, this direction we’ve chosen for short-term gains will almost certainly make humanity worse off, mostly for those who are powerless to do anything about it – our children, our grandchildren, and generations to come. If you’re an AI researcher or a data scientist like myself, how do you turn things back for yourself when you’ve spent years on years building your career in this direction? You’re likely making near or north of $200k annually TC and have a family to support, and so it’s too late, no matter how you feel about the direction the field has gone. If you’re a company, how do you standby and let your competitors aggressively push their AutoML solutions into more and more markets without putting out your own? Moreover, if you’re a manager or thought leader in this field like Jeff Dean how do you justify to your own boss and your shareholders your team’s billions of dollars in AI investment while simultaneously balancing ethical concerns? You can’t – the only answer is bigger and bigger models, more and more applications, more and more data, and more and more automation, and then automating that even further. If you’re a country like the US, how do responsibly develop AI while your competitors like China single-mindedly push full steam ahead without an iota of ethical concern to replace you in numerous areas in global power dynamics? Once again, failing to compete would be pre-emptively admitting defeat. Even assuming that none of what I’ve described here happens to such an extent, how are so few people not taking this seriously and discounting this possibility? If everything I’m saying is fear-mongering and non-sense, then I’d be interested in hearing what you think human-AI co-existence looks like in 20 to 30 years and why it isn’t as demoralizing as I’ve made it out to be. ​ EDIT: Day after posting this -- this post took off way more than I expected. Even if I received 20 - 25 comments, I would have considered that a success, but this went much further. Thank you to each one of you that has read this post, even more so if you left a comment, and triply so for those who gave awards! I've read almost every comment that has come in (even the troll ones), and am truly grateful for each one, including those in sharp disagreement. I've learned much more from this discussion with the sub than I could have imagined on this topic, from so many perspectives. While I will try to reply as many comments as I can, the sheer comment volume combined with limited free time between work and family unfortunately means that there are many that I likely won't be able to get to. That will invariably include some that I would love respond to under the assumption of infinite time, but I will do my best, even if the latency stretches into days. Thank you all once again!

[N] Ethan Caballero: Broken Neural Scaling Laws | New Podcast Episode
reddit
LLM Vibe Score0
Human Vibe Score0
evc123This week

[N] Ethan Caballero: Broken Neural Scaling Laws | New Podcast Episode

video: https://www.youtube.com/watch?v=SV87S38M1J4 OUTLINE: 00:00 Introduction 00:50 The "Scale Is All You Need" Movement 01:07 A Functional Form Predicting Every Scaling Behavior 01:40 A Break Between Two Straight Lines On A Log Log Plot 02:32 The Broken Neural Scaling Laws Equation 04:04 Extrapolating A Ton Of Large Scale Vision And Language Tasks 04:49 Upstream And Downstream Have Different Breaks 05:22 Extrapolating Four Digit Addition Performance 06:11 On The Feasability Of Running Enough Training Runs 06:31 Predicting Sharp Left Turns 07:51 Modeling Double Descent 08:41 Forecasting Interpretability And Controllability 09:33 How Deception Might Happen In Practice 10:24 Sinister Stumbles And Treacherous Turns 11:18 Recursive Self Improvement Precedes Sinister Stumbles 11:51 Humans In The Loop For The Very First Deception 12:32 The Hardware Stuff Is Going To Come After The Software Stuff 12:57 Distributing Your Training By Copy-Pasting Yourself Into Different Servers 13:42 Automating The Entire Hardware Pipeline 14:47 Having Text AGI Spit Out New Robotics Design 16:33 The Case For Existential Risk From AI 18:32 Git Re-basin 18:54 Is Chain-Of-Thoughts Enough For Complex Reasoning In LMs? 19:52 Why Diffusion Models Outperform Other Generative Models 21:13 Using Whisper To Train GPT4 22:33 Text To Video Was Only Slightly Impressive 23:29 The e=mc\^2 of AGI transcript: https://theinsideview.ai/ethan2

[D] AI Agents: too early, too expensive, too unreliable
reddit
LLM Vibe Score0
Human Vibe Score1
madredditscientistThis week

[D] AI Agents: too early, too expensive, too unreliable

Reference: Full blog post There has been a lot of hype about the promise of autonomous agent-based LLM workflows. By now, all major LLMs are capable of interacting with external tools and functions, letting the LLM perform sequences of tasks automatically. But reality is proving more challenging than anticipated. The WebArena leaderboard, which benchmarks LLMs agents against real-world tasks, shows that even the best-performing models have a success rate of only 35.8%. Challenges in Practice After seeing many attempts to AI agents, I believe it's too early, too expensive, too slow, too unreliable. It feels like many AI agent startups are waiting for a model breakthrough that will start the race to productize agents. Reliability: As we all know, LLMs are prone to hallucinations and inconsistencies. Chaining multiple AI steps compounds these issues, especially for tasks requiring exact outputs. Performance and costs: GPT-4o, Gemini-1.5, and Claude Opus are working quite well with tool usage/function calling, but they are still slow and expensive, particularly if you need to do loops and automatic retries. Legal concerns: Companies may be held liable for the mistakes of their agents. A recent example is Air Canada being ordered to pay a customer who was misled by the airline's chatbot. User trust: The "black box" nature of AI agents and stories like the above makes it hard for users to understand and trust their outputs. Gaining user trust for sensitive tasks involving payments or personal information will be hard (paying bills, shopping, etc.). Real-World Attempts Several startups are tackling the AI agent space, but most are still experimental or invite-only: adept.ai - $350M funding, but access is still very limited MultiOn - funding unknown, their API-first approach seems promising HypeWrite - $2.8M funding, started with an AI writing assistant and expanded into the agent space minion.ai - created some initial buzz but has gone quiet now, waitlist only Only MultiOn seems to be pursuing the "give it instructions and watch it go" approach, which is more in line with the promise of AI agents. All others are going down the record-and-replay RPA route, which may be necessary for reliability at this stage. Large players are also bringing AI capabilities to desktops and browsers, and it looks like we'll get native AI integrations on a system level: OpenAI announced their Mac desktop app that can interact with the OS screen. At Google I/O, Google demonstrated Gemini automatically processing a shopping return. Microsoft announced Copilot Studio, which will let developers build AI agent bots. Screenshot Screenshot These tech demos are impressive, but we'll see how well these agent capabilities will work when released publicly and tested against real-world scenarios instead of hand-picked demo cases. The Path Forward AI agents overhyped and it's too early. However, the underlying models continue to advance quickly, and we can expect to see more successful real-world applications. Instead of trying to have one large general purpose agent that is hard to control and test, we can use many smaller agents that basically just pick the right strategy for a specific sub-task in our workflows. These "agents" can be thought of as medium-sized LLM prompts with a) context and b) a set of functions available to call. The most promising path forward likely looks like this: Narrowly scoped, well testable automations that use AI as an augmentation tool rather than pursuing full autonomy Human-in-the-loop approaches that keep humans involved for oversight and handling edge cases Setting realistic expectations about current capabilities and limitations By combining tightly constrained agents, good evaluation data, human-in-the-loop oversight, and traditional engineering methods, we can achieve reliably good results for automating medium-complex tasks. Will AI agents automate tedious repetitive work, such as web scraping, form filling, and data entry? Yes, absolutely. Will AI agents autonomously book your vacation without your intervention? Unlikely, at least in the near future.

[D] Is this close enough to be usable? Need your inputs: Automated RAG testing tool. AI Data Pipelines for Real-World Production (Part 3)
reddit
LLM Vibe Score0
Human Vibe Score1
Snoo-bedoooThis week

[D] Is this close enough to be usable? Need your inputs: Automated RAG testing tool. AI Data Pipelines for Real-World Production (Part 3)

Hey there, Redditors! I'm back with the latest installment on creating dependable AI data pipelines for real-world production. If you've been following along, you know I'm on a mission to move beyond the "thin OpenAI wrapper" trend and tackle the challenges of building robust data pipelines. With 18 months of hands-on experience and many user interviews, I realized that with the probabilistic nature of systems, we need better\_testing.gpt: As you build you should test The world of AI is a fast-moving one, and we've realized that just working on systems is not an optimal design choice. By the time your product ships, it might already be using outdated technology. So, what's the lesson here? Embrace change, test along, but be prepared to switch pace. No Best Practices Yet for RAGs In this rapidly evolving landscape, there are no established best practices. You'll need to make educated bets on tools and processes, knowing that things will change. With the RAG testing tool, I tried allowing for testing many potential parameter combinations automatically Testing Frameworks If your generative AI product doesn't have users giving feedback, then you are building in isolation. I used Deepeval to generate test sets, and they will soon support synthetic test set generation Infographics only go so far AI researchers and data scientists, while brilliant, end up in a loop of pursuing Twitter promotional content. New ways are promoted via new content pieces, but ideally, we need something above simple tracing but less than full-fledged analytics. To do this, I stored test outputs in Postgres and created a Superset instance to visualize the results Bridging the Gap between VectorDBs There's a noticeable number of Vector DBs. To ensure smooth product development, we need to be able to switch to best best-performing one, especially since user interviews signal that they might start deteriorating after loading 50 million rows ​ Github repo is here Next steps: I have questions for you: What variables do you change when building RAGs? What is the set of strategies I should add to the solution? (parent-son etc.) How can I improve it in general? Is anyone interested in a leaderboard for best parameter configs? Check out the blog post: Link to part 3 Remember to give this post an upvote if you found it insightful! And also star our Github repo

I tested hundreds of marketing tools in the last three years and these 50 made it to the list. I'll sum up my top 50 marketing tools with one or two sentences + give you pricings.
reddit
LLM Vibe Score0
Human Vibe Score1
SpicyCopyThis week

I tested hundreds of marketing tools in the last three years and these 50 made it to the list. I'll sum up my top 50 marketing tools with one or two sentences + give you pricings.

Hey guys, I'm working in a growth marketing agency. Marketing tools are 30% of what we do, so we use them a lot and experiment with the new ones as much as possible. There are thousands of tools and it's easy to get lost, so I wanted to share the tools we use most on a daily basis. And divide the list into 14 categories. I thought this could be handy for Entrepreneurs subreddit. Why adopt tools? I see marketing tools as tireless colleagues. If you can't hire an employee, choosing the right tool can solve your problems, because they Are super cheap. Work 7/24 for you. Don’t make mistakes. Don’t need management. (or needless management) Help you to automate the majority of your lead gen process. Onwards to the list. (With the pricings post ended up quite long, you can find a link in the end if you want to check the prices) Email marketing tools #1 ActiveCampaign is armed with the most complicated email automation features and has the most intuitive user experience. It feels like you already know how to use it. \#2 Autopilot is visual marketing automation and customer journey tool that helps you acquire, nurture based on behaviors, interest etc. #3 Mailjet: This is the tool we use to send out bulky email campaigns such as newsletters. It doesn't have sexy features like others but does its job for a cheap price. Email address finders #4 Skrapp finds email of your contacts by name and company. It also works with LinkedIn Sales Navigator and can extract thousands of emails in bulk + have a browser add-on. #5 Hunter: Similar to Skrapp but doesn't work with LinkedIn Sales Navigator directly. In addition, there are email templates and you can set up email campaigns. Prospecting and outreach tools #6 Prospect combines the personal emails, follow-up calls, other social touches and helps you create multichannel campaigns.  #7 Reply is a more intuitive version of Prospect. It is easy to learn and use; their UX makes you feel good and sufficient.  CRM tools #8 Salesflare helps you to stop managing your data and start managing your customers. Not yet popular as Hubspot and etc but the best solution for smaller B2B businesses. (we're fans) \#9 Hubspot: The most popular CRM for good reason and has a broader product range you can adopt in your next steps. Try this if you have a bulky list of customers because it is free. #10 Pardot: Pardot is by Salesforce, it's armed with features that can close the gap between marketing and sales. Sales Tools #11 Salesforce is the best sales automation and lead management software. It helps you to create complicated segmentations and run, track, analyze campaigns from the same dashboard. #12 LinkedIn Sales Navigator gives you full access to LinkedIn's user database. You can even find a kidnapped CEO if you know how to use it with other marketing automation tools like Skrapp. #13 Pipedrive is a simple tool and excels in one thing. It tracks your leads and tells you when to take the next action. It makes sales easier. #14 Qwilr creates great-looking docs, at speed. You can design perfect proposals, quotes, client updates, and more in a flash. We use it a lot to close deals, it's effective. #15 Crystalknows is an add-on that tells you anyone’s personality on LinkedIn and gives you a detailed approach specific to that person. It's eerily accurate. #16 Leadfeeder shows you the companies that visited your website. Tells how they found you and what they’re interested in. It has a free version. Communication Tools #17 Intercom is a sweet and smart host that welcomes your visitors when you’re not home. It’s one of the best chatbot tools in the market. #18 Drift is famous for its conversational marketing features and more sales-focused than Intercom. #19 Manychat is a chatbot that helps you create high converting Facebook campaigns. #20 Plann3r helps you create your personalized meeting page. You can schedule meetings witch clients, candidates, and prospects. #21 Loom is a video messaging tool, it helps you to be more expressive and create closer relationships. #22 Callpage collects your visitors’ phone number and connects you with them in seconds. No matter where you are. Landing page tools #23 Instapage is the best overall landing page builder. It has a broad range of features and even squirrel can build a compelling landing page with templates. No coding needed. #24 Unbounce can do everything that Instapage does and lets you build a great landing page without a developer. But it's less intuitive. Lead generation / marketing automation tools #25 Phantombuster is by far the most used lead generation software in our tool kit. It extracts data, emails, sends requests, customized messages, and does many things on autopilot in any platform. You can check this, this and this if you want to see it in action. #26 Duxsoup is a Google Chrome add-on and can also automate some of LinkedIn lead generation efforts like Phantombuster. But not works in the cloud. #27 Zapier is a glue that holds all the lead generation tools together. With Zapier, You can connect different marketing tools and no coding required. Conversion rate optimization tools #28 Hotjar tracks what people are doing on your website by recording sessions and capturing mouse movements. Then it gives you a heatmap. #29 UsabilityHub shows your page to a digital crowd and measures the first impressions and helps you to validate your ideas. #30 Optinmonster is a top tier conversion optimization tool. It helps you to capture leads and enables you to increase conversions rates with many features. #31 Notifia is one mega tool of widgets that arms your website with the wildest social proof and lead capturing tactics. #32 Sumo is a much simpler version of Notifia. But Sumo has everything to help you capture leads and build your email lists. Web scrapers #33 Data Miner is a Google Chrome browser extension that helps you scrape data from web pages and into a CSV file or Excel spreadsheet. #34 Webscraper does the same thing as Data Miner; however, it is capable of handling more complex tasks. SEO and Content #35 Grammarly: Your English could be your first language and your grammar could be better than Shakespeare. Grammarly still can make your writing better. #36 Hemingwayapp is a copywriting optimization tool that gives you feedback about your copy and improves your readability score, makes your writing bolder and punchier. Free. #37 Ahrefs is an all-rounder search engine optimization tool that helps you with off-page, on-page or technical SEO. #38 SurferSEO makes things easier for your on-page SEO efforts. It’s a tool that analyzes top Google results for specific keywords and gives you a content brief based on that data. Video editing and design tools #39 Canva is a graphic design platform that makes everything easy. It has thousands of templates for anything from Facebook ads, stylish presentations to business cards.  #40 Kapwing is our go-to platform for quick video edits. It works on the browser and can help you to create stylish videos, add subtitles, resize videos, create memes, or remove backgrounds. #41 Animoto can turn your photos and video clip into beautiful video slideshows. It comes handy when you want to create an advertising material but don’t have a budget. Advertising tools #42 AdEspresso lets you create and test multiple ads with few clicks. You can optimize your FB, IG, and Google ads from this tool and measure your ads with in-depth analytics. #43 AdRoll is an AI-driven platform that connects and coordinates marketing efforts across ads, email, and online stores. Other tools #44 Replug helps you to shorten, track, optimize your links with call-to-actions, branded links, and retargeting pixels #45 Draw.io = Mindmaps, schemes, and charts. With Draw.io, you can put your brain in a digital paper in an organized way. #46 Built With is a tool that finds out what websites are built with. So you can see what tools they're using and so on. #47 Typeform can turn data collection into an experience with Typeform. This tool helps you to engage your audience with conversational forms or surveys and help you to collect more data. #48 Livestorm helped us a lot, especially in COVID-19 tiles. It’s a webinar software that works on your browser, mobile, and desktop. #49 Teachable \- If you have an online course idea but hesitating because of the production process, Teachable can help you. It's easy to configure and customizable for your needs. #50 Viral Loops provides a revolutionary referral marketing solution for modern marketers. You can create and run referral campaigns in a few clicks with templates. Remember, most of these tools have a free trial or free version. Going over them one by one can teach you a lot and help you grow your business with less work power in the early stages of your business. I hope you enjoyed the read and can find some tools to make things easier! Let me know about your favorite tools in the comments, so I can try them out. \------ If you want to check the prices and see a broader explanation about the tools, you can go here.

tools I use to not have to hire anyone
reddit
LLM Vibe Score0
Human Vibe Score1
Pio_SceThis week

tools I use to not have to hire anyone

I’ve spent unreasonable amount of time with AI tools and here’s curated list of ones I recommend for productivity (honestly, some of them can replace an employee): General assistants ChatGPT \- You probably know it. It’s a great tool for ideating, brainstorming, document summarization and quick question-answer work. There’s a desktop app available so you can quickly pop it up by pressing control + space, which makes it even better for productivity. Claude \- Another chat interface, similar to ChatGPT. It’s a different model provider so the answers and behavior might be different. From my experience, Claude 3.5 Sonnet is performing better than GPT-4o (but not o1) in tasks that focus on reasoning, code writing and copywriting. There’s also a desktop app available. Gemini \- Honestly, I’m not even sure where to put it. It’s Google’s model, one of the most powerful in terms of multimodal capabilities (text, image, audio). And it’s tailored for your Google Workspace. Email, docs, spreadsheets, meets, presentation. Anything. Research Perplexity \- Perplexity is an AI search engine that provides answers to questions with up-to-date information. So, forget Google. Use Perplexity to get answers to questions and dive down the rabbit hole. Exa AI \- Exa is another advanced search engine that combines AI-driven neural search with traditional keyword search. It understands the semantic meaning of queries and documents. And you can also choose what you want to search: academic articles, news, reports, tweets etc. Meetings, calendar and email Granola \- Great AI notepad for meetings. It’s a desktop app, so there’s no bot joining your meetings. It automatically transcribes and enhances meeting notes, helping organize and summarize key takeaways and generates action items, follow-up emails, etc. It also allows you to ask questions about the transcript and get answers. Reclaim \- AI-powered calendar that optimizes for productivity. Essentially, it automates meetings, tracks tasks, and protects deep work time. Cool thing is that it syncs with Google Calendar and Slack. Cora \- Batch processing emails is one of the main productivity tactics. Cora enables that. You only see emails that you need to respond to. And it generates automatic replies for you. All other emails are summarized twice a day. Knowledge summarization Particle News \- Short summaries of the daily news. Pretty straightforward. Notebook LM \- Notebook LM helps process and summarize various types of content, such as PDFs, websites, videos, and more. The cool thing is that it provides insights and connections between topics, cites sources and offers audio summaries. I use it when the content to read is too long and I’m on the go. Napkin \- For creating visuals from text. You can easily generate and customize infographics, diagrams etc. So, if you’re brainstorming, writing or preparing for a presentation, Napkin will work well. Writing and brainstorming Grammarly \- Well known grammar checker. It helps improve writing by focusing on clarity and tone. Sometimes the Grammarly icon popping up is annoying though. Flow \- Flow helps you write and edit notes by speaking. And it integrates across all the apps you use, adapts to your tone and style. Cool tool for just yapping! Automations Gumloop \- Think AI-first Zapier, but 100x more powerful. It's is a platform for automating complex work using AI via a no-code drag and drop interface. It’s very easy to automate work without needing engineers. And they have loads of templates. Wordware \- A platform for building AI agents with natural language. Honestly, for folks who are a bit more technical. You simply prompt LLM to perform a task for you. And you can build any integration you want. If you’re a builder, you can later on connect the agent via API. I strongly believe that technology is leverage. And with AI we can be in top 0.1% of people. If you want bit deeper dive into the topic, I shared that on my substack (available via link in my profile) Any other recommendations for apps I could use? What works if you want to keep the team super lean in early days?

[CASE STUDY] From 217/m to $2,836/m in 9 months - Sold for $59,000; I grow and monetise web traffic of 5, 6, 7 figures USD valued passive income content sites [AMA]
reddit
LLM Vibe Score0
Human Vibe Score1
jamesackerman1234This week

[CASE STUDY] From 217/m to $2,836/m in 9 months - Sold for $59,000; I grow and monetise web traffic of 5, 6, 7 figures USD valued passive income content sites [AMA]

Hello Everyone (VERY LONG CASE STUDY AHEAD) - 355% return in 9 months Note: I own a 7-figures USD valued portfolio of 41+ content sites that generates 5-6 figures USD a month in passive income. This is my first time posting in this sub and my goal is to NOT share generic advice but precise numbers, data and highly refined processes so you can also get started with this business yourself or if you already have an existing business, drive huge traffic to it and scale it substantially (get more customers). I will use a case study to explain the whole process. As most of us are entrepreneurs here, explaining an actual project would be more meaningful. In this case study I used AI assisted content to grow an existing site from $217/m to $2,836/m in 9 months (NO BACKLINKS) and sold it for $59,000. ROI of 3 months: 355% Previous case studies (before I give an overview of the model) Amazon Affiliate Content Site: $371/m to $19,263/m in 14 MONTHS - $900K CASE STUDY \[AMA\] Affiliate Website from $267/m to $21,853/m in 19 months (CASE STUDY - Amazon?) \[AMA\] Amazon Affiliate Website from $0 to $7,786/month in 11 months Amazon Affiliate Site from $118/m to $3,103/m in 8 MONTHS (SOLD it for $62,000+) Note: You can check pinned posts on my profile. Do go through the comments as well as a lot of questions are answered in those. However, if you still have any questions, feel free to reach out. This is an \[AMA\]. Quick Overview of the Model Approach: High traffic, niche specific, informative content websites that monetise its traffic through highly automated methods like display ads and affiliate. The same model can be applied to existing businesses to drive traffic and get customers. Main idea: Make passive income in a highly automated way Easy to understand analogy You have real estate (here you have digital asset like a website) You get rental income (here you get ads and affiliate income with no physical hassle, in case you have a business like service, product etc. then you can get customers for that too but if not, it's alright) Real estate has value (this digital asset also has value that can be appreciated with less effort) Real estate can be sold (this can be sold too but faster) IMPORTANT NOTE: Search traffic is the BEST way to reach HUGE target audience and it's important when it comes to scaling. This essentially means that you can either monetise that via affiliate, display etc. or if you have a business then you can reach a bigger audience to scale. Overview of this website's valuation (then and now: Oct. 2022 and June 2023) Oct 2022: $217/m Valuation: $5,750.5 (26.5x) - set it the same as the multiple it was sold for June 2023: $2,836/m Traffic and revenue trend: growing fast Last 3 months avg: $2,223 Valuation now: $59,000 (26.5x) Description: The domain was registered in 2016, it grew and then the project was left unattended. I decided to grow it again using properly planned AI assisted content. Backlink profile: 500+ Referring domains (Ahrefs). Backlinks mean the sites linking back to you. This is important when it comes to ranking. Summary of Results of This Website - Before and After Note: If the terms seem technical, do not worry. I will explain them in detail later. Still if you have any questions. Feel free to comment or reach out. |Metric|Oct 2022|June 2023|Difference|Comments| |:-|:-|:-|:-|:-| |Articles|314|804|\+490|AI Assisted content published in 3 months| |Traffic|9,394|31,972|\+22,578|Organic| |Revenue|$217|$2,836|\+$2,619|Multiple sources| |RPM (revenue/1000 web traffic)|23.09|$88.7|\+$65.61|Result of Conversion rate optimisation (CRO). You make changes to the site for better conversions| |EEAT (expertise, experience, authority and trust of website)|2 main authors|8 authors|6|Tables, video ads and 11 other fixations| |CRO|Nothing|Tables, video ads |Tables, video ads and 11 other fixations || ​ Month by Month Growth |Month|Revenue|Steps| |:-|:-|:-| |Sept 2022|NA|Content Plan| |Oct 2022|$217|Content Production| |Nov 2022|$243|Content production + EEAT authors| |Dec 2022|$320|Content production + EEAT authors| |Jan 2023|$400|Monitoring| |Feb 2023|$223|Content production + EEAT authors| |Mar 2023|$2,128|CRO & Fixations| |April 2023|$1,609|CRO & Fixations| |May 2023|$2,223|Content production + EEAT authors| |June 2023|$2,836|CRO and Fixations| |Total|$10,199|| ​ What will I share Content plan and Website structure Content Writing Content Uploading, formatting and onsite SEO Faster indexing Conversion rate optimisation Guest Posting EEAT (Experience, Expertise, Authority, Trust) Costing ROI The plans moving forward with these sites ​ Website Structure and Content Plan This is probably the most important important part of the whole process. The team spends around a month just to get this right. It's like defining the direction of the project. Description: Complete blueprint of the site's structure in terms of organisation of categories, subcategories and sorting of articles in each one of them. It also includes the essential pages. The sorted articles target main keyword, relevant entities and similar keywords. This has to be highly data driven and we look at over 100 variables just to get it right. It's like beating Google's algorithm to ensure you have a blueprint for a site that will rank. It needs to be done right. If there is a mistake, then even if you do everything right - it's not going to work out and after 8-16 months you will realise that everything went to waste. Process For this project, we had a niche selected already so we didn't need to do a lot of research pertaining to that. We also knew the topic since the website was already getting good traffic on that. We just validated from Ahrefs, SEMRUSH and manual analysis if it would be worth it to move forward with that topic. ​ Find entities related to the topic: We used Ahrefs and InLinks to get an idea about the related entities (topics) to create a proper topical relevance. In order to be certain and have a better idea, we used ChatGPT to find relevant entities as well \> Ahrefs (tool): Enter main keyword in keywords explorer. Check the left pain for popular topics \> Inlinks (tool): Enter the main keyword, check the entity maps \> ChatGPT (tool): Ask it to list down the most important and relevant entities in order of their priority Based on this info, you can map out the most relevant topics that are semantically associated to your main topic Sorting the entities in topics (categories) and subtopics (subcategories): Based on the information above, cluster them properly. The most relevant ones must be grouped together. Each group must be sorted into its relevant category. \> Example: Site about cycling. \> Categories/entities: bicycles, gear and equipment, techniques, safety, routes etc. \> The subcategories/subentities for let's say "techniques" would be: Bike handling, pedaling, drafting etc. Extract keywords for each subcategory/subentity: You can do this using Ahrefs or Semrush. Each keyword would be an article. Ensure that you target the similar keywords in one article. For example: how to ride a bicycle and how can I ride a bicycle will be targeted by one article. Make the more important keyword in terms of volume and difficulty as the main keyword and the other one(s) as secondary Define main focus vs secondary focus: Out of all these categories/entities - there will be one that you would want to dominate in every way. So, focus on just that in the start. This will be your main focus. Try to answer ALL the questions pertaining to that. You can extract the questions using Ahrefs. \> Ahrefs > keywords explorer \> enter keyword \> Questions \> Download the list and cluster the similar ones. This will populate your main focus category/entity and will drive most of the traffic. Now, you need to write in other categories/subentities as well. This is not just important, but crucial to complete the topical map loop. In simple words, if you do this Google sees you as a comprehensive source on the topic - otherwise, it ignores you and you don't get ranked Define the URLs End result: List of all the entities and sub-entities about the main site topic in the form of categories and subcategories respectively. A complete list of ALL the questions about the main focus and at around 10 questions for each one of the subcategories/subentities that are the secondary focus Content Writing So, now that there's a plan. Content needs to be produced. Pick out a keyword (which is going to be a question) and... Answer the question Write about 5 relevant entities Answer 10 relevant questions Write a conclusion Keep the format the same for all the articles. Content Uploading, formatting and onsite SEO Ensure the following is taken care of: H1 Permalink H2s H3s Lists Tables Meta description Socials description Featured image 2 images in text \\Schema Relevant YouTube video (if there is) Note: There are other pointers link internal linking in a semantically relevant way but this should be good to start with. Faster Indexing Indexing means Google has read your page. Ranking only after this step has been done. Otherwise, you can't rank if Google hasn't read the page. Naturally, this is a slow process. But, we expedite it in multiple ways. You can use RankMath to quickly index the content. Since, there are a lot of bulk pages you need a reliable method. Now, this method isn't perfect. But, it's better than most. Use Google Indexing API and developers tools to get indexed. Rank Math plugin is used. I don't want to bore you and write the process here. But, a simple Google search can help you set everything up. Additionally, whenever you post something - there will be an option to INDEX NOW. Just press that and it would be indexed quite fast. Conversion rate optimisation Once you get traffic, try adding tables right after the introduction of an article. These tables would feature a relevant product on Amazon. This step alone increased our earnings significantly. Even though the content is informational and NOT review. This still worked like a charm. Try checking out the top pages every single day in Google analytics and add the table to each one of them. Moreover, we used EZOIC video ads as well. That increased the RPM significantly as well. Both of these steps are highly recommended. Overall, we implemented over 11 fixations but these two contribute the most towards increasing the RPM so I would suggest you stick to these two in the start. Guest Posting We made additional income by selling links on the site as well. However, we were VERY careful about who we offered a backlink to. We didn't entertain any objectionable links. Moreover, we didn't actively reach out to anyone. We had a professional email clearly stated on the website and a particularly designated page for "editorial guidelines" A lot of people reached out to us because of that. As a matter of fact, the guy who bought the website is in the link selling business and plans to use the site primarily for selling links. According to him, he can easily make $4000+ from that alone. Just by replying to the prospects who reached out to us. We didn't allow a lot of people to be published on the site due to strict quality control. However, the new owner is willing to be lenient and cash it out. EEAT (Experience, Expertise, Authority, Trust) This is an important ranking factor. You need to prove on the site that your site has authors that are experienced, have expertise, authority and trust. A lot of people were reaching out to publish on our site and among them were a few established authors as well. We let them publish on our site for free, added them on our official team, connected their socials and shared them on all our socials. In return, we wanted them to write 3 articles each for us and share everything on all the social profiles. You can refer to the tables I shared above to check out the months it was implemented. We added a total of 6 writers (credible authors). Their articles were featured on the homepage and so were their profiles. Costing Well, we already had the site and the backlinks on it. Referring domains (backlinks) were already 500+. We just needed to focus on smart content and content. Here is the summary of the costs involved. Articles: 490 Avg word count per article: 1500 Total words: 735,000 (approximately) Cost per word: 2 cents (includes research, entities, production, quality assurance, uploading, formatting, adding images, featured image, alt texts, onsite SEO, publishing/scheduling etc.) Total: $14,700 ROI (Return on investment) Earning: Oct 22 - June 23 Earnings: $10,199 Sold for: $59,000 Total: $69,199 Expenses: Content: $14,700 Misc (hosting and others): $500 Total: $15,200 ROI over a 9 months period: 355.25% The plans moving forward This website was a part of a research and development experiment we did. With AI, we wanted to test new waters and transition more towards automation. Ideally, we want to use ChatGPT or some other API to produce these articles and bulk publish on the site. The costs with this approach are going to be much lower and the ROI is much more impressive. It's not the the 7-figures projects I created earlier (as you may have checked the older case studies on my profile), but it's highly scalable. We plan to refine this model even further, test more and automate everything completely to bring down our costs significantly. Once we have a model, we are going to scale it to 100s of sites. The process of my existing 7-figures websites portfolio was quite similar. I tested out a few sites, refined the model and scaled it to over 41 sites. Now, the fundamentals are the same however, we are using AI in a smarter way to do the same but at a lower cost, with a smaller team and much better returns. The best thing in my opinion is to run numerous experiments now. Our experimentation was slowed down a lot in the past since we couldn't write using AI but now it's much faster. The costs are 3-6 times lower so when it used to take $50-100k to start, grow and sell a site. Now you can pump 3-6 more sites for the same budget. This is a good news for existing business owners as well who want to grow their brand. Anyway, I am excited to see the results of more sites. In the meantime, if you have any questions - feel free to let me know. Best of luck for everything. Feel free to ask questions. I'd be happy to help. This is an AMA.

[Ultimate List] A list of Marketing Tools That I’ve tested over the years and found helpful to do better marketing with less work. More than 50 Tools To Help you with Marketing, Copywriting & Sales!
reddit
LLM Vibe Score0
Human Vibe Score0.973
lazymentorsThis week

[Ultimate List] A list of Marketing Tools That I’ve tested over the years and found helpful to do better marketing with less work. More than 50 Tools To Help you with Marketing, Copywriting & Sales!

Starting to focus on marketing for your business, You will come across the same tools mentioned over and over by marketers. I would like to mention here tools that you might haven’t seen going viral in the community but actually will help you grow faster and efficiently. Starting off with My favourite Marketing Channel! #Email Marketing For SMBs Convertkit / Mailerlite / Mailchimp - These 3 Platforms are the best options for SMBs and entrepreneurs just starting out with email marketing. All 3 have free plans up to 1,000 subscribers. Scribe - Email Signature Tool, Create Great Email signatures for your emails. Liramail - Most Email marketing platforms don’t offer great email templates. This tool will help you build great email templates with drag and drop. Quick mail Auto-Warmer - Most Businesses at the beginning don’t know what to do when open rate drops. You need to use an email warmer like this to keep it up. #Email Marketing For Big Businesses SendGrid - Overall Email Marketing Tools, this tool is best for brands that have huge email lists and email marketing is the key marketing channel. Braze - This tool is leading in email marketing for large Email senders. When I was working for agencies, this was one of the best email marketing tools I had used. NeoCertified - Protect your emails for spammers and threats. To keep your email list healthy, this is a must have! Sparkloop - Referral Marketing For Email Campaigns. Email can generate great huge amount of referrals for you and Sparkloop makes it easier. #Cold Emails & Lead Generation Hunter - A Great Tool to scrape emails from domain names. The tool comes with a green free plan but Pro plan is worth the amount of features it provides. Icyleads - It’s better than Hunter as it’s heavily focused on the sales and prospecting to help you derive great results from your campaigns. Mailshake - Beginner Friend Cold Email Tool with Great features like email list warming. #Communication Tools Twilio - One do the best customer engagement platform used by Companies like Stripe and mine too. Chatlio - Use Live chat feature on your website with slack integration. My favourite easier to catch up on conversations through slack integration. Intercom - Used by Most Marketers, Industry Leading customer communication platform. Great for beginners! Chatwoot - Another Amazing Communication Tool but the best part is they have a great free plan useful for new businesses. Loom - Communicate with your audience through Videos. Loom is great for SaaS and to show human interaction to close new visitors effectively. #CRM Outseta - This tool provides great CRM and their billing system is better than other tools out their which makes it stands out! Hubspot - I don’t think this tool needs an introduction because Hubspot’s CRM is the best in industry. Salesflare - This CRM is a great alternative to hubspot as it’s beginner friendly and helpful for SMBs. #SEO Tools Ahrefs - One of the best SEO tool in the industry. They also just launched a bunch of free tools to help SEO beginners. Screaming frog - The only website crawler I have used since I bought my first domain. It’s the best! Ubersuggest- The Tool by Neil Patel is the best SEO tool for you. (I’m Joking, it’s the worst) Contentking - This tool is good at Real-time SEO Auditing, they do a lot of Marketing work through Newsletters. If you are subscribed to any SEO newsletter. You may have seen this tool. SEOquake & Semrush - SEOquake is a great tool to conduct on-page analysis, SERP, and much more. Great tool but it’s owned by Semrush. You should go for Semrush because that tool will cover all SEO aspects for you. #Content Marketing Buzzsumo - This tool is great for content research and but you may find the regular emails pretty annoying sometimes. Contentrow - Analyse Your Content and find it’s strength. Highly recommended who are weak at content structuring like me. Grammarly - If you are not a native English speaker like me, you might think you need it or not. You need it for sure for grammar corrections. #Graphic Design Tools Visme - At agencies, Infographics can be more effective than usual postscript. Visme is a graphic design tool focused on infographics and designs related to B2B and B2C. It’s great for agencies! Glorify - A Graphic Design Tool focused on E-commerce, filled with Designs useful for E-commerce store owners. Canva - All-in-one Industry leading Graphic Design Tool that everyone knows and every template is overused now. Adobe Creative Cloud ( previously Sparkpost) - It’s a great alternative to Canva filled with Amazing Stock images to use in your visuals but the only backlash is the exports in this tool are not high quality. Snaps - A Canva Alternative that might not have overused templates for your Social Accounts. #Advertising Tools Plai - It’s a great PPC tool to create Ads for Instagram and Tiktok. Wordstream - It’s an industry leading PPC Tool, great for Ad Grading and auditing. AdEspresso - This Is a tool by Hootsuite. They have a lot of Data sourced at the backend, which helps in Ad optimisation through this tool. That’s the reason I recommend this tool. #Video Editing Tools Veed Studio - I have been using Veed from last year. It’s one of the best Video Marketing Tool Optimized for Instagram & Tiktok. Synthesia - It’s a new AI video generation platform. From last few months, if you have seen marketing agencies including Videos in Emails. The chances are that’s not a Agency member taking but AI generated Human. Motionbox - It’s also a great video editing tool focused on video editing for Digital Marketers. Jitter Video - It’s a great motion design tool. Comes with great templates, the only place where other tools I mentioned lacks. It’s great and beginner friendly. #Copywriting Jasper AI - Google’s John Mueller says AI generated content is banned on Search but I think with Jasper AI you can generate SEO optimised Content but you have to put in some efforts like at least give 30 minutes for editing the Copy by yourself. Copy AI - Another AI tool to help you write better copy. This one is more focused on helping you write copy suitable for Ads and Social media campaigns. Hemingway App - To help you write more clearly and Bold. This tool is better than Grammarly if you look for writing perspective and it’s free. #Social Media Management App I’ve used a Lot of SMM Tools and that’s why going to mention all of them with a short review. Sprout social - The Best with deep insights coverage. Hootsuite - Great Scheduling tool just under sprout social. Later - Heavily Focused on Instagram from beginning and Now Tiktok too. SkedSocial - It’s like a Later alternative with great addition features like link-in-bio. Facebook’s Business Manager- Great but sometimes bugs can make a huge issue for you and customer support is like dead. Tweet Hunter & Hypefury- Both are Twitter Scheduling tools growing very fast on platform and are great for growth. Buffer - It’s a great tool but I haven’t seen any new updates to help with management. Zoho Social - It’s a great SMM tool and if you use other marketing solutions from Zoho. It’s a must have! #Market Research Tool • SparkToro - That’s the only one I have ever used. It’s great for audience research and comes with great customer service. Founded by Rand Fishkin, it’s one of the best research tool. #Influencer Marketing & UGC InfluenceGrid - A free search engine To find Tiktok & Instagram Influencers for your campaigns. Tiktok Creative Center- TikTok’s in-built tool called “Creative Center” is the best to find content trends, audience demographics and much more. Archive - Find Instagram Stories and Posts mentioning Your brands and use them as Ads for your business Marketing. #Landing Page Builders Leadpages - Its a great landing page builder because the integration and drag-and-drop features makes it easier to work with! Cardd co - A Great Landing page builder with easy step up but it lacks the copywriting and tracking features. Instapage - It’s one of the best out and I think the overall product is effective enough to help you stand out with your landing page. Unbounce - It’s a great alternative to Instapage due its well polished landing page templates that might be helpful for you. #Community Building Mighty Networks - A Great Community building platform, and you can also sell courses within the platform. Circle so - A great alternative to Mighty networks focused on Communities specifically. We are currently using for small community Of ours. #Sales Tools Drift - You can get much more out of Drift than just sales tools but The Sales solutions provided in Drift are one of the best. Salesforce - It’s the industry Sales solution provider. A go-to and have various pricing plans making it suitable for majority of SMBs. #Social Proof Tools People don’t have enough time to search across internet to decide to trust you after seeing your Ad first time. That’s what you might be facing too. Here are two tools I absolutely love for social proof! Use Proof - Show Recent Activities occurring on your website and build the trust of your visitors. Testimonial to - Gather Testimonials across Social Media platforms related to your business with this tool. Capture tweets and comments mentioning your brands and mention them. #Analytics Tools Plausible Analytics- A privacy friendly Analytics alternative to Google Analytics if you hate Analytics 4 like me. Mixpanel - Product Analytics and funnel reports better than Google Analytics. #Reddit Marketing Gummysearch- This tool will help To find your target audience on Reddit and interact with them with its help and close your new customers. Howitzer- It’s another pretty similar tool to Gummysearch focused on Reddit cold outreach to get clients and new customers. Both are great but Gummysearch provides better customer support while Howtizer is helpful on a large scale Reddit Marketing. #Text Marketing Klaviyo - It’s an email + SMS marketing tool, it’s taking up space in marketing industry very quickly as an industry leader due to its great integrations but you need to learn the platform usage to maximise the outcome. Cartloop - This tool provides great text marketing solutions with integration with Spotify and other e-commerce marketing tools. Attentive Mobile - This is my favourite Text marketing tool due to the interactive dashboard + they have a library of Text marketing examples to help you out with your campaigns. #Other Tools I have used throughout my journey! Triple Whale - It’s a great E-commerce marketing tools with Triple pixel to help you track your campaigns more efficiently. Fastory - To create well optimized Instagram & Tiktok Stories for your business. Jotform - Online Form Builder with integrations with leading marketing tools. Gated - As an entrepreneur and marketer, you may receive a bunch of unwanted emails. Use Gated to get rid of them and receive useful mails only! ClickUp- The main Tool for Project Management, one of the best and highly recommended. Riverside - Forget Zoom or Google Meet, For your Podcast Interviews and Marketing conferences. You need riverside with great video quality and recording features. Manychat- Automate your Instagram DMs and interact with your followers more efficiently + sell out your products/ services when you are offline. Calendy - To schedule meetings with your ideal clients. ServiceProviderPro - It’s a client portal for SEO & Growing Agencies, very helpful in scaling agencies. SendCheckit - Compare your Email Subject Lines with 100,000+ others in the database for free. Otter AI - Using AI track your meetings more effectively, you can easily edit, annotate and share notes from the meetings. Ryte - Optimise your website User experience with this tool focused on UX aspects + SEO too. PhantomBuster - Scrape LinkedIn Profile and Data from Facebook/LinkedIn groups. I clearly love this tool! #Honourable Mentions Zapier - The Only tool you need to integrate your favourite tool with a new effective tool. Elementor - That’s what I use for web design and it’s great! Marketer Hire - To hire world class marketers to work with you. InShot & Capcut - I create Instagram Reels and TikTok’s and life without these tools isn’t possible. Nira - It’s a great tool to Manage your workspace and this tool has launched many marketing templates in-built helpful for marketers and also entrepreneurs. X - The tool you love that wasn’t mentioned here is valuable and I honour that tool and share that if you would like to! I mean thanks for reading what I have curated all over my life as a marketer. I share 5 Marketing Tools, 5 Marketing Resources and 1 Free Resourceevery week in my newsletter, you can subscribe here to receive that for free. Also, You can read an expanded list of email marketing tools in this Reddit post!

Building and launching an AI-powered Product Strategy tool, or; a story of nights and weekends
reddit
LLM Vibe Score0
Human Vibe Score1
_raZeThis week

Building and launching an AI-powered Product Strategy tool, or; a story of nights and weekends

Speaking to peers in the software development sphere I learned of one constant that we had all personally experienced throughout our careers: a bloated product development process that feels like work for the sake of work, centred around the highest-paid person's opinion instead of its customers. We didn't like how current tools assume AI will provide the perfect answer on the first run. Instead, we wanted a tool that allows for manual refining and editing AI suggestions, keeping all previous ideas in context. This way, we can develop a solution step by step, instead of trying to get it perfect on the first try. An approach more similar to how you'd typically approach product discovery as a human. AI is then used to help save time and reduce admin, instead of replace the expert So, we got together and asked over 100 Product Managers questions about it, brought all that feedback goodness together, and started building Squad. We think we've created something really cool and hope you think so too. The ELI5 on what Squad does: 1) Creates alignment that empowers bottom up software development whilst keeping executive in the loop 2) Increases confidence that what you're building is what people actually want - data driven by default 2) Speeds up the time from idea --> execution by ideating with you on an experimentation approach 3) Helps gives PMs time back to focus on strategy (currently stats show they spend 75% of their time on admin, 25% on strategy) The team hustled hard on this as a passion project while working day jobs, and today have launched on Product Hunt. Check it out and see if the mission resonates with you, we'd appreciate the love! https://www.producthunt.com/posts/squad-8b75e29c-d767-4a8f-a60a-fd162e141a72 ​

Started a content marketing agency 8 years ago - $0 to $7,863,052 (2025 update)
reddit
LLM Vibe Score0
Human Vibe Score0.882
mr_t_forhireThis week

Started a content marketing agency 8 years ago - $0 to $7,863,052 (2025 update)

Hey friends, My name is Tyler and for the past 8 years, I’ve been documenting my experience building a content marketing agency called Optimist. Year 1 — 0 to $500k ARR Year 2 — $500k to $1MM ARR Year 3 — $1MM ARR to $1.5MM(ish) ARR Year 4 — $3,333,686 Revenue Year 5 — $4,539,659 Revenue Year 6 — $5,974,324 Revenue Year 7 - $6,815,503 Revenue (Edit: Seems like links are banned now. You can check my post history for all of my previous updates with lessons and learnings.) How Optimist Works First, an overview/recap of the Optimist business model: We operate as a “collective” of full time/professional freelancers Everyone aside from me is a contractor Entirely remote/distributed team We pay freelancers a flat fee for most work, working out to roughly $65-100/hour. Clients pay us a flat monthly fee for full-service content marketing (research, strategy, writing, editing, design/photography, reporting and analytics, targeted linkbuilding, and more)\ Packages range in price from \~$10-20k/mo \This is something we are revisiting now* The Financials In 2024, we posted $1,032,035.34 in revenue. This brings our lifetime revenue to $7,863,052. Here’s our monthly revenue from January 2017 to December of 2024. (Edit: Seems like I'm not allowed to link to the chart.) The good news: Revenue is up 23% YoY. EBITDA in Q4 trending up 1-2 points. We hosted our first retreat in 4 years, going to Ireland with about half the team. The bad news: Our revenue is still historically low. At $1MM for the year, we’re down about 33% from our previous years over $1.5MM. Revenue has been rocky. It doesn’t feel like we’ve really “recovered” from the bumps last year. The trend doesn’t really look great. Even though, anecdotally, it feels like we are moving in a good direction. EBITDA is still hovering at around 7%. Would love to get that closer to 20%. (For those who may ask: I’m calculating EBITDA after paying taxes and W2 portion of my income.) — Almost every year, my update starts the same way: This has been a year of growth and change. Both for my business—and me personally. 2024 was no different. I guess that tells you something about entrepreneurship. It’s a lot more like sailing a ship than driving a car. You’re constantly adapting, tides are shifting, and any blip of calm is usually just a moment before the next storm. As with past years, there’s a lot to unpack from the last 12 months. Here we go again. Everything is Burning In the last 2 years, everything has turned upside down in the world of content and SEO. Back in 2020, we made a big decision to re-position the agency. (See post history) We decided to narrow our focus to our most successful, profitable, and consistent segment of clients and re-work our entire operation to focus on serving them. We defined our ICP as: \~Series A ($10mm+ funding) with 6-12 months runway to scale organic as a channel Product-led company with “simple” sales cycle involving fewer stakeholders Demonstrable opportunity to use SEO to drive business growth Our services: Content focused on growing organic search (SEO) Full-service engagements that included research, planning, writing, design, reporting And our engagement structure: Engaged directly with an executive; ownership over strategy and day-to-day execution 1-2 points of contact or stakeholders Strategic partner that drives business growth (not a service vendor who makes content) Most importantly, we decided that we were no longer going to offer a broader range of content that we used to sell. That included everything from thought leadership content to case studies and ebooks. We doubled-down on “SEO content” for product-led SaaS companies. And this worked phenomenally for us. We started bringing on more clients than ever. We developed a lot of internal system and processes that helped us scale and take on more work than we’ve ever had and drive great outcomes for our ideal clients. But in 2023 and 2024, things started going awry. One big change, of course, was the rise of AI. Many companies and executives (and writers) feel that AI can write content just as well as an agency like ours. That made it a lot harder to sell a $10,000 per month engagement when they feel like the bulk of the work could be “done for free.” (Lots of thoughts on this if you want my opinions.) But it wasn’t just that. Google also started tinkering with their algorithm, introducing new features like AI Overviews, and generally changing the rules of the game. This created 3 big shifts in our world: The perceived value of content (especially “SEO content”) dropped dramatically in many people’s minds because of AI’s writing capabilities SEO became less predictable as a source of traffic and revenue It’s harder than ever for startups and smaller companies to rank for valuable keywords (let alone generate any meaningful traffic or revenue from them) The effect? The middle of the content market has hollowed out. People—like us—providing good, human-crafted content aimed on driving SEO growth saw a dramatic decline in demand. We felt it all year. Fewer and fewer leads. The leads we did see usually scoffed at our prices. They were indexing us against the cost of content mills and mass-produced AI articles. It was a time of soul-searching and looking for a way forward. I spent the first half of the year convinced that the only way to survive was to run toward the fire. We have to build our own AI workflows. We have to cut our rates internally. We have to get faster and cheaper to stay competitive with the agencies offering the same number of deliverables for a fraction of our rates. It’s the only way forward. But then I asked myself a question… Is this the game I actually want to play? As an entrepreneur, do I want to run a business where I’m competing mostly on price and efficiency rather than quality and value? Do I want to hop into a race toward cheaper and cheaper content? Do I want to help people chase a dwindling amount of organic traffic that’s shrinking in value? No. That’s not the game I want to play. That’s not a business I want to run. I don’t want to be in the content mill business. So I decided to turn the wheel—again. Repositioning Part II: Electric Boogaloo What do you do when the whole world shifts around you and the things that used to work aren’t working anymore? You pivot. You re-position the business and move in another direction. So that’s what we decided to do. Again. There was only one problem: I honestly wasn’t sure what opportunities existed in the content marketing industry outside of what we were already doing. We lived in a little echo chamber of startups and SEO. It felt like the whole market was on fire and I had fight through the smoke to find an escape hatch. So I started making calls. Good ol’ fashioned market research. I reached out to a few dozen marketing and content leaders at a bunch of different companies. I got on the phone and just asked lots of questions about their content programs, their goals, and their pain points. I wanted to understand what was happening in the market and how we could be valuable. And, luckily, this process really paid off. I learned a lot about the fragmentation happening across content and how views were shifting. I noticed key trends and how our old target market really wasn’t buying what we were selling. Startups and small companies are no longer willing to invest in an agency like ours. If they were doing content and SEO at all, they were focused entirely on using AI to scale output and minimize costs. VC money is still scarce and venture-backed companies are more focused on profitability than pure growth and raising another round. Larger companies (\~500+ employees) are doing more content than ever and drowning in content production. They want to focus on strategy but can barely tread water keeping up with content requests from sales, demand gen, the CEO, and everyone else. Many of the companies still investing in content are looking at channels and formats outside of SEO. Things like thought leadership, data reports, interview-driven content, and more. They see it as a way to stand out from the crowd of “bland SEO content.” Content needs are constantly in flux. They range from data reports and blog posts to product one-pagers. The idea of a fixed-scope retainer is a total mismatch for the needs of most companies. All of this led to the logical conclusion: We were talking to the wrong people about the wrong things\.\ Many companies came to one of two logical conclusions: SEO is a risky bet, so it’s gotta be a moonshot—super-low cost with a possibility for a big upside (i.e., use AI to crank out lots of content. If it works, great. If it doesn’t, then at least we aren’t out much money.) SEO is a risky bet, so we should diversify into other strategies and channels to drive growth (i.e., shift our budget from SEO and keyword-focused content to video, podcasts, thought leadership, social, etc) Unless we were going to lean into AI and dramatically cut our costs and rates, our old buyers weren’t interested. And the segment of the market that needs our help most are looking primarily for production support across a big range of content types. They’re not looking for a team to run a full-blown program focused entirely on SEO. So we had to go back to the drawing board. I’ve written before about our basic approach to repositioning the business. But, ultimately it comes down to identifying our unique strengths as a team and then connecting them to needs in the market. After reviewing the insights from my discussions and taking another hard look at our business and our strengths, I decided on a new direction: Move upmarket: Serve mid-size to enterprise businesses with \~500-5,000 employees instead of startups Focus on content that supports a broader range of business goals instead of solely on SEO and organic growth (e.g., sales, demand gen, brand, etc) Shift back to our broader playbook of content deliverables, including thought leadership, data studies, and more Focus on content execution and production to support an internally-directed content strategy across multiple functions In a way, it’s sort of a reverse-niche move. Rather than zooming in specifically on driving organic growth for startups, we want to be more of an end-to-end content production partner that solves issues of execution and operations for all kinds of content teams. It’s early days, but the response here has been promising. We’ve seen an uptick in leads through Q4. And more companies in our pipeline fit the new ICP. They’re bigger, often have more budget. (But they move more slowly). We should know by the end of the quarter if this maneuver is truly paying off. Hopefully, this will work out. Hopefully our research and strategy are right and we’ll find a soft landing serving a different type of client. If it doesn’t? Then it will be time to make some harder decisions. As I already mentioned, I’m not interested in the race to the bottom of AI content. And if that’s the only game left in town, then it might be time to think hard about a much bigger change. — To be done: Build new content playbooks for expanded deliverables Build new showcase page for expanded deliverables Retooling the Operation It’s easy to say we’re doing something new. It’s a lot harder to actually do it—and do it well. Beyond just changing our positioning, we have to do open-heart surgery on the entire content operation behind the scenes. We need to create new systems that work for a broader range of content types, formats, and goals. Here’s the first rub: All of our workflows are tooled specifically for SEO-focused content. Every template, worksheet, and process that we’ve built and scaled in the last 5 years assumes that the primary goal of every piece of content is SEO. Even something as simple as requiring a target keyword is a blocker in a world where we’re not entirely focused on SEO. This is relatively easy to fix, but it requires several key changes: Update content calendars to make keywords optional Update workflows to determine whether we need an optimization report for each deliverable Next, we need to break down the deliverables into parts rather than a single line item. In our old system, we would plan content as a single row in a Content Calendar spreadsheet. It was a really wide sheet with lots of fields where we’d define the dimensions of each individual article. This was very efficient and simple to follow. But every article had the same overall scope when it came to the workflow. In Asana (our project management tool), all of the steps in the creation were strung together in a single task. We would create a few basic templates for each client, and then each piece would flow through the same steps: Briefing Writing Editing Design etc. If we had anything that didn’t fit into the “standard” workflow, we’d just tag it in the calendar with an unofficial notation \[USING BRACKETS\]. It worked. But it wasn’t ideal. Now we need the steps to be more modular. Imagine, for example, a client asks us to create a mix of deliverables: 1 article with writing + design 1 content brief 1 long-form ebook with an interview + writing + design Each of these would require its own steps and its own workflow. We need to break down the work to accommodate for a wider variety of workflows and variables. This means we need to update the fields and structure of our calendar to accommodate for the new dimensions—while also keeping the planning process simple and manageable. This leads to the next challenge: The number of “products” that we’re offering could be almost infinite. Just looking at the example scope above, you can mix and match all of these different building blocks to create a huge variety of different types of work, each requiring its own workflow. This is part of the reason we pivoted away from this model to focus on a productized, SEO-focused content service back in 2020. Take something as simple as a case study. On the surface, it seems like one deliverable that can be easily scoped and priced, right? Well, unpack what goes into a case study: Is there already source material from the customer or do we need to conduct an interview? How long is it? Is it a short overview case study or a long-form narrative? Does it need images and graphics? How many? Each of these variables opens up 2-3 possibilities. And when you combine them, we end up with something like 10 possible permutations for this single type of deliverable. It gets a bit messy. But not only do we have to figure out how to scope and price all for all of these variables, we also have to figure out how to account for these variables in the execution. We have to specify—for every deliverable—what type it is, how long, which steps are involved and not involved, the timeline for delivery, and all of the other factors. We’re approaching infinite complexity, here. We have to figure out a system that allows for a high level of flexibility to serve the diverse needs of our clients but is also productized enough that we can build workflows, process, and templates to deliver the work. I’ve spent the last few months designing that system. Failed Attempt #1: Ultra-Productization In my first pass, I tried to make it as straight forward as possible. Just sit down, make a list of all of the possible deliverables we could provide and then assign them specific scopes and services. Want a case study? Okay that’ll include an interview, up to 2,000 words of content, and 5 custom graphics. It costs $X. But this solution quickly fell apart when we started testing it against real-world scenarios. What if the client provided the brief instead of us creating one? What if they didn’t want graphics? What if this particular case study really needs to be 3,000 words but all of the others should be 2,000? In order for this system to work, we’d need to individual scope and price all of these permutations of each productized service. Then we’d need to somehow keep track of all of these and make sure that we accurately scope, price, and deliver them across dozens of clients. It’s sort of like a restaurant handling food allergies by creating separate versions of every single dish to account for every individual type of allergy. Most restaurants have figured out that it makes way more sense to have a “standard” and an “allergy-free” version. Then you only need 2 options to cover 100% of the cases. Onto the next option. Failed Attempt #2: Deliverable-Agnostic Services Next, I sat down with my head of Ops, Katy, to try to map it out. We took a big step back and said: Why does the deliverable itself even matter? At the end of the day, what we’re selling is just a few types of work (research, writing, editing, design, etc) that can be packaged up in an infinite number of ways. Rather than try to define deliverables, shouldn’t we leave it open ended for maximum flexibility? From there, we decided to break down everything into ultra-modular building blocks. We started working on this super complex system of modular deliverables where we would have services like writing, design, editing, etc—plus a sliding scale for different scopes like the length of writing or the number of images. In theory, it would allow us to mix and match any combination of services to create custom deliverables for the client. In fact, we wanted the work to be deliverable-agnostic. That way we could mold it to fit any client’s needs and deliver any type of content, regardless of the format or goal. Want a 5,000-word case study with 15 custom graphics? That’ll be $X. Want a 2,000-word blog post with an interview and no visuals? $Y. Just want us to create 10 briefs, you handle the writing, and we do design? It’s $Z. Again, this feels like a reasonable solution. But it quickly spiraled out of amuck. (That’s an Office reference.) For this to work, we need to have incredibly precise scoping process for every single deliverable. Before we can begin work (or even quote a price), we need to know pretty much the exact word count of the final article, for example. In the real world? This almost never happens. The content is as long as the content needs to be. Clients rarely know if the blog post should be 2,000 words or 3,000 words. They just want good content. We have a general ballpark, but we can rarely dial it in within just 1,000 words until we’ve done enough research to create the brief. Plus, from a packaging and pricing perspective, it introduces all kind of weird scenarios where clients will owe exactly $10,321 for this ultra-specific combination of services. We were building an open system that could accommodate any and all types of potential deliverables. On the face that seems great because it makes us incredibly flexible. In reality, the ambiguity actually works against us. It makes it harder for us to communicate to clients clearly about what they’ll get, how much it will cost, and how long it will take. That, of course, also means that it hurts our client relationships. (This actually kind of goes back to my personal learnings, which I’ll mention in a bit. I tend to be a “let’s leave things vague so we don’t have to limit our options” kind of person. But I’m working on fixing this to be more precise, specific, and clear in everything that we do.) Dialing It In: Building a Closed System We were trying to build an open system. We need to build a closed system. We need to force clarity and get specific about what we do, what we don’t do, and how much it all costs. Then we need a system to expand on that closed system—add new types of deliverables, new content playbooks, and new workflows if and when the need arises. With that in mind, we can start by mapping out the key dimensions of any type of deliverable that we would ever want to deliver. These are the universal dimensions that determine the scope, workflow, and price of any deliverable—regardless of the specific type output. Dimensions are: Brief scope Writing + editing scope Design scope Interview scope Revision (rounds) Scope, essentially, just tells us how many words, graphics, interviews, etc are required for the content we’re creating. In our first crack at the system, we got super granular with these scopes. But to help force a more manageable system, we realized that we didn’t need tiny increments for most of this work. Instead, we just need boundaries—you pay $X for up to Y words. We still need some variability around the scope of these articles. Obviously, most clients won’t be willing to pay the same price for a 1,000-word article as a 10,000-word article. But we can be smarter about the realistic break points. We boiled it down to the most common ranges: (Up to) 250 words 1,000 words 3,000 words 6,000 words 10,000 words This gives us a much more manageable number of variables. But we still haven’t exactly closed the system. We need one final dimension: Deliverable type. This tells us what we’re actually building with these building blocks. This is how we’ll put a cap on the potentially infinite number of combinations we could offer. The deliverable type will define what the final product should look like (e.g., blog post, case study, ebook, etc). And it will also give us a way to put standards and expectations around different types of deliverables that we want to offer. Then we can expand on this list of deliverables to offer new services. In the mean time, only the deliverables that we have already defined are, “on the menu,” so to speak. If a client comes to us and asks for something like a podcast summary article (which we don’t currently offer), we’ll have to either say we can’t provide that work or create a new deliverable type and define the dimensions of that specific piece. But here’s the kicker: No matter the deliverable type, it has to still fit within the scopes we’ve already defined. And the pricing will be the same. This means that if you’re looking for our team to write up to 1,000 words of content, it costs the same amount—whether it’s a blog post, an ebook, a LinkedIn post, or anything else. Rather than trying to retool our entire system to offer this new podcast summary article deliverable, we’ll just create the new deliverable type, add it to the list of options, and it’s ready to sell with the pre-defined dimensions we’ve already identified. To do: Update onboarding workflow Update contracts and scope documents Dial in new briefing process Know Thyself For the last year, I’ve been going through personal therapy. (Huge shout out to my wife, Laura, for her support and encouragement throughout the process.) It’s taught me a lot about myself and my tendencies. It’s helped me find some of my weaknesses and think about how I can improve as a person, as a partner, and as an entrepreneur. And it’s forced me to face a lot of hard truths. For example, consider some of the critical decisions I’ve made for my business: Unconventional freelance “collective” model No formal management structure Open-ended retainers with near-infinite flexibility General contracts without defined scope “Take it or leave it” approach to sales and marketing Over the years, I’ve talked about almost everything on this list as a huge advantage. I saw these things as a reflection of how I wanted to do things differently and better than other companies. But now, I see them more as a reflection of my fears and insecurities. Why did I design my business like this? Why do I want so much “flexibility” and why do I want things left open-ended rather than clearly defined? One reason that could clearly explain it: I’m avoidant. If you’re not steeped in the world of therapy, this basically means that my fight or flight response gets turned all the way to “flight.” If I’m unhappy or uncomfortable, my gut reaction is usually to withdraw from the situation. I see commitment and specificity as a prelude to future conflict. And I avoid conflict whenever possible. So I built my business to minimize it. If I don’t have a specific schedule of work that I’m accountable for delivering, then we can fudge the numbers a bit and hope they even out in the end. If I don’t set a specific standard for the length of an article, then I don’t have to let the client know when their request exceeds that limit. Conflict….avoided? Now, that’s not to say that everything I’ve built was wrong or bad. There is a lot of value in having flexibility in your business. For example, I would say that our flexible retainers are, overall, an advantage. Clients have changing needs. Having flexibility to quickly adapt to those needs can be a huge value add. And not everything can be clearly defined upfront (at least not without a massive amount of time and work just to decide how long to write an article). Overly-rigid structures and processes can be just as problematic as loosey-goosey ones. But, on the whole, I realized that my avoidant tendencies and laissez faire approach to management have left a vacuum in many areas. The places where I avoided specificity were often the places where there was the most confusion, uncertainty, and frustration from the team and from clients. People simply didn’t know what to expect or what was expected of them. Ironically, this often creates the conflict I’m trying to avoid. For example, if I don’t give feedback to people on my team, then they feel uneasy about their work. Or they make assumptions about expectations that don’t match what I’m actually expecting. Then the client might get upset, I might get upset, and our team members may be upset. Conflict definitely not avoided. This happens on the client side, too. If we don’t define a specific timeline when something will be delivered, the client might expect it sooner than we can deliver—creating frustration when we don’t meet their expectation. This conflict actually would have been avoided if we set clearer expectations upfront. But we didn’t do that. I didn’t do that. So it’s time to step up and close the gaps. Stepping Up and Closing the Gaps If I’m going to address these gaps and create more clarity and stability, I have to step up. Both personally and professionally. I have to actually face the fear and uncertainty that drives me to be avoidant. And then apply that to my business in meaningful ways that aren’t cop-out ways of kinda-sorta providing structure without really doing it. I’ve gotta be all in. This means: Fill the gaps where I rely on other people to do things that aren’t really their job but I haven’t put someone in place to do it Set and maintain expectations about our internal work processes, policies, and standards Define clear boundaries on things like roles, timelines, budgets, and scopes Now, this isn’t going to happen overnight. And just because I say that I need to step up to close these gaps doesn’t mean that I need to be the one who’s responsible for them (at least not forever). It just means that, as the business leader, I need to make sure the gaps get filled—by me or by someone else who has been specifically charged with owning that part of the operation. So, this is probably my #1 focus over the coming quarter. And it starts by identifying the gaps that exist. Then, step into those gaps myself, pay someone else to fill that role, or figure out how to eliminate the gap another way. This means going all the way back to the most basic decisions in our business. One of the foundational things about Optimist is being a “different kind” of agency. I always wanted to build something that solved for the bureaucracy, hierarchy, and siloed structure of agencies. If a client has feedback, they should be able to talk directly to the person doing the work rather than going through 3 layers of account management and creative directors. So I tried to be clever. I tried to design all kinds of systems and processes that eliminated these middle rungs. (In retrospect, what I was actually doing was designing a system that played into my avoidant tendencies and made it easy to abdicate responsibility for lots of things.) Since we didn’t want to create hierarchy, we never implemented things like Junior and Senior roles. We never hired someone to manage or direct the individual creatives. We didn’t have Directors or VPs. (Hell, we barely had a project manager for the first several years of existence.) This aversion to hierarchy aligned with our values around elevating ownership and collective contribution. I still believe in the value a flat structure. But a flat structure doesn’t eliminate the complexity of a growing business. No one to review writers and give them 1:1 feedback? I guess I’ll just have to do that….when I have some spare time. No Content Director? Okay, well someone needs to manage our content playbooks and roll out new ones. Just add it to my task list. Our flat structure didn’t eliminate the need for these roles. It just eliminated the people to do them. All of those unfilled roles ultimately fell back on me or our ops person, Katy. Of course, this isn’t the first time we’ve recognized this. We’ve known there were growing holes in our business as it’s gotten bigger and more complex. Over the years, we’ve experimented with different ways to solve for it. The Old Solution: Distributed Ops One system we designed was a “distributed ops” framework. Basically, we had one person who was the head of ops (at the time, we considered anything that was non-client-facing to be “ops”). They’d plan and organize all of the various things that needed to happen around Optimist. Then they’d assign out the work to whoever was able to help. We had a whole system for tying this into the our profit share and even gave people “Partner” status based on their contributions to ops. It worked—kinda. One big downfall is that all of the tasks and projects were ad hoc. People would pick up jobs, but they didn’t have much context or expertise to apply. So the output often varied. Since we were trying to maintain a flat structure, there was minimal oversight or management of the work. In other words, we didn’t always get the best results. But, more importantly, we still didn’t close all of the gaps entirely. Because everything was an ad-hoc list of tasks and projects, we never really had the “big picture” view of everything that needed to be done across the business. This also meant we rarely had clarity on what was important, what was trivial, and what was critical. We need a better system. Stop Reinventing the Wheel (And Create a Damn Org Chart) It’s time to get serious about filling the gaps in our business. It can’t be a half-fix or an ad hoc set of projects and tasks. We need clarity on the roles that need to be filled and then fill them. The first step here is to create an org chart. A real one. Map out all of the jobs that need to be done for Optimist to be successful besides just writers and designers. Roles like: Content director Design director SEO manager Reporting Finance Account management Business development Sales Marketing Project management It feels a bit laughable listing all of these roles. Because most are either empty or have my name attached to them. And that’s the problem. I can’t do everything. And all of the empty roles are gaps in our structure—places where people aren’t getting the direction, feedback, or guidance they need to do their best work. Or where things just aren’t being done consistently. Content director, for example, should be responsible for steering the output of our content strategists, writers, and editors. They’re not micromanaging every deliverable. But they give feedback, set overall policy, and help our team identify opportunities to get better. Right now we don’t have anyone in that role. Which means it’s my job—when I have time. Looking at the org chart (a real org chart that I actually built to help with this), it’s plain as day how many roles look like this. Even if we aren’t going to implement a traditional agency structure and a strict hierarchy, we still need to address these gaps. And the only way for that to happen is face the reality and then create a plan to close the gaps. Now that we have a list of theoretical roles, we need to clearly define the responsibilities and boundaries of those roles to make sure they cover everything that actually needs to happen. Then we can begin the process of delegating, assigning, hiring, and otherwise addressing each one. So that’s what I need to do. To be done: Create job descriptions for all of the roles we need to fill Hire Biz Dev role Hire Account Lead role(s) Hire Head of Content Playing Offense As we move into Q1 of 2025 and I reflect on the tumultuous few years we’ve had, one thought keeps running through my head. We need to play offense. Most of the last 1-2 years was reacting to changes that were happening around us. Trying to make sense and chart a new path forward. Reeling. But what I really want—as a person and as an entrepreneur—is to be proactive. I want to think and plan ahead. Figure out where we want to go before we’re forced to change course by something that’s out of our control. So my overarching focus for Q1 is playing offense. Thinking longer term. Getting ahead of the daily deluge and creating space to be more proactive, innovative, and forward thinking. To do: Pilot new content formats Audit and update our own content strategy Improve feedback workflows Build out long-term roadmap for 1-2 years for Optimist Final Note on Follow-Through and Cadence In my reflection this year, one of the things I’ve realized is how helpful these posts are for me. I process by writing. So I actually end up making a lot of decisions and seeing things more clearly each time I sit down to reflect and write my yearly recap. It also gives me a space to hold myself accountable for the things I said I would do. So, I’m doing two things a bit differently from here on out. First: I’m identifying clear action items that I’m holding myself accountable for getting done in the next 3 months (listed in the above sections). In each future update, I’ll do an accounting of what I got done and what wasn’t finished (and why). Second: I’m going to start writing shorter quarterly updates. This will gives me more chances each year to reflect, process, and make decisions. Plus it gives me a shorter feedback loop for the action items that I identified above. (See—playing offense.) — Okay friends, enemies, and frenemies. This is my first update for 2025. Glad to share with y’all. And thanks to everyone who’s read, commented, reached out, and shared their own experiences over the years. We are all the accumulation of our connections and our experiences. As always, I will pop in to respond to comments and answer questions. Feel free to share your thoughts, questions, and general disdain down below. Cheers, Tyler

 I just sold my startup for $200,000 after 11 months. AMA
reddit
LLM Vibe Score0
Human Vibe Score0
jeannenThis week

I just sold my startup for $200,000 after 11 months. AMA

Last August, I was looking for a startup idea I could grow and made a MVP in a week then launched it. I received the $200,000 wire from the buyer a couple of days ago I found tons of useful info online for free, so I hope this can be my way of giving back :) Here is some background: Idea I got the idea when trying to write a tweet using Google Doc's transcription tool, which was terrible. I was pretty sure I wasn't the only one too lazy to type, I made my own solution using AI to transcribe and reformat voice notes into any kind of content. I called it Talknotes, mainly because it was the only domain available lol Validation: My rule is to only reinvest what the project generates. After listing on startup directories and posting on Twitter, I generated $700 in 10 days. It wasn't much, but enough to show interest and keep me motivated. I added user-requested features, but the launch effect wore off, and daily revenues dropped to $0 after a few weeks. I almost gave up, but friends encouraged me to continue. In October, I launched on ProductHunt and it blew up. It became Product of the Day and reached $1500 MRR thanks to media coverage. I initially built everything using vanilla JS/CSS/HTML + Node for backend. But it's pretty limited for apps with lots of interactivity so, I rebuilt the app using Nuxt.js to make it easier to ship new features. Then, I launched ads on Facebook and I implemented a feedback loop: Get new users Learn about them through onboarding Make more ads based on onboarding data This doubled MRR in about 2 months. Burnout and Sale: In May, I had a bad burnout after emergency bug fixes. This made it hard to work on the app after. At this point MRR was around $7000 and total revenues around $70,0000 I listed it on Acquire.com for $200,000, a very good price for the buyer considering revenues and growth. I could've gotten $300,000 with buyer financing or earn-outs, but I wanted cash, $200,000 today is better than $300,000 in a year. Everything was smooth until we tried using Escrow, which almost fucked up the deal (details here). Long story short, had to threaten them to make a sponsored post on Twitter explaining what they did + legal action. They sent the refund the very next day, and we completed the transfer directly. Now, this isn't an overnight success. It's the result of 7 years of grind. I launched over 40 projects since I started, and most of them failed. I often worked 100 hours per week, and I rarely go out or meet many people. It's not for everyone, but I'm fine with it With the profit from the app + sale, and other projects, I have close to 1/3 of a million dollar. I could retire in Asia if I wanted Just mind blowing to think I wrote funny characters in a code editor and sold it for the price of a house lol Edit 1: A few people got confused. I said it's 7 years of grind and most of my projects failed, not that I was not making money. I also said I OFTEN worked 100h/week, not every week :) Since I learned to code 2 years ago I've made close to $400k from my app's profit + exit (this one + another one for $65k last year). And before that I was making money as a marketing freelancer. Also, I dropped after high-school, so, I had to learn everything from scratch, it takes time! Edit 2: Lots of people asked how/where I learned to code in 2 months. I wrote a blog/journal about it back then with links to resources, you can find it here if you're interested

Unmasking Fake Testimonials on a YC backed company
reddit
LLM Vibe Score0
Human Vibe Score0.75
Far-Amphibian3043This week

Unmasking Fake Testimonials on a YC backed company

As developers, marketeers and builders, we often rely on trusted platforms to guide us in finding tools that meet our unique needs. Recently, I stumbled upon Overlap, a site marketed as a haven for collaboration tools. Its sleek interface and glowing testimonials initially convinced me I had found a gem. But as I dug deeper, I uncovered a jaw-dropping reality: their testimonials featured stock images, all of which were easily identified through a quick reverse image search. Even more shocking was the realization that Overlap is a Y Combinator-backed company—an organization renowned for nurturing some of the most innovative startups in the world. With significant funding at their disposal, the decision to cut corners with fake testimonials felt like a slap in the face to their user base. They could easily afford a robust testimonial platform, yet chose a path that undermined their credibility. As developers, marketeers and builders, we often rely on trusted platforms to guide us in finding tools that meet our unique needs. Recently, I stumbled upon Overlap, a site marketed as a haven for video AI tools. Its sleek interface and glowing testimonials initially convinced me I had found a gem. But as I dug deeper, I uncovered a jaw-dropping reality: their testimonials featured stock images, all of which were easily identified through a quick reverse image search. Even more shocking was the realization that Overlap is a Y Combinator-backed company—an organization renowned for nurturing some of the most innovative startups in the world. With significant funding at their disposal, the decision to cut corners with fake testimonials felt like a slap in the face to their user base. They could easily afford a robust testimonial platform, yet chose a path that undermined their credibility. A screenshot of Overlap's landing page https://preview.redd.it/zosmdl0v01ce1.png?width=1000&format=png&auto=webp&s=83ced4af92ca284486281f00b020f1f0114b4fcd This discovery was nothing short of a wake-up call. For a developer-focused website—an audience that prizes authenticity and technical precision above all else—faking testimonials with stock photos isn’t just misleading, it’s a catastrophic betrayal of trust. It left me questioning the integrity of their entire operation and serves as a stark reminder for businesses everywhere: your audience notices when you’re not authentic, and they won’t forgive it easily. Position of Fake Testimonials One of the stock images https://preview.redd.it/a7ugasrw01ce1.png?width=341&format=png&auto=webp&s=5261df741f1198a92e537f1e61640e7d6ec60a7f Lessons for Startup Founders and Developers This experience offers several critical lessons for startup founders and developers alike: Authenticity is Non-Negotiable: In a competitive market, trust and transparency can make or break your brand. Fake testimonials might provide a short-term boost, but the long-term damage to credibility far outweighs any temporary gains. Invest in Genuine Solutions: If you have the resources, like a Y Combinator-backed company, prioritize tools and practices that enhance authenticity. Platforms like RapidFeedback allow businesses to dynamically update reviews and manage feedback efficiently. Leverage Real User Feedback: Authentic testimonials not only build trust but also provide actionable insights into your product’s strengths and weaknesses. This feedback loop can be invaluable for refining and growing your business. Understand Your Audience: Developers value precision, integrity, and honesty. Catering to this audience requires a commitment to these principles in every aspect of your business. Let’s ensure that the tools we build and the businesses we run prioritize authenticity. In the long run, a commitment to transparency and user trust will always yield greater rewards than any shortcut could provide. Why Fake Testimonials Are a Problem Fake testimonials damage your brand in more ways than one: Loss of Credibility: Developers are a discerning audience. Trust is everything, and losing it can be catastrophic for your reputation. Hurt User Experience: Knowing a platform misrepresents itself makes users skeptical about its features and promises. Missed Opportunities: Genuine feedback can provide valuable insights for growth and improvement, which fake testimonials completely overlook. A Smarter Way: Authentic Testimonials with RapidFeedback This experience reminded me of why tools like RapidFeedback are invaluable. RapidFeedback helps businesses maintain authenticity by dynamically updating reviews and images in real time. Here’s why it stands out: Real-Time Updates: Reviews are fetched and displayed dynamically, ensuring they’re always up-to-date. Dashboard Management: Businesses can monitor and manage good vs. bad reviews from a centralized dashboard, enabling them to address concerns promptly. Authenticity Guaranteed: Dynamic updates ensure that testimonials reflect real users and their experiences, which builds trust and credibility. Lessons for Developers and Businesses If there’s one takeaway from my Overlap experience, it’s this: authenticity isn’t optional. Whether you’re building tools for developers or selling consumer products, your audience values transparency. Using tools like RapidFeedback ensures your business maintains trust while gaining actionable insights to grow. Let’s commit to prioritizing honesty in our work. Because in the end, authentic relationships with users are what truly drive success.

[CASE STUDY] From 217/m to $2,836/m in 9 months - Sold for $59,000; I grow and monetise web traffic of 5, 6, 7 figures USD valued passive income content sites [AMA]
reddit
LLM Vibe Score0
Human Vibe Score1
jamesackerman1234This week

[CASE STUDY] From 217/m to $2,836/m in 9 months - Sold for $59,000; I grow and monetise web traffic of 5, 6, 7 figures USD valued passive income content sites [AMA]

Hello Everyone (VERY LONG CASE STUDY AHEAD) - 355% return in 9 months Note: I own a 7-figures USD valued portfolio of 41+ content sites that generates 5-6 figures USD a month in passive income. This is my first time posting in this sub and my goal is to NOT share generic advice but precise numbers, data and highly refined processes so you can also get started with this business yourself or if you already have an existing business, drive huge traffic to it and scale it substantially (get more customers). I will use a case study to explain the whole process. As most of us are entrepreneurs here, explaining an actual project would be more meaningful. In this case study I used AI assisted content to grow an existing site from $217/m to $2,836/m in 9 months (NO BACKLINKS) and sold it for $59,000. ROI of 3 months: 355% Previous case studies (before I give an overview of the model) Amazon Affiliate Content Site: $371/m to $19,263/m in 14 MONTHS - $900K CASE STUDY \[AMA\] Affiliate Website from $267/m to $21,853/m in 19 months (CASE STUDY - Amazon?) \[AMA\] Amazon Affiliate Website from $0 to $7,786/month in 11 months Amazon Affiliate Site from $118/m to $3,103/m in 8 MONTHS (SOLD it for $62,000+) Note: You can check pinned posts on my profile. Do go through the comments as well as a lot of questions are answered in those. However, if you still have any questions, feel free to reach out. This is an \[AMA\]. Quick Overview of the Model Approach: High traffic, niche specific, informative content websites that monetise its traffic through highly automated methods like display ads and affiliate. The same model can be applied to existing businesses to drive traffic and get customers. Main idea: Make passive income in a highly automated way Easy to understand analogy You have real estate (here you have digital asset like a website) You get rental income (here you get ads and affiliate income with no physical hassle, in case you have a business like service, product etc. then you can get customers for that too but if not, it's alright) Real estate has value (this digital asset also has value that can be appreciated with less effort) Real estate can be sold (this can be sold too but faster) IMPORTANT NOTE: Search traffic is the BEST way to reach HUGE target audience and it's important when it comes to scaling. This essentially means that you can either monetise that via affiliate, display etc. or if you have a business then you can reach a bigger audience to scale. Overview of this website's valuation (then and now: Oct. 2022 and June 2023) Oct 2022: $217/m Valuation: $5,750.5 (26.5x) - set it the same as the multiple it was sold for June 2023: $2,836/m Traffic and revenue trend: growing fast Last 3 months avg: $2,223 Valuation now: $59,000 (26.5x) Description: The domain was registered in 2016, it grew and then the project was left unattended. I decided to grow it again using properly planned AI assisted content. Backlink profile: 500+ Referring domains (Ahrefs). Backlinks mean the sites linking back to you. This is important when it comes to ranking. Summary of Results of This Website - Before and After Note: If the terms seem technical, do not worry. I will explain them in detail later. Still if you have any questions. Feel free to comment or reach out. |Metric|Oct 2022|June 2023|Difference|Comments| |:-|:-|:-|:-|:-| |Articles|314|804|\+490|AI Assisted content published in 3 months| |Traffic|9,394|31,972|\+22,578|Organic| |Revenue|$217|$2,836|\+$2,619|Multiple sources| |RPM (revenue/1000 web traffic)|23.09|$88.7|\+$65.61|Result of Conversion rate optimisation (CRO). You make changes to the site for better conversions| |EEAT (expertise, experience, authority and trust of website)|2 main authors|8 authors|6|Tables, video ads and 11 other fixations| |CRO|Nothing|Tables, video ads |Tables, video ads and 11 other fixations || ​ Month by Month Growth |Month|Revenue|Steps| |:-|:-|:-| |Sept 2022|NA|Content Plan| |Oct 2022|$217|Content Production| |Nov 2022|$243|Content production + EEAT authors| |Dec 2022|$320|Content production + EEAT authors| |Jan 2023|$400|Monitoring| |Feb 2023|$223|Content production + EEAT authors| |Mar 2023|$2,128|CRO & Fixations| |April 2023|$1,609|CRO & Fixations| |May 2023|$2,223|Content production + EEAT authors| |June 2023|$2,836|CRO and Fixations| |Total|$10,199|| ​ What will I share Content plan and Website structure Content Writing Content Uploading, formatting and onsite SEO Faster indexing Conversion rate optimisation Guest Posting EEAT (Experience, Expertise, Authority, Trust) Costing ROI The plans moving forward with these sites ​ Website Structure and Content Plan This is probably the most important important part of the whole process. The team spends around a month just to get this right. It's like defining the direction of the project. Description: Complete blueprint of the site's structure in terms of organisation of categories, subcategories and sorting of articles in each one of them. It also includes the essential pages. The sorted articles target main keyword, relevant entities and similar keywords. This has to be highly data driven and we look at over 100 variables just to get it right. It's like beating Google's algorithm to ensure you have a blueprint for a site that will rank. It needs to be done right. If there is a mistake, then even if you do everything right - it's not going to work out and after 8-16 months you will realise that everything went to waste. Process For this project, we had a niche selected already so we didn't need to do a lot of research pertaining to that. We also knew the topic since the website was already getting good traffic on that. We just validated from Ahrefs, SEMRUSH and manual analysis if it would be worth it to move forward with that topic. ​ Find entities related to the topic: We used Ahrefs and InLinks to get an idea about the related entities (topics) to create a proper topical relevance. In order to be certain and have a better idea, we used ChatGPT to find relevant entities as well \> Ahrefs (tool): Enter main keyword in keywords explorer. Check the left pain for popular topics \> Inlinks (tool): Enter the main keyword, check the entity maps \> ChatGPT (tool): Ask it to list down the most important and relevant entities in order of their priority Based on this info, you can map out the most relevant topics that are semantically associated to your main topic Sorting the entities in topics (categories) and subtopics (subcategories): Based on the information above, cluster them properly. The most relevant ones must be grouped together. Each group must be sorted into its relevant category. \> Example: Site about cycling. \> Categories/entities: bicycles, gear and equipment, techniques, safety, routes etc. \> The subcategories/subentities for let's say "techniques" would be: Bike handling, pedaling, drafting etc. Extract keywords for each subcategory/subentity: You can do this using Ahrefs or Semrush. Each keyword would be an article. Ensure that you target the similar keywords in one article. For example: how to ride a bicycle and how can I ride a bicycle will be targeted by one article. Make the more important keyword in terms of volume and difficulty as the main keyword and the other one(s) as secondary Define main focus vs secondary focus: Out of all these categories/entities - there will be one that you would want to dominate in every way. So, focus on just that in the start. This will be your main focus. Try to answer ALL the questions pertaining to that. You can extract the questions using Ahrefs. \> Ahrefs > keywords explorer \> enter keyword \> Questions \> Download the list and cluster the similar ones. This will populate your main focus category/entity and will drive most of the traffic. Now, you need to write in other categories/subentities as well. This is not just important, but crucial to complete the topical map loop. In simple words, if you do this Google sees you as a comprehensive source on the topic - otherwise, it ignores you and you don't get ranked Define the URLs End result: List of all the entities and sub-entities about the main site topic in the form of categories and subcategories respectively. A complete list of ALL the questions about the main focus and at around 10 questions for each one of the subcategories/subentities that are the secondary focus Content Writing So, now that there's a plan. Content needs to be produced. Pick out a keyword (which is going to be a question) and... Answer the question Write about 5 relevant entities Answer 10 relevant questions Write a conclusion Keep the format the same for all the articles. Content Uploading, formatting and onsite SEO Ensure the following is taken care of: H1 Permalink H2s H3s Lists Tables Meta description Socials description Featured image 2 images in text \\Schema Relevant YouTube video (if there is) Note: There are other pointers link internal linking in a semantically relevant way but this should be good to start with. Faster Indexing Indexing means Google has read your page. Ranking only after this step has been done. Otherwise, you can't rank if Google hasn't read the page. Naturally, this is a slow process. But, we expedite it in multiple ways. You can use RankMath to quickly index the content. Since, there are a lot of bulk pages you need a reliable method. Now, this method isn't perfect. But, it's better than most. Use Google Indexing API and developers tools to get indexed. Rank Math plugin is used. I don't want to bore you and write the process here. But, a simple Google search can help you set everything up. Additionally, whenever you post something - there will be an option to INDEX NOW. Just press that and it would be indexed quite fast. Conversion rate optimisation Once you get traffic, try adding tables right after the introduction of an article. These tables would feature a relevant product on Amazon. This step alone increased our earnings significantly. Even though the content is informational and NOT review. This still worked like a charm. Try checking out the top pages every single day in Google analytics and add the table to each one of them. Moreover, we used EZOIC video ads as well. That increased the RPM significantly as well. Both of these steps are highly recommended. Overall, we implemented over 11 fixations but these two contribute the most towards increasing the RPM so I would suggest you stick to these two in the start. Guest Posting We made additional income by selling links on the site as well. However, we were VERY careful about who we offered a backlink to. We didn't entertain any objectionable links. Moreover, we didn't actively reach out to anyone. We had a professional email clearly stated on the website and a particularly designated page for "editorial guidelines" A lot of people reached out to us because of that. As a matter of fact, the guy who bought the website is in the link selling business and plans to use the site primarily for selling links. According to him, he can easily make $4000+ from that alone. Just by replying to the prospects who reached out to us. We didn't allow a lot of people to be published on the site due to strict quality control. However, the new owner is willing to be lenient and cash it out. EEAT (Experience, Expertise, Authority, Trust) This is an important ranking factor. You need to prove on the site that your site has authors that are experienced, have expertise, authority and trust. A lot of people were reaching out to publish on our site and among them were a few established authors as well. We let them publish on our site for free, added them on our official team, connected their socials and shared them on all our socials. In return, we wanted them to write 3 articles each for us and share everything on all the social profiles. You can refer to the tables I shared above to check out the months it was implemented. We added a total of 6 writers (credible authors). Their articles were featured on the homepage and so were their profiles. Costing Well, we already had the site and the backlinks on it. Referring domains (backlinks) were already 500+. We just needed to focus on smart content and content. Here is the summary of the costs involved. Articles: 490 Avg word count per article: 1500 Total words: 735,000 (approximately) Cost per word: 2 cents (includes research, entities, production, quality assurance, uploading, formatting, adding images, featured image, alt texts, onsite SEO, publishing/scheduling etc.) Total: $14,700 ROI (Return on investment) Earning: Oct 22 - June 23 Earnings: $10,199 Sold for: $59,000 Total: $69,199 Expenses: Content: $14,700 Misc (hosting and others): $500 Total: $15,200 ROI over a 9 months period: 355.25% The plans moving forward This website was a part of a research and development experiment we did. With AI, we wanted to test new waters and transition more towards automation. Ideally, we want to use ChatGPT or some other API to produce these articles and bulk publish on the site. The costs with this approach are going to be much lower and the ROI is much more impressive. It's not the the 7-figures projects I created earlier (as you may have checked the older case studies on my profile), but it's highly scalable. We plan to refine this model even further, test more and automate everything completely to bring down our costs significantly. Once we have a model, we are going to scale it to 100s of sites. The process of my existing 7-figures websites portfolio was quite similar. I tested out a few sites, refined the model and scaled it to over 41 sites. Now, the fundamentals are the same however, we are using AI in a smarter way to do the same but at a lower cost, with a smaller team and much better returns. The best thing in my opinion is to run numerous experiments now. Our experimentation was slowed down a lot in the past since we couldn't write using AI but now it's much faster. The costs are 3-6 times lower so when it used to take $50-100k to start, grow and sell a site. Now you can pump 3-6 more sites for the same budget. This is a good news for existing business owners as well who want to grow their brand. Anyway, I am excited to see the results of more sites. In the meantime, if you have any questions - feel free to let me know. Best of luck for everything. Feel free to ask questions. I'd be happy to help. This is an AMA.

[Ultimate List] A list of Marketing Tools That I’ve tested over the years and found helpful to do better marketing with less work. More than 50 Tools To Help you with Marketing, Copywriting & Sales!
reddit
LLM Vibe Score0
Human Vibe Score0.973
lazymentorsThis week

[Ultimate List] A list of Marketing Tools That I’ve tested over the years and found helpful to do better marketing with less work. More than 50 Tools To Help you with Marketing, Copywriting & Sales!

Starting to focus on marketing for your business, You will come across the same tools mentioned over and over by marketers. I would like to mention here tools that you might haven’t seen going viral in the community but actually will help you grow faster and efficiently. Starting off with My favourite Marketing Channel! #Email Marketing For SMBs Convertkit / Mailerlite / Mailchimp - These 3 Platforms are the best options for SMBs and entrepreneurs just starting out with email marketing. All 3 have free plans up to 1,000 subscribers. Scribe - Email Signature Tool, Create Great Email signatures for your emails. Liramail - Most Email marketing platforms don’t offer great email templates. This tool will help you build great email templates with drag and drop. Quick mail Auto-Warmer - Most Businesses at the beginning don’t know what to do when open rate drops. You need to use an email warmer like this to keep it up. #Email Marketing For Big Businesses SendGrid - Overall Email Marketing Tools, this tool is best for brands that have huge email lists and email marketing is the key marketing channel. Braze - This tool is leading in email marketing for large Email senders. When I was working for agencies, this was one of the best email marketing tools I had used. NeoCertified - Protect your emails for spammers and threats. To keep your email list healthy, this is a must have! Sparkloop - Referral Marketing For Email Campaigns. Email can generate great huge amount of referrals for you and Sparkloop makes it easier. #Cold Emails & Lead Generation Hunter - A Great Tool to scrape emails from domain names. The tool comes with a green free plan but Pro plan is worth the amount of features it provides. Icyleads - It’s better than Hunter as it’s heavily focused on the sales and prospecting to help you derive great results from your campaigns. Mailshake - Beginner Friend Cold Email Tool with Great features like email list warming. #Communication Tools Twilio - One do the best customer engagement platform used by Companies like Stripe and mine too. Chatlio - Use Live chat feature on your website with slack integration. My favourite easier to catch up on conversations through slack integration. Intercom - Used by Most Marketers, Industry Leading customer communication platform. Great for beginners! Chatwoot - Another Amazing Communication Tool but the best part is they have a great free plan useful for new businesses. Loom - Communicate with your audience through Videos. Loom is great for SaaS and to show human interaction to close new visitors effectively. #CRM Outseta - This tool provides great CRM and their billing system is better than other tools out their which makes it stands out! Hubspot - I don’t think this tool needs an introduction because Hubspot’s CRM is the best in industry. Salesflare - This CRM is a great alternative to hubspot as it’s beginner friendly and helpful for SMBs. #SEO Tools Ahrefs - One of the best SEO tool in the industry. They also just launched a bunch of free tools to help SEO beginners. Screaming frog - The only website crawler I have used since I bought my first domain. It’s the best! Ubersuggest- The Tool by Neil Patel is the best SEO tool for you. (I’m Joking, it’s the worst) Contentking - This tool is good at Real-time SEO Auditing, they do a lot of Marketing work through Newsletters. If you are subscribed to any SEO newsletter. You may have seen this tool. SEOquake & Semrush - SEOquake is a great tool to conduct on-page analysis, SERP, and much more. Great tool but it’s owned by Semrush. You should go for Semrush because that tool will cover all SEO aspects for you. #Content Marketing Buzzsumo - This tool is great for content research and but you may find the regular emails pretty annoying sometimes. Contentrow - Analyse Your Content and find it’s strength. Highly recommended who are weak at content structuring like me. Grammarly - If you are not a native English speaker like me, you might think you need it or not. You need it for sure for grammar corrections. #Graphic Design Tools Visme - At agencies, Infographics can be more effective than usual postscript. Visme is a graphic design tool focused on infographics and designs related to B2B and B2C. It’s great for agencies! Glorify - A Graphic Design Tool focused on E-commerce, filled with Designs useful for E-commerce store owners. Canva - All-in-one Industry leading Graphic Design Tool that everyone knows and every template is overused now. Adobe Creative Cloud ( previously Sparkpost) - It’s a great alternative to Canva filled with Amazing Stock images to use in your visuals but the only backlash is the exports in this tool are not high quality. Snaps - A Canva Alternative that might not have overused templates for your Social Accounts. #Advertising Tools Plai - It’s a great PPC tool to create Ads for Instagram and Tiktok. Wordstream - It’s an industry leading PPC Tool, great for Ad Grading and auditing. AdEspresso - This Is a tool by Hootsuite. They have a lot of Data sourced at the backend, which helps in Ad optimisation through this tool. That’s the reason I recommend this tool. #Video Editing Tools Veed Studio - I have been using Veed from last year. It’s one of the best Video Marketing Tool Optimized for Instagram & Tiktok. Synthesia - It’s a new AI video generation platform. From last few months, if you have seen marketing agencies including Videos in Emails. The chances are that’s not a Agency member taking but AI generated Human. Motionbox - It’s also a great video editing tool focused on video editing for Digital Marketers. Jitter Video - It’s a great motion design tool. Comes with great templates, the only place where other tools I mentioned lacks. It’s great and beginner friendly. #Copywriting Jasper AI - Google’s John Mueller says AI generated content is banned on Search but I think with Jasper AI you can generate SEO optimised Content but you have to put in some efforts like at least give 30 minutes for editing the Copy by yourself. Copy AI - Another AI tool to help you write better copy. This one is more focused on helping you write copy suitable for Ads and Social media campaigns. Hemingway App - To help you write more clearly and Bold. This tool is better than Grammarly if you look for writing perspective and it’s free. #Social Media Management App I’ve used a Lot of SMM Tools and that’s why going to mention all of them with a short review. Sprout social - The Best with deep insights coverage. Hootsuite - Great Scheduling tool just under sprout social. Later - Heavily Focused on Instagram from beginning and Now Tiktok too. SkedSocial - It’s like a Later alternative with great addition features like link-in-bio. Facebook’s Business Manager- Great but sometimes bugs can make a huge issue for you and customer support is like dead. Tweet Hunter & Hypefury- Both are Twitter Scheduling tools growing very fast on platform and are great for growth. Buffer - It’s a great tool but I haven’t seen any new updates to help with management. Zoho Social - It’s a great SMM tool and if you use other marketing solutions from Zoho. It’s a must have! #Market Research Tool • SparkToro - That’s the only one I have ever used. It’s great for audience research and comes with great customer service. Founded by Rand Fishkin, it’s one of the best research tool. #Influencer Marketing & UGC InfluenceGrid - A free search engine To find Tiktok & Instagram Influencers for your campaigns. Tiktok Creative Center- TikTok’s in-built tool called “Creative Center” is the best to find content trends, audience demographics and much more. Archive - Find Instagram Stories and Posts mentioning Your brands and use them as Ads for your business Marketing. #Landing Page Builders Leadpages - Its a great landing page builder because the integration and drag-and-drop features makes it easier to work with! Cardd co - A Great Landing page builder with easy step up but it lacks the copywriting and tracking features. Instapage - It’s one of the best out and I think the overall product is effective enough to help you stand out with your landing page. Unbounce - It’s a great alternative to Instapage due its well polished landing page templates that might be helpful for you. #Community Building Mighty Networks - A Great Community building platform, and you can also sell courses within the platform. Circle so - A great alternative to Mighty networks focused on Communities specifically. We are currently using for small community Of ours. #Sales Tools Drift - You can get much more out of Drift than just sales tools but The Sales solutions provided in Drift are one of the best. Salesforce - It’s the industry Sales solution provider. A go-to and have various pricing plans making it suitable for majority of SMBs. #Social Proof Tools People don’t have enough time to search across internet to decide to trust you after seeing your Ad first time. That’s what you might be facing too. Here are two tools I absolutely love for social proof! Use Proof - Show Recent Activities occurring on your website and build the trust of your visitors. Testimonial to - Gather Testimonials across Social Media platforms related to your business with this tool. Capture tweets and comments mentioning your brands and mention them. #Analytics Tools Plausible Analytics- A privacy friendly Analytics alternative to Google Analytics if you hate Analytics 4 like me. Mixpanel - Product Analytics and funnel reports better than Google Analytics. #Reddit Marketing Gummysearch- This tool will help To find your target audience on Reddit and interact with them with its help and close your new customers. Howitzer- It’s another pretty similar tool to Gummysearch focused on Reddit cold outreach to get clients and new customers. Both are great but Gummysearch provides better customer support while Howtizer is helpful on a large scale Reddit Marketing. #Text Marketing Klaviyo - It’s an email + SMS marketing tool, it’s taking up space in marketing industry very quickly as an industry leader due to its great integrations but you need to learn the platform usage to maximise the outcome. Cartloop - This tool provides great text marketing solutions with integration with Spotify and other e-commerce marketing tools. Attentive Mobile - This is my favourite Text marketing tool due to the interactive dashboard + they have a library of Text marketing examples to help you out with your campaigns. #Other Tools I have used throughout my journey! Triple Whale - It’s a great E-commerce marketing tools with Triple pixel to help you track your campaigns more efficiently. Fastory - To create well optimized Instagram & Tiktok Stories for your business. Jotform - Online Form Builder with integrations with leading marketing tools. Gated - As an entrepreneur and marketer, you may receive a bunch of unwanted emails. Use Gated to get rid of them and receive useful mails only! ClickUp- The main Tool for Project Management, one of the best and highly recommended. Riverside - Forget Zoom or Google Meet, For your Podcast Interviews and Marketing conferences. You need riverside with great video quality and recording features. Manychat- Automate your Instagram DMs and interact with your followers more efficiently + sell out your products/ services when you are offline. Calendy - To schedule meetings with your ideal clients. ServiceProviderPro - It’s a client portal for SEO & Growing Agencies, very helpful in scaling agencies. SendCheckit - Compare your Email Subject Lines with 100,000+ others in the database for free. Otter AI - Using AI track your meetings more effectively, you can easily edit, annotate and share notes from the meetings. Ryte - Optimise your website User experience with this tool focused on UX aspects + SEO too. PhantomBuster - Scrape LinkedIn Profile and Data from Facebook/LinkedIn groups. I clearly love this tool! #Honourable Mentions Zapier - The Only tool you need to integrate your favourite tool with a new effective tool. Elementor - That’s what I use for web design and it’s great! Marketer Hire - To hire world class marketers to work with you. InShot & Capcut - I create Instagram Reels and TikTok’s and life without these tools isn’t possible. Nira - It’s a great tool to Manage your workspace and this tool has launched many marketing templates in-built helpful for marketers and also entrepreneurs. X - The tool you love that wasn’t mentioned here is valuable and I honour that tool and share that if you would like to! I mean thanks for reading what I have curated all over my life as a marketer. I share 5 Marketing Tools, 5 Marketing Resources and 1 Free Resourceevery week in my newsletter, you can subscribe here to receive that for free. Also, You can read an expanded list of email marketing tools in this Reddit post!

I tested hundreds of marketing tools in the last three years and these 50 made it to the list. I'll sum up my top 50 marketing tools with one or two sentences + give you pricings.
reddit
LLM Vibe Score0
Human Vibe Score1
SpicyCopyThis week

I tested hundreds of marketing tools in the last three years and these 50 made it to the list. I'll sum up my top 50 marketing tools with one or two sentences + give you pricings.

Hey guys, I'm working in a growth marketing agency. Marketing tools are 30% of what we do, so we use them a lot and experiment with the new ones as much as possible. There are thousands of tools and it's easy to get lost, so I wanted to share the tools we use most on a daily basis. And divide the list into 14 categories. I thought this could be handy for Entrepreneurs subreddit. Why adopt tools? I see marketing tools as tireless colleagues. If you can't hire an employee, choosing the right tool can solve your problems, because they Are super cheap. Work 7/24 for you. Don’t make mistakes. Don’t need management. (or needless management) Help you to automate the majority of your lead gen process. Onwards to the list. (With the pricings post ended up quite long, you can find a link in the end if you want to check the prices) Email marketing tools #1 ActiveCampaign is armed with the most complicated email automation features and has the most intuitive user experience. It feels like you already know how to use it. \#2 Autopilot is visual marketing automation and customer journey tool that helps you acquire, nurture based on behaviors, interest etc. #3 Mailjet: This is the tool we use to send out bulky email campaigns such as newsletters. It doesn't have sexy features like others but does its job for a cheap price. Email address finders #4 Skrapp finds email of your contacts by name and company. It also works with LinkedIn Sales Navigator and can extract thousands of emails in bulk + have a browser add-on. #5 Hunter: Similar to Skrapp but doesn't work with LinkedIn Sales Navigator directly. In addition, there are email templates and you can set up email campaigns. Prospecting and outreach tools #6 Prospect combines the personal emails, follow-up calls, other social touches and helps you create multichannel campaigns.  #7 Reply is a more intuitive version of Prospect. It is easy to learn and use; their UX makes you feel good and sufficient.  CRM tools #8 Salesflare helps you to stop managing your data and start managing your customers. Not yet popular as Hubspot and etc but the best solution for smaller B2B businesses. (we're fans) \#9 Hubspot: The most popular CRM for good reason and has a broader product range you can adopt in your next steps. Try this if you have a bulky list of customers because it is free. #10 Pardot: Pardot is by Salesforce, it's armed with features that can close the gap between marketing and sales. Sales Tools #11 Salesforce is the best sales automation and lead management software. It helps you to create complicated segmentations and run, track, analyze campaigns from the same dashboard. #12 LinkedIn Sales Navigator gives you full access to LinkedIn's user database. You can even find a kidnapped CEO if you know how to use it with other marketing automation tools like Skrapp. #13 Pipedrive is a simple tool and excels in one thing. It tracks your leads and tells you when to take the next action. It makes sales easier. #14 Qwilr creates great-looking docs, at speed. You can design perfect proposals, quotes, client updates, and more in a flash. We use it a lot to close deals, it's effective. #15 Crystalknows is an add-on that tells you anyone’s personality on LinkedIn and gives you a detailed approach specific to that person. It's eerily accurate. #16 Leadfeeder shows you the companies that visited your website. Tells how they found you and what they’re interested in. It has a free version. Communication Tools #17 Intercom is a sweet and smart host that welcomes your visitors when you’re not home. It’s one of the best chatbot tools in the market. #18 Drift is famous for its conversational marketing features and more sales-focused than Intercom. #19 Manychat is a chatbot that helps you create high converting Facebook campaigns. #20 Plann3r helps you create your personalized meeting page. You can schedule meetings witch clients, candidates, and prospects. #21 Loom is a video messaging tool, it helps you to be more expressive and create closer relationships. #22 Callpage collects your visitors’ phone number and connects you with them in seconds. No matter where you are. Landing page tools #23 Instapage is the best overall landing page builder. It has a broad range of features and even squirrel can build a compelling landing page with templates. No coding needed. #24 Unbounce can do everything that Instapage does and lets you build a great landing page without a developer. But it's less intuitive. Lead generation / marketing automation tools #25 Phantombuster is by far the most used lead generation software in our tool kit. It extracts data, emails, sends requests, customized messages, and does many things on autopilot in any platform. You can check this, this and this if you want to see it in action. #26 Duxsoup is a Google Chrome add-on and can also automate some of LinkedIn lead generation efforts like Phantombuster. But not works in the cloud. #27 Zapier is a glue that holds all the lead generation tools together. With Zapier, You can connect different marketing tools and no coding required. Conversion rate optimization tools #28 Hotjar tracks what people are doing on your website by recording sessions and capturing mouse movements. Then it gives you a heatmap. #29 UsabilityHub shows your page to a digital crowd and measures the first impressions and helps you to validate your ideas. #30 Optinmonster is a top tier conversion optimization tool. It helps you to capture leads and enables you to increase conversions rates with many features. #31 Notifia is one mega tool of widgets that arms your website with the wildest social proof and lead capturing tactics. #32 Sumo is a much simpler version of Notifia. But Sumo has everything to help you capture leads and build your email lists. Web scrapers #33 Data Miner is a Google Chrome browser extension that helps you scrape data from web pages and into a CSV file or Excel spreadsheet. #34 Webscraper does the same thing as Data Miner; however, it is capable of handling more complex tasks. SEO and Content #35 Grammarly: Your English could be your first language and your grammar could be better than Shakespeare. Grammarly still can make your writing better. #36 Hemingwayapp is a copywriting optimization tool that gives you feedback about your copy and improves your readability score, makes your writing bolder and punchier. Free. #37 Ahrefs is an all-rounder search engine optimization tool that helps you with off-page, on-page or technical SEO. #38 SurferSEO makes things easier for your on-page SEO efforts. It’s a tool that analyzes top Google results for specific keywords and gives you a content brief based on that data. Video editing and design tools #39 Canva is a graphic design platform that makes everything easy. It has thousands of templates for anything from Facebook ads, stylish presentations to business cards.  #40 Kapwing is our go-to platform for quick video edits. It works on the browser and can help you to create stylish videos, add subtitles, resize videos, create memes, or remove backgrounds. #41 Animoto can turn your photos and video clip into beautiful video slideshows. It comes handy when you want to create an advertising material but don’t have a budget. Advertising tools #42 AdEspresso lets you create and test multiple ads with few clicks. You can optimize your FB, IG, and Google ads from this tool and measure your ads with in-depth analytics. #43 AdRoll is an AI-driven platform that connects and coordinates marketing efforts across ads, email, and online stores. Other tools #44 Replug helps you to shorten, track, optimize your links with call-to-actions, branded links, and retargeting pixels #45 Draw.io = Mindmaps, schemes, and charts. With Draw.io, you can put your brain in a digital paper in an organized way. #46 Built With is a tool that finds out what websites are built with. So you can see what tools they're using and so on. #47 Typeform can turn data collection into an experience with Typeform. This tool helps you to engage your audience with conversational forms or surveys and help you to collect more data. #48 Livestorm helped us a lot, especially in COVID-19 tiles. It’s a webinar software that works on your browser, mobile, and desktop. #49 Teachable \- If you have an online course idea but hesitating because of the production process, Teachable can help you. It's easy to configure and customizable for your needs. #50 Viral Loops provides a revolutionary referral marketing solution for modern marketers. You can create and run referral campaigns in a few clicks with templates. Remember, most of these tools have a free trial or free version. Going over them one by one can teach you a lot and help you grow your business with less work power in the early stages of your business. I hope you enjoyed the read and can find some tools to make things easier! Let me know about your favorite tools in the comments, so I can try them out. \------ If you want to check the prices and see a broader explanation about the tools, you can go here.

I spent 18 hours every week tracking marketing trends and latest news. Here are my predictions for 2024
reddit
LLM Vibe Score0
Human Vibe Score0.778
lazymentorsThis week

I spent 18 hours every week tracking marketing trends and latest news. Here are my predictions for 2024

1/ Securing Digital Footprint becomes #1 Priority For Chronically Online Users, Protecting their digital footprint will become one of the main things. We saw influencers getting cancelled over Old Content and Brands used Old Travis Kelce Tweets, we saw what could happen without digital footprint protection. Online Engagement Precautions will be taken again with Twitter & IG showing your usernames above ‘Algorithm Suggested Content’. What you like is more visible to other people in UI Design of these apps, another reason behind why Digital Footprint preservation will matter a lot in 2024. This will impact likes to viewership ratio on your organic and paid content. ​ 2/  TikTok wants Long Videos with Storytelling As I was writing this report, TikTok also released their What’s Next 2024 Report. It focuses heavily on how the audiences on the app demand better storytelling and from the examples in the report, you can judge what TikTok wants. They also rolled out a 30-minute video upload limit. Engaging Content over 1-Minute Mark to keep the audiences longer on the app. I highlighted in the first trend, every social media platform wants the same thing, more time spent. 3/ Use of Shop the Look While Streaming Netflix or Amazon Prime. This year’s one of the most successful TV series, The Bear caused Men to go mad for the T-Shirt worn by Jeremy Allen White in the show. Showing us how TV Shows influence or encourage us to dress in a particular way. It’s nothing new, TV Shows like Friends & Gossip Girl influenced all demographics when they came out. But now, Streamings Services such as Roku & Amazon enable consumers to shop the look while watching the TV Shows. Many Brands will jump on these opportunities in upcoming months. 4/ Brands in Comments & Memes are the new norm By Summer 2024, Most Online Users & Creators will no longer feel too excited or answered when they see your brand in the comments. Why? It’s becoming too common for Brands to show in comments under viral content about them. Or Brands being funny with Internet Culture Trends is known to most users. The Saturation of Every Brand being funny and being present leads to increased competition of levitating the content quality. ​ 5/ Marketers decrease their focus on Traffic & Views With AI recommendations taking over, The Structure of content distributing on social media is changing, the same goes for SEO. Conversational AIs are changing how web traffic is distributed to publishers. An Increased focus on managing the conversion rate and landing page relevancy will be the main focus. 6/ OOH is kind of making a comeback. First, US OOH Ads Industry grew 1.1% in Q3 2023. Second, Outfront Media reported slight revenue increase in Q3 as Billboard Ad Revenue grew in Q3. Many Brands in UK are also aligning more toward traditional media Channels. With Burger King in UK focusing on only OOH for Christmas this year and Fashion Brands like SSENSE launching Billboards as Branding Play. 7/ Rise of Curation Continues This Year, we witnessed success of Pinterest Shuffles App, Gen-Z loved it. Similar Success with formats like IG photo dump & TikTok ‘My Fav Finds’ Carousels being the center of Gen-Z Content. Just look at this recent trend and tell me Curation isn’t personal to Online Teens. Spotify won with their idea of curating Songs with Astrology-type signs. The Fashion Products with Curated Emojis and Stickers on them, that scrappy curated approach is predicted to grow in 2024, data from Pinterest. 8/ Use of AI to Trace Consumers in the wild This year we saw a huge trend of people using Image/ face recognition tools to find or dig dirt about famous people. The biggest example was Dillion Dannis exposing Multiple images of Logan Paul’s girlfriend using AI tools. (Which was Obviously bad) But next year, I believe with better rules, big brands like Adidas or Nike will be able to find worldwide micro-influencers & Online Consumers seen wearing adidas. And partnering with them on a large scale through automated outreach. 9/ More Cartoons than Influencer-Brand Products. All the Cartoon shows are seeing huge rise on IG and TikTok, Shaun the sheep is viral, Snoopy was big this year, Sesame Street’s TikTok is working. Aussie Show Bluey is making a huge spark in the US. More Brand collaborations are on the road. Why? Cartoons have built a very consistent identity and they have social channels. I know many see Cartoons as Kids Content but on social, looking at TikTok Account of Sesame Street & Snoopy. Last month, Powerpuff Girls launched a collaboration with Nike. ​ 10/ The Best Trend to get people off social media ​ Try to get people off the social media apps, build your own loops. You can’t rely on social and you clearly shouldn’t burn out trying to win on social and streaming with Paid Ads or without them. This matters a lot because data shares most of your customers buy from you once or twice a year. And then they interact with your content, how bad will you feel if the only thing they remember as your content is being on TikTok. Nothing about your brand. 11/ The Internet Aesthetic will Die for Cafes & Restaurants When I wrote my post about Instagram Marketing, I mentioned this issue of Every Account looking the same. In reality, It isn’t limited to IG Feeds, This Creator points out the same Problem, mentioning the aesthetic Standards from Internet are changing how new businesses approach their whole business. More Content from Cafes & Restaurants need to be around their people and neighbourhood. 12/ Echo Chambers & Sonic Influence All Podcasts are Echo Chambers because if people wanted a new perspective in form of value. We would have chosen debates, but we chose Podcasts to find new value while being in comfort. People are now looking for more value in comfort than ever, Podcasts will continue to rise. 13/ Clever AI Integration to Better Customer Journeys in B2B & B2C Marketing Agencies can provide clever solutions to B2B Companies, and help them overcome the tag of Boring Ads only. How? Ogilvy India created an AI Ad Campaign for Cadbury, allowing SMBs to have the Bollywood Actor endorse them. They used the AI voice generation allowing businesses to alter the voice and have Shah Rukh Khan endorse their shop. A similar approach was taken by IPG India, An AI Ad with Shah Rukh Khan allowing everyone to add their face in the Branded Content. ​ If I sounded like an Old head in this report or I missed on some elements like Programmatic Advertising and PPC. I will try to include better analysis and new content about future trends. You can find the post shared with examples & research, linked here.

TASVerify
reddit
LLM Vibe Score0
Human Vibe Score0
doubleHelixSpiralThis week

TASVerify

The Opportunity: $10,000 to Launch the Future of Information Verification TrueAlphaSpiral (TAS) is seeking $10,000 in seed funding to develop a working prototype of our revolutionary verification system that will transform how businesses validate the accuracy and trustworthiness of information. The Problem In today's digital landscape: 76% of businesses report significant costs from inaccurate information AI systems frequently produce plausible but factually incorrect content ("hallucinations") Traditional verification tools use outdated binary (true/false) assessments that miss critical nuance Our Solution TrueAlphaSpiral is a next-generation verification system that: Analyzes content across multiple dimensions (factual, ethical, logical, experiential) Self-improves through innovative cybernetic feedback loops Provides specialized verification for high-value industries (healthcare, finance, media) Why $10,000 Now? Your seed investment will directly fund: Prototype Development ($6,000): Build a working demonstration of our core verification technology Technical Documentation ($2,000): Create essential materials for future development partners Initial Testing ($2,000): Validate our approach with pilot users in medical information verification 90-Day Roadmap With your funding, we will deliver: | Month | Milestone | Deliverable | |-------|-----------|-------------| | 1 | Core Algorithm Implementation | Functioning verification algorithm | | 2 | Basic API & Documentation | Developer documentation & test API | | 3 | Medical Verification Prototype | Demonstration with medical test cases | Market & Growth Potential Immediate Market: Medical content verification ($2.8B annual market) Expansion Markets: Financial services, media, and AI governance Total Addressable Market: $47.5B by 2028 Return on Investment Your $10,000 seed investment will: Secure 1.5% equity in TrueAlphaSpiral Position you for priority participation in our $500K pre-seed round (Q4 2025) Provide preferential terms in our $3M seed round (Q2 2026) Why Us, Why Now? Founding Team: Expertise in AI verification, cybernetics, and domain-specific knowledge Timing: Critical market need as AI content proliferates across industries Proven Concept: Preliminary results show 37% better accuracy than existing solutions Next Steps Initial $10,000 funding transfer to begin development Weekly progress updates and milestone reviews Demo day in 90 days to showcase working prototype

What do you think of SaaS 2.0: Service-as-a-Software?
reddit
LLM Vibe Score0
Human Vibe Score1
FrenzyOfLifeThis week

What do you think of SaaS 2.0: Service-as-a-Software?

A new term has recently emerged in the business world: Service-as-a-Software a.k.a. SaaS 2.0 In general, some authors of articles promoting this term assume that the new and rapidly growing possibilities offered by AI and automation mean that problems that were previously too individual or support-intensive can now be tackled. The focus is on (human) service on the customer side and the background processes in the company are fully AI-supported and automated. Unlike traditional SaaS, no software is primarily offered here as self-use. In other words: "Service as a Software" (SaaS 2.0) is a new type of business model that mixes software automation with real human support. Unlike traditional SaaS, which provides self-service tools for users to solve problems on their own, SaaS 2.0 focuses on delivering results by combining technology with human expertise. In this model, software handles repetitive tasks like data processing, scheduling, or matching, while humans step in to provide guidance, handle exceptions, or solve complex issues. This approach is often called Human-in-the-Loop because humans are actively involved in key parts of the process, ensuring a personalized and empathetic experience for the customer. SaaS 2.0 is especially useful in industries like healthcare, education, or elderly care placement, where trust and personalization are critical. For example, a traditional SaaS might offer a tool to search for care homes, while a SaaS 2.0 solution would also provide a care consultant to help families make the best choice. In this case no traditional marketplace is needed where the supply and demand side used to be scaled simultaneously. Instead, an AI can now search for the best match for a place in a retirement home and a human in the loop can be the external face for the customer and the retirement homes and thus act as an agent. By automating routine tasks and using humans for high-value touchpoints, SaaS 2.0 delivers better outcomes, builds stronger relationships with customers, and stands out from traditional software that relies only on automation. What do you think about the potential of this concept?

What do you think of SaaS 2.0: Service-as-a-Software?
reddit
LLM Vibe Score0
Human Vibe Score1
FrenzyOfLifeThis week

What do you think of SaaS 2.0: Service-as-a-Software?

A new term has recently emerged in the business world: Service-as-a-Software a.k.a. SaaS 2.0 In general, some authors of articles promoting this term assume that the new and rapidly growing possibilities offered by AI and automation mean that problems that were previously too individual or support-intensive can now be tackled. The focus is on (human) service on the customer side and the background processes in the company are fully AI-supported and automated. Unlike traditional SaaS, no software is primarily offered here as self-use. In other words: "Service as a Software" (SaaS 2.0) is a new type of business model that mixes software automation with real human support. Unlike traditional SaaS, which provides self-service tools for users to solve problems on their own, SaaS 2.0 focuses on delivering results by combining technology with human expertise. In this model, software handles repetitive tasks like data processing, scheduling, or matching, while humans step in to provide guidance, handle exceptions, or solve complex issues. This approach is often called Human-in-the-Loop because humans are actively involved in key parts of the process, ensuring a personalized and empathetic experience for the customer. SaaS 2.0 is especially useful in industries like healthcare, education, or elderly care placement, where trust and personalization are critical. For example, a traditional SaaS might offer a tool to search for care homes, while a SaaS 2.0 solution would also provide a care consultant to help families make the best choice. In this case no traditional marketplace is needed where the supply and demand side used to be scaled simultaneously. Instead, an AI can now search for the best match for a place in a retirement home and a human in the loop can be the external face for the customer and the retirement homes and thus act as an agent. By automating routine tasks and using humans for high-value touchpoints, SaaS 2.0 delivers better outcomes, builds stronger relationships with customers, and stands out from traditional software that relies only on automation. What do you think about the potential of this concept?

activepieces
github
LLM Vibe Score0.66
Human Vibe Score1
activepiecesMar 28, 2025

activepieces

An open source replacement for Zapier Documentation 🌪️ Create a Piece 🖉 Deploy 🔥 Join Discord 🤯 Welcome to Activepieces Your friendliest open source all-in-one automation tool, designed to be extensible through a type-safe pieces framework written in Typescript. 🔥 Why Activepieces is Different: 💖 Loved by Everyone: Intuitive interface and great experience for both technical and non-technical users with a quick learning curve. 🌐 Open Ecosystem: All pieces are open source and available on npmjs.com, 60% of the pieces are contributed by the community. 🛠️ Pieces are written in Typescript: Pieces are npm packages in TypeScript, offering full customization with the best developer experience, including hot reloading for local piece development on your machine. 😎 🤖 AI-Ready: Native AI pieces let you experiment with various providers, or create your own agents using our AI SDK, and there is Copilot to help you build flows inside the builder. 🏢 Enterprise-Ready: Developers set up the tools, and anyone in the organization can use the no-code builder. Full customization from branding to control. 🔒 Secure by Design: Self-hosted and network-gapped for maximum security and control over your data. 🧠 Human in the Loop: Delay execution for a period of time or require approval. These are just pieces built on top of the piece framework, and you can build many pieces like that. 🎨 💻 Human Input Interfaces: Built-in support for human input triggers like "Chat Interface" 💬 and "Form Interface" 📝 🛠️ Builder Features: [x] Loops [x] Branches [x] Auto Retries [x] HTTP [x] Code with NPM [x] ASK AI in Code Piece (Non technical user can clean data without knowing to code) [x] Flows are fully versioned. [x] Languages Translations [x] Customizable Templates [X] 200+ Pieces, check https://www.activepieces.com/pieces We release updates frequently. Check the product changelog for the latest features. 🔌 Create Your Own Piece Activepieces supports integrations with Google Sheets, OpenAI, Discord, RSS, and over 200 other services. Check out the full list of supported integrations, which is constantly expanding thanks to our community's contributions. As an open ecosystem, all integration source code is accessible in our repository. These integrations are versioned and published directly to npmjs.com upon contribution. You can easily create your own integration using our TypeScript framework. For detailed instructions, please refer to our Contributor's Guide. License Activepieces' Community Edition is released as open source under the MIT license and enterprise features are released under Commercial License Read more about the feature comparison here https://www.activepieces.com/docs/about/editions 💭 Join Our Community 🌐 Contributions We welcome contributions big or small and in different directions. The best way to do this is to check this document and we are always up to talk on our Discord Server. 📚 Translations Not into coding but still interested in contributing? Come join our Discord and visit https://www.activepieces.com/docs/about/i18n for more information. !fr translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !it translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !de translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !ja translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !pt-BR translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) 🦫 Contributors ShahedAlMashni🔌 AbdulTheActivePiecer🚧 Khaled Mashaly🚧 Mohammed Abu Aboud🚧 Abdulrahman Zeineddin🔌 ahmad jaber🔌 ashrafsamhouri🔌 Mohammad Abu Musa📆 Mukewa Wekalao🔌 Osama Abdallah Essa Haikal🔌 Arman🛡️ Oskar Krämer📖 Thibaut Patel🤔 🔌 Applesaucesomer🤔 crazyTweek🤔 Muhammad Tabaza🔌 Shay Punter📖 🔌 abaza738🔌 Jona Boeddinghaus🔌 fomojola💻 Alexander Storozhevsky💻 J0LGER🛡️ Patrick Veverka🐛 Berk Sümbül📖 Willian Guedes🔌 Abdullah Ranginwala💻 Dennis Tychsen🔌 MyWay🔌 Bibhuti Bhusan Panda🔌 Tarun Samanta🐛 Herman Kudria🔌 [NULL] Dev🔌 Jan Bebendorf🔌 Nilesh🔌 Vraj Gohil🔌 BastienMe🔌 Stephen Foskett📖 Nathan📖 Marcin Natanek🔌 Mark van Bellen🔌 Olivier Guzzi🔌 Osama Zakarneh🔌 phestvik🤔 Rajdeep Pal📖 Camilo Usuga🔌 Kishan Parmar📖 🔌 BBND🔌 Haseeb Rehman🔌 Rita Gorokhod🔌 Fábio Ferreira🔌 Florin Buffet📖 Drew Lewis🔌 Benjamin André-Micolon🔌 Denis Gurskij🔌 Nefer Lopez📖 fardeenpanjwani-codeglo📖 Landon Moir🔌 Diego Nijboer🔌 Tân Một Nắng🔌 Gavin Foley📖 Dennis Trautwein🐛 Andrew Rosenblatt🐛 rika🔌 Cyril Selasi🔌 Franck Nijimbere🔌 Aleksandr Denisov🔌 Reuben Swartz📖 joselupianez🔌 Awais Manzoor🐛 💻 Andrei🐛 derbbre📖 Maor Rozenfeld💻 Michael Huynh📖 Filip Dunđer💻 Don Thorp📖 Joe Workman🔌 Aykut Akgün💻 Yann Petitjean🔌 🐛 pfernandez98🔌 Daniel O.🔌 Meng-Yuan Huang📖 Leyla🐛 i-nithin🔌 la3rence🔌 Dennis Rongo🐛 🔌 Kartik Mehta📖 💻 Zakher Masri📖 💻 AbdullahBitar🔌 Mario Meyer🔌 Karim Khaleel🔌 CPonchet🐛 Olivier Sambourg🔌 Ahmad(Ed)🔌 leenmashni🔌 M Abdul Rauf📖 Vincent Barrier🔌 John💻 🔌 Joost de Valk🔌 MJ🔌 ShravanShenoy💻 Jon Kristian📖 cr0fters🐛 Bibek Timsina🐛 Viktor Szépe💻 Rendy Tan📖 🔌 Islam Abdelfattah🐛 Yoonjae Choi💻 Javier HM🔌 Mohamed Hassan🐛 Christian Schab🔌 Pratik Kinage🔌 Abdelrahman Mostafa 🔌 Hamza Zagha🐛 Lasse Schuirmann🔌 Cyril Duchon-Doris🔌 Javiink🔌 Harshit Harchani🔌 MrAkber📖 marek-slavicek🔌 hugh-codes🔌 Alex Lewis🐛 Yuanlin Lin📖 Ala Shiban📖 hamsh💻 Anne Mariel Catapang🔌 Carlo Gino Catapang🔌 Aditya Rathore🔌 coderbob2🔌 Ramy Gamal🔌 Alexandru-Dan Pop💻 Frank Micheal 🔌 Emmanuel Ferdman📖 Sany A🔌 Niels Swimberghe🐛 lostinbug🔌 gushkool🔌 Omar Sayed🔌 rSnapkoOpenOps🐛 ahronshor🔌 Cezar🐛 Shawn Lim🔌 Shawn Lim🔌 pavloDeshko🐛 abc💻 manoj kumar d🔌 Feli🔌 Miguel🔌 Instasent DEV🔌 Matthieu Lombard🔌 beyondlevi🔌 Rafal Zawadzki🔌 Simon Courtois🔌 alegria-solutions🔌 D-Rowe-FS🔌 张晟杰🔌 Ashot🔌 Amr Abu Aza🔌 John Goodliff🔌 Diwash Dev🔌 André🔌 Lou | Digital Marketing🔌 Maarten Coppens🔌 Mahmoud Hamed🔌 Theo Dammaretz🔌 s31w4n📖 Abdul Rahman🔌 Kent Smith🔌 Arvind Ramesh💻 valentin-mourtialon🔌 psgpsg16🔌 Mariia Shyn🔌 Joshua Heslin🔌 Ahmad🔌 you💻 Daniel Poon💻 Kévin Yu🔌 노영은🔌 reemayoush🔌 Brice🛡️ Mg Wunna🔌 This project follows the all-contributors specification. Contributions of any kind are welcome!

mentals-ai
github
LLM Vibe Score0.476
Human Vibe Score0.004852164397547106
turing-machinesMar 28, 2025

mentals-ai

Mentals AI is a tool designed for creating and operating agents that feature loops, memory, and various tools, all through straightforward markdown files with a .gen extension. Think of an agent file as an executable file. You focus entirely on the logic of the agent, eliminating the necessity to write scaffolding code in Python or any other language. Essentially, it redefines the foundational frameworks for future AI applications 🍓 [!NOTE] [work in progress] A local vector database to store your chats with the agents as well as your private information. See memory branch. [work in progress] Web UI with agents, tools, and vector storage Getting Started Differences from Other Frameworks Key Concepts Instruction (prompt) Working Memory (context) Short-Term Memory (experimental) Control flow: From strings to algorithms Roadmap The Idea 📌 Examples Word chain game in a self-loop controlled by LLM: !Word Chain game in a loop NLOP — Natural Language Operation Or more complex use cases: | 🔄 Any multi-agent interactions | 👾 Space Invaders generator agent | 🍄 2D platformer generator agent | |--------------------|-----------|--------------| |!react | !spaceinvaders.gen | !mario.gen | Or help with the content: Collect YouTube videos on a given topic and save them to a .csv file with the videos, views, channel name, and link; Get the transcription from the video and create a table of contents; Take top news from Hacker News, choose a topic and write an article on the topic with the participation of the critic, and save to a file. All of the above examples are located in the agents folder. [!NOTE] Llama3 support is available for providers using a compatible OpenAI API. 🚀 Getting Started Begin by securing an OpenAI API key through the creation of an OpenAI account. If you already have an API key, skip this step. 🏗️ Build and Run Prerequisites Before building the project, ensure the following dependencies are installed: libcurl: Used for making HTTP requests libfmt: Provides an API for formatting pgvector: Vector operations with PostgreSQL poppler: Required for PDF processing Depending on your operating system, you can install these using the following commands: Linux macOS Windows For Windows, it's recommended to use vcpkg or a similar package manager: pgvector installation [!NOTE] In the main branch you can skip this step Build from sources Docker, Homebrew, PGXN, APT, etc. Clone the repository Configuration Place your API key in the config.toml file: Build the project Run 🆚 Differences from Other Frameworks Mentals AI distinguishes itself from other frameworks in three significant ways: The Agent Executor 🧠 operates through a recursive loop. The LLM determines the next steps: selecting instructions (prompts) and managing data based on previous loops. This recursive decision-making process is integral to our system, outlined in mentalssystem.prompt Agents of any complexity can be created using Markdown, eliminating the need for traditional programming languages. However, Python can be integrated directly into the agent's Markdown script if necessary. Unlike platforms that include preset reasoning frameworks, Mentals AI serves as a blank canvas. It enables the creation and integration of your own reasoning frameworks, including existing ones: Tree of Thoughts, ReAct, Self-Discovery, Auto-CoT, and others. One can also link these frameworks together into more complex sequences, even creating a network of various reasoning frameworks. 🗝️ Key Concepts The agent file is a textual description of the agent instructions with a .gen extension. 📖 Instruction (prompt) Instruction is the basic component of an agent in Mentals. An agent can consist of one or more instructions, which can refer to each other. Instructions can be written in free form, but they always have a name that starts with the # symbol. The use: directive is used to specify a reference to other instructions. Multiple references are listed separated by commas. Below is an example with two instructions root and meme_explain with a reference: In this example, the root instruction calls the memeexplain instruction. The response from memeexplain is then returned to the instruction from which it was called, namely the root. An instruction can take an input parameter, which is automatically generated based on the context when the instruction is called. To specify the input data more precisely, you can use a free-form prompt in the input: directive, such as a JSON object or null. Using a document for input: Using a JSON object as input: [!NOTE] Instruction calls are implemented independently from function or tool calls at OpenAI, enabling the operation of agents with models like Llama3. The implementation of instruction calls is transparent and included in the mentals_system.prompt file. 🛠️ Tool Tool is a kind of instruction. Mentals has a set of native tools to handle message output, user input, file handling, Python interpreter, Bash commands, and Short-term memory. Ask user example: File handling example: The full list of native tools is listed in the file native_tools.toml. 🧠 Working Memory (context) Each instruction has its own working memory — context. When exiting an instruction and re-entering it, the context is kept by default. To clear the context when exiting an instruction, you can use the keep_context: false directive: By default, the size of the instruction context is not limited. To limit the context, there is a directive max_context: number which specifies that only the number of the most recent messages should be stored. Older messages will be pushed out of the context. This feature is useful when you want to keep the most recent data in context so that older data does not affect the chain of reasoning. ⏳ Short-Term Memory (experimental) Short-term memory allows for the storage of intermediate results from an agent's activities, which can then be used for further reasoning. The contents of this memory are accessible across all instruction contexts. The memory tool is used to store data. When data is stored, a keyword and a description of the content are generated. In the example below, the meme_recall instruction is aware of the meme because it was previously stored in memory. ⚙️ Control flow: From strings to algorithms The control flow, which includes conditions, instruction calls, and loops (such as ReAct, Auto-CoT, etc.), is fully expressed in natural language. This method enables the creation of semantic conditions that direct data stream branching. For instance, you can request an agent to autonomously play a word chain game in a loop or establish an ambiguous exit condition: exit the loop if you are satisfied with the result. Here, the language model and its context determine whether to continue or stop. All this is achieved without needing to define flow logic in Python or any other programming language. ⚖️ Reason Action (ReAct) example 🌳 Tree of Thoughts (ToT) example The idea behind ToT is to generate multiple ideas to solve a problem and then evaluate their value. Valuable ideas are kept and developed, other ideas are discarded. Let's take the example of the 24 game. The 24 puzzle is an arithmetical puzzle in which the objective is to find a way to manipulate four integers so that the end result is 24. First, we define the instruction that creates and manipulates the tree data structure. The model knows what a tree is and can represent it in any format, from plain text to XML/JSON or any custom format. In this example, we will use the plain text format: Next, we need to initialize the tree with initial data, let's start with the root instruction: Calling the root instruction will suggest 8 possible next steps to calculate with the first 2 numbers and store these steps as tree nodes. Further work by the agent results in the construction of a tree that is convenient for the model to understand and infer the final answer. A complete example is contained in the agents/treestructure.gen 🗺️ Roadmap [ ] Web UI -- WIP [ ] Vector database tools -- WIP [ ] Agent's experience (experimental) [ ] Tools: Image generation, Browser ✨ The Idea The concept originated from studies on psychoanalysis Executive functions, Exploring Central Executive, Alan Baddeley, 1996. He described a system that orchestrates cognitive processes and working memory, facilitating retrievals from long-term memory. The LLM functions as System 1, processing queries and executing instructions without inherent motivation or goal-setting. So, what then is System 2? Drawing from historical insights now reconsidered through a scientific lens: The central executive, or executive functions, is crucial for controlled processing in working memory. It manages tasks including directing attention, maintaining task objectives, decision-making, and memory retrieval. This sparks an intriguing possibility: constructing more sophisticated agents by integrating System 1 and System 2. The LLM, as the cognitive executor System 1, works in tandem with the Central Executive System 2, which governs and controls the LLM. This partnership forms the dual relationship foundational to Mentals AI.

GenAI_Agents
github
LLM Vibe Score0.563
Human Vibe Score0.24210481455988786
NirDiamantMar 28, 2025

GenAI_Agents

🌟 Support This Project: Your sponsorship fuels innovation in GenAI agent development. Become a sponsor to help maintain and expand this valuable resource! GenAI Agents: Comprehensive Repository for Development and Implementation 🚀 Welcome to one of the most extensive and dynamic collections of Generative AI (GenAI) agent tutorials and implementations available today. This repository serves as a comprehensive resource for learning, building, and sharing GenAI agents, ranging from simple conversational bots to complex, multi-agent systems. 📫 Stay Updated! 🚀Cutting-edgeUpdates 💡ExpertInsights 🎯Top 0.1%Content Join over 15,000 of AI enthusiasts getting unique cutting-edge insights and free tutorials! Plus, subscribers get exclusive early access and special 33% discounts to my book and the upcoming RAG Techniques course! Introduction Generative AI agents are at the forefront of artificial intelligence, revolutionizing the way we interact with and leverage AI technologies. This repository is designed to guide you through the development journey, from basic agent implementations to advanced, cutting-edge systems. 📚 Learn to Build Your First AI Agent Your First AI Agent: Simpler Than You Think This detailed blog post complements the repository by providing a complete A-Z walkthrough with in-depth explanations of core concepts, step-by-step implementation, and the theory behind AI agents. It's designed to be incredibly simple to follow while covering everything you need to know to build your first working agent from scratch. 💡 Plus: Subscribe to the newsletter for exclusive early access to tutorials and special discounts on upcoming courses and books! Our goal is to provide a valuable resource for everyone - from beginners taking their first steps in AI to seasoned practitioners pushing the boundaries of what's possible. By offering a range of examples from foundational to complex, we aim to facilitate learning, experimentation, and innovation in the rapidly evolving field of GenAI agents. Furthermore, this repository serves as a platform for showcasing innovative agent creations. Whether you've developed a novel agent architecture or found an innovative application for existing techniques, we encourage you to share your work with the community. Related Projects 📚 Dive into my comprehensive guide on RAG techniques to learn about integrating external knowledge into AI systems, enhancing their capabilities with up-to-date and relevant information retrieval. 🖋️ Explore my Prompt Engineering Techniques guide for an extensive collection of prompting strategies, from fundamental concepts to advanced methods, improving your ability to communicate effectively with AI language models. A Community-Driven Knowledge Hub This repository grows stronger with your contributions! Join our vibrant Discord community — the central hub for shaping and advancing this project together 🤝 GenAI Agents Discord Community Whether you're a novice eager to learn or an expert ready to share your knowledge, your insights can shape the future of GenAI agents. Join us to propose ideas, get feedback, and collaborate on innovative implementations. For contribution guidelines, please refer to our CONTRIBUTING.md file. Let's advance GenAI agent technology together! 🔗 For discussions on GenAI, agents, or to explore knowledge-sharing opportunities, feel free to connect on LinkedIn. Key Features 🎓 Learn to build GenAI agents from beginner to advanced levels 🧠 Explore a wide range of agent architectures and applications 📚 Step-by-step tutorials and comprehensive documentation 🛠️ Practical, ready-to-use agent implementations 🌟 Regular updates with the latest advancements in GenAI 🤝 Share your own agent creations with the community GenAI Agent Implementations Explore our extensive list of GenAI agent implementations, sorted by categories: 🌱 Beginner-Friendly Agents Simple Conversational Agent LangChain PydanticAI Overview 🔎 A context-aware conversational AI maintains information across interactions, enabling more natural dialogues. Implementation 🛠️ Integrates a language model, prompt template, and history manager to generate contextual responses and track conversation sessions. Simple Question Answering Agent Overview 🔎 Answering (QA) agent using LangChain and OpenAI's language model understands user queries and provides relevant, concise answers. Implementation 🛠️ Combines OpenAI's GPT model, a prompt template, and an LLMChain to process user questions and generate AI-driven responses in a streamlined manner. Simple Data Analysis Agent LangChain PydanticAI Overview 🔎 An AI-powered data analysis agent interprets and answers questions about datasets using natural language, combining language models with data manipulation tools for intuitive data exploration. Implementation 🛠️ Integrates a language model, data manipulation framework, and agent framework to process natural language queries and perform data analysis on a synthetic dataset, enabling accessible insights for non-technical users. 🔧 Framework Tutorial: LangGraph Introduction to LangGraph: Building Modular AI Workflows Overview 🔎 This tutorial introduces LangGraph, a powerful framework for creating modular, graph-based AI workflows. Learn how to leverage LangGraph to build more complex and flexible AI agents that can handle multi-step processes efficiently. Implementation 🛠️ Step-by-step guide on using LangGraph to create a StateGraph workflow. The tutorial covers key concepts such as state management, node creation, and graph compilation. It demonstrates these principles by constructing a simple text analysis pipeline, serving as a foundation for more advanced agent architectures. Additional Resources 📚 Blog Post 🎓 Educational and Research Agents ATLAS: Academic Task and Learning Agent System Overview 🔎 ATLAS demonstrates how to build an intelligent multi-agent system that transforms academic support through AI-powered assistance. The system leverages LangGraph's workflow framework to coordinate multiple specialized agents that provide personalized academic planning, note-taking, and advisory support. Implementation 🛠️ Implements a state-managed multi-agent architecture using four specialized agents (Coordinator, Planner, Notewriter, and Advisor) working in concert through LangGraph's workflow framework. The system features sophisticated workflows for profile analysis and academic support, with continuous adaptation based on student performance and feedback. Additional Resources 📚 YouTube Explanation Blog Post Scientific Paper Agent - Literature Review Overview 🔎 An intelligent research assistant that helps users navigate, understand, and analyze scientific literature through an orchestrated workflow. The system combines academic APIs with sophisticated paper processing techniques to automate literature review tasks, enabling researchers to efficiently extract insights from academic papers while maintaining research rigor and quality control. Implementation 🛠️ Leverages LangGraph to create a five-node workflow system including decision making, planning, tool execution, and quality validation nodes. The system integrates the CORE API for paper access, PDFplumber for document processing, and advanced language models for analysis. Key features include a retry mechanism for robust paper downloads, structured data handling through Pydantic models, and quality-focused improvement cycles with human-in-the-loop validation options. Additional Resources 📚 YouTube Explanation Blog Post Chiron - A Feynman-Enhanced Learning Agent Overview 🔎 An adaptive learning agent that guides users through educational content using a structured checkpoint system and Feynman-style teaching. The system processes learning materials (either user-provided or web-retrieved), verifies understanding through interactive checkpoints, and provides simplified explanations when needed, creating a personalized learning experience that mimics one-on-one tutoring. Implementation 🛠️ Uses LangGraph to orchestrate a learning workflow that includes checkpoint definition, context building, understanding verification, and Feynman teaching nodes. The system integrates web search for dynamic content retrieval, employs semantic chunking for context processing, and manages embeddings for relevant information retrieval. Key features include a 70% understanding threshold for progression, interactive human-in-the-loop validation, and structured output through Pydantic models for consistent data handling. Additional Resources 📚 YouTube Explanation 💼 Business and Professional Agents Customer Support Agent (LangGraph) Overview 🔎 An intelligent customer support agent using LangGraph categorizes queries, analyzes sentiment, and provides appropriate responses or escalates issues. Implementation 🛠️ Utilizes LangGraph to create a workflow combining state management, query categorization, sentiment analysis, and response generation. Essay Grading Agent (LangGraph) Overview 🔎 An automated essay grading system using LangGraph and an LLM model evaluates essays based on relevance, grammar, structure, and depth of analysis. Implementation 🛠️ Utilizes a state graph to define the grading workflow, incorporating separate grading functions for each criterion. Travel Planning Agent (LangGraph) Overview 🔎 A Travel Planner using LangGraph demonstrates how to build a stateful, multi-step conversational AI application that collects user input and generates personalized travel itineraries. Implementation 🛠️ Utilizes StateGraph to define the application flow, incorporates custom PlannerState for process management. GenAI Career Assistant Agent Overview 🔎 The GenAI Career Assistant demonstrates how to create a multi-agent system that provides personalized guidance for careers in Generative AI. Using LangGraph and Gemini LLM, the system delivers customized learning paths, resume assistance, interview preparation, and job search support. Implementation 🛠️ Leverages a multi-agent architecture using LangGraph to coordinate specialized agents (Learning, Resume, Interview, Job Search) through TypedDict-based state management. The system employs sophisticated query categorization and routing while integrating with external tools like DuckDuckGo for job searches and dynamic content generation. Additional Resources 📚 YouTube Explanation Project Manager Assistant Agent Overview 🔎 An AI agent designed to assist in project management tasks by automating the process of creating actionable tasks from project descriptions, identifying dependencies, scheduling work, and assigning tasks to team members based on expertise. The system includes risk assessment and self-reflection capabilities to optimize project plans through multiple iterations, aiming to minimize overall project risk. Implementation 🛠️ Leverages LangGraph to orchestrate a workflow of specialized nodes including task generation, dependency mapping, scheduling, allocation, and risk assessment. Each node uses GPT-4o-mini for structured outputs following Pydantic models. The system implements a feedback loop for self-improvement, where risk scores trigger reflection cycles that generate insights to optimize the project plan. Visualization tools display Gantt charts of the generated schedules across iterations. Additional Resources 📚 YouTube Explanation Contract Analysis Assistant (ClauseAI) Overview 🔎 ClauseAI demonstrates how to build an AI-powered contract analysis system using a multi-agent approach. The system employs specialized AI agents for different aspects of contract review, from clause analysis to compliance checking, and leverages LangGraph for workflow orchestration and Pinecone for efficient clause retrieval and comparison. Implementation 🛠️ Implements a sophisticated state-based workflow using LangGraph to coordinate multiple AI agents through contract analysis stages. The system features Pydantic models for data validation, vector storage with Pinecone for clause comparison, and LLM-based analysis for generating comprehensive contract reports. The implementation includes parallel processing capabilities and customizable report generation based on user requirements. Additional Resources 📚 YouTube Explanation E2E Testing Agent Overview 🔎 The E2E Testing Agent demonstrates how to build an AI-powered system that converts natural language test instructions into executable end-to-end web tests. Using LangGraph for workflow orchestration and Playwright for browser automation, the system enables users to specify test cases in plain English while handling the complexity of test generation and execution. Implementation 🛠️ Implements a structured workflow using LangGraph to coordinate test generation, validation, and execution. The system features TypedDict state management, integration with Playwright for browser automation, and LLM-based code generation for converting natural language instructions into executable test scripts. The implementation includes DOM state analysis, error handling, and comprehensive test reporting. Additional Resources 📚 YouTube Explanation 🎨 Creative and Content Generation Agents GIF Animation Generator Agent (LangGraph) Overview 🔎 A GIF animation generator that integrates LangGraph for workflow management, GPT-4 for text generation, and DALL-E for image creation, producing custom animations from user prompts. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow that generates character descriptions, plots, and image prompts using GPT-4, creates images with DALL-E 3, and assembles them into GIFs using PIL. Employs asynchronous programming for efficient parallel processing. TTS Poem Generator Agent (LangGraph) Overview 🔎 An advanced text-to-speech (TTS) agent using LangGraph and OpenAI's APIs classifies input text, processes it based on content type, and generates corresponding speech output. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow that classifies input text using GPT models, applies content-specific processing, and converts the processed text to speech using OpenAI's TTS API. The system adapts its output based on the identified content type (general, poem, news, or joke). Music Compositor Agent (LangGraph) Overview 🔎 An AI Music Compositor using LangGraph and OpenAI's language models generates custom musical compositions based on user input. The system processes the input through specialized components, each contributing to the final musical piece, which is then converted to a playable MIDI file. Implementation 🛠️ LangGraph orchestrates a workflow that transforms user input into a musical composition, using ChatOpenAI (GPT-4) to generate melody, harmony, and rhythm, which are then style-adapted. The final AI-generated composition is converted to a MIDI file using music21 and can be played back using pygame. Content Intelligence: Multi-Platform Content Generation Agent Overview 🔎 Content Intelligence demonstrates how to build an advanced content generation system that transforms input text into platform-optimized content across multiple social media channels. The system employs LangGraph for workflow orchestration to analyze content, conduct research, and generate tailored content while maintaining brand consistency across different platforms. Implementation 🛠️ Implements a sophisticated workflow using LangGraph to coordinate multiple specialized nodes (Summary, Research, Platform-Specific) through the content generation process. The system features TypedDict and Pydantic models for state management, integration with Tavily Search for research enhancement, and platform-specific content generation using GPT-4. The implementation includes parallel processing for multiple platforms and customizable content templates. Additional Resources 📚 YouTube Explanation Business Meme Generator Using LangGraph and Memegen.link Overview 🔎 The Business Meme Generator demonstrates how to create an AI-powered system that generates contextually relevant memes based on company website analysis. Using LangGraph for workflow orchestration, the system combines Groq's Llama model for text analysis and the Memegen.link API to automatically produce brand-aligned memes for digital marketing. Implementation 🛠️ Implements a state-managed workflow using LangGraph to coordinate website content analysis, meme concept generation, and image creation. The system features Pydantic models for data validation, asynchronous processing with aiohttp, and integration with external APIs (Groq, Memegen.link) to create a complete meme generation pipeline with customizable templates. Additional Resources 📚 YouTube Explanation Murder Mystery Game with LLM Agents Overview 🔎 A text-based detective game that utilizes autonomous LLM agents as interactive characters in a procedurally generated murder mystery. Drawing inspiration from the UNBOUNDED paper, the system creates unique scenarios each time, with players taking on the role of Sherlock Holmes to solve the case through character interviews and deductive reasoning. Implementation 🛠️ Leverages two LangGraph workflows - a main game loop for story/character generation and game progression, and a conversation sub-graph for character interactions. The system uses a combination of LLM-powered narrative generation, character AI, and structured game mechanics to create an immersive investigative experience with replayable storylines. Additional Resources 📚 YouTube Explanation 📊 Analysis and Information Processing Agents Memory-Enhanced Conversational Agent Overview 🔎 A memory-enhanced conversational AI agent incorporates short-term and long-term memory systems to maintain context within conversations and across multiple sessions, improving interaction quality and personalization. Implementation 🛠️ Integrates a language model with separate short-term and long-term memory stores, utilizes a prompt template incorporating both memory types, and employs a memory manager for storage and retrieval. The system includes an interaction loop that updates and utilizes memories for each response. Multi-Agent Collaboration System Overview 🔎 A multi-agent collaboration system combining historical research with data analysis, leveraging large language models to simulate specialized agents working together to answer complex historical questions. Implementation 🛠️ Utilizes a base Agent class to create specialized HistoryResearchAgent and DataAnalysisAgent, orchestrated by a HistoryDataCollaborationSystem. The system follows a five-step process: historical context provision, data needs identification, historical data provision, data analysis, and final synthesis. Self-Improving Agent Overview 🔎 A Self-Improving Agent using LangChain engages in conversations, learns from interactions, and continuously improves its performance over time through reflection and adaptation. Implementation 🛠️ Integrates a language model with chat history management, response generation, and a reflection mechanism. The system employs a learning system that incorporates insights from reflection to enhance future performance, creating a continuous improvement loop. Task-Oriented Agent Overview 🔎 A language model application using LangChain that summarizes text and translates the summary to Spanish, combining custom functions, structured tools, and an agent for efficient text processing. Implementation 🛠️ Utilizes custom functions for summarization and translation, wrapped as structured tools. Employs a prompt template to guide the agent, which orchestrates the use of tools. An agent executor manages the process, taking input text and producing both an English summary and its Spanish translation. Internet Search and Summarize Agent Overview 🔎 An intelligent web research assistant that combines web search capabilities with AI-powered summarization, automating the process of gathering information from the internet and distilling it into concise, relevant summaries. Implementation 🛠️ Integrates a web search module using DuckDuckGo's API, a result parser, and a text summarization engine leveraging OpenAI's language models. The system performs site-specific or general searches, extracts relevant content, generates concise summaries, and compiles attributed results for efficient information retrieval and synthesis. Multi agent research team - Autogen Overview 🔎 This technique explores a multi-agent system for collaborative research using the AutoGen library. It employs agents to solve tasks collaboratively, focusing on efficient execution and quality assurance. The system enhances research by distributing tasks among specialized agents. Implementation 🛠️ Agents are configured with specific roles using the GPT-4 model, including admin, developer, planner, executor, and quality assurance. Interaction management ensures orderly communication with defined transitions. Task execution involves collaborative planning, coding, execution, and quality checking, demonstrating a scalable framework for various domains. Additional Resources 📚 comprehensive solution with UI Blogpost Sales Call Analyzer Overview 🔎 An intelligent system that automates the analysis of sales call recordings by combining audio transcription with advanced natural language processing. The analyzer transcribes audio using OpenAI's Whisper, processes the text using NLP techniques, and generates comprehensive reports including sentiment analysis, key phrases, pain points, and actionable recommendations to improve sales performance. Implementation 🛠️ Utilizes multiple components in a structured workflow: OpenAI Whisper for audio transcription, CrewAI for task automation and agent management, and LangChain for orchestrating the analysis pipeline. The system processes audio through a series of steps from transcription to detailed analysis, leveraging custom agents and tasks to generate structured JSON reports containing insights about customer sentiment, sales opportunities, and recommended improvements. Additional Resources 📚 YouTube Explanation Weather Emergency & Response System Overview 🔎 A comprehensive system demonstrating two agent graph implementations for weather emergency response: a real-time graph processing live weather data, and a hybrid graph combining real and simulated data for testing high-severity scenarios. The system handles complete workflow from data gathering through emergency plan generation, with automated notifications and human verification steps. Implementation 🛠️ Utilizes LangGraph for orchestrating complex workflows with state management, integrating OpenWeatherMap API for real-time data, and Gemini for analysis and response generation. The system incorporates email notifications, social media monitoring simulation, and severity-based routing with configurable human verification for low/medium severity events. Additional Resources 📚 YouTube Explanation Self-Healing Codebase System Overview 🔎 An intelligent system that automatically detects, diagnoses, and fixes runtime code errors using LangGraph workflow orchestration and ChromaDB vector storage. The system maintains a memory of encountered bugs and their fixes through vector embeddings, enabling pattern recognition for similar errors across the codebase. Implementation 🛠️ Utilizes a state-based graph workflow that processes function definitions and runtime arguments through specialized nodes for error detection, code analysis, and fix generation. Incorporates ChromaDB for vector-based storage of bug patterns and fixes, with automated search and retrieval capabilities for similar error patterns, while maintaining code execution safety through structured validation steps. Additional Resources 📚 YouTube Explanation DataScribe: AI-Powered Schema Explorer Overview 🔎 An intelligent agent system that enables intuitive exploration and querying of relational databases through natural language interactions. The system utilizes a fleet of specialized agents, coordinated by a stateful Supervisor, to handle schema discovery, query planning, and data analysis tasks while maintaining contextual understanding through vector-based relationship graphs. Implementation 🛠️ Leverages LangGraph for orchestrating a multi-agent workflow including discovery, inference, and planning agents, with NetworkX for relationship graph visualization and management. The system incorporates dynamic state management through TypedDict classes, maintains database context between sessions using a db_graph attribute, and includes safety measures to prevent unauthorized database modifications. Memory-Enhanced Email Agent (LangGraph & LangMem) Overview 🔎 An intelligent email assistant that combines three types of memory (semantic, episodic, and procedural) to create a system that improves over time. The agent can triage incoming emails, draft contextually appropriate responses using stored knowledge, and enhance its performance based on user feedback. Implementation 🛠️ Leverages LangGraph for workflow orchestration and LangMem for sophisticated memory management across multiple memory types. The system implements a triage workflow with memory-enhanced decision making, specialized tools for email composition and calendar management, and a self-improvement mechanism that updates its own prompts based on feedback and past performance. Additional Resources 📚 Blog Post 📰 News and Information Agents News TL;DR using LangGraph Overview 🔎 A news summarization system that generates concise TL;DR summaries of current events based on user queries. The system leverages large language models for decision making and summarization while integrating with news APIs to access up-to-date content, allowing users to quickly catch up on topics of interest through generated bullet-point summaries. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow combining multiple components: GPT-4o-mini for generating search terms and article summaries, NewsAPI for retrieving article metadata, BeautifulSoup for web scraping article content, and Asyncio for concurrent processing. The system follows a structured pipeline from query processing through article selection and summarization, managing the flow between components to produce relevant TL;DRs of current news articles. Additional Resources 📚 YouTube Explanation Blog Post AInsight: AI/ML Weekly News Reporter Overview 🔎 AInsight demonstrates how to build an intelligent news aggregation and summarization system using a multi-agent architecture. The system employs three specialized agents (NewsSearcher, Summarizer, Publisher) to automatically collect, process and summarize AI/ML news for general audiences through LangGraph-based workflow orchestration. Implementation 🛠️ Implements a state-managed multi-agent system using LangGraph to coordinate the news collection (Tavily API), technical content summarization (GPT-4), and report generation processes. The system features modular architecture with TypedDict-based state management, external API integration, and markdown report generation with customizable templates. Additional Resources 📚 YouTube Explanation Journalism-Focused AI Assistant Overview 🔎 A specialized AI assistant that helps journalists tackle modern journalistic challenges like misinformation, bias, and information overload. The system integrates fact-checking, tone analysis, summarization, and grammar review tools to enhance the accuracy and efficiency of journalistic work while maintaining ethical reporting standards. Implementation 🛠️ Leverages LangGraph to orchestrate a workflow of specialized components including language models for analysis and generation, web search integration via DuckDuckGo's API, document parsing tools like PyMuPDFLoader and WebBaseLoader, text splitting with RecursiveCharacterTextSplitter, and structured JSON outputs. Each component works together through a unified workflow to analyze content, verify facts, detect bias, extract quotes, and generate comprehensive reports. Blog Writer (Open AI Swarm) Overview 🔎 A multi-agent system for collaborative blog post creation using OpenAI's Swarm package. It leverages specialized agents to perform research, planning, writing, and editing tasks efficiently. Implementation 🛠️ Utilizes OpenAI's Swarm Package to manage agent interactions. Includes an admin, researcher, planner, writer, and editor, each with specific roles. The system follows a structured workflow: topic setting, outlining, research, drafting, and editing. This approach enhances content creation through task distribution, specialization, and collaborative problem-solving. Additional Resources 📚 Swarm Repo Podcast Internet Search and Generate Agent 🎙️ Overview 🔎 A two step agent that first searches the internet for a given topic and then generates a podcast on the topic found. The search step uses a search agent and search function to find the most relevant information. The second step uses a podcast generation agent and generation function to create a podcast on the topic found. Implementation 🛠️ Utilizes LangGraph to orchestrate a two-step workflow. The first step involves a search agent and function to gather information from the internet. The second step uses a podcast generation agent and function to create a podcast based on the gathered information. 🛍️ Shopping and Product Analysis Agents ShopGenie - Redefining Online Shopping Customer Experience Overview 🔎 An AI-powered shopping assistant that helps customers make informed purchasing decisions even without domain expertise. The system analyzes product information from multiple sources, compares specifications and reviews, identifies the best option based on user needs, and delivers recommendations through email with supporting video reviews, creating a comprehensive shopping experience. Implementation 🛠️ Uses LangGraph to orchestrate a workflow combining Tavily for web search, Llama-3.1-70B for structured data analysis and product comparison, and YouTube API for review video retrieval. The system processes search results through multiple nodes including schema mapping, product comparison, review identification, and email generation. Key features include structured Pydantic models for consistent data handling, retry mechanisms for robust API interactions, and email delivery through SMTP for sharing recommendations. Additional Resources 📚 YouTube Explanation Car Buyer AI Agent Overview 🔎 The Smart Product Buyer AI Agent demonstrates how to build an intelligent system that assists users in making informed purchasing decisions. Using LangGraph and LLM-based intelligence, the system processes user requirements, scrapes product listings from websites like AutoTrader, and provides detailed analysis and recommendations for car purchases. Implementation 🛠️ Implements a state-based workflow using LangGraph to coordinate user interaction, web scraping, and decision support. The system features TypedDict state management, async web scraping with Playwright, and integrates with external APIs for comprehensive product analysis. The implementation includes a Gradio interface for real-time chat interaction and modular scraper architecture for easy extension to additional product categories. Additional Resources 📚 YouTube Explanation 🎯 Task Management and Productivity Agents Taskifier - Intelligent Task Allocation & Management Overview 🔎 An intelligent task management system that analyzes user work styles and creates personalized task breakdown strategies, born from the observation that procrastination often stems from task ambiguity among students and early-career professionals. The system evaluates historical work patterns, gathers relevant task information through web search, and generates customized step-by-step approaches to optimize productivity and reduce workflow paralysis. Implementation 🛠️ Leverages LangGraph for orchestrating a multi-step workflow including work style analysis, information gathering via Tavily API, and customized plan generation. The system maintains state through the process, integrating historical work pattern data with fresh task research to output detailed, personalized task execution plans aligned with the user's natural working style. Additional Resources 📚 YouTube Explanation Grocery Management Agents System Overview 🔎 A multi-agent system built with CrewAI that automates grocery management tasks including receipt interpretation, expiration date tracking, inventory management, and recipe recommendations. The system uses specialized agents to extract data from receipts, estimate product shelf life, track consumption, and suggest recipes to minimize food waste. Implementation 🛠️ Implements four specialized agents using CrewAI - a Receipt Interpreter that extracts item details from receipts, an Expiration Date Estimator that determines shelf life using online sources, a Grocery Tracker that maintains inventory based on consumption, and a Recipe Recommender that suggests meals using available ingredients. Each agent has specific tools and tasks orchestrated through a crew workflow. Additional Resources 📚 YouTube Explanation 🔍 Quality Assurance and Testing Agents LangGraph-Based Systems Inspector Overview 🔎 A comprehensive testing and validation tool for LangGraph-based applications that automatically analyzes system architecture, generates test cases, and identifies potential vulnerabilities through multi-agent inspection. The inspector employs specialized AI testers to evaluate different aspects of the system, from basic functionality to security concerns and edge cases. Implementation 🛠️ Integrates LangGraph for workflow orchestration, multiple LLM-powered testing agents, and a structured evaluation pipeline that includes static analysis, test case generation, and results verification. The system uses Pydantic for data validation, NetworkX for graph representation, and implements a modular architecture that allows for parallel test execution and comprehensive result analysis. Additional Resources 📚 YouTube Explanation Blog Post EU Green Deal FAQ Bot Overview 🔎 The EU Green Deal FAQ Bot demonstrates how to build a RAG-based AI agent that helps businesses understand EU green deal policies. The system processes complex regulatory documents into manageable chunks and provides instant, accurate answers to common questions about environmental compliance, emissions reporting, and waste management requirements. Implementation 🛠️ Implements a sophisticated RAG pipeline using FAISS vectorstore for document storage, semantic chunking for preprocessing, and multiple specialized agents (Retriever, Summarizer, Evaluator) for query processing. The system features query rephrasing for improved accuracy, cross-reference with gold Q&A datasets for answer validation, and comprehensive evaluation metrics to ensure response quality and relevance. Additional Resources 📚 YouTube Explanation Systematic Review Automation System + Paper Draft Creation Overview 🔎 A comprehensive system for automating academic systematic reviews using a directed graph architecture and LangChain components. The system generates complete, publication-ready systematic review papers, automatically processing everything from literature search through final draft generation with multiple revision cycles. Implementation 🛠️ Utilizes a state-based graph workflow that handles paper search and selection (up to 3 papers), PDF processing, and generates a complete academic paper with all standard sections (abstract, introduction, methods, results, conclusions, references). The system incorporates multiple revision cycles with automated critique and improvement phases, all orchestrated through LangGraph state management. Additional Resources 📚 YouTube Explanation 🌟 Special Advanced Technique 🌟 Sophisticated Controllable Agent for Complex RAG Tasks 🤖 Overview 🔎 An advanced RAG solution designed to tackle complex questions that simple semantic similarity-based retrieval cannot solve. This approach uses a sophisticated deterministic graph as the "brain" 🧠 of a highly controllable autonomous agent, capable of answering non-trivial questions from your own data. Implementation 🛠️ • Implement a multi-step process involving question anonymization, high-level planning, task breakdown, adaptive information retrieval and question answering, continuous re-planning, and rigorous answer verification to ensure grounded and accurate responses. Getting Started To begin exploring and building GenAI agents: Clone this repository: Navigate to the technique you're interested in: Follow the detailed implementation guide in each technique's notebook. Contributing We welcome contributions from the community! If you have a new technique or improvement to suggest: Fork the repository Create your feature branch: git checkout -b feature/AmazingFeature Commit your changes: git commit -m 'Add some AmazingFeature' Push to the branch: git push origin feature/AmazingFeature Open a pull request Contributors License This project is licensed under a custom non-commercial license - see the LICENSE file for details. ⭐️ If you find this repository helpful, please consider giving it a star! Keywords: GenAI, Generative AI, Agents, NLP, AI, Machine Learning, Natural Language Processing, LLM, Conversational AI, Task-Oriented AI

ARENA_2.0
github
LLM Vibe Score0.544
Human Vibe Score0.08491210825084358
callummcdougallMar 28, 2025

ARENA_2.0

This GitHub repo hosts the exercises and Streamlit pages for the ARENA 2.0 program. You can find a summary of each of the chapters below. For more detailed information (including the different ways you can access the exercises), click on the links in the chapter headings. Additionally, see this Notion page for a guide to the virtual study materials available. Chapter 0: Fundamentals The material on this page covers the first five days of the curriculum. It can be seen as a grounding in all the fundamentals necessary to complete the more advanced sections of this course (such as RL, transformers, mechanistic interpretability, and generative models). Some highlights from this chapter include: Building your own 1D and 2D convolution functions Building and loading weights into a Residual Neural Network, and finetuning it on a classification task Working with weights and biases to optimise hyperparameters Implementing your own backpropagation mechanism Chapter 1: Transformers & Mech Interp The material on this page covers the next 8 days of the curriculum. It will cover transformers (what they are, how they are trained, how they are used to generate output) as well as mechanistic interpretability (what it is, what are some of the most important results in the field so far, why it might be important for alignment). Some highlights from this chapter include: Building your own transformer from scratch, and using it to sample autoregressive output Using the TransformerLens library developed by Neel Nanda to locate induction heads in a 2-layer model Finding a circuit for indirect object identification in GPT-2 small Intepreting model trained on toy tasks, e.g. classification of bracket strings, or modular arithmetic Replicating Anthropic's results on superposition Unlike the first chapter (where all the material was compulsory), this chapter has 4 days of compulsory content and 4 days of bonus content. During the compulsory days you will build and train transformers, and get a basic understanding of mechanistic interpretability of transformer models which includes induction heads & use of TransformerLens. The next 4 days, you have the option to continue with whatever material interests you out of the remaining sets of exercises. There will also be bonus material if you want to leave the beaten track of exercises all together! Chapter 2: Reinforcement Learning Reinforcement learning is an important field of machine learning. It works by teaching agents to take actions in an environment to maximise their accumulated reward. In this chapter, you will be learning about some of the fundamentals of RL, and working with OpenAI’s Gym environment to run your own experiments. Some highlights from this chapter include: Building your own agent to play the multi-armed bandit problem, implementing methods from Sutton & Bardo Implementing a Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) to play the CartPole game Applying RLHF to autoregressive transformers like the ones you built in the previous chapter Chapter 3: Training at Scale With the advent of large language models, training at scale has become a necessity to create highly competent models. In this chapter we will go through the basics of GPUs and distributed training, along with introductions to libraries that make training at scale easier. Some highlights from this chapter include: Quantizing your model to INT8 for blazing fast inference Implementing distributed training loops using torch.dist Getting hands on with Huggingface Accelerate and Microsoft DeepsSpeed

LLMs-from-scratch
github
LLM Vibe Score0.62
Human Vibe Score1
rasbtMar 28, 2025

LLMs-from-scratch

Build a Large Language Model (From Scratch) This repository contains the code for developing, pretraining, and finetuning a GPT-like LLM and is the official code repository for the book Build a Large Language Model (From Scratch). In Build a Large Language Model (From Scratch), you'll learn and understand how large language models (LLMs) work from the inside out by coding them from the ground up, step by step. In this book, I'll guide you through creating your own LLM, explaining each stage with clear text, diagrams, and examples. The method described in this book for training and developing your own small-but-functional model for educational purposes mirrors the approach used in creating large-scale foundational models such as those behind ChatGPT. In addition, this book includes code for loading the weights of larger pretrained models for finetuning. Link to the official source code repository Link to the book at Manning (the publisher's website) Link to the book page on Amazon.com ISBN 9781633437166 To download a copy of this repository, click on the Download ZIP button or execute the following command in your terminal: (If you downloaded the code bundle from the Manning website, please consider visiting the official code repository on GitHub at https://github.com/rasbt/LLMs-from-scratch for the latest updates.) Table of Contents Please note that this README.md file is a Markdown (.md) file. If you have downloaded this code bundle from the Manning website and are viewing it on your local computer, I recommend using a Markdown editor or previewer for proper viewing. If you haven't installed a Markdown editor yet, MarkText is a good free option. You can alternatively view this and other files on GitHub at https://github.com/rasbt/LLMs-from-scratch in your browser, which renders Markdown automatically. Tip: If you're seeking guidance on installing Python and Python packages and setting up your code environment, I suggest reading the README.md file located in the setup directory. | Chapter Title | Main Code (for Quick Access) | All Code + Supplementary | |------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------| | Setup recommendations | - | - | | Ch 1: Understanding Large Language Models | No code | - | | Ch 2: Working with Text Data | - ch02.ipynb- dataloader.ipynb (summary)- exercise-solutions.ipynb | ./ch02 | | Ch 3: Coding Attention Mechanisms | - ch03.ipynb- multihead-attention.ipynb (summary) - exercise-solutions.ipynb| ./ch03 | | Ch 4: Implementing a GPT Model from Scratch | - ch04.ipynb- gpt.py (summary)- exercise-solutions.ipynb | ./ch04 | | Ch 5: Pretraining on Unlabeled Data | - ch05.ipynb- gpttrain.py (summary) - gptgenerate.py (summary) - exercise-solutions.ipynb | ./ch05 | | Ch 6: Finetuning for Text Classification | - ch06.ipynb - gptclassfinetune.py - exercise-solutions.ipynb | ./ch06 | | Ch 7: Finetuning to Follow Instructions | - ch07.ipynb- gptinstructionfinetuning.py (summary)- ollamaevaluate.py (summary)- exercise-solutions.ipynb | ./ch07 | | Appendix A: Introduction to PyTorch | - code-part1.ipynb- code-part2.ipynb- DDP-script.py- exercise-solutions.ipynb | ./appendix-A | | Appendix B: References and Further Reading | No code | - | | Appendix C: Exercise Solutions | No code | - | | Appendix D: Adding Bells and Whistles to the Training Loop | - appendix-D.ipynb | ./appendix-D | | Appendix E: Parameter-efficient Finetuning with LoRA | - appendix-E.ipynb | ./appendix-E | The mental model below summarizes the contents covered in this book. Hardware Requirements The code in the main chapters of this book is designed to run on conventional laptops within a reasonable timeframe and does not require specialized hardware. This approach ensures that a wide audience can engage with the material. Additionally, the code automatically utilizes GPUs if they are available. (Please see the setup doc for additional recommendations.) Bonus Material Several folders contain optional materials as a bonus for interested readers: Setup Python Setup Tips Installing Python Packages and Libraries Used In This Book Docker Environment Setup Guide Chapter 2: Working with text data Byte Pair Encoding (BPE) Tokenizer From Scratch Comparing Various Byte Pair Encoding (BPE) Implementations Understanding the Difference Between Embedding Layers and Linear Layers Dataloader Intuition with Simple Numbers Chapter 3: Coding attention mechanisms Comparing Efficient Multi-Head Attention Implementations Understanding PyTorch Buffers Chapter 4: Implementing a GPT model from scratch FLOPS Analysis Chapter 5: Pretraining on unlabeled data: Alternative Weight Loading Methods Pretraining GPT on the Project Gutenberg Dataset Adding Bells and Whistles to the Training Loop Optimizing Hyperparameters for Pretraining Building a User Interface to Interact With the Pretrained LLM Converting GPT to Llama Llama 3.2 From Scratch Memory-efficient Model Weight Loading Extending the Tiktoken BPE Tokenizer with New Tokens PyTorch Performance Tips for Faster LLM Training Chapter 6: Finetuning for classification Additional experiments finetuning different layers and using larger models Finetuning different models on 50k IMDB movie review dataset Building a User Interface to Interact With the GPT-based Spam Classifier Chapter 7: Finetuning to follow instructions Dataset Utilities for Finding Near Duplicates and Creating Passive Voice Entries Evaluating Instruction Responses Using the OpenAI API and Ollama Generating a Dataset for Instruction Finetuning Improving a Dataset for Instruction Finetuning Generating a Preference Dataset with Llama 3.1 70B and Ollama Direct Preference Optimization (DPO) for LLM Alignment Building a User Interface to Interact With the Instruction Finetuned GPT Model Questions, Feedback, and Contributing to This Repository I welcome all sorts of feedback, best shared via the Manning Forum or GitHub Discussions. Likewise, if you have any questions or just want to bounce ideas off others, please don't hesitate to post these in the forum as well. Please note that since this repository contains the code corresponding to a print book, I currently cannot accept contributions that would extend the contents of the main chapter code, as it would introduce deviations from the physical book. Keeping it consistent helps ensure a smooth experience for everyone. Citation If you find this book or code useful for your research, please consider citing it. Chicago-style citation: Raschka, Sebastian. Build A Large Language Model (From Scratch). Manning, 2024. ISBN: 978-1633437166. BibTeX entry:

AITreasureBox
github
LLM Vibe Score0.447
Human Vibe Score0.1014145151561518
superiorluMar 28, 2025

AITreasureBox

AI TreasureBox English | 中文 Collect practical AI repos, tools, websites, papers and tutorials on AI. Translated from ChatGPT, picture from Midjourney. Catalog Repos Tools Websites Report&Paper Tutorials Repos updated repos and stars every 2 hours and re-ranking automatically. | No. | Repos | Description | | ----:|:-----------------------------------------|:------------------------------------------------------------------------------------------------------| | 1|🔥codecrafters-io/build-your-own-x !2025-03-28364681428|Master programming by recreating your favorite technologies from scratch.| | 2|sindresorhus/awesome !2025-03-28353614145|😎 Awesome lists about all kinds of interesting topics| | 3|public-apis/public-apis !2025-03-28334299125|A collective list of free APIs| | 4|kamranahmedse/developer-roadmap !2025-03-2831269540|Interactive roadmaps, guides and other educational content to help developers grow in their careers.| | 5|vinta/awesome-python !2025-03-28238581114|A curated list of awesome Python frameworks, libraries, software and resources| | 6|practical-tutorials/project-based-learning !2025-03-28222661124|Curated list of project-based tutorials| | 7|tensorflow/tensorflow !2025-03-281888714|An Open Source Machine Learning Framework for Everyone| | 8|Significant-Gravitas/AutoGPT !2025-03-2817391338|An experimental open-source attempt to make GPT-4 fully autonomous.| | 9|jackfrued/Python-100-Days !2025-03-2816305141|Python - 100天从新手到大师| | 10|AUTOMATIC1111/stable-diffusion-webui !2025-03-2815011553|Stable Diffusion web UI| | 11|huggingface/transformers !2025-03-2814207850|🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.| | 12|ollama/ollama !2025-03-28135166151|Get up and running with Llama 2, Mistral, Gemma, and other large language models.| | 13|f/awesome-chatgpt-prompts !2025-03-2812212738 |This repo includes ChatGPT prompt curation to use ChatGPT better.| | 14|justjavac/free-programming-books-zhCN !2025-03-2811316119|📚 免费的计算机编程类中文书籍,欢迎投稿| | 15|krahets/hello-algo !2025-03-2811107930|《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing| | 16|yt-dlp/yt-dlp !2025-03-28105801114|A feature-rich command-line audio/video downloader| | 17|langchain-ai/langchain !2025-03-2810449479|⚡ Building applications with LLMs through composability ⚡| | 18|goldbergyoni/nodebestpractices !2025-03-281021629|✅ The Node.js best practices list (July 2024)| | 19|puppeteer/puppeteer !2025-03-289018212|JavaScript API for Chrome and Firefox| | 20|pytorch/pytorch !2025-03-288833938|Tensors and Dynamic neural networks in Python with strong GPU acceleration| | 21|neovim/neovim !2025-03-288781482|Vim-fork focused on extensibility and usability| | 22|🔥🔥langgenius/dify !2025-03-2887342639 |One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.| | 23|mtdvio/every-programmer-should-know !2025-03-28867069|A collection of (mostly) technical things every software developer should know about| | 24|open-webui/open-webui !2025-03-2886025159|User-friendly WebUI for LLMs (Formerly Ollama WebUI)| | 25|ChatGPTNextWeb/NextChat !2025-03-288231521|✨ Light and Fast AI Assistant. Support: Web | | 26|supabase/supabase !2025-03-287990956|The open source Firebase alternative.| | 27|openai/whisper !2025-03-287905542|Robust Speech Recognition via Large-Scale Weak Supervision| | 28|home-assistant/core !2025-03-287773219|🏡 Open source home automation that puts local control and privacy first.| | 29|tensorflow/models !2025-03-28774694|Models and examples built with TensorFlow| | 30| ggerganov/llama.cpp !2025-03-287731836 | Port of Facebook's LLaMA model in C/C++ | | 31|3b1b/manim !2025-03-287641918|Animation engine for explanatory math videos| | 32|microsoft/generative-ai-for-beginners !2025-03-287623860|12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/| | 33|nomic-ai/gpt4all !2025-03-28729285 |gpt4all: an ecosystem of open-source chatbots trained on a massive collection of clean assistant data including code, stories and dialogue| | 34|comfyanonymous/ComfyUI !2025-03-2872635111|The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.| | 35|bregman-arie/devops-exercises !2025-03-2872225209|Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions| | 36|elastic/elasticsearch !2025-03-28721419|Free and Open, Distributed, RESTful Search Engine| | 37|🔥n8n-io/n8n !2025-03-2872093495|Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.| | 38|fighting41love/funNLP !2025-03-287200422|The Most Powerful NLP-Weapon Arsenal| | 39|hoppscotch/hoppscotch !2025-03-287060134|Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)| | 40|abi/screenshot-to-code !2025-03-286932817|Drop in a screenshot and convert it to clean HTML/Tailwind/JS code| | 41|binary-husky/gptacademic !2025-03-28680374|Academic Optimization of GPT| | 42|d2l-ai/d2l-zh !2025-03-286774142|Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries| | 43|josephmisiti/awesome-machine-learning !2025-03-286739215|A curated list of awesome Machine Learning frameworks, libraries and software.| | 44|grafana/grafana !2025-03-286725414|The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.| | 45|python/cpython !2025-03-286602218|The Python programming language| | 46|apache/superset !2025-03-286519020|Apache Superset is a Data Visualization and Data Exploration Platform| | 47|xtekky/gpt4free !2025-03-28639391 |decentralizing the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered API's ( poe.com, phind.com, chat.openai.com etc...)| | 48|sherlock-project/sherlock !2025-03-286332536|Hunt down social media accounts by username across social networks| | 49|twitter/the-algorithm !2025-03-28630586 |Source code for Twitter's Recommendation Algorithm| | 50|keras-team/keras !2025-03-28627835|Deep Learning for humans| | 51|openai/openai-cookbook !2025-03-28625136 |Examples and guides for using the OpenAI API| | 52|immich-app/immich !2025-03-286238670|High performance self-hosted photo and video management solution.| | 53|AppFlowy-IO/AppFlowy !2025-03-286173528|Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.| | 54|scikit-learn/scikit-learn !2025-03-286158212|scikit-learn: machine learning in Python| | 55|binhnguyennus/awesome-scalability !2025-03-286117021|The Patterns of Scalable, Reliable, and Performant Large-Scale Systems| | 56|labmlai/annotateddeeplearningpaperimplementations !2025-03-285951726|🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠| | 57|OpenInterpreter/open-interpreter !2025-03-285894710|A natural language interface for computers| | 58|lobehub/lobe-chat !2025-03-285832054|🤖 Lobe Chat - an open-source, extensible (Function Calling), high-performance chatbot framework. It supports one-click free deployment of your private ChatGPT/LLM web application.| | 59|meta-llama/llama !2025-03-28579536|Inference code for Llama models| | 60|nuxt/nuxt !2025-03-28566437|The Intuitive Vue Framework.| | 61|imartinez/privateGPT !2025-03-28555192|Interact with your documents using the power of GPT, 100% privately, no data leaks| | 62|Stirling-Tools/Stirling-PDF !2025-03-285500846|#1 Locally hosted web application that allows you to perform various operations on PDF files| | 63|PlexPt/awesome-chatgpt-prompts-zh !2025-03-285459720|ChatGPT Chinese Training Guide. Guidelines for various scenarios. Learn how to make it listen to you| | 64|dair-ai/Prompt-Engineering-Guide !2025-03-285451025 |🐙 Guides, papers, lecture, notebooks and resources for prompt engineering| | 65|ageitgey/facerecognition !2025-03-28544382|The world's simplest facial recognition api for Python and the command line| | 66|CorentinJ/Real-Time-Voice-Cloning !2025-03-285384814|Clone a voice in 5 seconds to generate arbitrary speech in real-time| | 67|geekan/MetaGPT !2025-03-285375376|The Multi-Agent Meta Programming Framework: Given one line Requirement, return PRD, Design, Tasks, Repo | | 68|gpt-engineer-org/gpt-engineer !2025-03-285367419|Specify what you want it to build, the AI asks for clarification, and then builds it.| | 69|lencx/ChatGPT !2025-03-2853653-3|🔮 ChatGPT Desktop Application (Mac, Windows and Linux)| | 70|deepfakes/faceswap !2025-03-28535672|Deepfakes Software For All| | 71|langflow-ai/langflow !2025-03-285319584|Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.| | 72|commaai/openpilot !2025-03-28529759|openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.| | 73|clash-verge-rev/clash-verge-rev !2025-03-2852848124|Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)| | 74|All-Hands-AI/OpenHands !2025-03-285150675|🙌 OpenHands: Code Less, Make More| | 75|xai-org/grok-1 !2025-03-28502504|Grok open release| | 76|meilisearch/meilisearch !2025-03-284999122|A lightning-fast search API that fits effortlessly into your apps, websites, and workflow| | 77|🔥browser-use/browser-use !2025-03-2849910294|Make websites accessible for AI agents| | 78|jgthms/bulma !2025-03-28496783|Modern CSS framework based on Flexbox| | 79|facebookresearch/segment-anything !2025-03-284947116|The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.| |!green-up-arrow.svg 80|hacksider/Deep-Live-Cam !2025-03-2848612146|real time face swap and one-click video deepfake with only a single image (uncensored)| |!red-down-arrow 81|mlabonne/llm-course !2025-03-284860934|Course with a roadmap and notebooks to get into Large Language Models (LLMs).| | 82|PaddlePaddle/PaddleOCR !2025-03-284785530|Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)| | 83|alist-org/alist !2025-03-284732618|🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。| | 84|infiniflow/ragflow !2025-03-2847027129|RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.| | 85|Avik-Jain/100-Days-Of-ML-Code !2025-03-284679312|100 Days of ML Coding| | 86|v2ray/v2ray-core !2025-03-28458706|A platform for building proxies to bypass network restrictions.| | 87|hiyouga/LLaMA-Factory !2025-03-284555881|Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)| | 88|Asabeneh/30-Days-Of-Python !2025-03-284544930|30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw| | 89|type-challenges/type-challenges !2025-03-284488511|Collection of TypeScript type challenges with online judge| | 90|lllyasviel/Fooocus !2025-03-284402716|Focus on prompting and generating| | 91|RVC-Boss/GPT-SoVITS !2025-03-284327738|1 min voice data can also be used to train a good TTS model! (few shot voice cloning)| | 92|rasbt/LLMs-from-scratch !2025-03-284320667|Implementing a ChatGPT-like LLM from scratch, step by step| | 93|oobabooga/text-generation-webui !2025-03-284302012 |A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA.| | 94|vllm-project/vllm !2025-03-2842982102|A high-throughput and memory-efficient inference and serving engine for LLMs| | 95|dani-garcia/vaultwarden !2025-03-284297121|Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs| | 96|microsoft/autogen !2025-03-284233049|Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ| | 97|jeecgboot/JeecgBoot !2025-03-284205920|🔥「企业级低代码平台」前后端分离架构SpringBoot 2.x/3.x,SpringCloud,Ant Design&Vue3,Mybatis,Shiro,JWT。强大的代码生成器让前后端代码一键生成,无需写任何代码! 引领新的开发模式OnlineCoding->代码生成->手工MERGE,帮助Java项目解决70%重复工作,让开发更关注业务,既能快速提高效率,帮助公司节省成本,同时又不失灵活性。| | 98|Mintplex-Labs/anything-llm !2025-03-284186955|A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces.| | 99|THUDM/ChatGLM-6B !2025-03-28410192 |ChatGLM-6B: An Open Bilingual Dialogue Language Model| | 100|hpcaitech/ColossalAI !2025-03-28406902|Making large AI models cheaper, faster and more accessible| | 101|Stability-AI/stablediffusion !2025-03-28406337|High-Resolution Image Synthesis with Latent Diffusion Models| | 102|mingrammer/diagrams !2025-03-28405063|🎨 Diagram as Code for prototyping cloud system architectures| | 103|Kong/kong !2025-03-28404616|🦍 The Cloud-Native API Gateway and AI Gateway.| | 104|getsentry/sentry !2025-03-284040913|Developer-first error tracking and performance monitoring| | 105| karpathy/nanoGPT !2025-03-284034613 |The simplest, fastest repository for training/finetuning medium-sized GPTs| | 106|fastlane/fastlane !2025-03-2840014-1|🚀 The easiest way to automate building and releasing your iOS and Android apps| | 107|psf/black !2025-03-28399765|The uncompromising Python code formatter| | 108|OpenBB-finance/OpenBBTerminal !2025-03-283972074 |Investment Research for Everyone, Anywhere.| | 109|2dust/v2rayNG !2025-03-283943415|A V2Ray client for Android, support Xray core and v2fly core| | 110|apache/airflow !2025-03-283937314|Apache Airflow - A platform to programmatically author, schedule, and monitor workflows| | 111|KRTirtho/spotube !2025-03-283902746|🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!| | 112|coqui-ai/TTS !2025-03-283889719 |🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production| | 113|ggerganov/whisper.cpp !2025-03-283882116|Port of OpenAI's Whisper model in C/C++| | 114|ultralytics/ultralytics !2025-03-283866951|NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite| | 115|typst/typst !2025-03-283863914|A new markup-based typesetting system that is powerful and easy to learn.| | 116|streamlit/streamlit !2025-03-283845828|Streamlit — A faster way to build and share data apps.| | 117|LC044/WeChatMsg !2025-03-283836931|提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手| | 118|lm-sys/FastChat !2025-03-283822112 |An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.| | 119|NaiboWang/EasySpider !2025-03-283819013|A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。| | 120|microsoft/DeepSpeed !2025-03-283765816 |A deep learning optimization library that makes distributed training and inference easy, efficient, and effective| | 121|QuivrHQ/quivr !2025-03-28376067|Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.| | 122|freqtrade/freqtrade !2025-03-283757817 |Free, open source crypto trading bot| | 123|suno-ai/bark !2025-03-28373178 |🔊 Text-Prompted Generative Audio Model| | 124|🔥cline/cline !2025-03-2837307282|Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way.| | 125|LAION-AI/Open-Assistant !2025-03-28372712 |OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.| | 126|penpot/penpot !2025-03-283716217|Penpot: The open-source design tool for design and code collaboration| | 127|gradio-app/gradio !2025-03-283713320|Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!| | 128|FlowiseAI/Flowise !2025-03-283667135 |Drag & drop UI to build your customized LLM flow using LangchainJS| | 129|SimplifyJobs/Summer2025-Internships !2025-03-28366506|Collection of Summer 2025 tech internships!| | 130|TencentARC/GFPGAN !2025-03-28365027 |GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.| | 131|ray-project/ray !2025-03-283626819|Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.| | 132|babysor/MockingBird !2025-03-28360498|🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time| | 133|unslothai/unsloth !2025-03-283603691|5X faster 50% less memory LLM finetuning| | 134|zhayujie/chatgpt-on-wechat !2025-03-283600124 |Wechat robot based on ChatGPT, which uses OpenAI api and itchat library| | 135|upscayl/upscayl !2025-03-283599824|🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.| | 136|freeCodeCamp/devdocs !2025-03-28359738|API Documentation Browser| | 137|XingangPan/DragGAN !2025-03-28359043 |Code for DragGAN (SIGGRAPH 2023)| | 138|2noise/ChatTTS !2025-03-283543922|ChatTTS is a generative speech model for daily dialogue.| | 139|google-research/google-research !2025-03-28352207 |Google Research| | 140|karanpratapsingh/system-design !2025-03-28351003|Learn how to design systems at scale and prepare for system design interviews| | 141|lapce/lapce !2025-03-28350855|Lightning-fast and Powerful Code Editor written in Rust| | 142| microsoft/TaskMatrix !2025-03-2834500-3 | Talking, Drawing and Editing with Visual Foundation Models| | 143|chatchat-space/Langchain-Chatchat !2025-03-283442020|Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain| | 144|unclecode/crawl4ai !2025-03-283434163|🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper| | 145|Bin-Huang/chatbox !2025-03-283374733 |A desktop app for GPT-4 / GPT-3.5 (OpenAI API) that supports Windows, Mac & Linux| | 146|milvus-io/milvus !2025-03-283366525 |A cloud-native vector database, storage for next generation AI applications| | 147|mendableai/firecrawl !2025-03-2833297128|🔥 Turn entire websites into LLM-ready markdown| | 148|pola-rs/polars !2025-03-283269320|Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends| | 149|Pythagora-io/gpt-pilot !2025-03-28325321|PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation| | 150|hashicorp/vault !2025-03-28320797|A tool for secrets management, encryption as a service, and privileged access management| | 151|shardeum/shardeum !2025-03-28319580|Shardeum is an EVM based autoscaling blockchain| | 152|Chanzhaoyu/chatgpt-web !2025-03-28319242 |A demonstration website built with Express and Vue3 called ChatGPT| | 153|lllyasviel/ControlNet !2025-03-283186413 |Let us control diffusion models!| | 154|google/jax !2025-03-28317727|Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more| | 155|facebookresearch/detectron2 !2025-03-28315987|Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.| | 156|myshell-ai/OpenVoice !2025-03-28315233|Instant voice cloning by MyShell| | 157|TheAlgorithms/C-Plus-Plus !2025-03-283151411|Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.| | 158|hiroi-sora/Umi-OCR !2025-03-283138129|OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。| | 159|mudler/LocalAI !2025-03-283127815|🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.| | 160|facebookresearch/fairseq !2025-03-28312124 |Facebook AI Research Sequence-to-Sequence Toolkit written in Python.| | 161|alibaba/nacos !2025-03-28310559|an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.| | 162|yunjey/pytorch-tutorial !2025-03-28310326|PyTorch Tutorial for Deep Learning Researchers| | 163|v2fly/v2ray-core !2025-03-28307448|A platform for building proxies to bypass network restrictions.| | 164|mckaywrigley/chatbot-ui !2025-03-283067714|The open-source AI chat interface for everyone.| | 165|TabbyML/tabby !2025-03-28305949 |Self-hosted AI coding assistant| | 166|deepseek-ai/awesome-deepseek-integration !2025-03-283053193|| | 167|danielmiessler/fabric !2025-03-283028914|fabric is an open-source framework for augmenting humans using AI.| | 168|xinntao/Real-ESRGAN !2025-03-283026623 |Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.| | 169|paul-gauthier/aider !2025-03-283014642|aider is GPT powered coding in your terminal| | 170|tatsu-lab/stanfordalpaca !2025-03-28299022 |Code and documentation to train Stanford's Alpaca models, and generate the data.| | 171|DataTalksClub/data-engineering-zoomcamp !2025-03-282971817|Free Data Engineering course!| | 172|HeyPuter/puter !2025-03-282967014|🌐 The Internet OS! Free, Open-Source, and Self-Hostable.| | 173|mli/paper-reading !2025-03-282962314|Classic Deep Learning and In-Depth Reading of New Papers Paragraph by Paragraph| | 174|linexjlin/GPTs !2025-03-28295568|leaked prompts of GPTs| | 175|s0md3v/roop !2025-03-28295286 |one-click deepfake (face swap)| | 176|JushBJJ/Mr.-Ranedeer-AI-Tutor !2025-03-2829465-1 |A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.| | 177|opendatalab/MinerU !2025-03-282927074|A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。| | 178|mouredev/Hello-Python !2025-03-282920720|Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...| | 179|Lightning-AI/pytorch-lightning !2025-03-28292039|Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.| | 180|crewAIInc/crewAI !2025-03-282919344|Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.| | 181|facebook/folly !2025-03-282916612|An open-source C++ library developed and used at Facebook.| | 182|google-ai-edge/mediapipe !2025-03-28291519|Cross-platform, customizable ML solutions for live and streaming media.| | 183| getcursor/cursor !2025-03-282892025 | An editor made for programming with AI| | 184|chatanywhere/GPTAPIfree !2025-03-282856424|Free ChatGPT API Key, Free ChatGPT API, supports GPT-4 API (free), ChatGPT offers a free domestic forwarding API that allows direct connections without the need for a proxy. It can be used in conjunction with software/plugins like ChatBox, significantly reducing interface usage costs. Enjoy unlimited and unrestricted chatting within China| | 185|meta-llama/llama3 !2025-03-28285552|The official Meta Llama 3 GitHub site| | 186|tinygrad/tinygrad !2025-03-282845811|You like pytorch? You like micrograd? You love tinygrad! ❤️| | 187|google-research/tuningplaybook !2025-03-282841514|A playbook for systematically maximizing the performance of deep learning models.| | 188|huggingface/diffusers !2025-03-282830222|🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.| | 189|tokio-rs/tokio !2025-03-28282408|A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...| | 190|RVC-Project/Retrieval-based-Voice-Conversion-WebUI !2025-03-282823817|Voice data !2025-03-282822612|Jan is an open source alternative to ChatGPT that runs 100% offline on your computer| | 192|openai/CLIP !2025-03-282814720|CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image| | 193|🔥khoj-ai/khoj !2025-03-2828112313|Your AI second brain. A copilot to get answers to your questions, whether they be from your own notes or from the internet. Use powerful, online (e.g gpt4) or private, local (e.g mistral) LLMs. Self-host locally or use our web app. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.| | 194| acheong08/ChatGPT !2025-03-2828054-2 | Reverse engineered ChatGPT API | | 195|iperov/DeepFaceLive !2025-03-28279345 |Real-time face swap for PC streaming or video calls| | 196|eugeneyan/applied-ml !2025-03-28278471|📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.| | 197|XTLS/Xray-core !2025-03-282778213|Xray, Penetrates Everything. Also the best v2ray-core, with XTLS support. Fully compatible configuration.| | 198|feder-cr/JobsApplierAIAgent !2025-03-282776410|AutoJobsApplierAI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.| | 199|mindsdb/mindsdb !2025-03-282750631|The platform for customizing AI from enterprise data| | 200|DataExpert-io/data-engineer-handbook !2025-03-282721611|This is a repo with links to everything you'd ever want to learn about data engineering| | 201|exo-explore/exo !2025-03-282721633|Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚| | 202|taichi-dev/taichi !2025-03-2826926-1|Productive, portable, and performant GPU programming in Python.| | 203|mem0ai/mem0 !2025-03-282689134|The memory layer for Personalized AI| | 204|svc-develop-team/so-vits-svc !2025-03-28268096 |SoftVC VITS Singing Voice Conversion| | 205|OpenBMB/ChatDev !2025-03-28265624|Create Customized Software using Natural Language Idea (through Multi-Agent Collaboration)| | 206|roboflow/supervision !2025-03-282632010|We write your reusable computer vision tools. 💜| | 207|drawdb-io/drawdb !2025-03-282626913|Free, simple, and intuitive online database design tool and SQL generator.| | 208|karpathy/llm.c !2025-03-28261633|LLM training in simple, raw C/CUDA| | 209|airbnb/lottie-ios !2025-03-28261431|An iOS library to natively render After Effects vector animations| | 210|openai/openai-python !2025-03-282607713|The OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language.| | 211|academic/awesome-datascience !2025-03-28259876|📝 An awesome Data Science repository to learn and apply for real world problems.| | 212|harry0703/MoneyPrinterTurbo !2025-03-282576618|Generate short videos with one click using a large model| | 213|gabime/spdlog !2025-03-282571511|Fast C++ logging library.| | 214|ocrmypdf/OCRmyPDF !2025-03-2825674217|OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched| | 215|Vision-CAIR/MiniGPT-4 !2025-03-28256170 |Enhancing Vision-language Understanding with Advanced Large Language Models| | 216|Stability-AI/generative-models !2025-03-28255936|Generative Models by Stability AI| | 217|DS4SD/docling !2025-03-282555662|Get your docs ready for gen AI| | 218|PostHog/posthog !2025-03-282533227|🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.| | 219|nrwl/nx !2025-03-282509612|Smart Monorepos · Fast CI| | 220|continuedev/continue !2025-03-282500737|⏩ the open-source copilot chat for software development—bring the power of ChatGPT to VS Code| | 221|opentofu/opentofu !2025-03-28247968|OpenTofu lets you declaratively manage your cloud infrastructure.| | 222|invoke-ai/InvokeAI !2025-03-28247293|InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.| | 223|deepinsight/insightface !2025-03-282471615 |State-of-the-art 2D and 3D Face Analysis Project| | 224|apache/flink !2025-03-28246865|Apache Flink| | 225|ComposioHQ/composio !2025-03-28246436|Composio equips agents with well-crafted tools empowering them to tackle complex tasks| | 226|Genesis-Embodied-AI/Genesis !2025-03-282458314|A generative world for general-purpose robotics & embodied AI learning.| | 227|stretchr/testify !2025-03-28243184|A toolkit with common assertions and mocks that plays nicely with the standard library| | 228| yetone/openai-translator !2025-03-28242921 | Browser extension and cross-platform desktop application for translation based on ChatGPT API | | 229|frappe/erpnext !2025-03-282425211|Free and Open Source Enterprise Resource Planning (ERP)| | 230|songquanpeng/one-api !2025-03-282410034|OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.| | 231| microsoft/JARVIS !2025-03-28240604 | a system to connect LLMs with ML community | | 232|google/flatbuffers !2025-03-28239965|FlatBuffers: Memory Efficient Serialization Library| | 233|microsoft/graphrag !2025-03-282398928|A modular graph-based Retrieval-Augmented Generation (RAG) system| | 234|rancher/rancher !2025-03-28239675|Complete container management platform| | 235|bazelbuild/bazel !2025-03-282384618|a fast, scalable, multi-language and extensible build system| | 236|modularml/mojo !2025-03-28238236 |The Mojo Programming Language| | 237|danny-avila/LibreChat !2025-03-282378753|Enhanced ChatGPT Clone: Features OpenAI, GPT-4 Vision, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development| |!green-up-arrow.svg 238|🔥🔥🔥Shubhamsaboo/awesome-llm-apps !2025-03-28237391211|Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.| |!red-down-arrow 239|microsoft/semantic-kernel !2025-03-282373611|Integrate cutting-edge LLM technology quickly and easily into your apps| |!red-down-arrow 240|TheAlgorithms/Rust !2025-03-28236995|All Algorithms implemented in Rust| | 241|stanford-oval/storm !2025-03-28236326|An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.| | 242|openai/gpt-2 !2025-03-28232483|Code for the paper "Language Models are Unsupervised Multitask Learners"| | 243|labring/FastGPT !2025-03-282319445|A platform that uses the OpenAI API to quickly build an AI knowledge base, supporting many-to-many relationships.| | 244|pathwaycom/llm-app !2025-03-2822928-10|Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.| | 245|warpdotdev/Warp !2025-03-282286825|Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.| | 246|🔥agno-agi/agno !2025-03-2822833298|Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, knowledge, tools and reasoning.| | 247|qdrant/qdrant !2025-03-282275214 |Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/| | 248|ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code !2025-03-282271815|500 AI Machine learning Deep learning Computer vision NLP Projects with code| | 249|stanfordnlp/dspy !2025-03-282268321|Stanford DSPy: The framework for programming—not prompting—foundation models| | 250|PaddlePaddle/Paddle !2025-03-28226246|PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)| | 251|zulip/zulip !2025-03-28225464|Zulip server and web application. Open-source team chat that helps teams stay productive and focused.| | 252|Hannibal046/Awesome-LLM !2025-03-282240721|Awesome-LLM: a curated list of Large Language Model| | 253|facefusion/facefusion !2025-03-282218812|Next generation face swapper and enhancer| | 254|Mozilla-Ocho/llamafile !2025-03-28220624|Distribute and run LLMs with a single file.| | 255|yuliskov/SmartTube !2025-03-282201614|SmartTube - an advanced player for set-top boxes and tvs running Android OS| | 256|haotian-liu/LLaVA !2025-03-282201316 |Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.| | 257|ashishps1/awesome-system-design-resources !2025-03-282189367|This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems| | 258|Cinnamon/kotaemon !2025-03-28218248|An open-source RAG-based tool for chatting with your documents.| | 259|CodePhiliaX/Chat2DB !2025-03-282179757|🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.| | 260|blakeblackshear/frigate !2025-03-282177113|NVR with realtime local object detection for IP cameras| | 261|facebookresearch/audiocraft !2025-03-28217111|Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.| | 262|karpathy/minGPT !2025-03-28216567|A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training| | 263|grpc/grpc-go !2025-03-282159510|The Go language implementation of gRPC. HTTP/2 based RPC| | 264|HumanSignal/label-studio !2025-03-282137618|Label Studio is a multi-type data labeling and annotation tool with standardized output format| | 265|yoheinakajima/babyagi !2025-03-28212764 |uses OpenAI and Pinecone APIs to create, prioritize, and execute tasks, This is a pared-down version of the original Task-Driven Autonomous Agent| | 266|deepseek-ai/DeepSeek-Coder !2025-03-282118210|DeepSeek Coder: Let the Code Write Itself| | 267|BuilderIO/gpt-crawler !2025-03-282118010|Crawl a site to generate knowledge files to create your own custom GPT from a URL| | 268| openai/chatgpt-retrieval-plugin !2025-03-2821152-1 | Plugins are chat extensions designed specifically for language models like ChatGPT, enabling them to access up-to-date information, run computations, or interact with third-party services in response to a user's request.| | 269|microsoft/OmniParser !2025-03-282113123|A simple screen parsing tool towards pure vision based GUI agent| | 270|black-forest-labs/flux !2025-03-282107219|Official inference repo for FLUX.1 models| | 271|ItzCrazyKns/Perplexica !2025-03-282099154|Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI| | 272|microsoft/unilm !2025-03-28209876|Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities| | 273|Sanster/lama-cleaner !2025-03-282077614|Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.| | 274|assafelovic/gpt-researcher !2025-03-282057222|GPT based autonomous agent that does online comprehensive research on any given topic| | 275|PromtEngineer/localGPT !2025-03-28204230 |Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.| | 276|elastic/kibana !2025-03-28203482|Your window into the Elastic Stack| | 277|fishaudio/fish-speech !2025-03-282033222|Brand new TTS solution| | 278|mlc-ai/mlc-llm !2025-03-282028110 |Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.| | 279|deepset-ai/haystack !2025-03-282005320|🔍 Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more.| | 280|tree-sitter/tree-sitter !2025-03-28200487|An incremental parsing system for programming tools| | 281|Anjok07/ultimatevocalremovergui !2025-03-281999811|GUI for a Vocal Remover that uses Deep Neural Networks.| | 282|guidance-ai/guidance !2025-03-28199622|A guidance language for controlling large language models.| | 283|ml-explore/mlx !2025-03-28199619|MLX: An array framework for Apple silicon| | 284|mlflow/mlflow !2025-03-281995314|Open source platform for the machine learning lifecycle| | 285|ml-tooling/best-of-ml-python !2025-03-28198631|🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.| | 286|BerriAI/litellm !2025-03-281981862|Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)| | 287|LazyVim/LazyVim !2025-03-281981320|Neovim config for the lazy| | 288|wez/wezterm !2025-03-281976018|A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust| | 289|valkey-io/valkey !2025-03-281970416|A flexible distributed key-value datastore that supports both caching and beyond caching workloads.| | 290|LiLittleCat/awesome-free-chatgpt !2025-03-28196185|🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.| | 291|Byaidu/PDFMathTranslate !2025-03-281947645|PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker| | 292|openai/swarm !2025-03-281947111|Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.| | 293|HqWu-HITCS/Awesome-Chinese-LLM !2025-03-281921423|Organizing smaller, cost-effective, privately deployable open-source Chinese language models, including related datasets and tutorials| | 294|stitionai/devika !2025-03-28190903|Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.| | 295|OpenBMB/MiniCPM-o !2025-03-28190887|MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone| | 296|samber/lo !2025-03-281904815|💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)| | 297|chroma-core/chroma !2025-03-281895221 |the AI-native open-source embedding database| | 298|DarkFlippers/unleashed-firmware !2025-03-28189278|Flipper Zero Unleashed Firmware| | 299|brave/brave-browser !2025-03-281892710|Brave browser for Android, iOS, Linux, macOS, Windows.| | 300| tloen/alpaca-lora !2025-03-28188641 | Instruct-tune LLaMA on consumer hardware| | 301|VinciGit00/Scrapegraph-ai !2025-03-281884618|Python scraper based on AI| | 302|gitroomhq/postiz-app !2025-03-281879110|📨 Schedule social posts, measure them, exchange with other members and get a lot of help from AI 🚀| | 303|PrefectHQ/prefect !2025-03-281878715|Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines| | 304|ymcui/Chinese-LLaMA-Alpaca !2025-03-28187723 |Chinese LLaMA & Alpaca LLMs| | 305|kenjihiranabe/The-Art-of-Linear-Algebra !2025-03-28187335|Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"| | 306|joonspk-research/generativeagents !2025-03-28187288|Generative Agents: Interactive Simulacra of Human Behavior| | 307|renovatebot/renovate !2025-03-28186820|Universal dependency update tool that fits into your workflows.| | 308|gventuri/pandas-ai !2025-03-28186109 |Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational| | 309|thingsboard/thingsboard !2025-03-28185184|Open-source IoT Platform - Device management, data collection, processing and visualization.| | 310|ente-io/ente !2025-03-28184722|Fully open source, End to End Encrypted alternative to Google Photos and Apple Photos| | 311|serengil/deepface !2025-03-281840113|A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python| | 312|Raphire/Win11Debloat !2025-03-281840132|A simple, easy to use PowerShell script to remove pre-installed apps from windows, disable telemetry, remove Bing from windows search as well as perform various other changes to declutter and improve your windows experience. This script works for both windows 10 and windows 11.| | 313|Avaiga/taipy !2025-03-28179235|Turns Data and AI algorithms into production-ready web applications in no time.| | 314|microsoft/qlib !2025-03-281784231|Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.| | 315|CopilotKit/CopilotKit !2025-03-281778571|Build in-app AI chatbots 🤖, and AI-powered Textareas ✨, into react web apps.| | 316|QwenLM/Qwen-7B !2025-03-281766017|The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.| | 317|w-okada/voice-changer !2025-03-28176078 |リアルタイムボイスチェンジャー Realtime Voice Changer| | 318|rlabbe/Kalman-and-Bayesian-Filters-in-Python !2025-03-281756011|Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.| | 319|Mikubill/sd-webui-controlnet !2025-03-28174794 |WebUI extension for ControlNet| | 320|jingyaogong/minimind !2025-03-2817380116|「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!| | 321|apify/crawlee !2025-03-28172696|Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.| | 322|apple/ml-stable-diffusion !2025-03-28172395|Stable Diffusion with Core ML on Apple Silicon| | 323| transitive-bullshit/chatgpt-api !2025-03-28172095 | Node.js client for the official ChatGPT API. | | 324|teableio/teable !2025-03-281719222|✨ The Next Gen Airtable Alternative: No-Code Postgres| | 325| xx025/carrot !2025-03-28170900 | Free ChatGPT Site List | | 326|microsoft/LightGBM !2025-03-28170723|A fast, distributed, high-performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.| | 327|VikParuchuri/surya !2025-03-28169827|Accurate line-level text detection and recognition (OCR) in any language| | 328|deepseek-ai/Janus !2025-03-281692825|Janus-Series: Unified Multimodal Understanding and Generation Models| | 329|ardalis/CleanArchitecture !2025-03-28168823|Clean Architecture Solution Template: A starting point for Clean Architecture with ASP.NET Core| | 330|neondatabase/neon !2025-03-28166466|Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.| | 331|kestra-io/kestra !2025-03-281661313|⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...| | 332|Dao-AILab/flash-attention !2025-03-281659720|Fast and memory-efficient exact attention| | 333|RPCS3/rpcs3 !2025-03-281655712|PS3 emulator/debugger| | 334|meta-llama/llama-recipes !2025-03-28165486|Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger| | 335|emilwallner/Screenshot-to-code !2025-03-28165180|A neural network that transforms a design mock-up into a static website.| | 336|datawhalechina/llm-cookbook !2025-03-281650922|面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版| | 337|e2b-dev/awesome-ai-agents !2025-03-281643923|A list of AI autonomous agents| | 338|QwenLM/Qwen2.5 !2025-03-281641114|Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.| | 339|dair-ai/ML-YouTube-Courses !2025-03-28164114|📺 Discover the latest machine learning / AI courses on YouTube.| | 340|pybind/pybind11 !2025-03-28163620|Seamless operability between C++11 and Python| | 341|graphdeco-inria/gaussian-splatting !2025-03-281627116|Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"| | 342|meta-llama/codellama !2025-03-28162531|Inference code for CodeLlama models| | 343|TransformerOptimus/SuperAGI !2025-03-28161292 | SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.| | 344|microsoft/onnxruntime !2025-03-28161169|ONNX Runtime: cross-platform, high-performance ML inferencing and training accelerator| | 345|IDEA-Research/Grounded-Segment-Anything !2025-03-281601411 |Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect, Segment and Generate Anything with Image and Text Inputs| | 346|ddbourgin/numpy-ml !2025-03-28160054|Machine learning, in numpy| | 347|eosphoros-ai/DB-GPT !2025-03-281585225|Revolutionizing Database Interactions with Private LLM Technology| | 348|Stability-AI/StableLM !2025-03-28158310 |Stability AI Language Models| | 349|openai/evals !2025-03-28157935 |Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.| | 350|THUDM/ChatGLM2-6B !2025-03-28157500|ChatGLM2-6B: An Open Bilingual Chat LLM | | 351|sunner/ChatALL !2025-03-28156761 |Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers| | 352|abseil/abseil-cpp !2025-03-28156656|Abseil Common Libraries (C++)| | 353|NVIDIA/open-gpu-kernel-modules !2025-03-28156531|NVIDIA Linux open GPU kernel module source| | 354|letta-ai/letta !2025-03-281563718|Letta (formerly MemGPT) is a framework for creating LLM services with memory.| | 355|typescript-eslint/typescript-eslint !2025-03-28156211|✨ Monorepo for all the tooling which enables ESLint to support TypeScript| | 356|umijs/umi !2025-03-28156211|A framework in react community ✨| | 357|AI4Finance-Foundation/FinGPT !2025-03-281561215|Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model.| | 358|amplication/amplication !2025-03-28156022|🔥🔥🔥 The Only Production-Ready AI-Powered Backend Code Generation| | 359|KindXiaoming/pykan !2025-03-28155477|Kolmogorov Arnold Networks| | 360|arc53/DocsGPT !2025-03-28154900|GPT-powered chat for documentation, chat with your documents| | 361|influxdata/telegraf !2025-03-28154502|Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.| | 362|microsoft/Bringing-Old-Photos-Back-to-Life !2025-03-28154084|Bringing Old Photo Back to Life (CVPR 2020 oral)| | 363|GaiZhenbiao/ChuanhuChatGPT !2025-03-2815394-2|GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.| | 364|Zeyi-Lin/HivisionIDPhotos !2025-03-281529710|⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。| | 365| mayooear/gpt4-pdf-chatbot-langchain !2025-03-281529518 | GPT4 & LangChain Chatbot for large PDF docs | | 366|1Panel-dev/MaxKB !2025-03-2815277148|? Based on LLM large language model knowledge base Q&A system. Ready to use out of the box, supports quick integration into third-party business systems. Officially produced by 1Panel| | 367|ai16z/eliza !2025-03-281526811|Conversational Agent for Twitter and Discord| | 368|apache/arrow !2025-03-28151684|Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing| | 369|princeton-nlp/SWE-agent !2025-03-281516119|SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models| | 370|mlc-ai/web-llm !2025-03-281509311 |Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.| | 371|guillaumekln/faster-whisper !2025-03-281507117 |Faster Whisper transcription with CTranslate2| | 372|overleaf/overleaf !2025-03-28150316|A web-based collaborative LaTeX editor| | 373|triton-lang/triton !2025-03-28150169|Development repository for the Triton language and compiler| | 374|soxoj/maigret !2025-03-281500410|🕵️‍♂️ Collect a dossier on a person by username from thousands of sites| | 375|alibaba/lowcode-engine !2025-03-28149841|An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系| | 376|espressif/esp-idf !2025-03-28148545|Espressif IoT Development Framework. Official development framework for Espressif SoCs.| | 377|pgvector/pgvector !2025-03-281484913|Open-source vector similarity search for Postgres| | 378|datawhalechina/leedl-tutorial !2025-03-28148246|《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases| | 379|xcanwin/KeepChatGPT !2025-03-28147972 |Using ChatGPT is more efficient and smoother, perfectly solving ChatGPT network errors. No longer do you need to frequently refresh the webpage, saving over 10 unnecessary steps| | 380|m-bain/whisperX !2025-03-281471313|WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)| | 381|HumanAIGC/AnimateAnyone !2025-03-2814706-1|Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation| |!green-up-arrow.svg 382|naklecha/llama3-from-scratch !2025-03-281469024|llama3 implementation one matrix multiplication at a time| |!red-down-arrow 383| fauxpilot/fauxpilot !2025-03-28146871 | An open-source GitHub Copilot server | | 384|LlamaFamily/Llama-Chinese !2025-03-28145111|Llama Chinese Community, the best Chinese Llama large model, fully open source and commercially available| | 385|BradyFU/Awesome-Multimodal-Large-Language-Models !2025-03-281450121|Latest Papers and Datasets on Multimodal Large Language Models| | 386|vanna-ai/vanna !2025-03-281449819|🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.| | 387|bleedline/aimoneyhunter !2025-03-28144845|AI Side Hustle Money Mega Collection: Teaching You How to Utilize AI for Various Side Projects to Earn Extra Income.| | 388|stefan-jansen/machine-learning-for-trading !2025-03-28144629|Code for Machine Learning for Algorithmic Trading, 2nd edition.| | 389|state-spaces/mamba !2025-03-28144139|Mamba: Linear-Time Sequence Modeling with Selective State Spaces| | 390|vercel/ai-chatbot !2025-03-281434614|A full-featured, hackable Next.js AI chatbot built by Vercel| | 391|steven-tey/novel !2025-03-281428410|Notion-style WYSIWYG editor with AI-powered autocompletions| | 392|unifyai/ivy !2025-03-281409348|Unified AI| | 393|chidiwilliams/buzz !2025-03-281402411 |Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.| | 394|lukas-blecher/LaTeX-OCR !2025-03-28139769|pix2tex: Using a ViT to convert images of equations into LaTeX code.| | 395|openai/tiktoken !2025-03-28139599|tiktoken is a fast BPE tokeniser for use with OpenAI's models.| | 396|nocobase/nocobase !2025-03-281391522|NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.| | 397|neonbjb/tortoise-tts !2025-03-28139010 |A multi-voice TTS system trained with an emphasis on quality| | 398|yamadashy/repomix !2025-03-281382036|📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.| | 399|adobe/react-spectrum !2025-03-28136766|A collection of libraries and tools that help you build adaptive, accessible, and robust user experiences.| | 400|THUDM/ChatGLM3 !2025-03-28136684|ChatGLM3 series: Open Bilingual Chat LLMs | | 401|NVIDIA/NeMo !2025-03-28134837|A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)| | 402|BlinkDL/RWKV-LM !2025-03-28134346 |RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it combines the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.| | 403| fuergaosi233/wechat-chatgpt !2025-03-28133330 | Use ChatGPT On Wechat via wechaty | | 404|udecode/plate !2025-03-28133325|A rich-text editor powered by AI| | 405|xenova/transformers.js !2025-03-281331219|State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!| | 406|stas00/ml-engineering !2025-03-281325615|Machine Learning Engineering Guides and Tools| | 407| wong2/chatgpt-google-extension !2025-03-2813241-1 | A browser extension that enhances search engines with ChatGPT, this repos will not be updated from 2023-02-20| | 408|mrdbourke/pytorch-deep-learning !2025-03-281317520|Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.| | 409|Koenkk/zigbee2mqtt !2025-03-28131544|Zigbee 🐝 to MQTT bridge 🌉, get rid of your proprietary Zigbee bridges 🔨| | 410|vercel-labs/ai !2025-03-281298528|Build AI-powered applications with React, Svelte, and Vue| | 411|netease-youdao/QAnything !2025-03-28129318|Question and Answer based on Anything.| | 412|huggingface/trl !2025-03-281289622|Train transformer language models with reinforcement learning.| | 413|microsoft/BitNet !2025-03-28128503|Official inference framework for 1-bit LLMs| | 414|mediar-ai/screenpipe !2025-03-281283915|24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.| | 415|Skyvern-AI/skyvern !2025-03-281277612|Automate browser-based workflows with LLMs and Computer Vision| | 416|pytube/pytube !2025-03-28126591|A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.| | 417|official-stockfish/Stockfish !2025-03-28126574|UCI chess engine| | 418|sgl-project/sglang !2025-03-281260143|SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable.| | 419|plasma-umass/scalene !2025-03-28125535|Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals| | 420|danswer-ai/danswer !2025-03-28125503|Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.| | 421|OpenTalker/SadTalker !2025-03-28125226|[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation| | 422|facebookresearch/AnimatedDrawings !2025-03-28123693 |Code to accompany "A Method for Animating Children's Drawings of the Human Figure"| | 423|activepieces/activepieces !2025-03-28123609|Your friendliest open source all-in-one automation tool ✨ Workflow automation tool 100+ integration / Enterprise automation tool / Zapier Alternative| | 424|ggerganov/ggml !2025-03-28121992 |Tensor library for machine learning| | 425|bytebase/bytebase !2025-03-28121694|World's most advanced database DevOps and CI/CD for Developer, DBA and Platform Engineering teams. The GitLab/GitHub for database DevOps.| | 426| willwulfken/MidJourney-Styles-and-Keywords-Reference !2025-03-28120971 | A reference containing Styles and Keywords that you can use with MidJourney AI| | 427|Huanshere/VideoLingo !2025-03-281207013|Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | | 428|OpenLMLab/MOSS !2025-03-28120330 |An open-source tool-augmented conversational language model from Fudan University| | 429|llmware-ai/llmware !2025-03-281200727|Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.| | 430|PKU-YuanGroup/Open-Sora-Plan !2025-03-28119362|This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.| | 431|ShishirPatil/gorilla !2025-03-28119332 |Gorilla: An API store for LLMs| | 432|NVIDIA/Megatron-LM !2025-03-281192716|Ongoing research training transformer models at scale| | 433|illacloud/illa-builder !2025-03-28119192|Create AI-Driven Apps like Assembling Blocks| | 434|marimo-team/marimo !2025-03-281191521|A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.| | 435|smol-ai/developer !2025-03-28119111 | With 100k context windows on the way, it's now feasible for every dev to have their own smol developer| | 436|Lightning-AI/litgpt !2025-03-28118878|Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.| | 437|openai/shap-e !2025-03-28118474 |Generate 3D objects conditioned on text or images| | 438|eugeneyan/open-llms !2025-03-28118451 |A list of open LLMs available for commercial use.| | 439|andrewyng/aisuite !2025-03-28118124|Simple, unified interface to multiple Generative AI providers| | 440|hajimehoshi/ebiten !2025-03-28117816|Ebitengine - A dead simple 2D game engine for Go| | 441|kgrzybek/modular-monolith-with-ddd !2025-03-28117493|Full Modular Monolith application with Domain-Driven Design approach.| | 442|h2oai/h2ogpt !2025-03-2811736-1 |Come join the movement to make the world's best open source GPT led by H2O.ai - 100% private chat and document search, no data leaks, Apache 2.0| | 443|owainlewis/awesome-artificial-intelligence !2025-03-28117332|A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.| | 444|DataTalksClub/mlops-zoomcamp !2025-03-28116643|Free MLOps course from DataTalks.Club| | 445|Rudrabha/Wav2Lip !2025-03-281163410|This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.| | 446|aishwaryanr/awesome-generative-ai-guide !2025-03-281152810|A one stop repository for generative AI research updates, interview resources, notebooks and much more!| | 447|karpathy/micrograd !2025-03-28115146|A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API| | 448|InstantID/InstantID !2025-03-28115111|InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥| | 449|facebookresearch/seamlesscommunication !2025-03-28114434|Foundational Models for State-of-the-Art Speech and Text Translation| | 450|anthropics/anthropic-cookbook !2025-03-281140112|A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.| | 451|mastra-ai/mastra !2025-03-281139240|the TypeScript AI agent framework| | 452|NVIDIA/TensorRT !2025-03-28113864|NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.| | 453|plandex-ai/plandex !2025-03-28113645|An AI coding engine for complex tasks| | 454|RUCAIBox/LLMSurvey !2025-03-28112735 |A collection of papers and resources related to Large Language Models.| | 455|kubeshark/kubeshark !2025-03-28112711|The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clusters. Inspired by Wireshark, purposely built for Kubernetes| | 456|electric-sql/pglite !2025-03-28112617|Lightweight Postgres packaged as WASM into a TypeScript library for the browser, Node.js, Bun and Deno from https://electric-sql.com| | 457|lightaime/camel !2025-03-281124441 |🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society| | 458|huggingface/lerobot !2025-03-281120184|🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch| | 459|normal-computing/outlines !2025-03-28111657|Generative Model Programming| | 460|libretro/RetroArch !2025-03-28110701|Cross-platform, sophisticated frontend for the libretro API. Licensed GPLv3.| | 461|THUDM/CogVideo !2025-03-28110599|Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)| | 462|bentoml/OpenLLM !2025-03-28110495|An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease.| | 463|vosen/ZLUDA !2025-03-28110429|CUDA on AMD GPUs| | 464|dair-ai/ML-Papers-of-the-Week !2025-03-28110304 |🔥Highlighting the top ML papers every week.| | 465|WordPress/gutenberg !2025-03-28110212|The Block Editor project for WordPress and beyond. Plugin is available from the official repository.| | 466|microsoft/data-formulator !2025-03-281099827|🪄 Create rich visualizations with AI| | 467|LibreTranslate/LibreTranslate !2025-03-28109887|Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.| | 468|block/goose !2025-03-281097737|an open-source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM| | 469|getumbrel/llama-gpt !2025-03-28109553|A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device.| | 470|HigherOrderCO/HVM !2025-03-28109182|A massively parallel, optimal functional runtime in Rust| | 471|databrickslabs/dolly !2025-03-2810812-3 | A large language model trained on the Databricks Machine Learning Platform| | 472|srush/GPU-Puzzles !2025-03-28108014|Solve puzzles. Learn CUDA.| | 473|Z3Prover/z3 !2025-03-28107952|The Z3 Theorem Prover| | 474|UFund-Me/Qbot !2025-03-281079313 |Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment| | 475|langchain-ai/langgraph !2025-03-281077336|| | 476|lz4/lz4 !2025-03-28107647|Extremely Fast Compression algorithm| | 477|magic-research/magic-animate !2025-03-28107160|MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model| | 478|PaperMC/Paper !2025-03-281071410|The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies| | 479|getomni-ai/zerox !2025-03-281071015|Zero shot pdf OCR with gpt-4o-mini| |!green-up-arrow.svg 480|🔥NirDiamant/GenAIAgents !2025-03-2810693318|This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.| |!red-down-arrow 481|Unstructured-IO/unstructured !2025-03-28106889|Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.| | 482|apache/thrift !2025-03-28106610|Apache Thrift| | 483| TheR1D/shellgpt !2025-03-28106097 | A command-line productivity tool powered by ChatGPT, will help you accomplish your tasks faster and more efficiently | | 484|TheRamU/Fay !2025-03-281060312 |Fay is a complete open source project that includes Fay controller and numeral models, which can be used in different applications such as virtual hosts, live promotion, numeral human interaction and so on| | 485|zyronon/douyin !2025-03-28105566|Vue3 + Pinia + Vite5 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile| | 486|THU-MIG/yolov10 !2025-03-28105485|YOLOv10: Real-Time End-to-End Object Detection| | 487|idootop/mi-gpt !2025-03-281052522|? Transform XiaoAi speaker into a personal voice assistant with ChatGPT and DouBao integration.| | 488|SakanaAI/AI-Scientist !2025-03-281051310|The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬| | 489|szimek/sharedrop !2025-03-28105101|Easy P2P file transfer powered by WebRTC - inspired by Apple AirDrop| | 490|salesforce/LAVIS !2025-03-28103942 |LAVIS - A One-stop Library for Language-Vision Intelligence| | 491|aws/amazon-sagemaker-examples !2025-03-28103654|Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.| | 492|artidoro/qlora !2025-03-28103402 |QLoRA: Efficient Finetuning of Quantized LLMs| | 493|lllyasviel/stable-diffusion-webui-forge !2025-03-281029314| a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference| | 494|NielsRogge/Transformers-Tutorials !2025-03-28102487|This repository contains demos I made with the Transformers library by HuggingFace.| | 495|kedro-org/kedro !2025-03-28102371|Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.| | 496| chathub-dev/chathub !2025-03-28102301 | All-in-one chatbot client | | 497|microsoft/promptflow !2025-03-28101612|Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.| | 498|mistralai/mistral-src !2025-03-28101372|Reference implementation of Mistral AI 7B v0.1 model.| | 499|burn-rs/burn !2025-03-28101183|Burn - A Flexible and Comprehensive Deep Learning Framework in Rust| | 500|AIGC-Audio/AudioGPT !2025-03-28101150 |AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head| | 501|facebookresearch/dinov2 !2025-03-281011210 |PyTorch code and models for the DINOv2 self-supervised learning method.| | 502|RockChinQ/LangBot !2025-03-281008455|😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 🤖 | | 503|78/xiaozhi-esp32 !2025-03-281008180|Build your own AI friend| | 504|cumulo-autumn/StreamDiffusion !2025-03-28100761|StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation| | 505|DataTalksClub/machine-learning-zoomcamp !2025-03-28100664|The code from the Machine Learning Bookcamp book and a free course based on the book| | 506|nerfstudio-project/nerfstudio !2025-03-28100343|A collaboration friendly studio for NeRFs| | 507|cupy/cupy !2025-03-28100344|NumPy & SciPy for GPU| | 508|NVIDIA/TensorRT-LLM !2025-03-281000823|TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.| | 509|wasp-lang/open-saas !2025-03-2899665|A free, open-source SaaS app starter for React & Node.js with superpowers. Production-ready. Community-driven.| | 510|huggingface/text-generation-inference !2025-03-2899383|Large Language Model Text Generation Inference| | 511|jxnl/instructor !2025-03-2899224|structured outputs for llms| | 512|GoogleCloudPlatform/generative-ai !2025-03-2899086|Sample code and notebooks for Generative AI on Google Cloud| | 513|manticoresoftware/manticoresearch !2025-03-2898799|Easy to use open source fast database for search | | 514|langfuse/langfuse !2025-03-28985134|🪢 Open source LLM engineering platform. Observability, metrics, evals, prompt management, testing, prompt playground, datasets, LLM evaluations -- 🍊YC W23 🤖 integrate via Typescript, Python / Decorators, OpenAI, Langchain, LlamaIndex, Litellm, Instructor, Mistral, Perplexity, Claude, Gemini, Vertex| | 515|keephq/keep !2025-03-2897949|The open-source alert management and AIOps platform| | 516|sashabaranov/go-openai !2025-03-2897843|OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go| | 517|autowarefoundation/autoware !2025-03-2897766|Autoware - the world's leading open-source software project for autonomous driving| | 518|anthropics/courses !2025-03-2897269|Anthropic's educational courses| | 519|popcorn-official/popcorn-desktop !2025-03-2896853|Popcorn Time is a multi-platform, free software BitTorrent client that includes an integrated media player ( Windows / Mac / Linux ) A Butter-Project Fork| | 520|getmaxun/maxun !2025-03-28968515|🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]| | 521|wandb/wandb !2025-03-2896763|🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.| | 522|karpathy/minbpe !2025-03-2895353|Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.| | 523|bigscience-workshop/petals !2025-03-2895142|🌸 Run large language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading| | 524|OthersideAI/self-operating-computer !2025-03-2894931|A framework to enable multimodal models to operate a computer.| | 525|mshumer/gpt-prompt-engineer !2025-03-2894911|| | 526| BloopAI/bloop !2025-03-2894710 | A fast code search engine written in Rust| | 527|BlinkDL/ChatRWKV !2025-03-289467-1 |ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.| | 528|timlrx/tailwind-nextjs-starter-blog !2025-03-2894677|This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Perfect as a replacement to existing Jekyll and Hugo individual blogs.| | 529|google/benchmark !2025-03-2893634|A microbenchmark support library| | 530|facebookresearch/nougat !2025-03-2893603|Implementation of Nougat Neural Optical Understanding for Academic Documents| | 531|modelscope/facechain !2025-03-2893536|FaceChain is a deep-learning toolchain for generating your Digital-Twin.| | 532|DrewThomasson/ebook2audiobook !2025-03-2893388|Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!| | 533|RayTracing/raytracing.github.io !2025-03-2893035|Main Web Site (Online Books)| | 534|QwenLM/Qwen2.5-VL !2025-03-28930249|Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.| | 535|WongKinYiu/yolov9 !2025-03-2892201|Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information| | 536|alibaba-damo-academy/FunASR !2025-03-28920222|A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.| | 537|Visualize-ML/Book4Power-of-Matrix !2025-03-2891931|Book4 'Power of Matrix' | | 538|dice2o/BingGPT !2025-03-289185-1 |Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)| | 539|browserbase/stagehand !2025-03-28917621|An AI web browsing framework focused on simplicity and extensibility.| | 540|FlagOpen/FlagEmbedding !2025-03-28914111|Dense Retrieval and Retrieval-augmented LLMs| | 541|Const-me/Whisper !2025-03-2890979|High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model| | 542|lucidrains/denoising-diffusion-pytorch !2025-03-2890942|Implementation of Denoising Diffusion Probabilistic Model in Pytorch| | 543|Chainlit/chainlit !2025-03-28904422|Build Conversational AI in minutes ⚡️| | 544|togethercomputer/OpenChatKit !2025-03-2890160 |OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications| | 545|Stability-AI/StableStudio !2025-03-2889631 |Community interface for generative AI| | 546|voicepaw/so-vits-svc-fork !2025-03-2889482 |so-vits-svc fork with realtime support, improved interface and more features.| | 547|pymc-devs/pymc !2025-03-2889413|Bayesian Modeling and Probabilistic Programming in Python| | 548|espnet/espnet !2025-03-2889302|End-to-End Speech Processing Toolkit| | 549|kedacore/keda !2025-03-2888991|KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes| | 550|open-mmlab/Amphion !2025-03-28886911|Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.| | 551|gorse-io/gorse !2025-03-2888451|Gorse open source recommender system engine| | 552|adams549659584/go-proxy-bingai !2025-03-288768-1 |A Microsoft New Bing demo site built with Vue3 and Go, providing a consistent UI experience, supporting ChatGPT prompts, and accessible within China| | 553|open-mmlab/mmsegmentation !2025-03-2887513|OpenMMLab Semantic Segmentation Toolbox and Benchmark.| | 554|bytedance/monolith !2025-03-2887223|ByteDance's Recommendation System| | 555|LouisShark/chatgptsystemprompt !2025-03-2887216|store all agent's system prompt| | 556|brexhq/prompt-engineering !2025-03-2887080 |Tips and tricks for working with Large Language Models like OpenAI's GPT-4.| | 557|erincatto/box2d !2025-03-2886841|Box2D is a 2D physics engine for games| | 558|🔥microsoft/ai-agents-for-beginners !2025-03-288669323|10 Lessons to Get Started Building AI Agents| | 559|nashsu/FreeAskInternet !2025-03-2886102|FreeAskInternet is a completely free, private and locally running search aggregator & answer generate using LLM, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to the ChatGPT3.5 LLM and generate the answer based on search results.| | 560|goldmansachs/gs-quant !2025-03-2885981|Python toolkit for quantitative finance| | 561|srbhr/Resume-Matcher !2025-03-2885800|Open Source Free ATS Tool to compare Resumes with Job Descriptions and create a score to rank them.| | 562|facebookresearch/ImageBind !2025-03-2885681 |ImageBind One Embedding Space to Bind Them All| | 563|ashawkey/stable-dreamfusion !2025-03-2885481 |A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.| | 564|meetecho/janus-gateway !2025-03-2885232|Janus WebRTC Server| | 565|google/magika !2025-03-2885003|Detect file content types with deep learning| | 566|huggingface/chat-ui !2025-03-2884871 |Open source codebase powering the HuggingChat app| | 567|EleutherAI/lm-evaluation-harness !2025-03-28843012|A framework for few-shot evaluation of autoregressive language models.| | 568|jina-ai/reader !2025-03-2884089|Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/| | 569|microsoft/TypeChat !2025-03-288406-1|TypeChat is a library that makes it easy to build natural language interfaces using types.| | 570|thuml/Time-Series-Library !2025-03-28839715|A Library for Advanced Deep Time Series Models.| | 571|OptimalScale/LMFlow !2025-03-2883882|An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.| | 572|baptisteArno/typebot.io !2025-03-2883845|💬 Typebot is a powerful chatbot builder that you can self-host.| | 573|jzhang38/TinyLlama !2025-03-2883504|The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.| | 574|fishaudio/Bert-VITS2 !2025-03-2883472|vits2 backbone with multilingual-bert| | 575|OpenBMB/XAgent !2025-03-2882683|An Autonomous LLM Agent for Complex Task Solving| | 576|Acly/krita-ai-diffusion !2025-03-2882387|Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.| | 577|jasonppy/VoiceCraft !2025-03-2882151|Zero-Shot Speech Editing and Text-to-Speech in the Wild| | 578|SJTU-IPADS/PowerInfer !2025-03-2881693|High-speed Large Language Model Serving on PCs with Consumer-grade GPUs| | 579|modelscope/DiffSynth-Studio !2025-03-28814713|Enjoy the magic of Diffusion models!| | 580|o3de/o3de !2025-03-2881443|Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations without any fees or commercial obligations.| | 581|zmh-program/chatnio !2025-03-2881325|🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot 等模型,支持对话分享,自定义预设,云端同步,模型市场,支持弹性计费和订阅计划模式,支持图片解析,支持联网搜索,支持模型缓存,丰富美观的后台管理与仪表盘数据统计。| | 582|leptonai/searchwithlepton !2025-03-2880632|Building a quick conversation-based search demo with Lepton AI.| | 583|sebastianstarke/AI4Animation !2025-03-2880620|Bringing Characters to Life with Computer Brains in Unity| | 584|wangrongding/wechat-bot !2025-03-2880528|🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...| | 585|openvinotoolkit/openvino !2025-03-2880528|OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference| | 586|steven2358/awesome-generative-ai !2025-03-28802610|A curated list of modern Generative Artificial Intelligence projects and services| | 587|adam-maj/tiny-gpu !2025-03-2880234|A minimal GPU design in Verilog to learn how GPUs work from the ground up| | 588| anse-app/chatgpt-demo !2025-03-2880180 | A demo repo based on OpenAI API (gpt-3.5-turbo) | | 589| acheong08/EdgeGPT !2025-03-288015-1 |Reverse engineered API of Microsoft's Bing Chat | | 590|ai-collection/ai-collection !2025-03-2879994 |The Generative AI Landscape - A Collection of Awesome Generative AI Applications| | 591|GreyDGL/PentestGPT !2025-03-2879953 |A GPT-empowered penetration testing tool| | 592|delta-io/delta !2025-03-2879112|An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs| | 593|dataelement/bisheng !2025-03-2879085|Bisheng is an open LLM devops platform for next generation AI applications.| | 594|e2b-dev/e2b !2025-03-2878447 |Vercel for AI agents. We help developers to build, deploy, and monitor AI agents. Focusing on specialized AI agents that build software for you - your personal software developers.| | 595|01-ai/Yi !2025-03-2878311|A series of large language models trained from scratch by developers @01-ai| | 596|Plachtaa/VALL-E-X !2025-03-287830-1|An open source implementation of Microsoft's VALL-E X zero-shot TTS model. The demo is available at https://plachtaa.github.io| | 597|abhishekkrthakur/approachingalmost !2025-03-2878204|Approaching (Almost) Any Machine Learning Problem| | 598|pydantic/pydantic-ai !2025-03-28781041|Agent Framework / shim to use Pydantic with LLMs| | 599|rany2/edge-tts !2025-03-2877901|Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key| | 600|CASIA-IVA-Lab/FastSAM !2025-03-2877881|Fast Segment Anything| | 601|netease-youdao/EmotiVoice !2025-03-2877817|EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine| | 602|lllyasviel/IC-Light !2025-03-2877804|More relighting!| | 603|kroma-network/tachyon !2025-03-287774-1|Modular ZK(Zero Knowledge) backend accelerated by GPU| | 604|deep-floyd/IF !2025-03-2877731 |A novel state-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding| | 605|oumi-ai/oumi !2025-03-2877705|Everything you need to build state-of-the-art foundation models, end-to-end.| | 606|reorproject/reor !2025-03-2877681|AI note-taking app that runs models locally.| | 607|lightpanda-io/browser !2025-03-28775813|Lightpanda: the headless browser designed for AI and automation| | 608|xiangsx/gpt4free-ts !2025-03-287755-1|Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free| | 609|IDEA-Research/GroundingDINO !2025-03-28773311|Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"| | 610|bunkerity/bunkerweb !2025-03-2877326|🛡️ Make your web services secure by default !| | 611|vikhyat/moondream !2025-03-2877057|tiny vision language model| | 612|firmai/financial-machine-learning !2025-03-287703-1|A curated list of practical financial machine learning tools and applications.| | 613|n8n-io/self-hosted-ai-starter-kit !2025-03-28765121|The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.| | 614|intel-analytics/ipex-llm !2025-03-2876507|Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.| | 615|jrouwe/JoltPhysics !2025-03-28764510|A multi core friendly rigid body physics and collision detection library. Written in C++. Suitable for games and VR applications. Used by Horizon Forbidden West.| | 616|THUDM/CodeGeeX2 !2025-03-2876270|CodeGeeX2: A More Powerful Multilingual Code Generation Model| | 617|meta-llama/llama-stack !2025-03-2875866|Composable building blocks to build Llama Apps| | 618|sweepai/sweep !2025-03-287530-1|Sweep is an AI junior developer| | 619|lllyasviel/Omost !2025-03-2875301|Your image is almost there!| | 620|ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide !2025-03-2875050|Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.| | 621|dair-ai/ML-Papers-Explained !2025-03-2875050|Explanation to key concepts in ML| | 622|zaidmukaddam/scira !2025-03-28750110|Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.| | 623|Portkey-AI/gateway !2025-03-28749416|A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.| | 624|web-infra-dev/midscene !2025-03-28748729|An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language.| | 625|zilliztech/GPTCache !2025-03-2874801 |GPTCache is a library for creating semantic cache to store responses from LLM queries.| | 626|niedev/RTranslator !2025-03-2874742|RTranslator is the world's first open source real-time translation app.| |!green-up-arrow.svg 627|roboflow/notebooks !2025-03-2874666|Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.| |!red-down-arrow 628|openlm-research/openllama !2025-03-2874652|OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset| | 629|LiheYoung/Depth-Anything !2025-03-2874155|Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data| | 630|enso-org/enso !2025-03-2874040|Hybrid visual and textual functional programming.| | 631|bigcode-project/starcoder !2025-03-287401-1 |Home of StarCoder: fine-tuning & inference!| | 632|git-ecosystem/git-credential-manager !2025-03-2873975|Secure, cross-platform Git credential storage with authentication to GitHub, Azure Repos, and other popular Git hosting services.| | 633|OpenGVLab/InternVL !2025-03-2873634|[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型| | 634|WooooDyy/LLM-Agent-Paper-List !2025-03-2873551|The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.| | 635|lencx/Noi !2025-03-2873157|🦄 AI + Tools + Plugins + Community| | 636|udlbook/udlbook !2025-03-2873075|Understanding Deep Learning - Simon J.D. Prince| | 637|OpenBMB/MiniCPM !2025-03-2872841|MiniCPM-2B: An end-side LLM outperforms Llama2-13B.| | 638|jaywalnut310/vits !2025-03-2872815 |VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech| | 639|xorbitsai/inference !2025-03-28727528|Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.| | 640|PWhiddy/PokemonRedExperiments !2025-03-2872492|Playing Pokemon Red with Reinforcement Learning| | 641|Canner/WrenAI !2025-03-28723213|🤖 Open-source AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑‍💻| | 642|miurla/morphic !2025-03-2872258|An AI-powered answer engine with a generative UI| | 643|ml-explore/mlx-examples !2025-03-2872168|Examples in the MLX framework| | 644|PKU-YuanGroup/ChatLaw !2025-03-2872010|Chinese Legal Large Model| | 645|NVIDIA/cutlass !2025-03-2871883|CUDA Templates for Linear Algebra Subroutines| | 646|FoundationVision/VAR !2025-03-28717444|[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"| | 647|ymcui/Chinese-LLaMA-Alpaca-2 !2025-03-2871561|Chinese LLaMA-2 & Alpaca-2 LLMs| | 648|nadermx/backgroundremover !2025-03-2871514 |Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.| | 649|onuratakan/gpt-computer-assistant !2025-03-28714514|gpt-4o for windows, macos and ubuntu| | 650|graviraja/MLOps-Basics !2025-03-2871326|| | 651|Future-House/paper-qa !2025-03-287118-1|High accuracy RAG for answering questions from scientific documents with citations| | 652|open-mmlab/mmagic !2025-03-2871102 |OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox| | 653|bhaskatripathi/pdfGPT !2025-03-2870941 |PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot!| | 654|ollama/ollama-python !2025-03-28709117|Ollama Python library| | 655|facebookresearch/DiT !2025-03-2870376|Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"| | 656|geekyutao/Inpaint-Anything !2025-03-2870262 |Inpaint anything using Segment Anything and inpainting models.| | 657|AbdullahAlfaraj/Auto-Photoshop-StableDiffusion-Plugin !2025-03-2870160 |A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend.| | 658|apple/corenet !2025-03-2869990|CoreNet: A library for training deep neural networks| | 659|openstatusHQ/openstatus !2025-03-2869926|🏓 The open-source synthetic monitoring platform 🏓| | 660|weaviate/Verba !2025-03-2869772|Retrieval Augmented Generation (RAG) chatbot powered by Weaviate| | 661|meshery/meshery !2025-03-2869630|Meshery, the cloud native manager| | 662|OpenTalker/video-retalking !2025-03-2869530|[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild| | 663|digitalinnovationone/dio-lab-open-source !2025-03-28689013|Repositório do lab "Contribuindo em um Projeto Open Source no GitHub" da Digital Innovation One.| | 664|jianchang512/ChatTTS-ui !2025-03-2868842|一个简单的本地网页界面,直接使用ChatTTS将文字合成为语音,同时支持对外提供API接口。| | 665|patchy631/ai-engineering-hub !2025-03-28686434|In-depth tutorials on LLMs, RAGs and real-world AI agent applications.| | 666|gunnarmorling/1brc !2025-03-2868512|1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java| | 667|Azure-Samples/azure-search-openai-demo !2025-03-2868482 |A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.| | 668|mit-han-lab/streaming-llm !2025-03-2868382|Efficient Streaming Language Models with Attention Sinks| | 669|InternLM/InternLM !2025-03-2868352|InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.| | 670|dependency-check/DependencyCheck !2025-03-2868191|OWASP dependency-check is a software composition analysis utility that detects publicly disclosed vulnerabilities in application dependencies.| | 671|Soulter/AstrBot !2025-03-28678643|✨易上手的多平台 LLM 聊天机器人及开发框架✨。支持 QQ、QQ频道、Telegram、微信平台(Gewechat, 企业微信)、内置 Web Chat,OpenAI GPT、DeepSeek、Ollama、Llama、GLM、Gemini、OneAPI、LLMTuner,支持 LLM Agent 插件开发,可视化面板。一键部署。支持 Dify 工作流、代码执行器、Whisper 语音转文字。| | 672|react-native-webview/react-native-webview !2025-03-2867792|React Native Cross-Platform WebView| | 673|modelscope/agentscope !2025-03-28676916|Start building LLM-empowered multi-agent applications in an easier way.| | 674|mylxsw/aidea !2025-03-2867381|AIdea is a versatile app that supports GPT and domestic large language models,also supports "Stable Diffusion" text-to-image generation, image-to-image generation, SDXL 1.0, super-resolution, and image colorization| | 675|langchain-ai/ollama-deep-researcher !2025-03-28668635|Fully local web research and report writing assistant| | 676|threestudio-project/threestudio !2025-03-2866653|A unified framework for 3D content generation.| | 677|gaomingqi/Track-Anything !2025-03-2866631 |A flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.| | 678|spdustin/ChatGPT-AutoExpert !2025-03-2866570|🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).| | 679|HariSekhon/DevOps-Bash-tools !2025-03-2866463|1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..| | 680|modelscope/swift !2025-03-28661530|ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs| | 681|langchain-ai/opengpts !2025-03-2866080|This is an open source effort to create a similar experience to OpenAI's GPTs and Assistants API| | 682| yihong0618/xiaogpt !2025-03-2865131 | Play ChatGPT with xiaomi ai speaker | | 683| civitai/civitai !2025-03-2865111 | Build a platform where people can share their stable diffusion models | | 684|KoljaB/RealtimeSTT !2025-03-28649513|A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.| | 685|qunash/chatgpt-advanced !2025-03-2864910 | A browser extension that augments your ChatGPT prompts with web results.| | 686|Licoy/ChatGPT-Midjourney !2025-03-2864850|🎨 Own your own ChatGPT+Midjourney web service with one click| | 687|friuns2/BlackFriday-GPTs-Prompts !2025-03-2864744|List of free GPTs that doesn't require plus subscription| | 688|PixarAnimationStudios/OpenUSD !2025-03-2864700|Universal Scene Description| | 689|linyiLYi/street-fighter-ai !2025-03-2864630 |This is an AI agent for Street Fighter II Champion Edition.| | 690|run-llama/rags !2025-03-2864380|Build ChatGPT over your data, all with natural language| | 691|frdel/agent-zero !2025-03-2864154|Agent Zero AI framework| | 692|microsoft/DeepSpeedExamples !2025-03-2863911 |Example models using DeepSpeed| | 693|k8sgpt-ai/k8sgpt !2025-03-2863882|Giving Kubernetes Superpowers to everyone| | 694|open-metadata/OpenMetadata !2025-03-2863514|OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration.| | 695|google/gemma.cpp !2025-03-2863163|lightweight, standalone C++ inference engine for Google's Gemma models.| | 696|RayVentura/ShortGPT !2025-03-286314-1|🚀🎬 ShortGPT - An experimental AI framework for automated short/video content creation. Enables creators to rapidly produce, manage, and deliver content using AI and automation.| | 697|openai/consistencymodels !2025-03-2862940 |Official repo for consistency models.| | 698|yangjianxin1/Firefly !2025-03-2862924|Firefly: Chinese conversational large language model (full-scale fine-tuning + QLoRA), supporting fine-tuning of Llma2, Llama, Baichuan, InternLM, Ziya, Bloom, and other large models| | 699|enricoros/big-AGI !2025-03-2862665|Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.| | 700|aptos-labs/aptos-core !2025-03-2862633|Aptos is a layer 1 blockchain built to support the widespread use of blockchain through better technology and user experience.| | 701|wenda-LLM/wenda !2025-03-286262-1 |Wenda: An LLM invocation platform. Its objective is to achieve efficient content generation tailored to specific environments while considering the limited computing resources of individuals and small businesses, as well as knowledge security and privacy concerns| | 702|Project-MONAI/MONAI !2025-03-2862603|AI Toolkit for Healthcare Imaging| | 703|HVision-NKU/StoryDiffusion !2025-03-2862470|Create Magic Story!| | 704|deepseek-ai/DeepSeek-LLM !2025-03-2862463|DeepSeek LLM: Let there be answers| | 705|Tohrusky/Final2x !2025-03-2862393|2^x Image Super-Resolution| | 706|OpenSPG/KAG !2025-03-28619611|KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.| | 707|Moonvy/OpenPromptStudio !2025-03-2861861 |AIGC Hint Word Visualization Editor| | 708|levihsu/OOTDiffusion !2025-03-2861761|Official implementation of OOTDiffusion| | 709|tmc/langchaingo !2025-03-2861729|LangChain for Go, the easiest way to write LLM-based programs in Go| | 710|vladmandic/automatic !2025-03-2861374|SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models| | 711|clovaai/donut !2025-03-2861231 |Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022| | 712|Shaunwei/RealChar !2025-03-286121-1|🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖| | 713|microsoft/TinyTroupe !2025-03-2861142|LLM-powered multiagent persona simulation for imagination enhancement and business insights.| | 714| rustformers/llm !2025-03-2861010 | Run inference for Large Language Models on CPU, with Rust| | 715|firebase/firebase-ios-sdk !2025-03-2860950|Firebase SDK for Apple App Development| | 716|vespa-engine/vespa !2025-03-2860824|The open big data serving engine. https://vespa.ai| | 717|n4ze3m/page-assist !2025-03-28607610|Use your locally running AI models to assist you in your web browsing| | 718|Dooy/chatgpt-web-midjourney-proxy !2025-03-2860646|chatgpt web, midjourney, gpts,tts, whisper 一套ui全搞定| | 719|ethereum-optimism/optimism !2025-03-2860213|Optimism is Ethereum, scaled.| | 720|sczhou/ProPainter !2025-03-2859971|[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting| | 721|MineDojo/Voyager !2025-03-2859951 |An Open-Ended Embodied Agent with Large Language Models| | 722|lavague-ai/LaVague !2025-03-2859800|Automate automation with Large Action Model framework| | 723|SevaSk/ecoute !2025-03-2859770 |Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.| | 724|google/mesop !2025-03-2859661|| | 725|pengxiao-song/LaWGPT !2025-03-2859542 |Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge| | 726|fr0gger/Awesome-GPT-Agents !2025-03-2859434|A curated list of GPT agents for cybersecurity| | 727|google-deepmind/graphcast !2025-03-2859412|| | 728|comet-ml/opik !2025-03-28594126|Open-source end-to-end LLM Development Platform| | 729|SciPhi-AI/R2R !2025-03-28594033|A framework for rapid development and deployment of production-ready RAG systems| | 730|SkalskiP/courses !2025-03-2859272 |This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)| | 731|QuivrHQ/MegaParse !2025-03-2859122|File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.| | 732|pytorch-labs/gpt-fast !2025-03-2858971|Simple and efficient pytorch-native transformer text generation in !2025-03-2858886|Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.| | 734|nilsherzig/LLocalSearch !2025-03-2858852|LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.| | 735|kuafuai/DevOpsGPT !2025-03-285874-2|Multi agent system for AI-driven software development. Convert natural language requirements into working software. Supports any development language and extends the existing base code.| | 736|myshell-ai/MeloTTS !2025-03-2858486|High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.| | 737|OpenGVLab/LLaMA-Adapter !2025-03-2858421 |Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters| | 738|volcengine/verl !2025-03-28582563|veRL: Volcano Engine Reinforcement Learning for LLM| | 739|a16z-infra/companion-app !2025-03-2858171|AI companions with memory: a lightweight stack to create and host your own AI companions| | 740|HumanAIGC/OutfitAnyone !2025-03-285816-1|Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person| | 741|josStorer/RWKV-Runner !2025-03-2857472|A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.| | 742|648540858/wvp-GB28181-pro !2025-03-2857414|WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。| | 743|ToonCrafter/ToonCrafter !2025-03-2857345|a research paper for generative cartoon interpolation| | 744|PawanOsman/ChatGPT !2025-03-2857191|OpenAI API Free Reverse Proxy| | 745|apache/hudi !2025-03-2857091|Upserts, Deletes And Incremental Processing on Big Data.| | 746| nsarrazin/serge !2025-03-2857081 | A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API| | 747|homanp/superagent !2025-03-2857021|🥷 Superagent - Build, deploy, and manage LLM-powered agents| | 748|ramonvc/freegpt-webui !2025-03-2856910|GPT 3.5/4 with a Chat Web UI. No API key is required.| | 749|baichuan-inc/baichuan-7B !2025-03-2856901|A large-scale 7B pretraining language model developed by BaiChuan-Inc.| | 750|Azure/azure-sdk-for-net !2025-03-2856792|This repository is for active development of the Azure SDK for .NET. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/dotnet/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-net.| | 751|mnotgod96/AppAgent !2025-03-2856643|AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.| | 752|microsoft/TaskWeaver !2025-03-2856243|A code-first agent framework for seamlessly planning and executing data analytics tasks.| | 753| yetone/bob-plugin-openai-translator !2025-03-285600-1 | A Bob Plugin base ChatGPT API | | 754|PrefectHQ/marvin !2025-03-2855840 |A batteries-included library for building AI-powered software| | 755|microsoft/promptbase !2025-03-2855832|All things prompt engineering| | 756|fullstackhero/dotnet-starter-kit !2025-03-2855560|Production Grade Cloud-Ready .NET 8 Starter Kit (Web API + Blazor Client) with Multitenancy Support, and Clean/Modular Architecture that saves roughly 200+ Development Hours! All Batteries Included.| | 757|deepseek-ai/DeepSeek-Coder-V2 !2025-03-2855435|DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence| | 758|aiwaves-cn/agents !2025-03-2855391|An Open-source Framework for Autonomous Language Agents| | 759|microsoft/Mastering-GitHub-Copilot-for-Paired-Programming !2025-03-2855158|A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource.| | 760|allenai/OLMo !2025-03-2854506|Modeling, training, eval, and inference code for OLMo| | 761|apify/crawlee-python !2025-03-2854493|Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.| | 762|k2-fsa/sherpa-onnx !2025-03-28541520|Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift| | 763|TEN-framework/TEN-Agent !2025-03-28541411|TEN Agent is a realtime conversational AI agent powered by TEN. It seamlessly integrates the OpenAI Realtime API, RTC capabilities, and advanced features like weather updates, web search, computer vision, and Retrieval-Augmented Generation (RAG).| | 764|google/gemmapytorch !2025-03-2854010|The official PyTorch implementation of Google's Gemma models| | 765|snakers4/silero-vad !2025-03-2853858|Silero VAD: pre-trained enterprise-grade Voice Activity Detector| | 766|livekit/agents !2025-03-2853836|Build real-time multimodal AI applications 🤖🎙️📹| | 767|pipecat-ai/pipecat !2025-03-28537811|Open Source framework for voice and multimodal conversational AI| | 768|EricLBuehler/mistral.rs !2025-03-28536324|Blazingly fast LLM inference.| | 769|asg017/sqlite-vec !2025-03-28535810|Work-in-progress vector search SQLite extension that runs anywhere.| | 770|albertan017/LLM4Decompile !2025-03-2853563|Reverse Engineering: Decompiling Binary Code with Large Language Models| | 771|Permify/permify !2025-03-2853235|An open-source authorization as a service inspired by Google Zanzibar, designed to build and manage fine-grained and scalable authorization systems for any application.| | 772|imoneoi/openchat !2025-03-2853171|OpenChat: Advancing Open-source Language Models with Imperfect Data| | 773|mosaicml/composer !2025-03-2853140|Train neural networks up to 7x faster| | 774|dsdanielpark/Bard-API !2025-03-285277-1 |The python package that returns a response of Google Bard through API.| | 775|lxfater/inpaint-web !2025-03-2852552|A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| | 776|leanprover/lean4 !2025-03-2852441|Lean 4 programming language and theorem prover| | 777|AILab-CVC/YOLO-World !2025-03-2852415|Real-Time Open-Vocabulary Object Detection| | 778|openchatai/OpenChat !2025-03-2852260 |Run and create custom ChatGPT-like bots with OpenChat, embed and share these bots anywhere, the open-source chatbot console.| | 779|mufeedvh/code2prompt !2025-03-28519414|A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.| | 780|biobootloader/wolverine !2025-03-2851700 |Automatically repair python scripts through GPT-4 to give them regenerative abilities.| | 781|huggingface/parler-tts !2025-03-2851671|Inference and training library for high-quality TTS models.| | 782|Akegarasu/lora-scripts !2025-03-2851308 |LoRA training scripts use kohya-ss's trainer, for diffusion model.| | 783|openchatai/OpenCopilot !2025-03-285128-3|🤖 🔥 Let your users chat with your product features and execute things by text - open source Shopify sidekick| | 784|e2b-dev/fragments !2025-03-2851228|Open-source Next.js template for building apps that are fully generated by AI. By E2B.| | 785|microsoft/SynapseML !2025-03-2851132|Simple and Distributed Machine Learning| | 786|aigc-apps/sd-webui-EasyPhoto !2025-03-285108-1|📷 EasyPhoto | | 787|ChaoningZhang/MobileSAM !2025-03-2850944|This is the official code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight| | 788|huggingface/alignment-handbook !2025-03-2850932|Robust recipes for to align language models with human and AI preferences| | 789|alpkeskin/mosint !2025-03-2850920|An automated e-mail OSINT tool| | 790|TaskingAI/TaskingAI !2025-03-2850891|The open source platform for AI-native application development.| | 791|lipku/metahuman-stream !2025-03-28507615|Real time interactive streaming digital human| | 792|OpenInterpreter/01 !2025-03-2850530|The open-source language model computer| | 793|open-compass/opencompass !2025-03-28505111|OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.| | 794|xxlong0/Wonder3D !2025-03-2850491|A cross-domain diffusion model for 3D reconstruction from a single image| | 795|pytorch/torchtune !2025-03-2850342|A Native-PyTorch Library for LLM Fine-tuning| | 796|SuperDuperDB/superduperdb !2025-03-2850192|🔮 SuperDuperDB: Bring AI to your database: Integrate, train and manage any AI models and APIs directly with your database and your data.| | 797|WhiskeySockets/Baileys !2025-03-2850057|Lightweight full-featured typescript/javascript WhatsApp Web API| | 798| mpociot/chatgpt-vscode !2025-03-2849890 | A VSCode extension that allows you to use ChatGPT | | 799|OpenGVLab/DragGAN !2025-03-2849880|Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)| | 800|microsoft/LLMLingua !2025-03-2849824|To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.| | 801|Zipstack/unstract !2025-03-2849745|No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents| | 802|OpenBMB/ToolBench !2025-03-2849621|An open platform for training, serving, and evaluating large language model for tool learning.| | 803|Fanghua-Yu/SUPIR !2025-03-2849593|SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild| | 804|GaiaNet-AI/gaianet-node !2025-03-2849360|Install and run your own AI agent service| | 805|qodo-ai/qodo-cover !2025-03-284922-1|Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞| | 806|Zejun-Yang/AniPortrait !2025-03-2849042|AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation| | 807|lvwzhen/law-cn-ai !2025-03-2848901 |⚖️ AI Legal Assistant| | 808|developersdigest/llm-answer-engine !2025-03-2848740|Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper| | 809|Plachtaa/VITS-fast-fine-tuning !2025-03-2848640|This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion| | 810|espeak-ng/espeak-ng !2025-03-2848601|eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.| | 811|ant-research/CoDeF !2025-03-2848581|[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing| | 812|deepseek-ai/DeepSeek-V2 !2025-03-2848512|| | 813|XRPLF/rippled !2025-03-2848210|Decentralized cryptocurrency blockchain daemon implementing the XRP Ledger protocol in C++| | 814|AutoMQ/automq !2025-03-28478721|AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.| | 815|AILab-CVC/VideoCrafter !2025-03-2847800|VideoCrafter1: Open Diffusion Models for High-Quality Video Generation| | 816|nautechsystems/nautilustrader !2025-03-2847702|A high-performance algorithmic trading platform and event-driven backtester| | 817|kyegomez/swarms !2025-03-2847563|The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503| | 818|Deci-AI/super-gradients !2025-03-2847310 |Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.| | 819|QwenLM/Qwen2.5-Coder !2025-03-2847236|Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.| | 820|SCIR-HI/Huatuo-Llama-Med-Chinese !2025-03-2847191 |Repo for HuaTuo (华驼), Llama-7B tuned with Chinese medical knowledge| | 821|togethercomputer/RedPajama-Data !2025-03-2846841 |code for preparing large datasets for training large language models| | 822|mishushakov/llm-scraper !2025-03-2846704|Turn any webpage into structured data using LLMs| | 823|1rgs/jsonformer !2025-03-2846663 |A Bulletproof Way to Generate Structured JSON from Language Models| | 824|anti-work/shortest !2025-03-2846565|QA via natural language AI tests| | 825|dnhkng/GlaDOS !2025-03-2846510|This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.| | 826|Nukem9/dlssg-to-fsr3 !2025-03-2846380|Adds AMD FSR3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg).| | 827|BuilderIO/ai-shell !2025-03-2846373 |A CLI that converts natural language to shell commands.| | 828|facebookincubator/AITemplate !2025-03-2846220 |AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.| | 829|terraform-aws-modules/terraform-aws-eks !2025-03-2846030|Terraform module to create AWS Elastic Kubernetes (EKS) resources 🇺🇦| | 830|timescale/pgai !2025-03-2845915|A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL| | 831|awslabs/multi-agent-orchestrator !2025-03-2845788|Flexible and powerful framework for managing multiple AI agents and handling complex conversations| | 832|sanchit-gandhi/whisper-jax !2025-03-2845771 |Optimised JAX code for OpenAI's Whisper Model, largely built on the Hugging Face Transformers Whisper implementation| | 833|NVIDIA/NeMo-Guardrails !2025-03-2845755|NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.| | 834|PathOfBuildingCommunity/PathOfBuilding !2025-03-2845480|Offline build planner for Path of Exile.| | 835|UX-Decoder/Segment-Everything-Everywhere-All-At-Once !2025-03-2845412 |Official implementation of the paper "Segment Everything Everywhere All at Once"| | 836|build-trust/ockam !2025-03-2845171|Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale.| | 837|google-research/timesfm !2025-03-2845135|TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.| | 838|luosiallen/latent-consistency-model !2025-03-2844842|Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference| | 839|NVlabs/neuralangelo !2025-03-2844740|Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)| | 840|kyegomez/tree-of-thoughts !2025-03-2844720 |Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%| | 841|sjvasquez/handwriting-synthesis !2025-03-2844720 |Handwriting Synthesis with RNNs ✏️| | 842| madawei2699/myGPTReader !2025-03-2844420 | A slack bot that can read any webpage, ebook or document and summarize it with chatGPT | | 843|OpenBMB/AgentVerse !2025-03-2844413|🤖 AgentVerse 🪐 provides a flexible framework that simplifies the process of building custom multi-agent environments for large language models (LLMs).| | 844|argmaxinc/WhisperKit !2025-03-2844395|Swift native speech recognition on-device for iOS and macOS applications.| | 845|landing-ai/vision-agent !2025-03-2844346|Vision agent| | 846|InternLM/xtuner !2025-03-2844273|An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)| | 847|google-deepmind/alphageometry !2025-03-284421-1|Solving Olympiad Geometry without Human Demonstrations| | 848|ostris/ai-toolkit !2025-03-2844093|Various AI scripts. Mostly Stable Diffusion stuff.| | 849|LLM-Red-Team/kimi-free-api !2025-03-2844004|🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。| | 850|argilla-io/argilla !2025-03-2843991|Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.| | 851|spring-projects/spring-ai !2025-03-28438419|An Application Framework for AI Engineering| | 852|alibaba-damo-academy/FunClip !2025-03-2843555|Open-source, accurate and easy-to-use video clipping tool, LLM based AI clipping intergrated | | 853|yisol/IDM-VTON !2025-03-2843541|IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild| | 854|fchollet/ARC-AGI !2025-03-2843368|The Abstraction and Reasoning Corpus| | 855|MahmoudAshraf97/whisper-diarization !2025-03-2843064|Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper| | 856|Speykious/cve-rs !2025-03-2843047|Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀| | 857|Blealtan/efficient-kan !2025-03-2842770|An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).| | 858|smol-ai/GodMode !2025-03-284249-1|AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.| | 859|openai/plugins-quickstart !2025-03-284235-4 |Get a ChatGPT plugin up and running in under 5 minutes!| | 860|Doriandarko/maestro !2025-03-2842260|A framework for Claude Opus to intelligently orchestrate subagents.| | 861|philz1337x/clarity-upscaler !2025-03-2842204|Clarity-Upscaler: Reimagined image upscaling for everyone| | 862|facebookresearch/co-tracker !2025-03-2842142|CoTracker is a model for tracking any point (pixel) on a video.| | 863|xlang-ai/OpenAgents !2025-03-2842031|OpenAgents: An Open Platform for Language Agents in the Wild| | 864|alibaba/higress !2025-03-28419514|🤖 AI Gateway | | 865|ray-project/llm-numbers !2025-03-2841920 |Numbers every LLM developer should know| | 866|fudan-generative-vision/champ !2025-03-2841820|Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance| | 867|NVIDIA/garak !2025-03-2841795|the LLM vulnerability scanner| | 868|leetcode-mafia/cheetah !2025-03-2841740 |Whisper & GPT-based app for passing remote SWE interviews| | 869|ragapp/ragapp !2025-03-2841710|The easiest way to use Agentic RAG in any enterprise| | 870|collabora/WhisperSpeech !2025-03-2841692|An Open Source text-to-speech system built by inverting Whisper.| | 871|Facico/Chinese-Vicuna !2025-03-2841520 |Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model| | 872|openai/grok !2025-03-2841381|| | 873|CrazyBoyM/llama3-Chinese-chat !2025-03-2841361|Llama3 Chinese Repository with modified versions, and training and deployment resources| | 874|luban-agi/Awesome-AIGC-Tutorials !2025-03-2841301|Curated tutorials and resources for Large Language Models, AI Painting, and more.| | 875|damo-vilab/AnyDoor !2025-03-2841192|Official implementations for paper: Anydoor: zero-shot object-level image customization| | 876|raspberrypi/pico-sdk !2025-03-2841072|| | 877|mshumer/gpt-llm-trainer !2025-03-284097-1|| | 878|metavoiceio/metavoice-src !2025-03-284076-1|AI for human-level speech intelligence| | 879|intelowlproject/IntelOwl !2025-03-2840763|IntelOwl: manage your Threat Intelligence at scale| | 880|a16z-infra/ai-getting-started !2025-03-2840682|A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs| | 881|MarkFzp/mobile-aloha !2025-03-2840641|Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation| | 882| keijiro/AICommand !2025-03-2840380 | ChatGPT integration with Unity Editor | | 883|Tencent/HunyuanDiT !2025-03-2840214|Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding| | 884|hengyoush/kyanos !2025-03-2840061|Visualize the time packets spend in the kernel, watch & analyze in command line.| | 885|agiresearch/AIOS !2025-03-2840045|AIOS: LLM Agent Operating System| | 886|truefoundry/cognita !2025-03-2839773|RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry| | 887|X-PLUG/MobileAgent !2025-03-2839557|Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception| | 888|jackMort/ChatGPT.nvim !2025-03-2839231|ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API| | 889|microsoft/RD-Agent !2025-03-28388422|Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automate these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which let AI drive data-driven AI.| | 890|Significant-Gravitas/Auto-GPT-Plugins !2025-03-283882-1 |Plugins for Auto-GPT| | 891|apple/ml-mgie !2025-03-2838770|| | 892|OpenDriveLab/UniAD !2025-03-2838727|[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving| | 893|llSourcell/DoctorGPT !2025-03-2838640|DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.| | 894|FlagAI-Open/FlagAI !2025-03-2838601|FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.| | 895|krishnaik06/Roadmap-To-Learn-Generative-AI-In-2024 !2025-03-2838513|Roadmap To Learn Generative AI In 2024| | 896|SysCV/sam-hq !2025-03-2838491|Segment Anything in High Quality| | 897|google/security-research !2025-03-2838420|This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code.| | 898|shroominic/codeinterpreter-api !2025-03-2838330|Open source implementation of the ChatGPT Code Interpreter 👾| | 899|Yonom/assistant-ui !2025-03-2838308|React Components for AI Chat 💬 🚀| | 900|nucleuscloud/neosync !2025-03-2838262|Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.| | 901|ravenscroftj/turbopilot !2025-03-2838230 |Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU| | 902|NVlabs/Sana !2025-03-28380810|SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer| | 903|huggingface/distil-whisper !2025-03-2838061|Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.| | 904|Codium-ai/AlphaCodium !2025-03-2837971|code generation tool that surpasses most human competitors in CodeContests| | 905|fixie-ai/ultravox !2025-03-2837710|A fast multimodal LLM for real-time voice| | 906|unit-mesh/auto-dev !2025-03-28375715|🧙‍AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀| | 907|Marker-Inc-Korea/AutoRAG !2025-03-2837432|AutoML tool for RAG| | 908|deepseek-ai/DeepSeek-VL !2025-03-283734-1|DeepSeek-VL: Towards Real-World Vision-Language Understanding| | 909|hiyouga/ChatGLM-Efficient-Tuning !2025-03-283692-1|Fine-tuning ChatGLM-6B with PEFT | | 910| Yue-Yang/ChatGPT-Siri !2025-03-2836921 | Shortcuts for Siri using ChatGPT API gpt-3.5-turbo model | | 911|0hq/WebGPT !2025-03-2836901 |Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~2000 lines of vanilla Javascript.| | 912|cvg/LightGlue !2025-03-2836903|LightGlue: Local Feature Matching at Light Speed (ICCV 2023)| | 913|deanxv/coze-discord-proxy !2025-03-2836791|代理Discord-Bot对话Coze-Bot,实现API形式请求GPT4对话模型/微调模型| | 914|MervinPraison/PraisonAI !2025-03-2836764|PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.| | 915|Ironclad/rivet !2025-03-2836345 |The open-source visual AI programming environment and TypeScript library| | 916|BasedHardware/OpenGlass !2025-03-2835851|Turn any glasses into AI-powered smart glasses| | 917|ricklamers/gpt-code-ui !2025-03-2835840 |An open source implementation of OpenAI's ChatGPT Code interpreter| | 918|whoiskatrin/chart-gpt !2025-03-2835830 |AI tool to build charts based on text input| | 919|github/CopilotForXcode !2025-03-2835788|Xcode extension for GitHub Copilot| | 920|hemansnation/God-Level-Data-Science-ML-Full-Stack !2025-03-2835570 |A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI| | 921|pytorch/torchchat !2025-03-2835461|Run PyTorch LLMs locally on servers, desktop and mobile| | 922| Kent0n-Li/ChatDoctor !2025-03-2835451 | A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge | | 923|xtekky/chatgpt-clone !2025-03-283519-1 |ChatGPT interface with better UI| | 924|jupyterlab/jupyter-ai !2025-03-2835120|A generative AI extension for JupyterLab| | 925|pytorch/torchtitan !2025-03-2835064|A native PyTorch Library for large model training| | 926|minimaxir/simpleaichat !2025-03-2835031|Python package for easily interfacing with chat apps, with robust features and minimal code complexity.| | 927|srush/Tensor-Puzzles !2025-03-2834930|Solve puzzles. Improve your pytorch.| | 928|Helicone/helicone !2025-03-2834918|🧊 Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more. 🍓 YC W23| | 929|run-llama/llama-hub !2025-03-2834740|A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain| | 930|NExT-GPT/NExT-GPT !2025-03-2834700|Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model| | 931|souzatharsis/podcastfy !2025-03-2834661|An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI| | 932|Dataherald/dataherald !2025-03-2834450|Interact with your SQL database, Natural Language to SQL using LLMs| | 933|iryna-kondr/scikit-llm !2025-03-2834350 |Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks.| | 934|Netflix/maestro !2025-03-2834230|Maestro: Netflix’s Workflow Orchestrator| | 935|CanadaHonk/porffor !2025-03-2833560|A from-scratch experimental AOT JS engine, written in JS| | 936|hustvl/Vim !2025-03-2833323|Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model| | 937|pashpashpash/vault-ai !2025-03-2833250 |OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, etc) using a simple React frontend.| | 938|tencentmusic/supersonic !2025-03-28330611|SuperSonic is the next-generation BI platform that integrates Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.| | 939|billmei/every-chatgpt-gui !2025-03-2832981|Every front-end GUI client for ChatGPT| | 940|microsoft/torchgeo !2025-03-2832772|TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data| | 941|LLMBook-zh/LLMBook-zh.github.io !2025-03-28326110|《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣| | 942|dvlab-research/MiniGemini !2025-03-2832601|Official implementation for Mini-Gemini| | 943|rashadphz/farfalle !2025-03-2832460|🔍 AI search engine - self-host with local or cloud LLMs| | 944|Luodian/Otter !2025-03-2832450|🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.| | 945|AprilNEA/ChatGPT-Admin-Web !2025-03-2832370 | ChatGPT WebUI with user management and admin dashboard system| | 946|MarkFzp/act-plus-plus !2025-03-2832365|Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN| | 947|ethen8181/machine-learning !2025-03-2832310|🌎 machine learning tutorials (mainly in Python3)| | 948|opengeos/segment-geospatial !2025-03-2832312 |A Python package for segmenting geospatial data with the Segment Anything Model (SAM)| | 949|iusztinpaul/hands-on-llms !2025-03-283225-2|🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴| | 950|ToTheBeginning/PuLID !2025-03-2832221|Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment| | 951|neo4j-labs/llm-graph-builder !2025-03-2832164|Neo4j graph construction from unstructured data using LLMs| | 952|OpenGVLab/InternGPT !2025-03-2832150 |InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)| | 953|PKU-YuanGroup/Video-LLaVA !2025-03-2832060 |Video-LLaVA: Learning United Visual Representation by Alignment Before Projection| | 954|DataTalksClub/llm-zoomcamp !2025-03-2832030|LLM Zoomcamp - a free online course about building an AI bot that can answer questions about your knowledge base| | 955|gptscript-ai/gptscript !2025-03-2832010|Natural Language Programming| |!green-up-arrow.svg 956|isaac-sim/IsaacLab !2025-03-28320113|Unified framework for robot learning built on NVIDIA Isaac Sim| |!red-down-arrow 957|ai-boost/Awesome-GPTs !2025-03-2832003|Curated list of awesome GPTs 👍.| | 958|huggingface/safetensors !2025-03-2831901|Simple, safe way to store and distribute tensors| | 959|linyiLYi/bilibot !2025-03-2831771|A local chatbot fine-tuned by bilibili user comments.| | 960| project-baize/baize-chatbot !2025-03-283168-1 | Let ChatGPT teach your own chatbot in hours with a single GPU! | | 961|Azure-Samples/cognitive-services-speech-sdk !2025-03-2831280|Sample code for the Microsoft Cognitive Services Speech SDK| | 962|microsoft/Phi-3CookBook !2025-03-2831231|This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.| | 963|neuralmagic/deepsparse !2025-03-2831180|Sparsity-aware deep learning inference runtime for CPUs| | 964|sugarforever/chat-ollama !2025-03-2831000|ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.| | 965|amazon-science/chronos-forecasting !2025-03-2830974|Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting| | 966|damo-vilab/i2vgen-xl !2025-03-2830902|Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models| | 967|google-deepmind/gemma !2025-03-2830733|Open weights LLM from Google DeepMind.| | 968|iree-org/iree !2025-03-2830733|A retargetable MLIR-based machine learning compiler and runtime toolkit.| | 969|NVlabs/VILA !2025-03-2830724|VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)| | 970|microsoft/torchscale !2025-03-2830661|Foundation Architecture for (M)LLMs| | 971|openai/openai-realtime-console !2025-03-2830656|React app for inspecting, building and debugging with the Realtime API| | 972|daveshap/OpenAIAgentSwarm !2025-03-2830610|HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"| | 973|microsoft/PromptWizard !2025-03-2830555|Task-Aware Agent-driven Prompt Optimization Framework| | 974|CVI-SZU/Linly !2025-03-2830490 |Chinese-LLaMA basic model; ChatFlow Chinese conversation model; NLP pre-training/command fine-tuning dataset| | 975|cohere-ai/cohere-toolkit !2025-03-2830130|Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.| | 976|adamcohenhillel/ADeus !2025-03-2830131|An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.| | 977|Lightning-AI/LitServe !2025-03-2830132|Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.| | 978|potpie-ai/potpie !2025-03-2829973|Prompt-To-Agent : Create custom engineering agents for your codebase| | 979|ant-design/x !2025-03-28299529|Craft AI-driven interfaces effortlessly 🤖| | 980|meta-llama/PurpleLlama !2025-03-2829832|Set of tools to assess and improve LLM security.| | 981|williamyang1991/RerenderAVideo !2025-03-2829800|[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation| | 982|baichuan-inc/Baichuan-13B !2025-03-2829790|A 13B large language model developed by Baichuan Intelligent Technology| | 983|Stability-AI/stable-audio-tools !2025-03-2829761|Generative models for conditional audio generation| | 984|li-plus/chatglm.cpp !2025-03-2829720|C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs| | 985|NVIDIA/GenerativeAIExamples !2025-03-2829546|Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.| | 986|Josh-XT/AGiXT !2025-03-2829521 |AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.| | 987|MrForExample/ComfyUI-3D-Pack !2025-03-2829515|An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)| | 988|olimorris/codecompanion.nvim !2025-03-28295111|✨ AI-powered coding, seamlessly in Neovim. Supports Anthropic, Copilot, Gemini, Ollama, OpenAI and xAI LLMs| | 989|salesforce/CodeT5 !2025-03-282940-1 |Home of CodeT5: Open Code LLMs for Code Understanding and Generation| | 990|facebookresearch/ijepa !2025-03-2829391|Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."| | 991|eureka-research/Eureka !2025-03-2829351|Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"| | 992|NVIDIA/trt-llm-rag-windows !2025-03-282934-1|A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM| | 993|gmpetrov/databerry !2025-03-282930-1|The no-code platform for building custom LLM Agents| | 994|AI4Finance-Foundation/FinRobot !2025-03-28291946|FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀| | 995|nus-apr/auto-code-rover !2025-03-2829013|A project structure aware autonomous software engineer aiming for autonomous program improvement| | 996|deepseek-ai/DreamCraft3D !2025-03-2828921|[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior| | 997|mlabonne/llm-datasets !2025-03-2828848|High-quality datasets, tools, and concepts for LLM fine-tuning.| | 998|facebookresearch/jepa !2025-03-2828712|PyTorch code and models for V-JEPA self-supervised learning from video.| | 999|facebookresearch/habitat-sim !2025-03-2828604|A flexible, high-performance 3D simulator for Embodied AI research.| | 1000|xenova/whisper-web !2025-03-2828581|ML-powered speech recognition directly in your browser| | 1001|cvlab-columbia/zero123 !2025-03-2828530|Zero-1-to-3: Zero-shot One Image to 3D Object: https://zero123.cs.columbia.edu/| | 1002|yuruotong1/autoMate !2025-03-28285121|Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves| | 1003|muellerberndt/mini-agi !2025-03-282845-1 |A minimal generic autonomous agent based on GPT3.5/4. Can analyze stock prices, perform network security tests, create art, and order pizza.| | 1004|allenai/open-instruct !2025-03-2828432|| | 1005|CodingChallengesFYI/SharedSolutions !2025-03-2828360|Publicly shared solutions to Coding Challenges| | 1006|hegelai/prompttools !2025-03-2828220|Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate).| | 1007|mazzzystar/Queryable !2025-03-2828222|Run CLIP on iPhone to Search Photos.| | 1008|Doubiiu/DynamiCrafter !2025-03-2828173|DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors| | 1009|SamurAIGPT/privateGPT !2025-03-282805-1 |An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks| | 1010|facebookresearch/Pearl !2025-03-2827951|A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.| | 1011|intuitem/ciso-assistant-community !2025-03-2827954|CISO Assistant is a one-stop-shop for GRC, covering Risk, AppSec and Audit Management and supporting +70 frameworks worldwide with auto-mapping: NIST CSF, ISO 27001, SOC2, CIS, PCI DSS, NIS2, CMMC, PSPF, GDPR, HIPAA, Essential Eight, NYDFS-500, DORA, NIST AI RMF, 800-53, 800-171, CyFun, CJIS, AirCyber, NCSC, ECC, SCF and so much more| | 1012|facebookresearch/audio2photoreal !2025-03-2827840|Code and dataset for photorealistic Codec Avatars driven from audio| | 1013|Azure/azure-rest-api-specs !2025-03-2827770|The source for REST API specifications for Microsoft Azure.| | 1014|SCUTlihaoyu/open-chat-video-editor !2025-03-2827690 |Open source short video automatic generation tool| | 1015|Alpha-VLLM/LLaMA2-Accessory !2025-03-2827642|An Open-source Toolkit for LLM Development| | 1016|johnma2006/mamba-minimal !2025-03-2827601|Simple, minimal implementation of the Mamba SSM in one file of PyTorch.| | 1017|nerfstudio-project/gsplat !2025-03-2827576|CUDA accelerated rasterization of gaussian splatting| | 1018|Physical-Intelligence/openpi !2025-03-28274617|| | 1019|leptonai/leptonai !2025-03-2827246|A Pythonic framework to simplify AI service building| |!green-up-arrow.svg 1020|joanrod/star-vector !2025-03-28271149|StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.| |!red-down-arrow 1021|jqnatividad/qsv !2025-03-2827092|CSVs sliced, diced & analyzed.| | 1022|FranxYao/chain-of-thought-hub !2025-03-2826991|Benchmarking large language models' complex reasoning ability with chain-of-thought prompting| | 1023|princeton-nlp/SWE-bench !2025-03-2826965|[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?| | 1024|elastic/otel-profiling-agent !2025-03-2826930|The production-scale datacenter profiler| | 1025|src-d/hercules !2025-03-2826900|Gaining advanced insights from Git repository history.| | 1026|lanqian528/chat2api !2025-03-2826695|A service that can convert ChatGPT on the web to OpenAI API format.| | 1027|ishan0102/vimGPT !2025-03-2826681|Browse the web with GPT-4V and Vimium| | 1028|TMElyralab/MuseV !2025-03-2826650|MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising| | 1029|georgia-tech-db/eva !2025-03-2826600 |AI-Relational Database System | | 1030|kubernetes-sigs/controller-runtime !2025-03-2826590|Repo for the controller-runtime subproject of kubebuilder (sig-apimachinery)| | 1031|gptlink/gptlink !2025-03-2826550 |Build your own free commercial ChatGPT environment in 10 minutes. The setup is simple and includes features such as user management, orders, tasks, and payments| | 1032|pytorch/executorch !2025-03-2826534|On-device AI across mobile, embedded and edge for PyTorch| | 1033|NVIDIA/nv-ingest !2025-03-2826290|NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.| | 1034|SuperTux/supertux !2025-03-2826081|SuperTux source code| | 1035|abi/secret-llama !2025-03-2826050|Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.| | 1036|liou666/polyglot !2025-03-2825841 |Desktop AI Language Practice Application| | 1037|janhq/nitro !2025-03-2825821|A fast, lightweight, embeddable inference engine to supercharge your apps with local AI. OpenAI-compatible API| | 1038|deepseek-ai/DeepSeek-Math !2025-03-2825825|DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models| | 1039|anthropics/prompt-eng-interactive-tutorial !2025-03-2825781|Anthropic's Interactive Prompt Engineering Tutorial| | 1040|microsoft/promptbench !2025-03-2825741|A unified evaluation framework for large language models| | 1041|baaivision/Painter !2025-03-2825580 |Painter & SegGPT Series: Vision Foundation Models from BAAI| | 1042|OpenPipe/OpenPipe !2025-03-2825581|Turn expensive prompts into cheap fine-tuned models| | 1043|TracecatHQ/tracecat !2025-03-2825531|😼 The AI-native, open source alternative to Tines / Splunk SOAR.| | 1044|JoshuaC215/agent-service-toolkit !2025-03-2825528|Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit| | 1045|databricks/dbrx !2025-03-2825460|Code examples and resources for DBRX, a large language model developed by Databricks| | 1046|lamini-ai/lamini !2025-03-2825271 |Official repo for Lamini's data generator for generating instructions to train instruction-following LLMs| | 1047|mshumer/gpt-author !2025-03-282510-1|| | 1048|TMElyralab/MusePose !2025-03-2824971|MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation| | 1049|Kludex/fastapi-tips !2025-03-2824974|FastAPI Tips by The FastAPI Expert!| | 1050|openai/simple-evals !2025-03-2824813|| | 1051|iterative/datachain !2025-03-2824732|AI-data warehouse to enrich, transform and analyze data from cloud storages| | 1052|girafe-ai/ml-course !2025-03-2824703|Open Machine Learning course| | 1053|kevmo314/magic-copy !2025-03-2824620 |Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.| | 1054|Eladlev/AutoPrompt !2025-03-2824432|A framework for prompt tuning using Intent-based Prompt Calibration| | 1055|OpenBMB/CPM-Bee !2025-03-282434-1 |A bilingual large-scale model with trillions of parameters| | 1056|IDEA-Research/T-Rex !2025-03-2824310|T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy| | 1057|microsoft/genaiscript !2025-03-2824202|Automatable GenAI Scripting| | 1058|paulpierre/RasaGPT !2025-03-2824090 |💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram| | 1059|ashishpatel26/LLM-Finetuning !2025-03-2823911|LLM Finetuning with peft| | 1060|SoraWebui/SoraWebui !2025-03-2823570|SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.| | 1061|6drf21e/ChatTTScolab !2025-03-2823491|🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。| | 1062|Azure/PyRIT !2025-03-2823343|The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.| | 1063|tencent-ailab/V-Express !2025-03-2823201|V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.| | 1064|THUDM/CogVLM2 !2025-03-2823170|GPT4V-level open-source multi-modal model based on Llama3-8B| | 1065|dvmazur/mixtral-offloading !2025-03-2823001|Run Mixtral-8x7B models in Colab or consumer desktops| | 1066|semanser/codel !2025-03-2822950|✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.| | 1067|mshumer/gpt-investor !2025-03-2822590|| | 1068|aixcoder-plugin/aiXcoder-7B !2025-03-2822550|official repository of aiXcoder-7B Code Large Language Model| | 1069|Azure-Samples/graphrag-accelerator !2025-03-2822503|One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure| | 1070|emcf/engshell !2025-03-2821830 |An English-language shell for any OS, powered by LLMs| | 1071|hncboy/chatgpt-web-java !2025-03-2821771|ChatGPT project developed in Java, based on Spring Boot 3 and JDK 17, supports both AccessToken and ApiKey modes| | 1072|openai/consistencydecoder !2025-03-2821692|Consistency Distilled Diff VAE| | 1073|Alpha-VLLM/Lumina-T2X !2025-03-2821681|Lumina-T2X is a unified framework for Text to Any Modality Generation| | 1074|bghira/SimpleTuner !2025-03-2821612|A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.| | 1075|JiauZhang/DragGAN !2025-03-2821530 |Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold| | 1076|cgpotts/cs224u !2025-03-2821390|Code for Stanford CS224u| | 1077|PKU-YuanGroup/MoE-LLaVA !2025-03-2821300|Mixture-of-Experts for Large Vision-Language Models| | 1078|darrenburns/elia !2025-03-2820831|A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.| | 1079|ageerle/ruoyi-ai !2025-03-28207898|RuoYi AI 是一个全栈式 AI 开发平台,旨在帮助开发者快速构建和部署个性化的 AI 应用。| | 1080|NVIDIA/gpu-operator !2025-03-2820510|NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes| | 1081|BAAI-Agents/Cradle !2025-03-2820481|The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.| | 1082|microsoft/aici !2025-03-2820080|AICI: Prompts as (Wasm) Programs| | 1083|PRIS-CV/DemoFusion !2025-03-2820040|Let us democratise high-resolution generation! (arXiv 2023)| | 1084|apple/axlearn !2025-03-2820012|An Extensible Deep Learning Library| | 1085|naver/mast3r !2025-03-2819685|Grounding Image Matching in 3D with MASt3R| | 1086|liltom-eth/llama2-webui !2025-03-281958-1|Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.| | 1087|GaParmar/img2img-turbo !2025-03-2819582|One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more| | 1088|Niek/chatgpt-web !2025-03-2819560|ChatGPT web interface using the OpenAI API| | 1089|huggingface/cookbook !2025-03-2819421|Open-source AI cookbook| | 1090|pytorch/ao !2025-03-2819241|PyTorch native quantization and sparsity for training and inference| | 1091|emcie-co/parlant !2025-03-2819053|The behavior guidance framework for customer-facing LLM agents| | 1092|ymcui/Chinese-LLaMA-Alpaca-3 !2025-03-2818980|中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3| | 1093|Nutlope/notesGPT !2025-03-2818811|Record voice notes & transcribe, summarize, and get tasks| | 1094|InstantStyle/InstantStyle !2025-03-2818791|InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥| | 1095|idaholab/moose !2025-03-2818771|Multiphysics Object Oriented Simulation Environment| | 1096|The-OpenROAD-Project/OpenROAD !2025-03-2818351|OpenROAD's unified application implementing an RTL-to-GDS Flow. Documentation at https://openroad.readthedocs.io/en/latest/| | 1097|alibaba/spring-ai-alibaba !2025-03-281831121|Agentic AI Framework for Java Developers| | 1098|ytongbai/LVM !2025-03-2817990|Sequential Modeling Enables Scalable Learning for Large Vision Models| | 1099|microsoft/sample-app-aoai-chatGPT !2025-03-2817981|[PREVIEW] Sample code for a simple web chat experience targeting chatGPT through AOAI.| | 1100|AI-Citizen/SolidGPT !2025-03-2817830|Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱| | 1101|YangLing0818/RPG-DiffusionMaster !2025-03-2817784|Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)| | 1102|kyegomez/BitNet !2025-03-2817710|Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch| | 1103|eloialonso/diamond !2025-03-2817671|DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.| | 1104|flowdriveai/flowpilot !2025-03-2817250|flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines.| | 1105|xlang-ai/OSWorld !2025-03-2817200|OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments| | 1106|linyiLYi/snake-ai !2025-03-2817031|An AI agent that beats the classic game "Snake".| | 1107|baaivision/Emu !2025-03-2816991|Emu Series: Generative Multimodal Models from BAAI| | 1108|kevmo314/scuda !2025-03-2816870|SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.| | 1109|SharifiZarchi/IntroductiontoMachineLearning !2025-03-2816701|دوره‌ی مقدمه‌ای بر یادگیری ماشین، برای دانشجویان| | 1110|google/maxtext !2025-03-2816670|A simple, performant and scalable Jax LLM!| | 1111|ml-explore/mlx-swift-examples !2025-03-2816471|Examples using MLX Swift| | 1112|unitreerobotics/unitreerlgym !2025-03-2816256|| | 1113|collabora/WhisperFusion !2025-03-2815901|WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.| | 1114|lichao-sun/Mora !2025-03-2815520|Mora: More like Sora for Generalist Video Generation| | 1115|GoogleCloudPlatform/localllm !2025-03-2815370|Run LLMs locally on Cloud Workstations| | 1116|TencentARC/BrushNet !2025-03-2815330|The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"| | 1117|ai-christianson/RA.Aid !2025-03-2815288|Develop software autonomously.| | 1118|stephansturges/WALDO !2025-03-2815170|Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!| | 1119|skills/copilot-codespaces-vscode !2025-03-2815112|Develop with AI-powered code suggestions using GitHub Copilot and VS Code| | 1120|andrewnguonly/Lumos !2025-03-2814920|A RAG LLM co-pilot for browsing the web, powered by local LLMs| | 1121|TeamNewPipe/NewPipeExtractor !2025-03-2814811|NewPipe's core library for extracting data from streaming sites| | 1122|mhamilton723/FeatUp !2025-03-2814770|Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024| | 1123|AnswerDotAI/fsdpqlora !2025-03-2814671|Training LLMs with QLoRA + FSDP| | 1124|jgravelle/AutoGroq !2025-03-2814330|| | 1125|OpenGenerativeAI/llm-colosseum !2025-03-2814130|Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM| | 1126|microsoft/vscode-ai-toolkit !2025-03-2814000|| | 1127|McGill-NLP/webllama !2025-03-2813930|Llama-3 agents that can browse the web by following instructions and talking to you| | 1128|lucidrains/self-rewarding-lm-pytorch !2025-03-2813760|Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI| | 1129|ishaan1013/sandbox !2025-03-2813650|A cloud-based code editing environment with an AI copilot and real-time collaboration.| | 1130|goatcorp/Dalamud !2025-03-2813275|FFXIV plugin framework and API| | 1131|Lightning-AI/lightning-thunder !2025-03-2813151|Make PyTorch models Lightning fast! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once.| | 1132|PKU-YuanGroup/MagicTime !2025-03-2813052|MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators| | 1133|SakanaAI/evolutionary-model-merge !2025-03-2813000|Official repository of Evolutionary Optimization of Model Merging Recipes| | 1134|a-real-ai/pywinassistant !2025-03-2812950|The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.| | 1135|TraceMachina/nativelink !2025-03-2812630|NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster builds, reduced test flakiness, and significant infrastructure cost savings.| | 1136|MLSysOps/MLE-agent !2025-03-2812500|🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Ollama, etc supported. 🎆 Code RAG| | 1137|wpilibsuite/allwpilib !2025-03-2811610|Official Repository of WPILibJ and WPILibC| | 1138|elfvingralf/macOSpilot-ai-assistant !2025-03-2811470|Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.| | 1139|langchain-ai/langchain-extract !2025-03-2811210|🦜⛏️ Did you say you like data?| | 1140|FoundationVision/GLEE !2025-03-2811120|【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale| | 1141|Profluent-AI/OpenCRISPR !2025-03-2810990|AI-generated gene editing systems| | 1142|zju3dv/EasyVolcap !2025-03-2810821|[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research| | 1143|PaddlePaddle/PaddleHelix !2025-03-2810560|Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集| | 1144|myshell-ai/JetMoE !2025-03-289800|Reaching LLaMA2 Performance with 0.1M Dollars| | 1145|likejazz/llama3.np !2025-03-289770|llama3.np is pure NumPy implementation for Llama 3 model.| | 1146|mustafaaljadery/gemma-2B-10M !2025-03-289500|Gemma 2B with 10M context length using Infini-attention.| | 1147|HITsz-TMG/FilmAgent !2025-03-289382|Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making!| | 1148|aws-samples/amazon-bedrock-samples !2025-03-289362|This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models| | 1149|Akkudoktor-EOS/EOS !2025-03-2893154|This repository features an Energy Optimization System (EOS) that optimizes energy distribution, usage for batteries, heat pumps& household devices. It includes predictive models for electricity prices (planned), load forecasting& dynamic optimization to maximize energy efficiency & minimize costs. Founder Dr. Andreas Schmitz (YouTube @akkudoktor)| Tip: | symbol| rule | | :----| :---- | |🔥 | 256 1k| |!green-up-arrow.svg !red-down-arrow | ranking up / down| |⭐ | on trending page today| [Back to Top] Tools | No. | Tool | Description | | ----:|:----------------------------------------------- |:------------------------------------------------------------------------------------------- | | 1 | ChatGPT | A sibling model to InstructGPT, which is trained to follow instructions in a prompt and provide a detailed response | | 2 | DALL·E 2 | Create original, realistic images and art from a text description | | 3 | Murf AI | AI enabled, real people's voices| | 4 | Midjourney | An independent research lab that produces an artificial intelligence program under the same name that creates images from textual descriptions, used in Discord | 5 | Make-A-Video | Make-A-Video is a state-of-the-art AI system that generates videos from text | | 6 | Creative Reality™ Studio by D-ID| Use generative AI to create future-facing videos| | 7 | chat.D-ID| The First App Enabling Face-to-Face Conversations with ChatGPT| | 8 | Notion AI| Access the limitless power of AI, right inside Notion. Work faster. Write better. Think bigger. | | 9 | Runway| Text to Video with Gen-2 | | 10 | Resemble AI| Resemble’s AI voice generator lets you create human–like voice overs in seconds | | 11 | Cursor| Write, edit, and chat about your code with a powerful AI | | 12 | Hugging Face| Build, train and deploy state of the art models powered by the reference open source in machine learning | | 13 | Claude | A next-generation AI assistant for your tasks, no matter the scale | | 14 | Poe| Poe lets you ask questions, get instant answers, and have back-and-forth conversations with AI. Gives access to GPT-4, gpt-3.5-turbo, Claude from Anthropic, and a variety of other bots| [Back to Top] Websites | No. | WebSite |Description | | ----:|:------------------------------------------ |:---------------------------------------------------------------------------------------- | | 1 | OpenAI | An artificial intelligence research lab | | 2 | Bard | Base Google's LaMDA chatbots and pull from internet | | 3 | ERNIE Bot | Baidu’s new generation knowledge-enhanced large language model is a new member of the Wenxin large model family | | 4 | DALL·E 2 | An AI system that can create realistic images and art from a description in natural language | | 5 | Whisper | A general-purpose speech recognition model | | 6| CivitAI| A platform that makes it easy for people to share and discover resources for creating AI art| | 7|D-ID| D-ID’s Generative AI enables users to transform any picture or video into extraordinary experiences| | 8| Nvidia eDiff-I| Text-to-Image Diffusion Models with Ensemble of Expert Denoisers | | 9| Stability AI| The world's leading open source generative AI company which opened source Stable Diffusion | | 10| Meta AI| Whether it be research, product or infrastructure development, we’re driven to innovate responsibly with AI to benefit the world | | 11| ANTHROPIC| AI research and products that put safety at the frontier | [Back to Top] Reports&Papers | No. | Report&Paper | Description | |:---- |:-------------------------------------------------------------------------------------------------------------- |:---------------------------------------------------- | | 1 | GPT-4 Technical Report | GPT-4 Technical Report | | 2 | mli/paper-reading | Deep learning classics and new papers are read carefully paragraph by paragraph. | | 3 | labmlai/annotateddeeplearningpaperimplementations| A collection of simple PyTorch implementations of neural networks and related algorithms, which are documented with explanations | | 4 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Talking, Drawing and Editing with Visual Foundation Models | | 5 | OpenAI Research | The latest research report and papers from OpenAI | | 6 | Make-A-Video: Text-to-Video Generation without Text-Video Data|Meta's Text-to-Video Generation| | 7 | eDiff-I: Text-to-Image Diffusion Models with Ensemble of Expert Denoisers| Nvidia eDiff-I - New generation of generative AI content creation tool | | 8 | Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo | 2023 GPT4All Technical Report | | 9 | Segment Anything| Meta Segment Anything | | 10 | LLaMA: Open and Efficient Foundation Language Models| LLaMA: a collection of foundation language models ranging from 7B to 65B parameters| | 11 | papers-we-love/papers-we-love |Papers from the computer science community to read and discuss| | 12 | CVPR 2023 papers |The most exciting and influential CVPR 2023 papers| [Back to Top] Tutorials | No. | Tutorial | Description| |:---- |:---------------------------------------------------------------- | --- | | 1 | Coursera - Machine Learning | The Machine Learning Specialization Course taught by Dr. Andrew Ng| | 2 | microsoft/ML-For-Beginners | 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all| | 3 | ChatGPT Prompt Engineering for Developers | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) will teach how to use a large language model (LLM) to quickly build new and powerful applications | | 4 | Dive into Deep Learning |Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries | | 5 | AI Expert Roadmap | Roadmap to becoming an Artificial Intelligence Expert in 2022 | | 6 | Computer Science courses |List of Computer Science courses with video lectures| | 7 | Machine Learning with Python | Machine Learning with Python Certification on freeCodeCamp| | 8 | Building Systems with the ChatGPT API | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI), you will learn how to automate complex workflows using chain calls to a large language model| | 9 | LangChain for LLM Application Development | This short course taught by Harrison Chase (Co-Founder and CEO at LangChain) and Andrew Ng. you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework| | 10 | How Diffusion Models Work | This short course taught by Sharon Zhou (CEO, Co-founder, Lamini). you will gain a deep familiarity with the diffusion process and the models which carry it out. More than simply pulling in a pre-built model or using an API, this course will teach you to build a diffusion model from scratch| | 11 | Free Programming Books For AI |📚 Freely available programming books for AI | | 12 | microsoft/AI-For-Beginners |12 Weeks, 24 Lessons, AI for All!| | 13 | hemansnation/God-Level-Data-Science-ML-Full-Stack |A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI| | 14 | datawhalechina/prompt-engineering-for-developers |Chinese version of Andrew Ng's Big Model Series Courses, including "Prompt Engineering", "Building System", and "LangChain"| | 15 | ossu/computer-science |🎓 Path to a free self-taught education in Computer Science!| | 16 | microsoft/Data-Science-For-Beginners | 10 Weeks, 20 Lessons, Data Science for All! | |17 |jwasham/coding-interview-university !2023-09-29268215336 |A complete computer science study plan to become a software engineer.| [Back to Top] Thanks If this project has been helpful to you in any way, please give it a ⭐️ by clicking on the star.

aima-python
github
LLM Vibe Score0.575
Human Vibe Score0.33114909407186394
aimacodeMar 28, 2025

aima-python

aima-python Python code for the book Artificial Intelligence: A Modern Approach. You can use this in conjunction with a course on AI, or for study on your own. We're looking for solid contributors to help. Updates for 4th Edition The 4th edition of the book as out now in 2020, and thus we are updating the code. All code here will reflect the 4th edition. Changes include: Move from Python 3.5 to 3.7. More emphasis on Jupyter (Ipython) notebooks. More projects using external packages (tensorflow, etc.). Structure of the Project When complete, this project will have Python implementations for all the pseudocode algorithms in the book, as well as tests and examples of use. For each major topic, such as search, we provide the following files: search.ipynb and search.py: Implementations of all the pseudocode algorithms, and necessary support functions/classes/data. The .py file is generated automatically from the .ipynb file; the idea is that it is easier to read the documentation in the .ipynb file. search_XX.ipynb: Notebooks that show how to use the code, broken out into various topics (the XX). tests/test_search.py: A lightweight test suite, using assert statements, designed for use with py.test, but also usable on their own. Python 3.7 and up The code for the 3rd edition was in Python 3.5; the current 4th edition code is in Python 3.7. It should also run in later versions, but does not run in Python 2. You can install Python or use a browser-based Python interpreter such as repl.it. You can run the code in an IDE, or from the command line with python -i filename.py where the -i option puts you in an interactive loop where you can run Python functions. All notebooks are available in a binder environment. Alternatively, visit jupyter.org for instructions on setting up your own Jupyter notebook environment. Features from Python 3.6 and 3.7 that we will be using for this version of the code: f-strings: all string formatting should be done with f'var = {var}', not with 'var = {}'.format(var) nor 'var = %s' % var. typing module: declare functions with type hints: def successors(state) -> List[State]:; that is, give type declarations, but omit them when it is obvious. I don't need to say state: State, but in another context it would make sense to say s: State. Underscores in numerics: write a million as 1000000 not as 1000000. dataclasses module: replace namedtuple with dataclass. [//]: (There is a sibling [aima-docker]https://github.com/rajatjain1997/aima-docker project that shows you how to use docker containers to run more complex problems in more complex software environments.) Installation Guide To download the repository: git clone https://github.com/aimacode/aima-python.git Then you need to install the basic dependencies to run the project on your system: You also need to fetch the datasets from the aima-data repository: Wait for the datasets to download, it may take a while. Once they are downloaded, you need to install pytest, so that you can run the test suite: pip install pytest Then to run the tests: py.test And you are good to go! Index of Algorithms Here is a table of algorithms, the figure, name of the algorithm in the book and in the repository, and the file where they are implemented in the repository. This chart was made for the third edition of the book and is being updated for the upcoming fourth edition. Empty implementations are a good place for contributors to look for an issue. The aima-pseudocode project describes all the algorithms from the book. An asterisk next to the file name denotes the algorithm is not fully implemented. Another great place for contributors to start is by adding tests and writing on the notebooks. You can see which algorithms have tests and notebook sections below. If the algorithm you want to work on is covered, don't worry! You can still add more tests and provide some examples of use in the notebook! | Figure | Name (in 3rd edition) | Name (in repository) | File | Tests | Notebook |:-------|:----------------------------------|:------------------------------|:--------------------------------|:-----|:---------| | 2 | Random-Vacuum-Agent | RandomVacuumAgent | [agents.py][agents] | Done | Included | | 2 | Model-Based-Vacuum-Agent | ModelBasedVacuumAgent | [agents.py][agents] | Done | Included | | 2.1 | Environment | Environment | [agents.py][agents] | Done | Included | | 2.1 | Agent | Agent | [agents.py][agents] | Done | Included | | 2.3 | Table-Driven-Vacuum-Agent | TableDrivenVacuumAgent | [agents.py][agents] | Done | Included | | 2.7 | Table-Driven-Agent | TableDrivenAgent | [agents.py][agents] | Done | Included | | 2.8 | Reflex-Vacuum-Agent | ReflexVacuumAgent | [agents.py][agents] | Done | Included | | 2.10 | Simple-Reflex-Agent | SimpleReflexAgent | [agents.py][agents] | Done | Included | | 2.12 | Model-Based-Reflex-Agent | ReflexAgentWithState | [agents.py][agents] | Done | Included | | 3 | Problem | Problem | [search.py][search] | Done | Included | | 3 | Node | Node | [search.py][search] | Done | Included | | 3 | Queue | Queue | [utils.py][utils] | Done | No Need | | 3.1 | Simple-Problem-Solving-Agent | SimpleProblemSolvingAgent | [search.py][search] | Done | Included | | 3.2 | Romania | romania | [search.py][search] | Done | Included | | 3.7 | Tree-Search | depth/breadthfirsttree_search | [search.py][search] | Done | Included | | 3.7 | Graph-Search | depth/breadthfirstgraph_search | [search.py][search] | Done | Included | | 3.11 | Breadth-First-Search | breadthfirstgraph_search | [search.py][search] | Done | Included | | 3.14 | Uniform-Cost-Search | uniformcostsearch | [search.py][search] | Done | Included | | 3.17 | Depth-Limited-Search | depthlimitedsearch | [search.py][search] | Done | Included | | 3.18 | Iterative-Deepening-Search | iterativedeepeningsearch | [search.py][search] | Done | Included | | 3.22 | Best-First-Search | bestfirstgraph_search | [search.py][search] | Done | Included | | 3.24 | A\*-Search | astar_search | [search.py][search] | Done | Included | | 3.26 | Recursive-Best-First-Search | recursivebestfirst_search | [search.py][search] | Done | Included | | 4.2 | Hill-Climbing | hill_climbing | [search.py][search] | Done | Included | | 4.5 | Simulated-Annealing | simulated_annealing | [search.py][search] | Done | Included | | 4.8 | Genetic-Algorithm | genetic_algorithm | [search.py][search] | Done | Included | | 4.11 | And-Or-Graph-Search | andorgraph_search | [search.py][search] | Done | Included | | 4.21 | Online-DFS-Agent | onlinedfsagent | [search.py][search] | Done | Included | | 4.24 | LRTA\*-Agent | LRTAStarAgent | [search.py][search] | Done | Included | | 5.3 | Minimax-Decision | minimax_decision | [games.py][games] | Done | Included | | 5.7 | Alpha-Beta-Search | alphabeta_search | [games.py][games] | Done | Included | | 6 | CSP | CSP | [csp.py][csp] | Done | Included | | 6.3 | AC-3 | AC3 | [csp.py][csp] | Done | Included | | 6.5 | Backtracking-Search | backtracking_search | [csp.py][csp] | Done | Included | | 6.8 | Min-Conflicts | min_conflicts | [csp.py][csp] | Done | Included | | 6.11 | Tree-CSP-Solver | treecspsolver | [csp.py][csp] | Done | Included | | 7 | KB | KB | [logic.py][logic] | Done | Included | | 7.1 | KB-Agent | KB_AgentProgram | [logic.py][logic] | Done | Included | | 7.7 | Propositional Logic Sentence | Expr | [utils.py][utils] | Done | Included | | 7.10 | TT-Entails | tt_entails | [logic.py][logic] | Done | Included | | 7.12 | PL-Resolution | pl_resolution | [logic.py][logic] | Done | Included | | 7.14 | Convert to CNF | to_cnf | [logic.py][logic] | Done | Included | | 7.15 | PL-FC-Entails? | plfcentails | [logic.py][logic] | Done | Included | | 7.17 | DPLL-Satisfiable? | dpll_satisfiable | [logic.py][logic] | Done | Included | | 7.18 | WalkSAT | WalkSAT | [logic.py][logic] | Done | Included | | 7.20 | Hybrid-Wumpus-Agent | HybridWumpusAgent | | | | | 7.22 | SATPlan | SAT_plan | [logic.py][logic] | Done | Included | | 9 | Subst | subst | [logic.py][logic] | Done | Included | | 9.1 | Unify | unify | [logic.py][logic] | Done | Included | | 9.3 | FOL-FC-Ask | folfcask | [logic.py][logic] | Done | Included | | 9.6 | FOL-BC-Ask | folbcask | [logic.py][logic] | Done | Included | | 10.1 | Air-Cargo-problem | air_cargo | [planning.py][planning] | Done | Included | | 10.2 | Spare-Tire-Problem | spare_tire | [planning.py][planning] | Done | Included | | 10.3 | Three-Block-Tower | threeblocktower | [planning.py][planning] | Done | Included | | 10.7 | Cake-Problem | havecakeandeatcake_too | [planning.py][planning] | Done | Included | | 10.9 | Graphplan | GraphPlan | [planning.py][planning] | Done | Included | | 10.13 | Partial-Order-Planner | PartialOrderPlanner | [planning.py][planning] | Done | Included | | 11.1 | Job-Shop-Problem-With-Resources | jobshopproblem | [planning.py][planning] | Done | Included | | 11.5 | Hierarchical-Search | hierarchical_search | [planning.py][planning] | Done | Included | | 11.8 | Angelic-Search | angelic_search | [planning.py][planning] | Done | Included | | 11.10 | Doubles-tennis | doubletennisproblem | [planning.py][planning] | Done | Included | | 13 | Discrete Probability Distribution | ProbDist | [probability.py][probability] | Done | Included | | 13.1 | DT-Agent | DTAgent | [probability.py][probability] | Done | Included | | 14.9 | Enumeration-Ask | enumeration_ask | [probability.py][probability] | Done | Included | | 14.11 | Elimination-Ask | elimination_ask | [probability.py][probability] | Done | Included | | 14.13 | Prior-Sample | prior_sample | [probability.py][probability] | Done | Included | | 14.14 | Rejection-Sampling | rejection_sampling | [probability.py][probability] | Done | Included | | 14.15 | Likelihood-Weighting | likelihood_weighting | [probability.py][probability] | Done | Included | | 14.16 | Gibbs-Ask | gibbs_ask | [probability.py][probability] | Done | Included | | 15.4 | Forward-Backward | forward_backward | [probability.py][probability] | Done | Included | | 15.6 | Fixed-Lag-Smoothing | fixedlagsmoothing | [probability.py][probability] | Done | Included | | 15.17 | Particle-Filtering | particle_filtering | [probability.py][probability] | Done | Included | | 16.9 | Information-Gathering-Agent | InformationGatheringAgent | [probability.py][probability] | Done | Included | | 17.4 | Value-Iteration | value_iteration | [mdp.py][mdp] | Done | Included | | 17.7 | Policy-Iteration | policy_iteration | [mdp.py][mdp] | Done | Included | | 17.9 | POMDP-Value-Iteration | pomdpvalueiteration | [mdp.py][mdp] | Done | Included | | 18.5 | Decision-Tree-Learning | DecisionTreeLearner | [learning.py][learning] | Done | Included | | 18.8 | Cross-Validation | cross_validation | [learning.py][learning]\* | | | | 18.11 | Decision-List-Learning | DecisionListLearner | [learning.py][learning]\* | | | | 18.24 | Back-Prop-Learning | BackPropagationLearner | [learning.py][learning] | Done | Included | | 18.34 | AdaBoost | AdaBoost | [learning.py][learning] | Done | Included | | 19.2 | Current-Best-Learning | currentbestlearning | knowledge.py | Done | Included | | 19.3 | Version-Space-Learning | versionspacelearning | knowledge.py | Done | Included | | 19.8 | Minimal-Consistent-Det | minimalconsistentdet | knowledge.py | Done | Included | | 19.12 | FOIL | FOIL_container | knowledge.py | Done | Included | | 21.2 | Passive-ADP-Agent | PassiveADPAgent | [rl.py][rl] | Done | Included | | 21.4 | Passive-TD-Agent | PassiveTDAgent | [rl.py][rl] | Done | Included | | 21.8 | Q-Learning-Agent | QLearningAgent | [rl.py][rl] | Done | Included | | 22.1 | HITS | HITS | [nlp.py][nlp] | Done | Included | | 23 | Chart-Parse | Chart | [nlp.py][nlp] | Done | Included | | 23.5 | CYK-Parse | CYK_parse | [nlp.py][nlp] | Done | Included | | 25.9 | Monte-Carlo-Localization | montecarlolocalization | [probability.py][probability] | Done | Included | Index of data structures Here is a table of the implemented data structures, the figure, name of the implementation in the repository, and the file where they are implemented. | Figure | Name (in repository) | File | |:-------|:--------------------------------|:--------------------------| | 3.2 | romania_map | [search.py][search] | | 4.9 | vacumm_world | [search.py][search] | | 4.23 | onedimstate_space | [search.py][search] | | 6.1 | australia_map | [search.py][search] | | 7.13 | wumpusworldinference | [logic.py][logic] | | 7.16 | hornclausesKB | [logic.py][logic] | | 17.1 | sequentialdecisionenvironment | [mdp.py][mdp] | | 18.2 | waitingdecisiontree | [learning.py][learning] | Acknowledgements Many thanks for contributions over the years. I got bug reports, corrected code, and other support from Darius Bacon, Phil Ruggera, Peng Shao, Amit Patil, Ted Nienstedt, Jim Martin, Ben Catanzariti, and others. Now that the project is on GitHub, you can see the contributors who are doing a great job of actively improving the project. Many thanks to all contributors, especially @darius, @SnShine, @reachtarunhere, @antmarakis, @Chipe1, @ad71 and @MariannaSpyrakou. [agents]:../master/agents.py [csp]:../master/csp.py [games]:../master/games.py [grid]:../master/grid.py [knowledge]:../master/knowledge.py [learning]:../master/learning.py [logic]:../master/logic.py [mdp]:../master/mdp.py [nlp]:../master/nlp.py [planning]:../master/planning.py [probability]:../master/probability.py [rl]:../master/rl.py [search]:../master/search.py [utils]:../master/utils.py [text]:../master/text.py

RD-Agent
github
LLM Vibe Score0.548
Human Vibe Score0.27921589729164453
microsoftMar 28, 2025

RD-Agent

🖥️ Live Demo | 🎥 Demo Video ▶️YouTube | 📖 Documentation | 📃 Papers Data Science Agent Preview Check out our demo video showcasing the current progress of our Data Science Agent under development: https://github.com/user-attachments/assets/3eccbecb-34a4-4c81-bce4-d3f8862f7305 📰 News | 🗞️ News | 📝 Description | | -- | ------ | | Support LiteLLM Backend | We now fully support LiteLLM as a backend for integration with multiple LLM providers. | | More General Data Science Agent | 🚀Coming soon! | | Kaggle Scenario release | We release Kaggle Agent, try the new features! | | Official WeChat group release | We created a WeChat group, welcome to join! (🗪QR Code) | | Official Discord release | We launch our first chatting channel in Discord (🗪) | | First release | RDAgent is released on GitHub | 🌟 Introduction RDAgent aims to automate the most critical and valuable aspects of the industrial R&D process, and we begin with focusing on the data-driven scenarios to streamline the development of models and data. Methodologically, we have identified a framework with two key components: 'R' for proposing new ideas and 'D' for implementing them. We believe that the automatic evolution of R&D will lead to solutions of significant industrial value. R&D is a very general scenario. The advent of RDAgent can be your 💰 Automatic Quant Factory (🎥Demo Video|▶️YouTube) 🤖 Data Mining Agent: Iteratively proposing data & models (🎥Demo Video 1|▶️YouTube) (🎥Demo Video 2|▶️YouTube) and implementing them by gaining knowledge from data. 🦾 Research Copilot: Auto read research papers (🎥Demo Video|▶️YouTube) / financial reports (🎥Demo Video|▶️YouTube) and implement model structures or building datasets. 🤖 Kaggle Agent: Auto Model Tuning and Feature Engineering([🎥Demo Video Coming Soon...]()) and implementing them to achieve more in competitions. ... You can click the links above to view the demo. We're continuously adding more methods and scenarios to the project to enhance your R&D processes and boost productivity. Additionally, you can take a closer look at the examples in our 🖥️ Live Demo. ⚡ Quick start You can try above demos by running the following command: 🐳 Docker installation. Users must ensure Docker is installed before attempting most scenarios. Please refer to the official 🐳Docker page for installation instructions. Ensure the current user can run Docker commands without using sudo. You can verify this by executing docker run hello-world. 🐍 Create a Conda Environment Create a new conda environment with Python (3.10 and 3.11 are well-tested in our CI): Activate the environment: 🛠️ Install the RDAgent You can directly install the RDAgent package from PyPI: 💊 Health check rdagent provides a health check that currently checks two things. whether the docker installation was successful. whether the default port used by the rdagent ui is occupied. ⚙️ Configuration The demos requires following ability: ChatCompletion json_mode embedding query For example: If you are using the OpenAI API, you have to configure your GPT model in the .env file like this. However, not every API services support these features by default. For example: AZURE OpenAI, you have to configure your GPT model in the .env file like this. We now support LiteLLM as a backend for integration with multiple LLM providers. If you use LiteLLM Backend to use models, you can configure as follows: For more configuration information, please refer to the documentation. 🚀 Run the Application The 🖥️ Live Demo is implemented by the following commands(each item represents one demo, you can select the one you prefer): Run the Automated Quantitative Trading & Iterative Factors Evolution: Qlib self-loop factor proposal and implementation application Run the Automated Quantitative Trading & Iterative Model Evolution: Qlib self-loop model proposal and implementation application Run the Automated Medical Prediction Model Evolution: Medical self-loop model proposal and implementation application (1) Apply for an account at PhysioNet. (2) Request access to FIDDLE preprocessed data: FIDDLE Dataset. (3) Place your username and password in .env. Run the Automated Quantitative Trading & Factors Extraction from Financial Reports: Run the Qlib factor extraction and implementation application based on financial reports Run the Automated Model Research & Development Copilot: model extraction and implementation application Run the Automated Kaggle Model Tuning & Feature Engineering: self-loop model proposal and feature engineering implementation application Using sf-crime (San Francisco Crime Classification) as an example. Register and login on the Kaggle website. Configuring the Kaggle API. (1) Click on the avatar (usually in the top right corner of the page) -> Settings -> Create New Token, A file called kaggle.json will be downloaded. (2) Move kaggle.json to ~/.config/kaggle/ (3) Modify the permissions of the kaggle.json file. Reference command: chmod 600 ~/.config/kaggle/kaggle.json Join the competition: Click Join the competition -> I Understand and Accept at the bottom of the competition details page. Description of the above example: Kaggle competition data, contains two parts: competition description file (json file) and competition dataset (zip file). We prepare the competition description file for you, the competition dataset will be downloaded automatically when you run the program, as in the example. If you want to download the competition description file automatically, you need to install chromedriver, The instructions for installing chromedriver can be found in the documentation. The Competition List Available can be found here. 🖥️ Monitor the Application Results You can run the following command for our demo program to see the run logs. Note: Although port 19899 is not commonly used, but before you run this demo, you need to check if port 19899 is occupied. If it is, please change it to another port that is not occupied. You can check if a port is occupied by running the following command. 🏭 Scenarios We have applied RD-Agent to multiple valuable data-driven industrial scenarios. 🎯 Goal: Agent for Data-driven R&D In this project, we are aiming to build an Agent to automate Data-Driven R\&D that can 📄 Read real-world material (reports, papers, etc.) and extract key formulas, descriptions of interested features and models, which are the key components of data-driven R&D . 🛠️ Implement the extracted formulas (e.g., features, factors, and models) in runnable codes. Due to the limited ability of LLM in implementing at once, build an evolving process for the agent to improve performance by learning from feedback and knowledge. 💡 Propose new ideas based on current knowledge and observations. 📈 Scenarios/Demos In the two key areas of data-driven scenarios, model implementation and data building, our system aims to serve two main roles: 🦾Copilot and 🤖Agent. The 🦾Copilot follows human instructions to automate repetitive tasks. The 🤖Agent, being more autonomous, actively proposes ideas for better results in the future. The supported scenarios are listed below: | Scenario/Target | Model Implementation | Data Building | | -- | -- | -- | | 💹 Finance | 🤖 Iteratively Proposing Ideas & Evolving▶️YouTube | 🤖 Iteratively Proposing Ideas & Evolving ▶️YouTube 🦾 Auto reports reading & implementation▶️YouTube | | 🩺 Medical | 🤖 Iteratively Proposing Ideas & Evolving▶️YouTube | - | | 🏭 General | 🦾 Auto paper reading & implementation▶️YouTube 🤖 Auto Kaggle Model Tuning | 🤖Auto Kaggle feature Engineering | RoadMap: Currently, we are working hard to add new features to the Kaggle scenario. Different scenarios vary in entrance and configuration. Please check the detailed setup tutorial in the scenarios documents. Here is a gallery of successful explorations (5 traces showed in 🖥️ Live Demo). You can download and view the execution trace using this command from the documentation. Please refer to 📖readthedocs_scen for more details of the scenarios. ⚙️ Framework Automating the R&D process in data science is a highly valuable yet underexplored area in industry. We propose a framework to push the boundaries of this important research field. The research questions within this framework can be divided into three main categories: | Research Area | Paper/Work List | |--------------------|-----------------| | Benchmark the R&D abilities | Benchmark | | Idea proposal: Explore new ideas or refine existing ones | Research | | Ability to realize ideas: Implement and execute ideas | Development | We believe that the key to delivering high-quality solutions lies in the ability to evolve R&D capabilities. Agents should learn like human experts, continuously improving their R&D skills. More documents can be found in the 📖 readthedocs. 📃 Paper/Work list 📊 Benchmark Towards Data-Centric Automatic R&D !image 🔍 Research In a data mining expert's daily research and development process, they propose a hypothesis (e.g., a model structure like RNN can capture patterns in time-series data), design experiments (e.g., finance data contains time-series and we can verify the hypothesis in this scenario), implement the experiment as code (e.g., Pytorch model structure), and then execute the code to get feedback (e.g., metrics, loss curve, etc.). The experts learn from the feedback and improve in the next iteration. Based on the principles above, we have established a basic method framework that continuously proposes hypotheses, verifies them, and gets feedback from the real-world practice. This is the first scientific research automation framework that supports linking with real-world verification. For more detail, please refer to our 🖥️ Live Demo page. 🛠️ Development Collaborative Evolving Strategy for Automatic Data-Centric Development !image 🤝 Contributing We welcome contributions and suggestions to improve RD-Agent. Please refer to the Contributing Guide for more details on how to contribute. Before submitting a pull request, ensure that your code passes the automatic CI checks. 📝 Guidelines This project welcomes contributions and suggestions. Contributing to this project is straightforward and rewarding. Whether it's solving an issue, addressing a bug, enhancing documentation, or even correcting a typo, every contribution is valuable and helps improve RDAgent. To get started, you can explore the issues list, or search for TODO: comments in the codebase by running the command grep -r "TODO:". Before we released RD-Agent as an open-source project on GitHub, it was an internal project within our group. Unfortunately, the internal commit history was not preserved when we removed some confidential code. As a result, some contributions from our group members, including Haotian Chen, Wenjun Feng, Haoxue Wang, Zeqi Ye, Xinjie Shen, and Jinhui Li, were not included in the public commits. ⚖️ Legal disclaimer The RD-agent is provided “as is”, without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose and noninfringement. The RD-agent is aimed to facilitate research and development process in the financial industry and not ready-to-use for any financial investment or advice. Users shall independently assess and test the risks of the RD-agent in a specific use scenario, ensure the responsible use of AI technology, including but not limited to developing and integrating risk mitigation measures, and comply with all applicable laws and regulations in all applicable jurisdictions. The RD-agent does not provide financial opinions or reflect the opinions of Microsoft, nor is it designed to replace the role of qualified financial professionals in formulating, assessing, and approving finance products. The inputs and outputs of the RD-agent belong to the users and users shall assume all liability under any theory of liability, whether in contract, torts, regulatory, negligence, products liability, or otherwise, associated with use of the RD-agent and any inputs and outputs thereof.

LLMStack
github
LLM Vibe Score0.535
Human Vibe Score0.022778788676674117
trypromptlyMar 28, 2025

LLMStack

LLMStack is a no-code platform for building generative AI agents, workflows and chatbots, connecting them to your data and business processes. Quickstart | Documentation | Promptly Overview Build tailor-made generative AI agents, applications and chatbots that cater to your unique needs by chaining multiple LLMs. Seamlessly integrate your own data, internal tools and GPT-powered models without any coding experience using LLMStack's no-code builder. Trigger your AI chains from Slack or Discord. Deploy to the cloud or on-premise. !llmstack-quickstart See full demo video here Getting Started Check out our Cloud offering at Promptly or follow the instructions below to deploy LLMStack on your own infrastructure. LLMStack deployment comes with a default admin account whose credentials are admin and promptly. Be sure to change the password from admin panel after logging in. Installation Prerequisites LLMStack depends on a background docker container to run jobs. Make sure you have Docker installed on your machine if want to use jobs. You can follow the instructions here to install Docker. Install LLMStack using pip If you are on windows, please use WSL2 (Windows Subsystem for Linux) to install LLMStack. You can follow the instructions here to install WSL2. Once you are in a WSL2 terminal, you can install LLMStack using the above command. Start LLMStack using the following command: Above commands will install and start LLMStack. It will create .llmstack in your home directory and places the database and config files in it when run for the first time. Once LLMStack is up and running, it should automatically open your browser and point it to localhost:3000. You can add your own keys to providers like OpenAI, Cohere, Stability etc., from Settings page. If you want to provide default keys for all the users of your LLMStack instance, you can add them to the ~/.llmstack/config file. LLMStack: Quickstart video Features 🤖 Agents: Build generative AI agents like AI SDRs, Research Analysts, RPA Automations etc., without writing any code. Connect agents to your internal or external tools, search the web or browse the internet with agents. 🔗 Chain multiple models: LLMStack allows you to chain multiple LLMs together to build complex generative AI applications. 📊 Use generative AI on your Data: Import your data into your accounts and use it in AI chains. LLMStack allows importing various types (CSV, TXT, PDF, DOCX, PPTX etc.,) of data from a variety of sources (gdrive, notion, websites, direct uploads etc.,). Platform will take care of preprocessing and vectorization of your data and store it in the vector database that is provided out of the box. 🛠️ No-code builder: LLMStack comes with a no-code builder that allows you to build AI chains without any coding experience. You can chain multiple LLMs together and connect them to your data and business processes. ☁️ Deploy to the cloud or on-premise: LLMStack can be deployed to the cloud or on-premise. You can deploy it to your own infrastructure or use our cloud offering at Promptly. 🚀 API access: Apps or chatbots built with LLMStack can be accessed via HTTP API. You can also trigger your AI chains from Slack or Discord. 🏢 Multi-tenant: LLMStack is multi-tenant. You can create multiple organizations and add users to them. Users can only access the data and AI chains that belong to their organization. What can you build with LLMStack? Using LLMStack you can build a variety of generative AI applications, chatbots and agents. Here are some examples: 👩🏻‍💼 AI SDRs: You can build AI SDRs (Sales Development Representatives) that can generate personalized emails, LinkedIn messages, cold calls, etc., for your sales team 👩🏻‍💻 Research Analysts: You can build AI Research Analysts that can generate research reports, investment thesis, etc., for your investment team 🤖 RPA Automations: You can build RPA automations that can automate your business processes by generating emails, filling forms, etc., 📝 Text generation: You can build apps that generate product descriptions, blog posts, news articles, tweets, emails, chat messages, etc., by using text generation models and optionally connecting your data. Check out this marketing content generator for example 🤖 Chatbots: You can build chatbots trained on your data powered by ChatGPT like Promptly Help that is embedded on Promptly website 🎨 Multimedia generation: Build complex applications that can generate text, images, videos, audio, etc. from a prompt. This story generator is an example 🗣️ Conversational AI: Build conversational AI systems that can have a conversation with a user. Check out this Harry Potter character chatbot 🔍 Search augmentation: Build search augmentation systems that can augment search results with additional information using APIs. Sharebird uses LLMStack to augment search results with AI generated answer from their content similar to Bing's chatbot 💬 Discord and Slack bots: Apps built on LLMStack can be triggered from Slack or Discord. You can easily connect your AI chains to Slack or Discord from LLMStack's no-code app editor. Check out our Discord server to interact with one such bot. Administration Login to http://localhost:3000/admin using the admin account. You can add users and assign them to organizations in the admin panel. Cloud Offering Check out our cloud offering at Promptly. You can sign up for a free account and start building your own generative AI applications. Documentation Check out our documentation at docs.trypromptly.com/llmstack to learn more about LLMStack. Development Check out our development guide at docs.trypromptly.com/llmstack/development to learn more about how to run and develop LLMStack. Contributing We welcome contributions to LLMStack. Please check out our contributing guide to learn more about how you can contribute to LLMStack.

instill-core
github
LLM Vibe Score0.515
Human Vibe Score0.023472450495103967
instill-aiMar 28, 2025

instill-core

🔮 Instill Core A complete unstructured data solution: ETL processing, AI-readiness, open-source LLM hosting, and RAG capabilities in one powerful platform. Quick start Follow the installation steps below or documentation for more details to build versatile AI applications locally. What is Instill Core? Instill Core is an end-to-end AI platform for data, pipeline and model orchestration. 🔮 Instill Core simplifies infrastructure hassle and encompasses these core features: 💧 Pipeline: Quickly build versatile AI-first APIs or automated workflows. ⚗️ Model: Deploy and monitor AI models without GPU infrastructure hassles. 💾 Artifact: Transform unstructured data (e.g., documents, images, audio, video) into AI-ready formats. ⚙️ Component: Connect essential building blocks to construct powerful pipelines. What can you build? 📖 Parsing PDF Files to Markdown: Cookbook 🧱 Generating Structured Outputs from LLMs: Cookbook & Tutorial 🕸️ Web scraping & Google Search with Structured Insights 🌱 Instance segmentation on microscopic plant stomata images: Cookbook See Examples for more! Installation Prerequisites | Operating System | Requirements and Instructions | | ---------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | macOS or Linux | Instill Core works natively | | Windows | • Use Windows Subsystem for Linux (WSL2)• Install latest yq from GitHub Repository• Install latest Docker Desktop and enable WSL2 integration (tutorial)• (Optional) Install cuda-toolkit on WSL2 (NVIDIA tutorial) | | All Systems | • Docker Engine v25 or later• Docker Compose v2 or later• Install latest stable Docker and Docker Compose | Steps Use stable release version Execute the following commands to pull pre-built images with all the dependencies to launch: [!NOTE] We have restructured our project repositories. If you need to access 🔮 Instill Core projects up to version v0.13.0-beta, please refer to the instill-ai/deprecated-core repository. Use the latest version for local development Execute the following commands to build images with all the dependencies to launch: [!IMPORTANT] Code in the main branch tracks under-development progress towards the next release and may not work as expected. If you are looking for a stable alpha version, please use latest release. 🚀 That's it! Once all the services are up with health status, the UI is ready to go at . Please find the default login credentials in the documentation. To shut down all running services: Deployment Visit the Deployment Overview for more details. Client Access 📺 Console ⌨️ CLI 📦 SDK: Python SDK TypeScript SDK Stay tuned, as more SDKs are on the way! Documentation Please visit our official documentation for more. Additional resources: API Reference Cookbooks Tutorials Examples Contributing We welcome contributions from our community! Checkout the methods below: Cookbooks: Help us create helpful pipelines and guides for the community. Visit our Cookbook repository to get started. Issues: Contribute to improvements by raising tickets using templates here or discuss in existing ones you think you can help with. Community Standards We are committed to maintaining a respectful and welcoming atmosphere for all contributors. Before contributing, please read: Contributing Guidelines Code of Conduct Support Get help by joining our Discord community where you can post any questions on our #ask-for-help channel. Contributors ✨ Thank you to all these wonderful people (emoji key): Vibhor Bhatt Miguel Ortiz Sajda Kabir Henry Chen Hari Bhandari Shiva Gaire Zubeen ShihChun-H Ikko Eltociear Ashimine Farookh Zaheer Siddiqui Brian Gallagher hairyputtar David Marx Deniz Parlak Po-Yu Chen Po Chun Chiu Sarthak HR Wu phelan Chang, Hui-Tang Xiaofei Du Ping-Lin Chang Tony Wang Pratik date Juan Vallés Naman Anand totuslink Praharsh Jain Utsav Paul CaCaBlocker Rafael Melo Jeremy Shih Romit Mohane ChunHao Amelia C 楊竣凱 andre.liang Zoodane George Strong Anni Mubeen Kodvavi RCKT Wojciech Bandzerewicz Gary Leo felixcorleone Zoe Daniel Manul Thanura Akash Jana Anish0203 Prathamesh Tugaonkar Shubham This project follows the all-contributors specification. Contributions of any kind welcome! License See the LICENSE file for licensing information.

TornadoVM
github
LLM Vibe Score0.539
Human Vibe Score0.20972324263626374
beehive-labMar 28, 2025

TornadoVM

TornadoVM !TornadoVM version TornadoVM is a plug-in to OpenJDK and GraalVM that allows programmers to automatically run Java programs on heterogeneous hardware. TornadoVM targets OpenCL, PTX and SPIR-V compatible devices which include multi-core CPUs, dedicated GPUs (Intel, NVIDIA, AMD), integrated GPUs (Intel HD Graphics and ARM Mali), and FPGAs (Intel and Xilinx). TornadoVM has three backends that generate OpenCL C, NVIDIA CUDA PTX assembly, and SPIR-V binary. Developers can choose which backends to install and run. Website: tornadovm.org Documentation: https://tornadovm.readthedocs.io/en/latest/ For a quick introduction please read the following FAQ. Latest Release: TornadoVM 1.0.10 - 31/01/2025 : See CHANGELOG. Installation In Linux and macOS, TornadoVM can be installed automatically with the installation script. For example: NOTE Select the desired backend: opencl: Enables the OpenCL backend (requires OpenCL drivers) ptx: Enables the PTX backend (requires NVIDIA CUDA drivers) spirv: Enables the SPIRV backend (requires Intel Level Zero drivers) Example of installation: Alternatively, TornadoVM can be installed either manually from source or by using Docker. If you are planning to use Docker with TornadoVM on GPUs, you can also follow these guidelines. You can also run TornadoVM on Amazon AWS CPUs, GPUs, and FPGAs following the instructions here. Usage Instructions TornadoVM is currently being used to accelerate machine learning and deep learning applications, computer vision, physics simulations, financial applications, computational photography, and signal processing. Featured use-cases: kfusion-tornadovm: Java application for accelerating a computer-vision application using the Tornado-APIs to run on discrete and integrated GPUs. Java Ray-Tracer: Java application accelerated with TornadoVM for real-time ray-tracing. We also have a set of examples that includes NBody, DFT, KMeans computation and matrix computations. Additional Information General Documentation Benchmarks How TornadoVM executes reductions Execution Flags FPGA execution Profiler Usage Programming Model TornadoVM exposes to the programmer task-level, data-level and pipeline-level parallelism via a light Application Programming Interface (API). In addition, TornadoVM uses single-source property, in which the code to be accelerated and the host code live in the same Java program. Compute-kernels in TornadoVM can be programmed using two different approaches (APIs): a) Loop Parallel API Compute kernels are written in a sequential form (tasks programmed for a single thread execution). To express parallelism, TornadoVM exposes two annotations that can be used in loops and parameters: a) @Parallel for annotating parallel loops; and b) @Reduce for annotating parameters used in reductions. The following code snippet shows a full example to accelerate Matrix-Multiplication using TornadoVM and the loop-parallel API: To run TornadoVM, you need to either install the TornadoVM extension for GraalVM/OpenJDK, or run with our Docker images. Additional Resources Here you can find videos, presentations, tech-articles and artefacts describing TornadoVM, and how to use it. Academic Publications If you are using TornadoVM >= 0.2 (which includes the Dynamic Reconfiguration, the initial FPGA support and CPU/GPU reductions), please use the following citation: If you are using Tornado 0.1 (Initial release), please use the following citation in your work. Selected publications can be found here. Acknowledgments This work is partially funded by Intel corporation. In addition, it has been supported by the following EU & UKRI grants (most recent first): EU Horizon Europe & UKRI AERO 101092850. EU Horizon Europe & UKRI INCODE 101093069. EU Horizon Europe & UKRI ENCRYPT 101070670. EU Horizon Europe & UKRI TANGO 101070052. EU Horizon 2020 ELEGANT 957286. EU Horizon 2020 E2Data 780245. EU Horizon 2020 ACTiCLOUD 732366. Furthermore, TornadoVM has been supported by the following EPSRC grants: PAMELA EP/K008730/1. AnyScale Apps EP/L000725/1. Contributions and Collaborations We welcome collaborations! Please see how to contribute to the project in the CONTRIBUTING page. Write your questions and proposals: Additionally, you can open new proposals on the GitHub discussions page. Alternatively, you can share a Google document with us. Collaborations: For Academic & Industry collaborations, please contact here. TornadoVM Team Visit our website to meet the team. Licenses Per Module To use TornadoVM, you can link the TornadoVM API to your application which is under Apache 2. Each Java TornadoVM module is licensed as follows: | Module | License | |--------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Tornado-API | | | Tornado-Runtime | | | Tornado-Assembly | | | Tornado-Drivers | | | Tornado-Drivers-OpenCL-Headers | | | Tornado-scripts | | | Tornado-Annotation | | | Tornado-Unittests | | | Tornado-Benchmarks | | | Tornado-Examples | | | Tornado-Matrices | | | | |

oreilly-ai-agents
github
LLM Vibe Score0.437
Human Vibe Score0.07783740211883924
sinanuozdemirMar 28, 2025

oreilly-ai-agents

!oreilly-logo AI Agents A-Z This repository contains code for the O'Reilly Live Online Training for AI Agents A-Z This course provides a comprehensive guide to understanding, implementing, and managing AI agents both at the prototype stage and in production. Attendees will start with foundational concepts and progressively delve into more advanced topics, including various frameworks like CrewAI, LangChain, and AutoGen as well as building agents from scratch using powerful prompt engineering techniques. The course emphasizes practical application, guiding participants through hands-on exercises to implement and deploy AI agents, evaluate their performance, and iterate on their designs. We will go over key aspects like cost projections, open versus closed source options, and best practices are thoroughly covered to equip attendees with the knowledge to make informed decisions in their AI projects. Setup Instructions Using Python 3.11 Virtual Environment At the time of writing, we need a Python virtual environment with Python 3.11. Option 1: Python 3.11 is Already Installed Step 1: Verify Python 3.11 Installation Step 2: Create a Virtual Environment This creates a .venv folder in your current directory. Step 3: Activate the Virtual Environment macOS/Linux: Windows: You should see (.venv) in your terminal prompt. Step 4: Verify the Python Version Step 5: Install Packages Step 6: Deactivate the Virtual Environment Option 2: Install Python 3.11 If you don’t have Python 3.11, follow the steps below for your OS. macOS (Using Homebrew) Ubuntu/Debian Windows (Using Windows Installer) Go to Python Downloads. Download the installer for Python 3.11. Run the installer and ensure "Add Python 3.11 to PATH" is checked. Verify Installation Notebooks In the activated environment, run Using 3rd party agent frameworks Intro to CrewAI - An introductory notebook for CrewAI See the streamlit directory for an example of deploying crew on a streamlit app Intro to Autogen - An introductory notebook for Microsoft's Autogen Intro to OpenAI Swarm - An introductory notebook for OpenAI's Swarm Intro to LangGraph - An introductory notebook for LangGraph Agents playing Chess - An implementation of two ReAct Agents playing Chess with each other Evaluating Agents Evaluating Agent Output with Rubrics - Exploring a rubric prompt to evaluate generative output. This notebook also notes positional biases when choosing between agent responses. Advanced - Evaluating Alignment - A longer notebook doing a much more in depth analysis on how an LLM can judge agent's responses Evaluating Tool Selection - Calculating the accuracy of tool selection between different LLMs and quantifying the positional bias present in auto-regressive LLMs. See the additions here for V3 + DeepSeek Distilled Models and here for DeepSeek R1 Building our own agents First Steps with our own Agent - Working towards building our own agent framework See Squad Goals for a very simple example of my own agent framework Intro to Squad Goals - using my own framework to do some basic tasks Multimodal Agents - Incorporating Dalle-3 to allow our squad to generate images Modern Agent Paradigms Plan & Execute Agents - Plan & Execute Agents use a planner to create multi-step plans with an LLM and an executor to complete each step by invoking tools. Reflection Agents - Reflection Agents combine a generator to perform tasks and a reflector to provide feedback and guide improvements. Instructor Sinan Ozdemir is the Founder and CTO of LoopGenius where he uses State of the art AI to help people run digital ads on Meta, Google, and more. Sinan is a former lecturer of Data Science at Johns Hopkins University and the author of multiple textbooks on data science and machine learning. Additionally, he is the founder of the recently acquired Kylie.ai, an enterprise-grade conversational AI platform with RPA capabilities. He holds a master’s degree in Pure Mathematics from Johns Hopkins University and is based in San Francisco, CA.

awesome-quantum-machine-learning
github
LLM Vibe Score0.64
Human Vibe Score1
krishnakumarsekarMar 27, 2025

awesome-quantum-machine-learning

Awesome Quantum Machine Learning A curated list of awesome quantum machine learning algorithms,study materials,libraries and software (by language). Table of Contents INTRODUCTION Why Quantum Machine Learning? BASICS What is Quantum Mechanics? What is Quantum Computing? What is Topological Quantum Computing? Quantum Computing vs Classical Computing QUANTUM COMPUTING Atom Structure Photon wave Electron Fluctuation or spin States SuperPosition SuperPosition specific for machine learning(Quantum Walks) Classical Bit Quantum Bit or Qubit or Qbit Basic Gates in Quantum Computing Quantum Diode Quantum Transistor Quantum Processor Quantum Registery QRAM Quantum Entanglement QUANTUM COMPUTING MACHINE LEARNING BRIDGE Complex Numbers Tensors Tensors Network Oracle Hadamard transform Hilbert Space eigenvalues and eigenvectors Schr¨odinger Operators Quantum lambda calculus Quantum Amplitute Phase Qubits Encode and Decode convert classical bit to qubit Quantum Dirac and Kets Quantum Complexity Arbitrary State Generation QUANTUM ALGORITHMS Quantum Fourier Transform Variational-Quantum-Eigensolver Grovers Algorithm Shor's algorithm Hamiltonian Oracle Model Bernstein-Vazirani Algorithm Simon’s Algorithm Deutsch-Jozsa Algorithm Gradient Descent Phase Estimation Haar Tansform Quantum Ridgelet Transform Quantum NP Problem QUANTUM MACHINE LEARNING ALGORITHMS Quantum K-Nearest Neighbour Quantum K-Means Quantum Fuzzy C-Means Quantum Support Vector Machine Quantum Genetic Algorithm Quantum Hidden Morkov Models Quantum state classification with Bayesian methods Quantum Ant Colony Optimization Quantum Cellular Automata Quantum Classification using Principle Component Analysis Quantum Inspired Evolutionary Algorithm Quantum Approximate Optimization Algorithm Quantum Elephant Herding Optimization Quantum-behaved Particle Swarm Optimization Quantum Annealing Expectation-Maximization QAUNTUM NEURAL NETWORK Quantum perceptrons Qurons Quantum Auto Encoder Quantum Annealing Photonic Implementation of Quantum Neural Network Quantum Feed Forward Neural Network Quantum Boltzman Neural Network Quantum Neural Net Weight Storage Quantum Upside Down Neural Net Quantum Hamiltonian Neural Net QANN QPN SAL Quantum Hamiltonian Learning Compressed Quantum Hamiltonian Learning QAUNTUM STATISTICAL DATA ANALYSIS Quantum Probability Theory Kolmogorovian Theory Quantum Measurement Problem Intuitionistic Logic Heyting Algebra Quantum Filtering Paradoxes Quantum Stochastic Process Double Negation Quantum Stochastic Calculus Hamiltonian Calculus Quantum Ito's Formula Quantum Stochastic Differential Equations(QSDE) Quantum Stochastic Integration Itō Integral Quasiprobability Distributions Quantum Wiener Processes Quantum Statistical Ensemble Quantum Density Operator or Density Matrix Gibbs Canonical Ensemble Quantum Mean Quantum Variance Envariance Polynomial Optimization Quadratic Unconstrained Binary Optimization Quantum Gradient Descent Quantum Based Newton's Method for Constrained Optimization Quantum Based Newton's Method for UnConstrained Optimization Quantum Ensemble Quantum Topology Quantum Topological Data Analysis Quantum Bayesian Hypothesis Quantum Statistical Decision Theory Quantum Minimax Theorem Quantum Hunt-Stein Theorem Quantum Locally Asymptotic Normality Quantum Ising Model Quantum Metropolis Sampling Quantum Monte Carlo Approximation Quantum Bootstrapping Quantum Bootstrap Aggregation Quantum Decision Tree Classifier Quantum Outlier Detection Cholesky-Decomposition for Quantum Chemistry Quantum Statistical Inference Asymptotic Quantum Statistical Inference Quantum Gaussian Mixture Modal Quantum t-design Quantum Central Limit Theorem Quantum Hypothesis Testing Quantum Chi-squared and Goodness of Fit Testing Quantum Estimation Theory Quantum Way of Linear Regression Asymptotic Properties of Quantum Outlier Detection in Quantum Concepts QAUNTUM ARTIFICIAL INTELLIGENCE Heuristic Quantum Mechanics Consistent Quantum Reasoning Quantum Reinforcement Learning QAUNTUM COMPUTER VISION QUANTUM PROGRAMMING LANGUAGES , TOOLs and SOFTWARES ALL QUANTUM ALGORITHMS SOURCE CODES , GITHUBS QUANTUM HOT TOPICS Quantum Cognition Quantum Camera Quantum Mathematics Quantum Information Processing Quantum Image Processing Quantum Cryptography Quantum Elastic Search Quantum DNA Computing Adiabetic Quantum Computing Topological Big Data Anlytics using Quantum Hamiltonian Time Based Quantum Computing Deep Quantum Learning Quantum Tunneling Quantum Entanglment Quantum Eigen Spectrum Quantum Dots Quantum elctro dynamics Quantum teleportation Quantum Supremacy Quantum Zeno Effect Quantum Cohomology Quantum Chromodynamics Quantum Darwinism Quantum Coherence Quantum Decoherence Topological Quantum Computing Topological Quantum Field Theory Quantum Knots Topological Entanglment Boson Sampling Quantum Convolutional Code Stabilizer Code Quantum Chaos Quantum Game Theory Quantum Channel Tensor Space Theory Quantum Leap Quantum Mechanics for Time Travel Quantum Secured Block Chain Quantum Internet Quantum Optical Network Quantum Interference Quantum Optical Network Quantum Operating System Electron Fractionalization Flip-Flop Quantum Computer Quantum Information with Gaussian States Quantum Anomaly Detection Distributed Secure Quantum Machine Learning Decentralized Quantum Machine Learning Artificial Agents for Quantum Designs Light Based Quantum Chips for AI Training QUANTUM STATE PREPARATION ALGORITHM FOR MACHINE LEARNING Pure Quantum State Product State Matrix Product State Greenberger–Horne–Zeilinger State W state AKLT model Majumdar–Ghosh Model Multistate Landau–Zener Models Projected entangled-pair States Infinite Projected entangled-pair States Corner Transfer Matrix Method Tensor-entanglement Renormalization Tree Tensor Network for Supervised Learning QUANTUM MACHINE LEARNING VS DEEP LEARNING QUANTUM MEETUPS QUANTUM GOOGLE GROUPS QUANTUM BASED COMPANIES QUANTUM LINKEDLIN QUANTUM BASED DEGREES CONSOLIDATED QUANTUM ML BOOKS CONSOLIDATED QUANTUM ML VIDEOS CONSOLIDATED QUANTUM ML Reserach Papers CONSOLIDATED QUANTUM ML Reserach Scientist RECENT QUANTUM UPDATES FORUM ,PAGES AND NEWSLETTER INTRODUCTION Why Quantum Machine Learning? Machine Learning(ML) is just a term in recent days but the work effort start from 18th century. What is Machine Learning ? , In Simple word the answer is making the computer or application to learn themselves . So its totally related with computing fields like computer science and IT ? ,The answer is not true . ML is a common platform which is mingled in all the aspects of the life from agriculture to mechanics . Computing is a key component to use ML easily and effectively . To be more clear ,Who is the mother of ML ?, As no option Mathematics is the mother of ML . The world tremendous invention complex numbers given birth to this field . Applying mathematics to the real life problem always gives a solution . From Neural Network to the complex DNA is running under some specific mathematical formulas and theorems. As computing technology growing faster and faster mathematics entered into this field and makes the solution via computing to the real world . In the computing technology timeline once a certain achievements reached peoples interested to use advanced mathematical ideas such as complex numbers ,eigen etc and its the kick start for the ML field such as Artificial Neural Network ,DNA Computing etc. Now the main question, why this field is getting boomed now a days ? , From the business perspective , 8-10 Years before during the kick start time for ML ,the big barrier is to merge mathematics into computing field . people knows well in computing has no idea on mathematics and research mathematician has no idea on what is computing . The education as well as the Job Opportunities is like that in that time . Even if a person tried to study both then the business value for making a product be not good. Then the top product companies like Google ,IBM ,Microsoft decided to form a team with mathematician ,a physician and a computer science person to come up with various ideas in this field . Success of this team made some wonderful products and they started by providing cloud services using this product . Now we are in this stage. So what's next ? , As mathematics reached the level of time travel concepts but the computing is still running under classical mechanics . the companies understood, the computing field must have a change from classical to quantum, and they started working on the big Quantum computing field, and the market named this field as Quantum Information Science .The kick start is from Google and IBM with the Quantum Computing processor (D-Wave) for making Quantum Neural Network .The field of Quantum Computer Science and Quantum Information Science will do a big change in AI in the next 10 years. Waiting to see that........... .(google, ibm). References D-Wave - Owner of a quantum processor Google - Quantum AI Lab IBM - Quantum Computer Lab Quora - Question Regarding future of quantum AI NASA - NASA Quantum Works Youtube - Google Video of a Quantum Processor external-link - MIT Review microsoft new product - Newly Launched Microsoft Quantum Language and Development Kit microsoft - Microsoft Quantum Related Works Google2 - Google Quantum Machine Learning Blog BBC - About Google Quantum Supremacy,IBM Quantum Computer and Microsoft Q Google Quantum Supremacy - Latest 2019 Google Quantum Supremacy Achievement IBM Quantum Supremacy - IBM Talk on Quantum Supremacy as a Primer VICE on the fight - IBM Message on Google Quantum Supremacy IBM Zurich Quantum Safe Cryptography - An interesting startup to replace all our Certificate Authority Via Cloud and IBM Q BASICS What is Quantum Mechanics? In a single line study of an electron moved out of the atom then its classical mechanic ,vibrates inside the atom its quantum mechanics WIKIPEDIA - Basic History and outline LIVESCIENCE. - A survey YOUTUBE - Simple Animation Video Explanining Great. What is Quantum Computing? A way of parallel execution of multiple processess in a same time using qubit ,It reduces the computation time and size of the processor probably in neuro size WIKIPEDIA - Basic History and outline WEBOPEDIA. - A survey YOUTUBE - Simple Animation Video Explanining Great. Quantum Computing vs Classical Computing LINK - Basic outline Quantum Computing Atom Structure one line : Electron Orbiting around the nucleous in an eliptical format YOUTUBE - A nice animation video about the basic atom structure Photon Wave one line : Light nornmally called as wave transmitted as photons as similar as atoms in solid particles YOUTUBE - A nice animation video about the basic photon 1 YOUTUBE - A nice animation video about the basic photon 2 Electron Fluctuation or spin one line : When a laser light collide with solid particles the electrons of the atom will get spin between the orbitary layers of the atom ) YOUTUBE - A nice animation video about the basic Electron Spin 1 YOUTUBE - A nice animation video about the basic Electron Spin 2 YOUTUBE - A nice animation video about the basic Electron Spin 3 States one line : Put a point on the spinning electron ,if the point is in the top then state 1 and its in bottom state 0 YOUTUBE - A nice animation video about the Quantum States SuperPosition two line : During the spin of the electron the point may be in the middle of upper and lower position, So an effective decision needs to take on the point location either 0 or 1 . Better option to analyse it along with other electrons using probability and is called superposition YOUTUBE - A nice animation video about the Quantum Superposition SuperPosition specific for machine learning(Quantum Walks) one line : As due to computational complexity ,quantum computing only consider superposition between limited electrons ,In case to merge more than one set quantum walk be the idea YOUTUBE - A nice video about the Quantum Walks Classical Bits one line : If electron moved from one one atom to other ,from ground state to excited state a bit value 1 is used else bit value 0 used Qubit one line : The superposition value of states of a set of electrons is Qubit YOUTUBE - A nice video about the Quantum Bits 1 YOUTUBE - A nice video about the Bits and Qubits 2 Basic Gates in Quantum Computing one line : As like NOT, OR and AND , Basic Gates like NOT, Hadamard gate , SWAP, Phase shift etc can be made with quantum gates YOUTUBE - A nice video about the Quantum Gates Quantum Diode one line : Quantum Diodes using a different idea from normal diode, A bunch of laser photons trigger the electron to spin and the quantum magnetic flux will capture the information YOUTUBE - A nice video about the Quantum Diode Quantum Transistors one line : A transistor default have Source ,drain and gate ,Here source is photon wave ,drain is flux and gate is classical to quantum bits QUORA -Discussion about the Quantum Transistor YOUTUBE - Well Explained Quantum Processor one line : A nano integration circuit performing the quantum gates operation sorrounded by cooling units to reduce the tremendous amount of heat YOUTUBE - Well Explained Quantum Registery QRAM one line : Comapring the normal ram ,its ultrafast and very small in size ,the address location can be access using qubits superposition value ,for a very large memory set coherent superposition(address of address) be used PDF - very Well Explained QUANTUM COMPUTING MACHINE LEARNING BRIDGE Complex Numbers one line : Normally Waves Interference is in n dimensional structure , to find a polynomial equation n order curves ,better option is complex number YOUTUBE - Wonderful Series very super Explained Tensors one line : Vectors have a direction in 2D vector space ,If on a n dimensional vector space ,vectors direction can be specify with the tensor ,The best solution to find the superposition of a n vector electrons spin space is representing vectors as tensors and doing tensor calculus YOUTUBE - Wonderful super Explained tensors basics YOUTUBE - Quantum tensors basics Tensors Network one line : As like connecting multiple vectors ,multple tensors form a network ,solving such a network reduce the complexity of processing qubits YOUTUBE - Tensors Network Some ideas specifically for quantum algorithms QUANTUM MACHINE LEARNING ALGORITHMS Quantum K-Nearest Neighbour info : Here the centroid(euclidean distance) can be detected using the swap gates test between two states of the qubit , As KNN is regerssive loss can be tally using the average PDF1 from Microsoft - Theory Explanation PDF2 - A Good Material to understand the basics Matlab - Yet to come soon Python - Yet to come soon Quantum K-Means info : Two Approaches possible ,1. FFT and iFFT to make an oracle and calculate the means of superposition 2. Adiobtic Hamiltonian generation and solve the hamiltonian to determine the cluster PDF1 - Applying Quantum Kmeans on Images in a nice way PDF2 - Theory PDF3 - Explaining well the K-means clustering using hamiltonian Matlab - Yet to come soon Python - Yet to come soon Quantum Fuzzy C-Means info : As similar to kmeans fcm also using the oracle dialect ,but instead of means,here oracle optimization followed by a rotation gate is giving a good result PDF1 - Theory Matlab - Yet to come soon Python - Yet to come soon Quantum Support Vector Machine info : A little different from above as here kernel preparation is via classical and the whole training be in oracles and oracle will do the classification, As SVM is linear ,An optimal Error(Optimum of the Least Squares Dual Formulation) Based regression is needed to improve the performance PDF1 - Nice Explanation but little hard to understand :) PDF2 - Nice Application of QSVM Matlab - Yet to come soon Python - Yet to come soon Quantum Genetic Algorithm info : One of the best algorithm suited for Quantum Field ,Here the chromosomes act as qubit vectors ,the crossover part carrying by an evaluation and the mutation part carrying by the rotation of gates ![Flow Chart]() PDF1 - Very Beautiful Article , well explained and superp PDF2 - A big theory :) PDF3 - Super Comparison Matlab - Simulation Python1 - Simulation Python2 - Yet to come Quantum Hidden Morkov Models info : As HMM is already state based ,Here the quantum states acts as normal for the markov chain and the shift between states is using quantum operation based on probability distribution ![Flow Chart]() PDF1 - Nice idea and explanation PDF2 - Nice but a different concept little Matlab - Yet to come Python1 - Yet to come Python2 - Yet to come Quantum state classification with Bayesian methods info : Quantum Bayesian Network having the same states concept using quantum states,But here the states classification to make the training data as reusable is based on the density of the states(Interference) ![Bayesian Network Sample1]() ![Bayesian Network Sample2]() ![Bayesian Network Sample3]() PDF1 - Good Theory PDF2 - Good Explanation Matlab - Yet to come Python1 - Yet to come Python2 - Yet to come Quantum Ant Colony Optimization info : A good algorithm to process multi dimensional equations, ACO is best suited for Sales man issue , QACO is best suited for Sales man in three or more dimension, Here the quantum rotation circuit is doing the peromene update and qubits based colony communicating all around the colony in complex space ![Ant Colony Optimization 1]() PDF1 - Good Concept PDF2 - Good Application Matlab - Yet to come Python1 - Yet to come Python2 - Yet to come Quantum Cellular Automata info : One of the very complex algorithm with various types specifically used for polynomial equations and to design the optimistic gates for a problem, Here the lattice is formed using the quatum states and time calculation is based on the change of the state between two qubits ,Best suited for nano electronics ![Quantum Cellular Automata]() Wikipedia - Basic PDF1 - Just to get the keywords PDF2 - Nice Explanation and an easily understandable application Matlab - Yet to come Python1 - Yet to come Python2 - Yet to come QAUNTUM NEURAL NETWORK one line : Its really one of the hardest topic , To understand easily ,Normal Neural Network is doing parallel procss ,QNN is doing parallel of parallel processess ,In theory combination of various activation functions is possible in QNN ,In Normal NN more than one activation function reduce the performance and increase the complexity Quantum perceptrons info : Perceptron(layer) is the basic unit in Neural Network ,The quantum version of perceptron must satisfy both linear and non linear problems , Quantum Concepts is combination of linear(calculus of superposition) and nonlinear(State approximation using probability) ,To make a perceptron in quantum world ,Transformation(activation function) of non linearity to certain limit is needed ,which is carrying by phase estimation algorithm ![Quantum Perceptron 3]() PDF1 - Good Theory PDF2 - Good Explanation Matlab - Yet to come Python1 - Yet to come Python2 - Yet to come QAUNTUM STATISTICAL DATA ANALYSIS one line : An under research concept ,It can be seen in multiple ways, one best way if you want to apply n derivative for a problem in current classical theory its difficult to compute as its serialization problem instead if you do parallelization of differentiation you must estimate via probability the value in all flows ,Quantum Probability Helps to achieve this ,as the loss calculation is very less . the other way comparatively booming is Quantum Bayesianism, its a solution to solve most of the uncertainity problem in statistics to combine time and space in highly advanced physical research QUANTUM PROGRAMMING LANGUAGES , TOOLs and SOFTWARES All info : All Programming languages ,softwares and tools in alphabetical order Software - Nice content of all Python library - A python library Matlab based python library - Matlab Python Library Quantum Tensor Network Github - Tensor Network Bayesforge - A Beautiful Amazon Web Service Enabled Framework for Quantum Alogorithms and Data Analytics Rigetti - A best tools repository to use quantum computer in real time Rigetti Forest - An API to connect Quantum Computer quil/pyQuil - A quantum instruction language to use forest framework Grove - Grove is a repository to showcase quantum Fourier transform, phase estimation, the quantum approximate optimization algorithm, and others developed using Forest QISKit - A IBM Kit to access quantum computer and mainly for quantum circuits IBM Bluemix Simulator - A Bluemix Simulator for Quantum Circuits Microsoft Quantum Development Kit - Microsoft Visual Studio Enbaled Kit for Quantum Circuit Creation Microsoft "Q#" - Microsoft Q Sharp a new Programming Language for Quantum Circuit Creation qiskit api python - An API to connect IBM Quantum Computer ,With the generated token its easy to connect ,but very limited utils ,Lot of new utils will come soon Cyclops Tensor Framework - A framework to do tensor network simulations Python ToolKit for chemistry and physics Quantum Algorithm simulations - A New Started Project for simulating molecule and solids Bayesian Based Quatum Projects Repository - A nice repository and the kickstarter of bayesforge Google Fermion Products - A newly launched product specifivally for chemistry simulation Tree Tensor Networks - Interesting Tensor Network in Incubator Deep Tensor Neural Network - Some useful information about Tensor Neural Network in Incubator Generative Tensorial Networks - A startup to apply machine learning via tensor network for drug discovery Google Bristlecone - A new Quantum Processor from Google , Aimed for Future Hardwares with full fledged AI support XANADU - A Light based Quantum Hardware(chips supports) and Software Company Started in Preparation Stage. Soon will be in market fathom computing - A new concept to train the ai in a processor using light and quantum based concepts. soon products will be launch Alibaba Quantum Computing Cloud Service - Cloud Service to access 11 Bit Quantum Computing Processor Atomistic Machine Learning Project - Seems something Interesting with Deep Tensor Network for Quantum Chemistry Applications circQ and Google Works - Google Top Efforts on Tools IBM Safe Cryptography on Cloud - IBM Started and Developing a Quantm Safe Cryptography to replace all our Certificate Authority via Cloud Google Tensor Network Open Source - Google Started the Most Scientist Preferred Way To Use a Quantum Computer Circuit. Tensor Flow Which Makes Easy to Design the Network and Will Leave the Work Effect Of Gates, Processor Preparation and also going to tell the beauty of Maths Google Tensor Network Github - Github Project of Google Tensor Network Quantum Tensorflow - Yet to come soon Quantum Spark - Yet to come soon Quatum Map Reduce - Yet to come soon Quantum Database - Yet to come soon Quantum Server - Yet to come soon Quantum Data Analytics - Yet to come soon QUANTUM HOT TOPICS Deep Quantum Learning why and what is deep learning? In one line , If you know deep learning you can get a good job :) ,Even a different platform undergraduated and graduated person done a master specialization in deep learning can work in this big sector :), Practically speaking machine learning (vector mathematics) , deep learning (vector space(Graphics) mathematics) and big data are the terms created by big companies to make a trend in the market ,but in science and research there is no word such that , Now a days if you ask a junior person working in this big companies ,what is deep learning ,you will get some reply as "doing linear regression with stochastic gradient for a unsupervised data using Convolutional Neural Network :)" ,They knows the words clearly and knows how to do programming using that on a bunch of "relative data" , If you ask them about the FCM , SVM and HMM etc algorithms ,they will simply say these are olden days algorithms , deep learning replaced all :), But actually they dont know from the birth to the till level and the effectiveness of algorithms and mathematics ,How many mathematical theorems in vector, spaces , tensors etc solved to find this "hiding the complexity technology", They did not played with real non relative data like medical images, astro images , geology images etc , finding a relation and features is really complex and looping over n number of images to do pattern matching is a giant work , Now a days the items mentioned as deep learning (= multiple hidden artifical neural network) is not suitable for that why quantum deep learning or deep quantum learning? In the mid of Artificial Neural Network Research people realised at the maximum extreme only certain mathematical operations possible to do with ANN and the aim of this ANN is to achieve parallel execution of many mathematical operations , In artificial Intelligence ,the world intelligence stands for mathematics ,how effective if a probem can be solvable is based on the mathematics logic applying on the problem , more the logic will give more performance(more intelligent), This goal open the gate for quantum artificial neural network, On applying the ideas behind the deep learning to quantum mechanics environment, its possible to apply complex mathematical equations to n number of non relational data to find more features and can improve the performance Quantum Machine Learning vs Deep Learning Its fun to discuss about this , In recent days most of the employees from Product Based Companies Like google,microsoft etc using the word deep learning ,What actually Deep Learning ? and is it a new inventions ? how to learn this ? Is it replacing machine learning ? these question come to the mind of junior research scholars and mid level employees The one answer to all questions is deep learning = parallel "for" loops ,No more than that ,Its an effective way of executing multiple tasks repeatly and to reduce the computation cost, But it introduce a big cap between mathematics and computerscience , How ? All classical algorithms based on serial processing ,Its depends on the feedback of the first loop ,On applying a serial classical algorithm in multiple clusters wont give a good result ,but some light weight parallel classical algorithms(Deep learning) doing the job in multiple clusters and its not suitable for complex problems, What is the solution for then? As in the title Quantum Machine Learning ,The advantage behind is deep learning is doing the batch processing simply on the data ,but quantum machine learning designed to do batch processing as per the algorithm The product companies realised this one and they started migrating to quantum machine learning and executing the classical algorithms on quantum concept gives better result than deep learning algorithms on classical computer and the target to merge both to give very wonderful result References Quora - Good Discussion Quora - The Bridge Discussion Pdf - Nice Discussion Google - Google Research Discussion Microsoft - Microsoft plan to merge both IBM - IBM plan to merge both IBM Project - IBM Project idea MIT and Google - Solutions for all questions QUANTUM MEETUPS Meetup 1 - Quantum Physics Meetup 2 - Quantum Computing London Meetup 3 - Quantum Computing New York Meetup 4 - Quantum Computing Canada Meetup 5 - Quantum Artificial Intelligence Texas Meetup 6 - Genarl Quantum Mechanics , Mathematics New York Meetup 7 - Quantum Computing Mountain View California Meetup 8 - Statistical Analysis New York Meetup 9 - Quantum Mechanics London UK Meetup 10 - Quantum Physics Sydney Australia Meetup 11 - Quantum Physics Berkeley CA Meetup 12 - Quantum Computing London UK Meetup 13 - Quantum Mechanics Carmichael CA Meetup 14 - Maths and Science Group Portland Meetup 15 - Quantum Physics Santa Monica, CA Meetup 16 - Quantum Mechanics London Meetup 17 - Quantum Computing London Meetup 18 - Quantum Meta Physics ,Kansas City , Missouri ,US Meetup 19 - Quantum Mechanics and Physics ,Boston ,Massachusetts ,US Meetup 20 - Quantum Physics and Mechanics ,San Francisco ,California Meetup 21 - Quantum Mechanics ,Langhorne, Pennsylvania Meetup 22 - Quantum Mechanics ,Portland QUANTUM BASED DEGREES Plenty of courses around the world and many Universities Launching it day by day ,Instead of covering only Quantum ML , Covering all Quantum Related topics gives more idea in the order below Available Courses Quantum Mechanics for Science and Engineers Online Standford university - Nice Preparatory Course edx - Quantum Mechanics for Everyone NPTEL 1 - Nice Series of Courses to understand basics and backbone of quantum mechanics NPTEL 2 NPTEL 3 NPTEL 4 NPTEL 5 Class Based Course UK Bristol Australia Australian National University Europe Maxs Planks University Quantum Physics Online MIT - Super Explanation and well basics NPTEL - Nice Series of Courses to understand basics and backbone of quantum Physics Class Based Course Europe University of Copenhagen Quantum Chemistry Online NPTEL 1 - Nice Series of Courses to understand basics and backbone of quantum Chemistry NPTEL 2 - Class Based Course Europe UGent Belgium Quantum Computing Online MIT - Super Explanation and well basics edx - Nice Explanation NPTEL - Nice Series of Courses to understand basics and backbone of quantum Computing Class Based Course Canada uwaterloo Singapore National University Singapore USA Berkley China Baidu Quantum Technology Class Based Course Canada uwaterloo Singapore National University Singapore Europe Munich Russia Skoltech Quantum Information Science External Links quantwiki Online MIT - Super Explanation and well basics edx - Nice Explanation NPTEL - Nice Series of Courses to understand basics and backbone of quantum information and computing Class Based Course USA MIT Standford University Joint Center for Quantum Information and Computer Science - University of Maryland Canada Perimeter Institute Singapore National University Singapore Europe ULB Belgium IQOQI Quantum Electronics Online MIT - Wonderful Course NPTEL - Nice Series of Courses to understand basics and backbone of quantum Electronics Class Based Course USA Texas Europe Zurich ICFO Asia Tata Institute Quantum Field Theory Online Standford university - Nice Preparatory Course edx - Some QFT Concepts available Class Based Course UK Imperial Europe Vrije Quantum Computer Science Class Based Course USA Oxford Joint Center for Quantum Information and Computer Science - University of Maryland Quantum Artificial Intelligence and Machine Learning External Links Quora 1 Quora 1 Artificial Agents Research for Quantum Designs Quantum Mathematics Class Based Course USA University of Notre CONSOLIDATED Quantum Research Papers scirate - Plenty of Quantum Research Papers Available Peter Wittek - Famous Researcher for the Quantum Machine Leanrning , Published a book in this topic [Murphy Yuezhen Niu] (https://scholar.google.com/citations?user=0wJPxfkAAAAJ&hl=en) - A good researcher published some nice articles Recent Quantum Updates forum ,pages and newsletter Quantum-Tech - A Beautiful Newsletter Page Publishing Amazing Links facebook Quantum Machine Learning - Running By me . Not that much good :). You can get some ideas Linkedlin Quantum Machine Learning - A nice page running by experts. Can get plenty of ideas FOSDEM 2019 Quantum Talks - A one day talk in fosdem 2019 with more than 10 research topics,tools and ideas FOSDEM 2020 Quantum Talks - Live talk in fosdem 2020 with plenty new research topics,tools and ideas License Dedicated Opensources ![Dedicated Opensources]() Source code of plenty of Algortihms in Image Processing , Data Mining ,etc in Matlab, Python ,Java and VC++ Scripts Good Explanations of Plenty of algorithms with flow chart etc Comparison Matrix of plenty of algorithms Is Quantum Machine Learning Will Reveal the Secret Maths behind Astrology? Awesome Machine Learning and Deep Learning Mathematics is online Published Basic Presentation of the series Quantum Machine Learning Contribution If you think this page might helpful. Please help for World Education Charity or kids who wants to learn

Ultimate-Data-Science-Toolkit---From-Python-Basics-to-GenerativeAI
github
LLM Vibe Score0.555
Human Vibe Score0.3470230117125603
bansalkanavMar 27, 2025

Ultimate-Data-Science-Toolkit---From-Python-Basics-to-GenerativeAI

Getting started with Machine Learning and Deep Learning Star this repo if you find it useful :star: Module 1 - Python Programming | Topic Name | What's Covered | | :---: | :---: | | Intro to Python | Applications and Features of Python, Hello World Program, Identifiers and Rules to define identifiers, Data Types (numeric, boolean, strings, list, tuple, set and dict), Comments, Input and Output, Operators - Arithmatic, Reltaional, Equality, Logical, Bitwise, Assignment, Ternary, Identity and Membership | | Data Structures in Python (Strings, List, Tuple, Set, Dictionary) | Strings - Creating a string, Indexing, Slicing, Split, Join, etc, List - Initialization, Indexing, Slicing, Sorting, Appending, etc, Tuple - Initialization, Indexing, Slicing, Count, Index, etc, Set - Initialization, Unordered Sequence, Set Opertaions, etc, Dictionary - Initialization, Updating, Keys, Values, Items, etc | | Control Statements (Conditionals and Loops) | Conditional Statements - Introducing Indentation, if statement, if...else statement, if..elif...else statement, Nested if else statement, Loops - while loops, while...else loop, Membership operator, for loop, for...else loop, Nested Loops, Break and Continue Statement, Why else? | | Functions and Modules | Functions - Introduction to Python Functions, Function Definition and Calling, Functions with Arguments/Parameters, Return Statement, Scope of a Variable, Global Variables, Modules - Introduction to Modules, Importing a Module, Aliasing, from...import statement, import everything, Some important modules - math, platform, random, webbrowser, etc | | Object Oriented Programming | Classes and Objects - Creating a class, Instantiating an Object, Constructor, Class Members - Variables and Mentods, Types of Variables - Instance, Static and Local Variables, Types of Methods - Instance, Class and Static Methods, Access Modifiers - Public, Private and Protected, Pillars of Object Oriented Programming - Inheritance, Polymorphism, Abstraction and Encapsulation, Setters and Getters, Inheritance vs Association | | Exception Handling | Errors vs Exception, Syntax and Indentation Errors, try...except block, Control Flow in try...except block, try with multiple except, finally block, try...except...else, Nested try...except...finally, User Defined Exception | | File Handling | Introduction to File Handling, Opening and Closing a File, File Object Properties, Read Data from Text Files, Write Data to Text Files, with statement, Renaming and Deleting Files | | Web API | Application Programming Interface, Indian Space Station API, API Request, Status Code, Query Parameters, Getting JSON from an API Request, Working with JSON - dump and load, Working with Twitter API | | Databases | Introduction to Databases, SQLite3 - Connecting Python with SQLite3, Performing CRUD Opertations, MySQL - Connecting Python with MySQL, Performing CRUD Opertations, MongoDB - Connecting Python with MongoDB, Performing CRUD Opertations, Object Relation Mapping - SQLAlchemy ORM, CRUD operations and Complex DB operations | | List Comprehension, Lambda, Filter, Map, Reduce) | List Comprehension, Anonymous Functions, Filter, Map, Reduce, Function Aliasing | | Problem Solving for Interviews | Swapping two numbers, Factorial of a number, Prime Number, Fibbonnacci Sequence, Armstrong Number, Palindrome Number, etc | Module 2 - Python for Data Analysis | Topic Name | What's Covered | | :---: | :---: | | Data Analytics Framework | Data Collection, Business Understanding, Exploratory Data Analysis, Data Preparation, Model Building, Model Evaluation, Deployment, Understanding Cross Industry Standard Process for Data Mining (CRISP-DM) and Microsoft's Team Data Science Process (TDSP) | | Numpy | Array Oriented Numerical Computations using Numpy, Creating a Numpy Array, Basic Operations on Numpy Array - Check Dimensions, Shape, Datatypes and ItemSize, Why Numpy, Various ways to create Numpy Array, Numpy arange() function, Numpy Random Module - rand(), randn(), randint(), uniform(), etc, Indexing and Slicing in Numpy Arrays, Applying Mathematical Operations on Numpy Array - add(), subtract(), multiply(), divide(), dot(), matmul(), sum(), log(), exp(), etc, Statistical Operations on Numpy Array - min(), max(), mean(), median(), var(), std(), corrcoef(), etc, Reshaping a Numpy Array, Miscellaneous Topics - Linspace, Sorting, Stacking, Concatenation, Append, Where and Numpy Broadcasting | | Pandas for Beginners | Pandas Data Structures - Series, Dataframe and Panel, Creating a Series, Data Access, Creating a Dataframe using Tuples and Dictionaries, DataFrame Attributes - columns, shape, dtypes, axes, values, etc, DataFrame Methods - head(), tail(), info(), describe(), Working with .csv and .xlsx - readcsv() and readexcel(), DataFrame to .csv and .xlsx - tocsv() and toexcel() | | Advance Pandas Operations | What's Covered | | Case Study - Pandas Manipulation | What's Covered | | Missing Value Treatment | What's Covered | | Visuallization Basics - Matplotlib and Seaborn | What's Covered | | Case Study - Covid19TimeSeries | What's Covered | | Plotly and Express | What's Covered | | Outliers - Coming Soon | What's Covered | Module 3 - Statistics for Data Analysis | Topic Name | What's Covered | | :---: | :---: | | Normal Distribution | What's Covered | | Central Limit Theorem | What's Covered | | Hypothesis Testing | What's Covered | | Chi Square Testing | What's Covered | | Performing Statistical Test | What's Covered | Module 4 - Machine Learning Data Preparation and Modelling with SKLearn Working with Text Data Working with Image Data Supervised ML Algorithms K - Nearest Neighbours Linear Regression Logistic Regression Gradient Descent Decision Trees Support Vector Machines Models with Feature Engineering Hyperparameter Tuning Ensembles Unsupervised ML Algorithms Clustering Principal Component Analysis Module 5 - MLOPs | Topic Name | What's Covered | | :---: | :---: | | Model Serialization and Deserialization | What's Covered | | Application Integration | What's Covered | | MLFlow - Experiment Tracking and Model Management | What's Covered | | Prefect - Orchestrate ML Pipeline | What's Covered | Module 6 - Case Studies | Topic Name | What's Covered | | :---: | :---: | | Car Price Prediction (Regression) | What's Covered | | Airline Sentiment Analysis (NLP - Classification) | What's Covered | | Adult Income Prediction (Classification) | What's Covered | | Web App Development + Serialization and Deserialization | What's Covered | | AWS Deployment | What's Covered | | Streamlit Heroku Deployment | What's Covered | | Customer Segmentation | What's Covered | | Web Scrapping | What's Covered | Module 7 - Deep Learning | Topic Name | What's Covered | | :---: | :---: | | Introduction to Deep Learning | What's Covered | | Training a Deep Neural Network + TensorFlow.Keras | What's Covered | | Convolutional Neural Network + TensorFlow.Keras | What's Covered | | Auto Encoders for Image Compression) | What's Covered | | Recurrent Neural Network (Coming Soon) | What's Covered |

OpenAI-CLIP
github
LLM Vibe Score0.507
Human Vibe Score0.015912940499642817
moein-shariatniaMar 27, 2025

OpenAI-CLIP

Update (December 2023) I am happy to find out that this code has been used and cited in the following papers: Domino: Discovering Systematic Errors with Cross-Modal Embeddings by Eyuboglu et. al. at ICLR 2022 GSCLIP : A Framework for Explaining Distribution Shifts in Natural Language by Zhu et. al. at ICML 2022 UIC-NLP at SemEval-2022 Task 5: Exploring Contrastive Learning for Multimodal Detection of Misogynistic Memes by Cuervo et. al. at SemEval-2022 cdsBERT - Extending Protein Language Models with Codon Awareness by Hallee et. al. from University of Delaware (Sep 2023) ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios by Ragusa et. al. (Nov 2023) You can find the citation info on the right section of this GitHub repo page named: Cite this repository or use the below citation info. Introduction It was in January of 2021 that OpenAI announced two new models: DALL-E and CLIP, both multi-modality models connecting texts and images in some way. In this article we are going to implement CLIP model from scratch in PyTorch. OpenAI has open-sourced some of the code relating to CLIP model but I found it intimidating and it was far from something short and simple. I also came across a good tutorial inspired by CLIP model on Keras code examples and I translated some parts of it into PyTorch to build this tutorial totally with our beloved PyTorch! What does CLIP do? Why is it fun? In Learning Transferable Visual Models From Natural Language Supervision paper, OpenAI introduces their new model which is called CLIP, for Contrastive Language-Image Pre-training. In a nutshell, this model learns the relationship between a whole sentence and the image it describes; in a sense that when the model is trained, given an input sentence it will be able to retrieve the most related images corresponding to that sentence. The important thing here is that it is trained on full sentences instead of single classes like car, dog, etc. The intuition is that when trained on whole sentences, the model can learn a lot more things and finds some pattern between images and texts. They also show that when this model is trained on a huge dataset of images and their corresponding texts, it can also act as a classifier too. I encourage you to study the paper to learn more about this exciting model and their astonishing results on benchmarking datasets . To mention just one, CLIP model trained with this strategy classifies ImageNet better than those SOTA models trained on the ImageNet itself optimized for the only task of classification! As a teaser (!), let's see what the final model that we will build in this article from scratch is capable of: given a query (raw text) like "a boy jumping with skateboard" or "a girl jumping from swing", the model will retrieve the most relevant images: !title_img Let's see some more outputs: Config A note on config and CFG: I wrote the codes with python scripts and then converted it into a Jupyter Notebook. So, in case of python scripts, config is a normal python file where I put all the hyperparameters and in the case of Jupyter Notebook, its a class defined in the beginning of the notebook to keep all the hyperparameters. Utils Dataset As you can see in the tittle image of this article, we need to encode both images and their describing texts. So, the dataset needs to return both images and texts. Of course we are not going to feed raw text to our text encoder! We will use DistilBERT model (which is smaller than BERT but performs nearly as well as BERT) from HuggingFace library as our text encoder; so, we need to tokenize the sentences (captions) with DistilBERT tokenizer and then feed the token ids (input_ids) and the attention masks to DistilBERT. Therefore, the dataset needs to take care of the tokenization as well. Below you can see the dataset's code. Below that I'll explain the most important things that is happening in the code. In the \\init\\ we receive a tokenizer object which is actually a HuggingFace tokinzer; this tokenizer will be loaded when running the model. We are padding and truncating the captions to a specified maxlength. In the \\getitem\\ we will first load an encoded caption which is a dictionary with keys inputids and attention_mask, make tensors out of its values and after that we will load the corresponding image, transform and augment it (if there is any!) and then we make it a tensor and put it in the dictionary with "image" as the key. Finally we put the raw text of the caption with the key "caption" in the dictionary only for visualization purposes. I did not use additional data augmentations but you can add them if you want to improve the model's performance. Image Encoder The image encoder code is straight forward. I'm using PyTorch Image Models library (timm) here which makes a lot of different image models available from ResNets to EfficientNets and many more. Here we will use a ResNet50 as our image encoder. You can easily use torchvision library to use ResNets if you don't want to install a new library. The code encodes each image to a fixed size vector with the size of the model's output channels (in case of ResNet50 the vector size will be 2048). This is the output after the nn.AdaptiveAvgPool2d() layer. Text Encoder As I mentioned before, I'll use DistilBERT as the text encoder. Like its bigger brother BERT, two special tokens will be added to the actual input tokens: CLS and SEP which mark the start and end of a sentence. To grab the whole representation of a sentence (as the related BERT and DistilBERT papers point out) we use the final representations of the CLS token and we hope that this representation captures the overall meaning of the sentence (caption). Thinking it in this way, it is similar to what we did to images and converted them into a fixed size vector. In the case of DistilBERT (and also BERT) the output hidden representation for each token is a vector with size 768. So, the whole caption will be encoded in the CLS token representation whose size is 768. Projection Head I used Keras code example implementation of projection head to write the following in PyTorch. Now that we have encoded both our images and texts into fixed size vectors (2048 for image and 768 for text) we need to bring (project) them into a new world (!) with similar dimensions for both images and texts in order to be able to compare them and push apart the non-relevant image and texts and pull together those that match. So, the following code will bring the 2048 and 768 dimensional vectors into a 256 (projection_dim) dimensional world, where we can compare them. "embeddingdim" is the size of the input vector (2048 for images and 768 for texts) and "projectiondim" is the the size of the output vector which will be 256 for our case. For understanding the details of this part you can refer to the CLIP paper. CLIP This part is where all the fun happens! I'll also talk about the loss function here. I translated some of the code from Keras code examples into PyTorch for writing this part. Take a look at the code and then read the explanation below this code block. Here we will use the previous modules that we built to implement the main model. The \\init\\ function is self-explanatory. In the forward function, we first encode the images and texts separately into fixed size vectors (with different dimensionalities). After that, using separate projection modules we project them to that shared world (space) that I talked about previously. Here the encodings will become of similar shape (256 in our case). After that we will compute the loss. Again I recommend reading CLIP paper to get it better but I'll try my best to explain this part. In Linear Algebra, one common way to measure if two vectors are of similar characteristics (they are like each other) is to calculate their dot product (multiplying the matching entries and take the sum of them); if the final number is big, they are alike and if it is small they are not (relatively speaking)! Okay! What I just said is the most important thing to have in mind to understand this loss function. Let's continue. We talked about two vectors, but, what do we have here? We have imageembeddings, a matrix with shape (batchsize, 256) and textembeddings with shape (batchsize, 256). Easy enough! it means we have two groups of vectors instead of two single vectors. How do we measure how similar two groups of vectors (two matrices) are to each other? Again, with dot product (@ operator in PyTorch does the dot product or matrix multiplication in this case). To be able to multiply these two matrices together, we transpose the second one. Okay, we get a matrix with shape (batchsize, batchsize) which we will call logits. (temperature is equal to 1.0 in our case, so, it does not make a difference. You can play with it and see what difference it makes. Also look at the paper to see why it is here!). I hope you are still with me! If not it's okay, just review the code and check their shapes. Now that we have our logits, we need targets. I need to say that there is a more straight forward way to obtain targets but I had to do this for our case (I'll talk about why in a next paragraph). Let's consider what we hope that this model learns: we want it to learn "similar representations (vectors)" for a given image and the caption describing it. Meaning that either we give it an image or the text describing it, we want it to produce same 256 sized vectors for both. Check the cell below this code block for the continue of the explanations So, in the best case scenario, textembeddings and imageembedding matricies should be the same because they are describing similar things. Let's think now: if this happens, what would the logits matrix be like? Let's see with a simple example! So logits, in the best case, will be a matrix that if we take its softmax, will have 1.0s in the diagonal (An identity matrix to call it with fancy words!). As the loss function's job is to make model's predictions similar to targets (at least in most cases!), we want such a matrix as our target. That's the reason why we are calculating imagessimilarity and textssimilarity matrices in the code block above. Now that we've got our targets matrix, we will use simple cross entropy to calculate the actual loss. I've written the full matrix form of cross entropy as a function which you can see in the bottom of the code block. Okay! We are done! Wasn't it simple?! Alright, you can ignore the next paragraph but if you are curious, there is an important note in that. Here's why I didn't use a simpler approach: I need to admit that there's a simpler way to calculate this loss in PyTorch; by doing this: nn.CrossEntropyLoss()(logits, torch.arange(batch_size)). Why I did not use it here? For 2 reasons. 1- The dataset we are using has multiple captions for a single image; so, there is the possibility that two identical images with their similar captions exist in a batch (it is rare but it can happen). Taking the loss with this easier method will ignore this possibility and the model learns to pull apart two representations (assume them different) that are actually the same. Obviously, we don't want this to happen so I calculated the whole target matrix in a way that takes care of these edge cases. 2- Doing it the way I did, gave me a better understanding of what is happening in this loss function; so, I thought it would give you a better intuition as well! Train Here are some funtions to help us load train and valid dataloaders, our model and then train and evaluate our model on those. There's not much going on here; just simple training loop and utility functions Here's a handy function to train our model. There's not much happening here; just loading the batches, feeding them to the model and stepping the optimizer and lr_scheduler. Running the next cell start training the model. Put the kernel on GPU mode. Every epoch should take about 24 minutes on GPU (even one epoch is enough!). It can take one minute before training actually starts because we are going to encode all the captions once in the train and valid dataset, so please don't stop it! Every thing is working fine. Inference Okay! We are done with training the model. Now, we need to do inference which in our case will be giving the model a piece of text and want it to retrieve the most relevant images from an unseen validation (or test) set. Getting Image Embeddings In this function, we are loading the model that we saved after training, feeding it images in validation set and returning the imageembeddings with shape (validset_size, 256) and the model itself. Finding Matches This function does the final task that we wished our model would be capable of: it gets the model, image_embeddings, and a text query. It will display the most relevant images from the validation set! Isn't it amazing? Let's see how it performs after all! This is how we use this function. Aaaannnndddd the results: Final words I hope you have enjoyed this article. Implementing this paper was a really interesting experience for me. I want to thank Khalid Salama for the great Keras code example he provided which inspired me to write something similar in PyTorch.

machine-learning-blackjack-solution
github
LLM Vibe Score0.42
Human Vibe Score0.022610872675250356
GregSommervilleMar 27, 2025

machine-learning-blackjack-solution

machine-learning-blackjack-solution Introduction A genetic algorithm is a type of artificial intelligence programming that uses ideas from evolution to solve complex problems. It works by creating a population of (initially random) candidate solutions, then repeatedly selecting pairs of candidates and combining their solutions using a process similar to genetic crossover. Sometimes candidate solutions even go through mutation, just to introduce new possibilities into the population. After a large number of generations, the best solution found up to that point is often the optimal, best solution possible. Genetic algorithms are particularly well-suited for combinatorial problems, where there are huge numbers of potential solutions to a problem. The evolutionary process they go through is, in essence, a search through a huge solution space. A solution space so large that you simply could never use a brute force approach. This project is a demonstration of using a genetic algorithm to find an optimal strategy for playing the casino game Blackjack. Please see this article for a story about how this program was used, and what the results were. The article describes some of the available settings, and shows how different values for those settings affect the final result. The source code is for a Windows application written in Cthat allows you to play with different settings like population size, selection style and mutation rate. Each generation's best solution is displayed, so you can watch the program literally evolve a solution. !blackjack strategy tester screenshot The property grid located at the upper left of the screen is where you adjust settings. There's an informational area below that, and the right side of the screen is the display area for the three tables that represent a strategy for playing Blackjack. The tall table on the left is for hard hands, the table in the upper right is for soft hands, and the table in the lower right is for pairs. We'll talk more about how to interpret this strategy in a bit. The columns along the tops of the three tables are for the dealer upcard. When you play Blackjack the dealer has one of his two cards initially turned face up, and the rank of that card has a big impact on recommended strategy. Notice that the upcard ranks don't include Jack, Queen or King. That's because those cards all count 10, so we group them and the Ten together and simplify the tables. To use the tables, first, determine if you have a pair, soft hand, or hard hand. Then look in the appropriate table, with the correct dealer upcard column. The cell in the table will be "H" when the correct strategy is to hit, "S" when the correct strategy is to stand, "D" for double-down, and (in the pairs table only) "P" for split. A Word About This "Optimal" Strategy Before we go any further, it needs to be stated that this problem of finding an optimal Blackjack strategy has already been solved. Back in the 1960s, a mathematician named Edward O. Thorp authored a book called Beat the Dealer, which included charts showing the optimal "Basic" strategy. That strategy looks like this: !optimal blackjack strategy So we're solving a problem that has already been solved, but that's actually good. That means we can compare our results to the known best solution. For example, if our result strategy tells us to do anything but stand when holding a pair of Tens, Jacks, Queens or Kings, we know there's a problem. There's one other thing to get out of the way before we go any further, and that's the idea of nondeterministic code. That means that if we run the same code twice in a row, we're likely to get two different results. That's something that happens with genetic algorithms due to their inherent randomness. There's no guarantee you'll find the absolute optimal solution, but it is assured that you will find an optimal or near-optimal solution. It's something that isn't typical when writing code, so it takes some adjustment for most programmers. Genetic Algorithms Now let's talk about the details of a genetic algorithm. Fitness Scores First of all, we need a way to evaluate candidates so we can compare them to each other. That means a numeric fitness score, which in this case is quite simple: you simulate playing a certain number of hands using the strategy, and then count the number of chips you have at the end. The big question is, how many hands should we test with? The challenge of trying to test a strategy is that due to the innate randomness of Blackjack, you could use the same strategy ten times and get ten completely different results. Obviously, the more hands you play, the more the randomness gets smoothed out, and the quality of the underlying strategy starts to emerge. If you doubt this, just think about flipping a coin. If you only flip it five times, there's certainly a possibility that it'll come up heads all five times (in fact, that happens just over 3% of the time). However, if you flip it 500 times, there's no way it's going to end up all heads - the odds of it happening are 0.5500, which works out to be roughly once every 3 x 10150 times you try it. After some testing and analysis, it was determined that a minimum of 100,000 hands per test is needed for a reasonable level of accuracy. There's still variance even at that number, but in order to cut the variance in half, you'd need to bump the number of hands to 500,000. One reason this accuracy is important is that in the later generations, the differences between candidates are very small. Evolution has caused the main parts of the strategy to converge on a particular approach, and towards the end all it's doing is refining the minor details. In those cases it's important to accurately determine the difference between two similar candidates. Representation Representation is simply the idea that we need to use a data structure for a candidate solution that can be combined via crossover, and possibly mutated. In this case, that's also quite simple because the way that human beings represent a Blackjack strategy is to use three tables, as we've seen. Representing those in code with three two-dimensional arrays is the obvious approach. Each cell in those three tables will have "Hit", "Stand", "Double-Down", or (only for pairs) "Split". By the way, since there are 160 cells in the hard hands table, and 80 cells in the soft hands table, and 100 cells in the pairs table, we can calculate exactly how many possible distinct strategies there are for Blackjack: 4100 x 380 x 3160 = 5 x 10174 possible Blackjack strategies That's a big number, which is obviously impossible to search using brute force. Genetic algorithms (GAs) are extremely helpful when trying to find an optimal solution from a very large set of possible solutions like this. Blackjack Rules and Strategies The rules of Blackjack are fairly simple. The dealer and the player both are dealt two cards. The player sees both of their cards (they are usually dealt face up), and one of the dealer's cards is dealt face up. Each card has a value - for cards between 2 and 10, the value is the same as the card's rank (so an Eight of Spades counts as 8, for example). All face cards count as 10, and an Ace can either be 1 or 11 (it counts as 11 only when that does not result in a hand that exceeds 21). The suit of a card does not matter. After the cards are dealt, if the player has Blackjack (a total of 21) and the dealer does not, the player is immediately paid 1.5 times their original bet, and a new hand is dealt. If the player has 21 and the dealer does also, then it's a tie and the player gets their original bet back, and a new hand is dealt. If the player wasn't dealt a Blackjack, then play continues with the player deciding whether to Stand (not get any more cards), Hit (receive an additional card), Double-down (place an additional bet, and receive one and only one more card), or, in the case of holding a pair, splitting the hand, which means placing an additional bet and receiving two new cards, so the end result is that the player is now playing two (or, in the case of multiple splits, more than two) hands simultaneously. If the player hits or double-downs and has a resulting hand that exceeds 21, then they lose and play continues with the next hand. If not, then the dealer draws until their hand totals at least 17. If the dealer exceeds 21 at this point, the player receives a payment equal to twice their original bet. If the dealer doesn't exceed 21, then the hands are compared and the player with the highest total that doesn't exceed 21 wins. Because of these rules, certain effective strategies emerge. One common strategy is that if you hold a hard hand with a value of 20, 19 or 18, you should Stand, since you avoid busting by going over 21, and you have a nice hand total that might win in a showdown with the dealer. Another common strategy is to split a pair of Aces, since Aces are so powerful (due to the fact that count as 11 or 1, you can often Hit a hand with a soft Ace with no risk of busting). Likewise, splitting a pair of 8s is a good idea because with a hard total of 16, it's likely you will bust if you take a Hit (since so many cards count as 10). As a human being, all it takes is a little knowledge about the rules in order to construct a strategy. The GA program doesn't have that advantage, and operates completely without any pre-programmed knowledge of Blackjack. It simply uses the relative fitness scores and the mechanism of evolution to find the solution. GA Settings There are many variables or settings for a GA. You can adjust population size, how parent candidates are selected, how the resulting children may be mutated, and several other items. The following sections describe some of these settings: Setting: Selection Style Once we've solved representation and have a fitness function, the next step is to select two candidates for crossover during the process of building a new generation. There are three common styles for selection, and this program supports all of them. First, you can choose Roulette Wheel selection. It's named for a Roulette wheel because you can imagine each candidate's fitness score being a wedge in a pie chart, with a size proportionate to its relative fitness compared to the other candidates. (Of course, this assumes that all fitness scores are positive, which we will talk about shortly). The main benefit of Roulette Wheel selection is that selection is fitness-proportionate. Imagine if you had only three candidates, with fitness scores of 1, 3, and 8. The relative selection probabilities for those candidates will be 1/12, 3/12, and 8/12. The downside of Roulette Wheel selection is that it tends to be somewhat slow in terms of processing. The selection process is done by iterating through the candidates until a particular condition is matched - in other words, O(N) performance. Another potential problem with Roulette Wheel selection is that there may be situations where fitness scores vary widely, to such an extent that only certain candidates have any reasonable chance of being selected. This happens frequently in early generations, since the majority of candidates are mostly random. Although this might sound like a positive (since you ultimately want to select candidates with high fitness scores), it also results in a loss of genetic diversity. In other words, even though a particular candidate may have a low fitness score in an early generation, it may contain elements that are needed to find the ultimate solution in later generations. Ranked Selection is the solution to this problem. Instead of using raw fitness scores during the selection process, the candidates are sorted by fitness, with the worst candidate receiving a score of 0, the second worse receiving 1, and so forth, all the way to the best candidate, which has a score equal to the population size - 1. Ranked Selection is quite slow, since it combines the O(N) performance of Roulette Wheel, with the additional requirement that the candidates be sorted before selection. However, there may be circumstances where it performs better than other selection approaches. Finally, the fastest selection method of all is called Tournament Selection. This method simply selects N random candidates from the current generation, and then uses the one with the best fitness score. A tournament size of 2 means two random candidates are selected, and the best of those two is used. If you have a large tournament size (like 10), then 10 different candidates will be selected, with the best of those being the ultimate selection. That obviously tilts the balance between randomness and quality. Tournament selection works well in most cases, but it does require some experimentation to find the best tourney size. Setting: Elitism Elitism is a technique that helps ensure that the best candidates are always maintained. Since all selection methods are random to some degree, it is possible to completely lose the best candidates from one generation to another. By using Elitism, we automatically advance a certain percentage of the best candidates to the next generation. Elitism does have a negative impact on performance since all of the candidates must be sorted by fitness score. Typically Elitism is done before filling the rest of a new generation with new candidates created by crossover. Crossover Details Once two candidate solutions have been selected, the next step in building a new generation is to combine those two into a single new candidate, hopefully using the best of both parent strategies. There are a number of ways to do crossover, but the method used in this program is quite straightforward - the two fitness scores are compared, and crossover happens in a relatively proportionate way. If one candidate has a fitness of 10, and the other has a fitness of 5, then the one with fitness 10 contributes twice as much to the child as the parent with a fitness of 5. Since the fitness scores in this program are based on how much the strategy would win over thousands of hands, almost all fitness scores will be negative. (This is obviously because the rules are set up so the house always wins.) This makes it difficult to calculate relative fitnesses (how do you compare a positive number with a negative, and find relative proportions?), and also causes problems with selection methods like Roulette Wheel or Ranked. To solve this, we find the lowest fitness score of the generation and add that value to each candidate. This results in an adjusted fitness score of 0 for the very worse candidate, so it never gets selected. Mutation As has been mentioned a few times, maintaining genetic diversity in our population of candidate solutions is a good thing. It helps the GA ultimately find the very best solution, by occasionally altering a candidate in a positive direction. There are two settings for mutation. MutationRate controls what percentage of new candidates have mutation done on them. MutationImpact controls what percentage of their strategy is randomized. Population Size Population size has a significant impact on performance. The smaller the population size, the faster the GA will execute. On the other hand, if the size is too low the population may not have enough genetic diversity to find the ultimate solution. During testing, it looks like 700 to 1000 is a good balance between speed and correctness. Performance Notes This program consumes a lot of processing power. Running tests of hundreds of thousands of hands of Blackjack for hundreds or thousands of candidates consumes a lot of time. It's really imperative to write the code so that it works as efficiently as possible. If your CPU isn't consistently at or above 95% usage, there's still room for improvement. Multi-threading is a natural fit for genetic algorithms because we often want to perform the same action on each candidate. The best example of this is when we calculate fitness scores. This is often an operation that takes quite a bit of time. In our case, we're dealing out 100,000 hands, and each hand has to be played until the end. If we're single-threading that code, it's going to take a long time. Multi-threading is really the way to go. Luckily, there's a ridiculously simple way to efficiently use all of your processors for an operation like this. This code loops over all of the candidates in the currentGeneration list, calls the fitness function and sets the fitness property for each: Regardless of the number of items in the list or the number of processors on your machine, the code will efficiently run the code in a multi-threaded manner, and continue only when all of the threads are complete. One of the side effects of making this code multi-threaded is that all of the code relating to evaluating a candidate must be thread-safe, including any Singleton objects. When making code thread-safe, pay attention that you don't accidentally introduce code that will slow your program down unintentionally, because sometimes it can be quite subtle. Random numbers are central to how genetic algorithms work, so it's critical that they can be used correctly from a multithreaded environment. That means that each random number generator must be separate from the others, and it also means that each must produce a distinct series of random numbers. Random number generators use seed values which are usually time-based, like the number of milliseconds the computer has been turned on. Starting with that seed, subsequent calls will return a series of numbers that look random, but really aren't. If you start with the same seed, you get the same sequence. And that's a problem because if you create multiple random number generator objects in a loop using the default time-based seed, several of them will have the same time-based initial seed value, which will result in the same sequence of "random" numbers. That's a bug, because it can reduce the true randomness of the program a great deal, and that's vital to a genetic algorithm. There are a couple of ways to solve this problem. First, you can make the random object truly a singleton, and restrict access to it by using a Clock statement. The makes all access serialized for any random number need, which reduces performance. Another approach is to make the variable static per thread. By declaring the variable as static and also marking it with the [ThreadStatic] attribute, the .NET runtime allocates one static variable per thread. That eliminates the locking/serialization, but also has performance issues. The approach used in this application is to use a non-default seed value. In this case we call Guid.NewGuid().GetHashCode(), which generates a new, unique GUID, then gets an integer hashcode value that should be unique, depending on how GetHashCode is implemented. While multithreading really helps performance, there are also other things we can do to improve performance. For example, when dealing with large populations, the hundreds or thousands of objects that will be generated each generation can quickly turn into a huge problem related to garbage collection. In the end, the easiest way to solve that is to look through the code and find objects being allocate inside a loop. It's better to declare the variable outside of the loop, and then clear it in the loop, rather than reallocate it. In a program like this one where you could be looping hundreds of thousands of times, this can result in a very significant performance boost. For example, in an early version of this code, a Deck object was created for each hand. Since there are hundreds of candidate solutions running hundreds of thousands of trial hands, this was a huge inefficiency. The code was changed to allocate one deck per test sequence. The deck was shuffled as needed, so it never needs to be reallocated. Beyond the cards in the deck, another object type that was repeatedly created and destroyed were the candidate strategies. To mitigate this problem, a StrategyPool class was created that handles allocation and deallocation. This means that strategy objects are reused, rather than dynamically created when needed. The pool class has to be thread-safe, so it does serialize access to its methods via a Clock statement, but overall using the pool approach produced a good performance increase. Finally, a subtle form of object allocation is conversion. In an early version of the code, a utility card function used Convert.ToInt32(rankEnum). Obviously, the easiest way to convert from an enum to an int is simply to cast it, like (int)rankEnum. But it's hard to know exactly what the difference is between that approach, int.Parse(), int.TryParse(), or Convert.ToInt32(), since they can all be used and are roughly equivalent. Perhaps the compiler was boxing the enum value before passing it to Convert.ToInt32(), because the profiler identified this as a function that had large amounts of thread contention waiting - and the problem got much, much worse as the generations passed. By rewriting the conversion to use a simple cast, the program performance increased threefold (3x). Contributing Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us. Author Greg Sommerville - Initial work* License This project is licensed under the Apache 2.0 License - see the LICENSE.md file for details

ai-flow
github
LLM Vibe Score0.461
Human Vibe Score0.01809909681901274
DahnM20Mar 25, 2025

ai-flow

Open-source tool to seamlessly connect multiple AI model APIs into repeatable workflows. 🔗 Website • 📚 Documentation 🎉🚀 Latest Release: v0.10.0 🚀🎉 New Nodes: Claude 3.7, OpenRouter, Generate Random Number Configuration can now be done entirely in the UI !AI-Flow Intro Overview AI-Flow is an open-source, user-friendly UI that lets you visually design, manage, and monitor AI-driven workflows by seamlessly connecting multiple AI model APIs (e.g., OpenAI, StabilityAI, Replicate, Claude, Deepseek). Features Visual Workflow Builder: Drag-and-drop interface for crafting AI workflows. Real-Time Monitoring: Watch your workflow execute and track results. Parallel Processing: Nodes run in parallel whenever possible. Model Management: Easily organize and manage diverse AI models. Import/Export: Share or back up your workflows effortlessly. Supported Models Replicate: LLaMa, Mistral, FaceSwap, InstantMesh, MusicGen, and more. OpenAI: GPT-4o, TTS, o1, o3. StabilityAI: Stable Diffusion 3.5, SDXL, Stable Video Diffusion, plus additional tools. Others: Claude, Deepseek. !Scenario Example Open Source vs. Cloud AI-Flow is fully open source and available under the MIT License, empowering you to build and run your AI workflows on your personal machine. For those seeking enhanced functionality and a polished experience, AI-Flow Pro on our cloud platform (app.ai-flow.net) offers advanced features, including: Subflows & Loops: Create complex, nested workflows and iterate tasks effortlessly. API-Triggered Flows: Initiate workflows via API calls for seamless automation. Integrated Services: Connect with external services such as Google Search, Airtable, Zapier, and Make. Simplified Interface: Transform workflows into streamlined tools with an intuitive UI. !Pro VS Open Source The cloud version builds upon the foundation of the open-source project, giving you more power and flexibility while still letting you use your own API keys. Installation Note: To unlock full functionality, AI-Flow requires S3-compatible storage (with proper CORS settings) to host resources. Without it, features like File Upload or nodes that rely on external providers (e.g., StabilityAI) may not work as expected. Also, set REPLICATEAPIKEY in your environment to use the Replicate node. Local Installation (Without Docker) Clone the Repository: UI Setup: Backend Setup: Windows Users: Run the Application: Start the backend: In a new terminal, start the UI: Open your browser and navigate to http://localhost:3000. Docker Installation Prepare Docker Compose: Navigate to the docker directory: Update the REPLICATEAPIKEY in the YAML file. Launch with Docker Compose: Access the Application: Open http://localhost:80 in your browser. To stop, run: Contributing We welcome contributions! If you encounter issues or have feature ideas, please open an issue or submit a pull request. License This project is released under the MIT License.

sqlalchemy_aio
github
LLM Vibe Score0.432
Human Vibe Score0.06443138549576317
RazerMMar 17, 2025

sqlalchemy_aio

sqlalchemy_aio ============== |PyPI Version| |Documentation| |Travis| |Coverage| |MIT License| `sqlalchemyaio adds asyncio and Trio support to SQLAlchemy core, derived from alchimia_. +-------------------------------------------------------------------------------+ | ⚠️ Compatibility Note | +===============================================================================+ | SQLAlchemy 1.3 is the latest supported version. SQLAlchemy 1.4 | | brings native asyncio support_, so you should consider using that instead. | +-------------------------------------------------------------------------------+ .. _alchimia: https://github.com/alex/alchimia .. _Trio: https://github.com/python-trio/trio .. _native asyncio support: https://docs.sqlalchemy.org/en/14/orm/extensions/asyncio.html Getting started .. code-block:: python import asyncio from sqlalchemyaio import ASYNCIOSTRATEGY from sqlalchemy import ( Column, Integer, MetaData, Table, Text, create_engine, select) from sqlalchemy.schema import CreateTable, DropTable async def main(): engine = create_engine( In-memory sqlite database cannot be accessed from different threads, use file. 'sqlite:///test.db', strategy=ASYNCIO_STRATEGY ) metadata = MetaData() users = Table( 'users', metadata, Column('id', Integer, primary_key=True), Column('name', Text), ) Create the table await engine.execute(CreateTable(users)) conn = await engine.connect() Insert some users await conn.execute(users.insert().values(name='Jeremy Goodwin')) await conn.execute(users.insert().values(name='Natalie Hurley')) await conn.execute(users.insert().values(name='Dan Rydell')) await conn.execute(users.insert().values(name='Casey McCall')) await conn.execute(users.insert().values(name='Dana Whitaker')) result = await conn.execute(users.select(users.c.name.startswith('D'))) d_users = await result.fetchall() await conn.close() Print out the users for user in d_users: print('Username: %s' % user[users.c.name]) Supports context async managers async with engine.connect() as conn: async with conn.begin() as trans: assert await conn.scalar(select([1])) == 1 await engine.execute(DropTable(users)) if name == 'main': loop = asyncio.geteventloop() loop.rununtilcomplete(main()) Getting started with Trio To use the above example with Trio_, just change the following: .. code-block:: python import trio from sqlalchemyaio import TRIOSTRATEGY async def main(): engine = createengine('sqlite:///test.db', strategy=TRIOSTRATEGY) ... trio.run(main) What is this? It's not an asyncio implementation of SQLAlchemy or the drivers it uses. sqlalchemy_aio lets you use SQLAlchemy by running operations in a separate thread. If you're already using runinexecutor_ to execute SQLAlchemy tasks, sqlalchemy_aio will work well with similar performance. If performance is critical, perhaps asyncpg_ can help. .. _asyncpg: https://github.com/MagicStack/asyncpg .. runinexecutor: https://docs.python.org/3/library/asyncio-eventloop.html#asyncio.AbstractEventLoop.runin_executor Documentation The documentation`_ has more information, including limitations of the API. .. _The documentation: https://sqlalchemy-aio.readthedocs.io/en/latest/ .. |PyPI Version| image:: https://img.shields.io/pypi/v/sqlalchemy_aio.svg?style=flat-square :target: https://pypi.python.org/pypi/sqlalchemy_aio/ .. |Documentation| image:: https://img.shields.io/badge/docs-latest-brightgreen.svg?style=flat-square :target: https://sqlalchemy-aio.readthedocs.io/en/latest/ .. |Travis| image:: http://img.shields.io/travis/RazerM/sqlalchemy_aio/master.svg?style=flat-square&label=travis :target: https://travis-ci.org/RazerM/sqlalchemy_aio .. |Coverage| image:: https://img.shields.io/codecov/c/github/RazerM/sqlalchemy_aio/master.svg?style=flat-square :target: https://codecov.io/github/RazerM/sqlalchemy_aio?branch=master .. |MIT License| image:: http://img.shields.io/badge/license-MIT-blue.svg?style=flat-square :target: https://raw.githubusercontent.com/RazerM/sqlalchemy_aio/master/LICENSE

Vibe Coding is the Future (?)
youtube
LLM Vibe Score0.365
Human Vibe Score0.69
Code MonkeyMar 13, 2025

Vibe Coding is the Future (?)

✅ FREE Game Dev Report Newsletter https://cmonkey.co/gamedevreportnewsletter ❤️ FREE Complete Courses https://cmonkey.co/freecourses ✅ Get my CComplete Course! https://cmonkey.co/csharpcourse 🎮 Play my Steam game! https://cmonkey.co/dinkyguardians ❤️ Watch my FREE Complete Courses https://www.youtube.com/watch?v=oZCbmB6opxY 🌍 Get my Complete Courses! ✅ https://unitycodemonkey.com/courses 👍 Learn to make awesome games step-by-step from start to finish. 🎮 Get my Steam Games https://unitycodemonkey.com/gamebundle Andrej Karpathy Twitter Post https://x.com/karpathy/status/1886192184808149383 Vibe Coding with AI in 2025 https://www.youtube.com/shorts/1_rSrkXovOk Vibe Coding is The Future https://www.youtube.com/watch?v=IACHfKmZMr8 🔴 RELATED VIDEOS 🔴 AI is creating illiterate programmers! (you?) https://www.youtube.com/watch?v=2H4ouL4bCUs AI Game Engine replacing Game Developers? https://www.youtube.com/watch?v=97C7xScuzTk Unity for NOT Game Dev? https://www.youtube.com/watch?v=yo7sFIahYQo How to SURVIVE as a Game Dev for a DECADE! (Over $1,000,000 Revenue!) https://www.youtube.com/watch?v=sfD4MMFcebE 💬 There is a new term popping up named Vibe Coding, this is apparently where you put your faith entirely in AI generated code and you never even look at it. You just prompt the AI, perhaps even with voice so you don't even use the keyboard, and you just blindly accept whatever answer the AI gives you. Is this really the future of coding? I definitely have some thoughts on this. 📝 Some Links are Affiliate links which means it costs the same to you and I get a nice commission. 🌍 Get Code Monkey on Steam! 👍 Interactive Tutorials, Complete Games and More! ✅ https://store.steampowered.com/app/1294220/ If you have any questions post them in the comments and I'll do my best to answer them. 🔔 Subscribe for more Unity Tutorials https://www.youtube.com/channel/UCFK6NCbuCIVzA6Yj1GZqCg?subconfirmation=1 See you next time! 📍 Support on Patreon https://www.patreon.com/unitycodemonkey 🎮 Grab the Game Bundle at https://unitycodemonkey.com/gameBundle.php 📝 Get the Code Monkey Utilities at https://unitycodemonkey.com/utils.php Hello and Welcome! I'm your Code Monkey and here you will learn everything about Game Development in Unity using C#. I've been developing games for several years with 8 published games on Steam and now I'm sharing my knowledge to help you on your own game development journey. I do Unity Tutorials on just about every topic, Unity Tutorials for Beginners and Unity Tutorials for Advanced users. Website: https://unitycodemonkey.com/ Twitter: https://twitter.com/UnityCodeMonkey Steam: https://store.steampowered.com/developer/EndlessLoopStudios

The only video you need to Master N8N + AI agents (For complete beginners)
youtube
LLM Vibe Score0.396
Human Vibe Score0.64
Simon Scrapes | AI Agents & AutomationFeb 21, 2025

The only video you need to Master N8N + AI agents (For complete beginners)

Serious about Implementing AI? Shortcut your Path HERE, and connect with +300 entrepreneurs on the same mission: https://www.skool.com/scrapes This is a comprehensive 4hr course with all the secrets I've learned from 8 months of building out N8N workflows for my clients (over 100+ workflows!). During this course we'll cover everything you need to shortcut your journey into building automations with N8N, AI Agents & workflow automation! 🛠️ Links (affiliate) • n8n: https://n8n.partnerlinks.io/scrapesai 📧 Curated roundups of real-world AI implementations 📧 https://scrapes-ai.kit.com/b6b1a73dfd Want more? https://www.youtube.com/@simonscrapes?sub_confirmation=1 🚧 Looking for custom built AI agents for your business? 🚧 https://automake.io 💬 Share in the comments what you learnt during the video! 0:00:00 - Course Overview 0:04:12 - SECTION 1 - Getting started 0:09:57 - 1.1. Setting up N8N 0:15:10 - 1.2. Building blocks of N8N 0:16:52 - 1.3. The N8N Canvas 0:19:02 - 1.4. Triggers & Actions 0:24:55 - 1.5. Connect nodes 0:30:09 - 1.6. Visualising Data 0:32:13 - 1.7. JSON vs Table vs Schema 0:35:12 - 1.8. Mastering Static Data 0:38:10 - 1.9. Dynamic Data 0:43:21 - 1.10. Referencing Nodes (Foolproof) 0:47:05 - 1.11. Pinning Data 0:49:26 - 1.12. Simple Retry Logic 0:52:15 - 1.13. Node Naming 0:57:38 - SECTION 2 - Building Your First Automation with Data From Your Business 0:58:45 - 2.1. Planning Your Workflow 1:02:05 - 2.2. Monitoring Your Gmail 1:04:15 - 2.3. Setting up Google Credentials 1:09:01 - 2.4. Manipulating Data with Set 1:13:11 - 2.5. Data Format Comparison (HTML, Markdown) 1:15:55 - 2.6. Your First Automation 1:20:46 - 2.7. Building an Invoice Parsing System & Tackling File Formats 1:30:42 - 2.8. Cleaning Data with Code Node 1:39:19 - 2.9. Conditionals (IF) 1:44:24 - 2.10. Multiple Inputs 1:46:04 - 2.11. Merging Data 1:50:03 - 2.12. Memory Management 1:51:15 - 2.13. Large Data Sets (Loops) 1:54:52 - 2.14. Rounding Up Our Automation 1:55:16 - SECTION 3 - Agentic Workflows & AI Agents 1:56:07 - 3.1. Agentic vs Non-Agentic Workflows 1:59:28 - 3.2. Agentic Examples You Might Use 2:05:16 - 3.3. N8N AI Nodes 2:12:55 - 3.4. AI Agents - So What Are They? 2:20:42 - 3.5. AI Agents - What Business Use Do They Have? 2:25:05 - 3.6. Setting Up AI in Our Workflow 2:27:58 - 3.7. Prompting for Beginners 2:33:29 - 3.8. Openrouter for AI Models 2:39:10 - 3.9. Getting Consistent Outputs 2:45:53 - 3.10. Rounding Up Your Invoice Parsing Workflow 2:46:49 - 3.11. Mapping Back to Your Database 2:54:00 - SECTION 4 - Data From Outside Your Business 2:59:10 - 4.1. Connecting to an API with N8N 3:01:29 - 4.2. Reading API Docs Made Easy 3:04:24 - 4.3. API Authorisation 3:06:50 - 4.4. POST Request - PDFco 3:12:47 - 4.5. Uploading Our Files via API 3:22:18 - 4.6. Completing Our API Uploads 3:25:37 - 4.7. Connect to ANY API in 2 mins 3:29:30 - 4.8. Push Data Back to Our Table 3:35:03 - SECTION 5 - Making Your Life Easy & Scalable 3:37:27 - 5.1. Naming Workflows & Tagging 3:38:43 - 5.2. Workflow Separation 3:41:11 - 5.3. Modular Design 3:48:12 - 5.4. Error Handling 3:52:31 - 5.5. Debugging (easy Mode!) 3:53:31 - 5.6. Community Nodes 3:56:31 - 5.7. N8N Template Library 3:59:14 - 5.8. Getting Help #N8N #n8ntutorial #N8NBeginner

airbnb
github
LLM Vibe Score0.414
Human Vibe Score0.013305067808012168
dmcgloneFeb 4, 2025

airbnb

Notes on Airbnb business in New York and elsewhere ================================================== Disclaimer The script scrapes the Airbnb web site to collect data about the shape of the company's business. No guarantees are made about the quality of data obtained using this script, statistically or about an individual page. So please check your results. Changelog 2014-12-02 Tom Slee More robustness fixes. 2014-09-23 Tom Slee Bug fixes that solve problems where over-eager exception handling caused the script to exit too early. 2014-08-26 Tom Slee Version 2.1 is updated to be able to collect data from Airbnb's updated web site. Not all cities have the new format, but the script should handle both versions. It will not, however, handle cities without neighborhoods. 2014-05-26 Tom Slee Version 2 (May 2014) is much more thorough and efficient about searching Airbnb's web site for a given city and has more options. I have moved it to python 3 for better handling of unicode multi-lingual data. It is also ported to SAP SQL Anywhere to allow more flexible reporting and better concurrency than SQLite can provide. A free developer edition is available from the SAP web site. You may need to configure the python driver following the instructions given in http://dcx.sybase.com/index.html#sa160/en/dbprogramming/pg-python.html. airbnb.py is the python script to collect data. plot.py just produces some charts. airbnb.db is the data. The basic data is in the table room. A complete search of a given city's listings is a "survey" and the surveys are tracked in table survey. Using the script To create the database: python airbnb.py -dbi. This command does two things: initializes a database file (dbnb.db in the current directory) runs the reload.sql script against the database to create the tables, views, and stored procedures that make up the database. No data is added. On Windows, the reload.sql script does not always run. If that fails, try this to create the database tables: dbisql -c "uid=dba;pwd=sql;dbf=dbnb.db;eng=db" From Interactive SQL, click File > Open and choose reload.sql from the current directory. Hit F5 to execute the script and create the tables. Test that you can connect to the database file: run python airbnb.py --dbping and confirm that there are no errors. If there are errors, check the database file setting near the top of the script and change its location. To run a survey: add a city (search area) to the database, by running ./airbnb.py -asa "city-name". It scans the Airbnb web site and adds the neighborhoods for the city. add a survey to the database by running ./airbnb.py -asv "city-name". The command lists the survey_id value that was created. collect the roomids for the survey by running ./airbnb.py -s surveyid. The survey_id can be seen by running ./airbnb -ls. This search loops over neighborhoods, property types, and pages of listings in the Airbnb search pages. fill in the details of the rooms by running ./airbnb -f. If any step fails: If the -s step or the -f step fails (say because the internet connection was lost), you can just run it again, and it will pick up from where it left off without losing data. Continue until the script completes.

I built an AI Agent in 43 min to automate my workflows (Zero Coding)
youtube
LLM Vibe Score0.459
Human Vibe Score0.88
Greg IsenbergJan 31, 2025

I built an AI Agent in 43 min to automate my workflows (Zero Coding)

In this episode, Max Brodeur-Urbas, Gumloop's CEO, where we dive deep into how to build AI agents and how to automate any workflow. We cover various use cases, from automated sales outreach to content generation. Max shows us how Gumloop makes complex automations accessible to everyone by having user-friendly UI/UX, intuitive workflow buildouts, and easy custom integration creation. Timestamps: 00:00 - Intro 02:29 - Gumloop Workflow Overview 05:00 - Example: Lead Automation Workflow 10:23 - Templates for Workflows 12:21 - Example: YouTube to Blog Post Automation Workflow 21:03 - Gumloop Interfaces Demonstration 21:40 - Example: Media Ad Library Analyzer Automation Workflow 24:38 - Using Gumloop for SaaS Products 26:25 - Example: Analyze Daily Calendar Automation Workflow 27:47 - Output of Media Ad Library Analyzer Automation Workflow 28:43 - Cost of Running Gumloop 30:34 - Custom Node Builder Demonstration 34:18 - Gumloop Chrome Extension 37:06 - Final thoughts on business automation Gumloop Templates: https://www.gumloop.com/templates Key Points: • Demonstration of Gumloop's automation platform for building AI-powered workflows • Showcase of features including custom nodes, Chrome extension, and interface builder • Real-world examples of automated processes for sales, recruitment, and content generation • Discussion of practical business applications and cost-effectiveness of automation: Key Features Demonstrated: • Visual workflow builder • AI-powered content generation • Custom integration creation • Chrome extension functionality • Interface builder for non-technical users • Webhook integration capabilities 1) Gumloop is a visual workflow builder that lets you create powerful AI automations by connecting "nodes" - think Zapier meets ChatGPT, but WAY more powerful. Key features that stood out: 2) SUBFLOWS: Create reusable workflow components Build once, use everywhere Share with team members Perfect for complex operations Makes scaling easier 3) The YouTube Blog Post Generator is INSANE: Takes any YT video link Extracts transcript Generates TLDR summary Creates full blog post Adds video embed Posts to CMS Cost? About $1.62 per post 4) Competitor Ad Analysis automation: Scrapes competitor FB/IG ads Uses Gemini to analyze videos/images Generates strategy insights Sends beautiful email reports Runs on schedule Save 40+ hours/month 5) Custom Node Builder = game changer Create your own integrations No coding required AI helps write the code Share with your team Endless possibilities 6) Chrome Extension feature: Turn any workflow into a 1-click tool Works on any webpage Perfect for LinkedIn outreach Data enrichment Email automation 7) Why this matters: Most companies (even $1B+ ones) are still doing things manually that could be automated. The competitive advantage isn't just having AI - it's automating your workflows at scale. 8) Pricing & Getting Started: Free to try No CC required 1000 free credits with tutorial Build custom workflows Join their community Notable Quotes: "If you can list it as a list of steps, like for an intern, you would hand off a little sticky note being like, you do these 15 things in a row and that's the entire workflow, then you can 100% automate it." - Max "Being in business is a game of unfair advantages... And that means it's always about how do you save time as founders and executive teams." - Greg LCA helps Fortune 500s and fast-growing startups build their future - from Warner Music to Fortnite to Dropbox. We turn 'what if' into reality with AI, apps, and next-gen products https://latecheckout.agency/ BoringAds — ads agency that will build you profitable ad campaigns http://boringads.com/ BoringMarketing — SEO agency and tools to get your organic customers http://boringmarketing.com/ Startup Empire - a membership for builders who want to build cash-flowing businesses https://www.startupempire.co FIND ME ON SOCIAL X/Twitter: https://twitter.com/gregisenberg Instagram: https://instagram.com/gregisenberg/ LinkedIn: https://www.linkedin.com/in/gisenberg/ FIND MAX ON SOCIAL Gumloop: https://www.gumloop.com X/Twitter: https://x.com/maxbrodeururbas?lang=en LinkedIn: https://www.linkedin.com/in/max-brodeur-urbas-1a4b25172/

airflow-tutorial
github
LLM Vibe Score0.508
Human Vibe Score0.13240553426231688
hgrifJan 19, 2025

airflow-tutorial

Airflow tutorial This tutorial is loosely based on the Airflow tutorial in the official documentation. It will walk you through the basics of setting up Airflow and creating an Airflow workflow. This tutorial was published on the blog of GoDataDriven. Setup You can skip this section if Airflow is already set up. Make sure that you can run airflow commands, know where to put your DAGs and have access to the web UI. Install Airflow Airflow is installable with pip via a simple pip install apache-airflow. Either use a separate python virtual environment or install it in your default python environment. To use the conda virtual environment as defined in environment.yml in this git-repo: Install miniconda. Make sure that conda is on your path: Create the virtual environment from environment.yml: Activate the virtual environment: You should now have an (almost) working Airflow installation. Alternatively, install Airflow yourself by running: Airflow used to be packaged as airflow but is packaged as apache-airflow since version 1.8.1. Make sure that you install any extra packages with the right Python package: e.g. use pip install apache-airflow[dask] if you've installed apache-airflow and do not use pip install airflow[dask]. Leaving out the prefix apache- will install an old version of Airflow next to your current version, leading to a world of hurt. You may run into problems if you don't have the right binaries or Python packages installed for certain backends or operators. When specifying support for e.g. PostgreSQL when installing extra Airflow packages, make sure the database is installed; do a brew install postgresql or apt-get install postgresql before the pip install apache-airflow[postgres]. Similarly, when running into HiveOperator errors, do a pip install apache-airflow[hive] and make sure you can use Hive. Run Airflow Before you can use Airflow you have to initialize its database. The database contains information about historical & running workflows, connections to external data sources, user management, etc. Once the database is set up, Airflow's UI can be accessed by running a web server and workflows can be started. The default database is a SQLite database, which is fine for this tutorial. In a production setting you'll probably be using something like MySQL or PostgreSQL. You'll probably want to back it up as this database stores the state of everything related to Airflow. Airflow will use the directory set in the environment variable AIRFLOW_HOME to store its configuration and our SQlite database. This directory will be used after your first Airflow command. If you don't set the environment variable AIRFLOW_HOME, Airflow will create the directory ~/airflow/ to put its files in. Set environment variable AIRFLOW_HOME to e.g. your current directory $(pwd): or any other suitable directory. Next, initialize the database: Now start the web server and go to localhost:8080 to check out the UI: It should look something like this: With the web server running workflows can be started from a new terminal window. Open a new terminal, activate the virtual environment and set the environment variable AIRFLOW_HOME for this terminal as well: Make sure that you're an in the same directory as before when using $(pwd). Run a supplied example: And check in the web UI that it has run by going to Browse -> Task Instances. This concludes all the setting up that you need for this tutorial. Tips Both Python 2 and 3 are be supported by Airflow. However, some of the lesser used parts (e.g. operators in contrib) might not support Python 3. For more information on configuration check the sections on Configuration and Security of the Airflow documentation. Check the Airflow repository for upstart and systemd templates. Airflow logs extensively, so pick your log folder carefully. Set the timezone of your production machine to UTC: Airflow assumes it's UTC. Workflows We'll create a workflow by specifying actions as a Directed Acyclic Graph (DAG) in Python. The tasks of a workflow make up a Graph; the graph is Directed because the tasks are ordered; and we don't want to get stuck in an eternal loop so the graph also has to be Acyclic. The figure below shows an example of a DAG: The DAG of this tutorial is a bit easier. It will consist of the following tasks: print 'hello' wait 5 seconds print 'world and we'll plan daily execution of this workflow. Create a DAG file Go to the folder that you've designated to be your AIRFLOWHOME and find the DAGs folder located in subfolder dags/ (if you cannot find, check the setting dagsfolder in $AIRFLOW_HOME/airflow.cfg). Create a Python file with the name airflow_tutorial.py that will contain your DAG. Your workflow will automatically be picked up and scheduled to run. First we'll configure settings that are shared by all our tasks. Settings for tasks can be passed as arguments when creating them, but we can also pass a dictionary with default values to the DAG. This allows us to share default arguments for all the tasks in our DAG is the best place to set e.g. the owner and start date of our DAG. Add the following import and dictionary to airflow_tutorial.py to specify the owner, start time, and retry settings that are shared by our tasks: Configure common settings These settings tell Airflow that this workflow is owned by 'me', that the workflow is valid since June 1st of 2017, it should not send emails and it is allowed to retry the workflow once if it fails with a delay of 5 minutes. Other common default arguments are email settings on failure and the end time. Create the DAG We'll now create a DAG object that will contain our tasks. Name it airflowtutorialv01 and pass default_args: With schedule_interval='0 0 *' we've specified a run at every hour 0; the DAG will run each day at 00:00. See crontab.guru for help deciphering cron schedule expressions. Alternatively, you can use strings like '@daily' and '@hourly'. We've used a context manager to create a DAG (new since 1.8). All the tasks for the DAG should be indented to indicate that they are part of this DAG. Without this context manager you'd have to set the dag parameter for each of your tasks. Airflow will generate DAG runs from the startdate with the specified scheduleinterval. Once a DAG is active, Airflow continuously checks in the database if all the DAG runs have successfully ran since the start_date. Any missing DAG runs are automatically scheduled. When you initialize on 2016-01-04 a DAG with a startdate at 2016-01-01 and a daily scheduleinterval, Airflow will schedule DAG runs for all the days between 2016-01-01 and 2016-01-04. A run starts after the time for the run has passed. The time for which the workflow runs is called the execution_date. The daily workflow for 2016-06-02 runs after 2016-06-02 23:59 and the hourly workflow for 2016-07-03 01:00 starts after 2016-07-03 01:59. From the ETL viewpoint this makes sense: you can only process the daily data for a day after it has passed. This can, however, ask for some juggling with date for other workflows. For Machine Learning models you may want to use all the data up to a given date, you'll have to add the scheduleinterval to your executiondate somewhere in the workflow logic. Because Airflow saves all the (scheduled) DAG runs in its database, you should not change the startdate and scheduleinterval of a DAG. Instead, up the version number of the DAG (e.g. airflowtutorialv02) and avoid running unnecessary tasks by using the web interface or command line tools Timezones and especially daylight savings can mean trouble when scheduling things, so keep your Airflow machine in UTC. You don't want to skip an hour because daylight savings kicks in (or out). Create the tasks Tasks are represented by operators that either perform an action, transfer data, or sense if something has been done. Examples of actions are running a bash script or calling a Python function; of transfers are copying tables between databases or uploading a file; and of sensors are checking if a file exists or data has been added to a database. We'll create a workflow consisting of three tasks: we'll print 'hello', wait for 10 seconds and finally print 'world'. The first two are done with the BashOperator and the latter with the PythonOperator. Give each operator an unique task ID and something to do: Note how we can pass bash commands in the BashOperator and that the PythonOperator asks for a Python function that can be called. Dependencies in tasks are added by setting other actions as upstream (or downstream). Link the operations in a chain so that sleep will be run after printhello and is followed by printworld; printhello -> sleep -> printworld: After rearranging the code your final DAG should look something like: Test the DAG First check that DAG file contains valid Python code by executing the file with Python: You can manually test a single task for a given execution_date with airflow test: This runs the task locally as if it was for 2017-07-01, ignoring other tasks and without communicating to the database. Activate the DAG Now that you're confident that your dag works, let's set it to run automatically! To do so, the scheduler needs to be turned on; the scheduler monitors all tasks and all DAGs and triggers the task instances whose dependencies have been met. Open a new terminal, activate the virtual environment and set the environment variable AIRFLOW_HOME for this terminal, and type Once the scheduler is up and running, refresh the DAGs page in the web UI. You should see airflowtutorialv01 in the list of DAGs with an on/off switch next to it. Turn on the DAG in the web UI and sit back while Airflow starts backfilling the dag runs! Tips Make your DAGs idempotent: rerunning them should give the same results. Use the the cron notation for schedule_interval instead of @daily and @hourly. @daily and @hourly always run after respectively midnight and the full hour, regardless of the hour/minute specified. Manage your connections and secrets with the Connections and/or Variables. Exercises You now know the basics of setting up Airflow, creating a DAG and turning it on; time to go deeper! Change the interval to every 30 minutes. Use a sensor to add a delay of 5 minutes before starting. Implement templating for the BashOperator: print the executiondate instead of 'hello' (check out the original tutorial and the example DAG). Implement templating for the PythonOperator: print the executiondate with one hour added in the function printworld() (check out the documentation of the PythonOperator). Resources Data Pipelines with Apache Airflow Airflow documentation ETL best practices with Airflow Airflow: Tips, Tricks, and Pitfalls Kubernetes Custom controller for deploying Airflow

coursera-practical-data-science-specialization
github
LLM Vibe Score0.465
Human Vibe Score0.0230635140825568
honghanhhOct 9, 2024

coursera-practical-data-science-specialization

Solutions on Practical Data Science Specialization Access all courses in the Coursera Practical Data Science Specialization Specialization offered by deeplearning.ai. This repo contains the SOLUTIONS of exercises/labs to achieve the badge. Course keynotes and solutions of related quizzes, assignments Practical Data Science Specialization on Coursera contains three courses: Course 1: Analyze Datasets and Train ML Models using AutoML Week 1: Artificial Intelligence (AI) mimics human behavior. Machine Learning (ML) is a subset of AI that uses statistical methods and algorithms that are able to learn from data without being explicitly programmed. Deep learning (DL) is a subset of machine learning that uses artificial neural networks to learn from data. AWS SageMaker --> [x] Practice Quiz: Week 1. [x] Graded External Tool: Register and visualize dataset. Week 2: Statistical Bias: Training data does not comprehensively represent the underlying problem space. Statistical Bias Causes: Activity Bias, Societal Bias, Selection Bias, Data Drift/Shift, ... Class Imbalance (CI) measures the imbalance in the number of members between different facet values. Detecting Statistical Bias by AWS SageMaker DataWrangler and AWS SageMaker Clarify. Feature Importance explains the features that make up the training data using a score. How useful or valuable the feature is relative to other features? SHAP (SHapley Additive exPlanations) --> [x] Practice Quiz: Week 2. [x] Graded External Tool: Detect data bias with Amazon SageMaker Clarify. Week 3: Data Prepreration includes Ingesting & Analyzing, Prepraring & Transforming, Training & Tuning, and Deploying & Managing. AutoML aims at automating the process of building a model. Model Hosting. --> [x] Practice Quiz: Week 3. [x] Graded External Tool: Train a model with Amazon SageMaker Autopilot. Week 4: Built-in Alogrithms in AWS SageMaker supports Classification, Regression, and Clustering problems. Text Analysis Evolution: Word2Vec (CBOW & Skip-gram), GloVe, FastText, Transformer, BlazingText, ELMo, GPT, BERT, ... --> [x] Practice Quiz: Week 4. [x] Graded External Tool: Train a text classifier using Amazon SageMaker BlazingText built-in algorithm. Course 2: Build, Train, and Deploy ML Pipelines using BERT Week 1 Feature Engineering involves converting raw data from one or more sources into meaningful features that can be used for training machine learning models. Feature Engineering Step includes feature selection, creation, and transformation. BERT is Transformer-based pretrained language models that sucessfully capture bidirectional contexts in word representation. Feature Store: centralized, reusable, discoverable. --> [x] Practice Quiz: Week 1. [x] Graded External Tool: Feature transformation with Amazon SageMaker processing job and Feature Store. Week 2 Learn how to train a customized Pretrained BERT and its variant models, debug, and profile with AWS SageMaker. --> [x] Practice Quiz: Week 2. [x] Graded External Tool: Train a review classifier with BERT and Amazon SageMaker. Week 3 MLOps builds on DevOps practices that encompass people, process, and technology. MLOps also includes considerations and practices that are really unique to machine learning workloads. --> [x] Practice Quiz: Week 3. [x] Graded External Tool: SageMaker pipelines to train a BERT-Based text classifier. Course 3: Optimize ML Models and Deploy Human-in-the-Loop Pipelines Week 1 Model Tuning aims to fit the model to the underlying data patterns in your training data and learn the best possible parameters for your model. Automatic Model Tuning includes grid search, random search, bayesian optimization, hyperband. Challenges: checkpointing, distribution training strategy. --> [x] Practice Quiz: Week 1. [x] Graded External Tool: Optimize models using Automatic Model Tuning. Week 2 [x] Practice Quiz: Week 2. [x] Graded External Tool: A/B testing, traffic shifting and autoscaling. Week 3 [x] Practice Quiz: Week 3. [x] Graded External Tool: Data labeling and human-in-the-loop pipelines with Amazon Augmented AI (A2I). Disclaimer The solutions here are ONLY FOR REFERENCE to guide you if you get stuck somewhere. Highly recommended to try out the quizzes and assignments yourselves first before referring to the solutions here. Feel free to discuss further with me on .