VibeBuilders.ai Logo
VibeBuilders.ai

Package

Explore resources related to package to help implement AI solutions for your business.

[P] I created a package implementing a SOTA technique for XAI ( Explainable AI)
reddit
LLM Vibe Score0
Human Vibe Score0
PressureDry1111This week

[P] I created a package implementing a SOTA technique for XAI ( Explainable AI)

This is the package https://github.com/mfumagalli68/xi-method Follow the README and install directly from pypi. From the paper: " \[..\]To bridge this gap we propose a family of measures of statistical association whose definition is well-posed also for nonordered data. Our intuition is to rely on separation measurements between probability mass functions. Here, by separation measurement we mean any distance or divergence between probability mass functions that is positive, and that is null if and only if the probability mass functions coincide. Then, we show that the new class of sensitivity indices complies with Renyi’s postulate D of measures of statistical dependence (Renyi, 1959). This postulate, called zero-independence property in the following, requires that a measure of association is null if and only if the two random variables are statistically independent. We address the estimation of this new class of indicators for generic samples, and discuss their asymptotic convergence. We then use these probabilistic sensitivity measures in the context of explainability. A relevant aspect related to measures of statistical association is that they can be computed directly on the original dataset without the need of actually fitting a machine learning model. Thus, not only are they model agnostic in explaining the behavior of a black box, but they also provide pre-hoc explanations. Our intuition is then to compare explanations provided by measures of statistical association first calculated on the original data (the pre-hoc explanations) and then on the forecasts of the machine learning model fitted to the data (post-hoc explanations). This comparison provides an indication on whether the ML model predictions capture the statistical dependence originally present in the data. We call the resulting approach Xi-method\[...\] " The paper can't be shared freely, but as always with a little bit of research you can find it online. If you find it interesting, star the repo. ​ Thanks

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies) (I will not promote)
reddit
LLM Vibe Score0
Human Vibe Score1
Royal_Rest8409This week

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies) (I will not promote)

AI Palette is an AI-driven platform that helps food and beverage companies predict emerging product trends. I had the opportunity recently to sit down with the founder to get his advice on building an AI-first startup, which he'll be going through in this post. (I will not promote) About AI Palette: Co-founders: >!2 (Somsubhra GanChoudhuri, Himanshu Upreti)!!100+!!$12.7M USD!!AI-powered predictive analytics for the CPG (Consumer Packaged Goods) industry!!Signed first paying customer in the first year!!65+ global brands, including Cargill, Diageo, Ajinomoto, Symrise, Mondelez, and L’Oréal, use AI Palette!!Every new product launched has secured a paying client within months!!Expanded into Beauty & Personal Care (BPC), onboarding one of India’s largest BPC companies within weeks!!Launched multiple new product lines in the last two years, creating a unified suite for brand innovation!Identify the pain points in your industry for ideas* When I was working in the flavour and fragrance industry, I noticed a major issue CPG companies faced: launching a product took at least one to two years. For instance, if a company decided today to launch a new juice, it wouldn’t hit the market until 2027. This long timeline made it difficult to stay relevant and on top of trends. Another big problem I noticed was that companies relied heavily on market research to determine what products to launch. While this might work for current consumer preferences, it was highly inefficient since the product wouldn’t actually reach the market for several years. By the time the product launched, the consumer trends had already shifted, making that research outdated. That’s where AI can play a crucial role. Instead of looking at what consumers like today, we realised that companies should use AI to predict what they will want next. This allows businesses to create products that are ahead of the curve. Right now, the failure rate for new product launches is alarmingly high, with 8 out of 10 products failing. By leveraging AI, companies can avoid wasting resources on products that won’t succeed, leading to better, more successful launches. Start by talking to as many industry experts as possible to identify the real problems When we first had the idea for AI Palette, it was just a hunch, a gut feeling—we had no idea whether people would actually pay for it. To validate the idea, we reached out to as many people as we could within the industry. Since our focus area was all about consumer insights, we spoke to professionals in the CPG sector, particularly those in the insights departments of CPG companies. Through these early conversations, we began to see a common pattern emerge and identified the exact problem we wanted to solve. Don’t tell people what you’re building—listen to their frustrations and challenges first. Going into these early customer conversations, our goal was to listen and understand their challenges without telling them what we were trying to build. This is crucial as it ensures that you can gather as much data about the problem to truly understand it and that you aren't biasing their answers by showing your solution. This process helped us in two key ways: First, it validated that there was a real problem in the industry through the number of people who spoke about experiencing the same problem. Second, it allowed us to understand the exact scale and depth of the problem—e.g., how much money companies were spending on consumer research, what kind of tools they were currently using, etc. Narrow down your focus to a small, actionable area to solve initially. Once we were certain that there was a clear problem worth solving, we didn’t try to tackle everything at once. As a small team of two people, we started by focusing on a specific area of the problem—something big enough to matter but small enough for us to handle. Then, we approached customers with a potential solution and asked them for feedback. We learnt that our solution seemed promising, but we wanted to validate it further. If customers are willing to pay you for the solution, it’s a strong validation signal for market demand. One of our early customer interviewees even asked us to deliver the solution, which we did manually at first. We used machine learning models to analyse the data and presented the results in a slide deck. They paid us for the work, which was a critical moment. It meant we had something with real potential, and we had customers willing to pay us before we had even built the full product. This was the key validation that we needed. By the time we were ready to build the product, we had already gathered crucial insights from our early customers. We understood the specific information they wanted and how they wanted the results to be presented. This input was invaluable in shaping the development of our final product. Building & Product Development Start with a simple concept/design to validate with customers before building When we realised the problem and solution, we began by designing the product, but not by jumping straight into coding. Instead, we created wireframes and user interfaces using tools like InVision and Figma. This allowed us to visually represent the product without the need for backend or frontend development at first. The goal was to showcase how the product would look and feel, helping potential customers understand its value before we even started building. We showed these designs to potential customers and asked for feedback. Would they want to buy this product? Would they pay for it? We didn’t dive into actual development until we found a customer willing to pay a significant amount for the solution. This approach helped us ensure we were on the right track and didn’t waste time or resources building something customers didn’t actually want. Deliver your solution using a manual consulting approach before developing an automated product Initially, we solved problems for customers in a more "consulting" manner, delivering insights manually. Recall how I mentioned that when one of our early customer interviewees asked us to deliver the solution, we initially did it manually by using machine learning models to analyse the data and presenting the results to them in a slide deck. This works for the initial stages of validating your solution, as you don't want to invest too much time into building a full-blown MVP before understanding the exact features and functionalities that your users want. However, after confirming that customers were willing to pay for what we provided, we moved forward with actual product development. This shift from a manual service to product development was key to scaling in a sustainable manner, as our building was guided by real-world feedback and insights rather than intuition. Let ongoing customer feedback drive iteration and the product roadmap Once we built the first version of the product, it was basic, solving only one problem. But as we worked closely with customers, they requested additional features and functionalities to make it more useful. As a result, we continued to evolve the product to handle more complex use cases, gradually developing new modules based on customer feedback. Product development is a continuous process. Our early customers pushed us to expand features and modules, from solving just 20% of their problems to tackling 50–60% of their needs. These demands shaped our product roadmap and guided the development of new features, ultimately resulting in a more complete solution. Revenue and user numbers are key metrics for assessing product-market fit. However, critical mass varies across industries Product-market fit (PMF) can often be gauged by looking at the size of your revenue and the number of customers you're serving. Once you've reached a certain critical mass of customers, you can usually tell that you're starting to hit product-market fit. However, this critical mass varies by industry and the type of customers you're targeting. For example, if you're building an app for a broad consumer market, you may need thousands of users. But for enterprise software, product-market fit may be reached with just a few dozen key customers. Compare customer engagement and retention with other available solutions on the market for product-market fit Revenue and the number of customers alone isn't always enough to determine if you're reaching product-market fit. The type of customer and the use case for your product also matter. The level of engagement with your product—how much time users are spending on the platform—is also an important metric to track. The more time they spend, the more likely it is that your product is meeting a crucial need. Another way to evaluate product-market fit is by assessing retention, i.e whether users are returning to your platform and relying on it consistently, as compared to other solutions available. That's another key indication that your solution is gaining traction in the market. Business Model & Monetisation Prioritise scalability Initially, we started with a consulting-type model where we tailor-made specific solutions for each customer use-case we encountered and delivered the CPG insights manually, but we soon realized that this wasn't scalable. The problem with consulting is that you need to do the same work repeatedly for every new project, which requires a large team to handle the workload. That is not how you sustain a high-growth startup. To solve this, we focused on building a product that would address the most common problems faced by our customers. Once built, this product could be sold to thousands of customers without significant overheads, making the business scalable. With this in mind, we decided on a SaaS (Software as a Service) business model. The benefit of SaaS is that once you create the software, you can sell it to many customers without adding extra overhead. This results in a business with higher margins, where the same product can serve many customers simultaneously, making it much more efficient than the consulting model. Adopt a predictable, simplistic business model for efficiency. Look to industry practices for guidance When it came to monetisation, we considered the needs of our CPG customers, who I knew from experience were already accustomed to paying annual subscriptions for sales databases and other software services. We decided to adopt the same model and charge our customers an annual upfront fee. This model worked well for our target market, aligning with industry standards and ensuring stable, recurring revenue. Moreover, our target CPG customers were already used to this business model and didn't have to choose from a huge variety of payment options, making closing sales a straightforward and efficient process. Marketing & Sales Educate the market to position yourself as a thought leader When we started, AI was not widely understood, especially in the CPG industry. We had to create awareness around both AI and its potential value. Our strategy focused on educating potential users and customers about AI, its relevance, and why they should invest in it. This education was crucial to the success of our marketing efforts. To establish credibility, we adopted a thought leadership approach. We wrote blogs on the importance of AI and how it could solve problems for CPG companies. We also participated in events and conferences to demonstrate our expertise in applying AI to the industry. This helped us build our brand and reputation as leaders in the AI space for CPG, and word-of-mouth spread as customers recognized us as the go-to company for AI solutions. It’s tempting for startups to offer products for free in the hopes of gaining early traction with customers, but this approach doesn't work in the long run. Free offerings don’t establish the value of your product, and customers may not take them seriously. You should always charge for pilots, even if the fee is minimal, to ensure that the customer is serious about potentially working with you, and that they are committed and engaged with the product. Pilots/POCs/Demos should aim to give a "flavour" of what you can deliver A paid pilot/POC trial also gives you the opportunity to provide a “flavour” of what your product can deliver, helping to build confidence and trust with the client. It allows customers to experience a detailed preview of what your product can do, which builds anticipation and desire for the full functionality. During this phase, ensure your product is built to give them a taste of the value you can provide, which sets the stage for a broader, more impactful adoption down the line. Fundraising & Financial Management Leverage PR to generate inbound interest from VCs When it comes to fundraising, our approach was fairly traditional—we reached out to VCs and used connections from existing investors to make introductions. However, looking back, one thing that really helped us build momentum during our fundraising process was getting featured in Tech in Asia. This wasn’t planned; it just so happened that Tech in Asia was doing a series on AI startups in Southeast Asia and they reached out to us for an article. During the interview, they asked if we were fundraising, and we mentioned that we were. As a result, several VCs we hadn’t yet contacted reached out to us. This inbound interest was incredibly valuable, and we found it far more effective than our outbound efforts. So, if you can, try to generate some PR attention—it can help create inbound interest from VCs, and that interest is typically much stronger and more promising than any outbound strategies because they've gone out of their way to reach out to you. Be well-prepared and deliberate about fundraising. Keep trying and don't lose heart When pitching to VCs, it’s crucial to be thoroughly prepared, as you typically only get one shot at making an impression. If you mess up, it’s unlikely they’ll give you a second chance. You need to have key metrics at your fingertips, especially if you're running a SaaS company. Be ready to answer questions like: What’s your retention rate? What are your projections for the year? How much will you close? What’s your average contract value? These numbers should be at the top of your mind. Additionally, fundraising should be treated as a structured process, not something you do on the side while juggling other tasks. When you start, create a clear plan: identify 20 VCs to reach out to each week. By planning ahead, you’ll maintain momentum and speed up the process. Fundraising can be exhausting and disheartening, especially when you face multiple rejections. Remember, you just need one investor to say yes to make it all worthwhile. When using funds, prioritise profitability and grow only when necessary. Don't rely on funding to survive. In the past, the common advice for startups was to raise money, burn through it quickly, and use it to boost revenue numbers, even if that meant operating at a loss. The idea was that profitability wasn’t the main focus, and the goal was to show rapid growth for the next funding round. However, times have changed, especially with the shift from “funding summer” to “funding winter.” My advice now is to aim for profitability as soon as possible and grow only when it's truly needed. For example, it’s tempting to hire a large team when you have substantial funds in the bank, but ask yourself: Do you really need 10 new hires, or could you get by with just four? Growing too quickly can lead to unnecessary expenses, so focus on reaching profitability as soon as possible, rather than just inflating your team or burn rate. The key takeaway is to spend your funds wisely and only when absolutely necessary to reach profitability. You want to avoid becoming dependent on future VC investments to keep your company afloat. Instead, prioritize reaching break-even as quickly as you can, so you're not reliant on external funding to survive in the long run. Team-Building & Leadership Look for complementary skill sets in co-founders When choosing a co-founder, it’s important to find someone with a complementary skill set, not just someone you’re close to. For example, I come from a business and commercial background, so I needed someone with technical expertise. That’s when I found my co-founder, Himanshu, who had experience in machine learning and AI. He was a great match because his technical knowledge complemented my business skills, and together we formed a strong team. It might seem natural to choose your best friend as your co-founder, but this can often lead to conflict. Chances are, you and your best friend share similar interests, skills, and backgrounds, which doesn’t bring diversity to the table. If both of you come from the same industry or have the same strengths, you may end up butting heads on how things should be done. Having diverse skill sets helps avoid this and fosters a more collaborative working relationship. Himanshu (left) and Somsubhra (right) co-founded AI Palette in 2018 Define roles clearly to prevent co-founder conflict To avoid conflict, it’s essential that your roles as co-founders are clearly defined from the beginning. If your co-founder and you have distinct responsibilities, there is no room for overlap or disagreement. This ensures that both of you can work without stepping on each other's toes, and there’s mutual respect for each other’s expertise. This is another reason as to why it helps to have a co-founder with a complementary skillset to yours. Not only is having similar industry backgrounds and skillsets not particularly useful when building out your startup, it's also more likely to lead to conflicts since you both have similar subject expertise. On the other hand, if your co-founder is an expert in something that you're not, you're less likely to argue with them about their decisions regarding that aspect of the business and vice versa when it comes to your decisions. Look for employees who are driven by your mission, not salary For early-stage startups, the first hires are crucial. These employees need to be highly motivated and excited about the mission. Since the salary will likely be low and the work demanding, they must be driven by something beyond just the paycheck. The right employees are the swash-buckling pirates and romantics, i.e those who are genuinely passionate about the startup’s vision and want to be part of something impactful beyond material gains. When employees are motivated by the mission, they are more likely to stick around and help take the startup to greater heights. A litmus test for hiring: Would you be excited to work with them on a Sunday? One of the most important rounds in the hiring process is the culture fit round. This is where you assess whether a candidate shares the same values as you and your team. A key question to ask yourself is: "Would I be excited to work with this person on a Sunday?" If there’s any doubt about your answer, it’s likely not a good fit. The idea is that you want employees who align with the company's culture and values and who you would enjoy collaborating with even outside of regular work hours. How we structure the team at AI Palette We have three broad functions in our organization. The first two are the big ones: Technical Team – This is the core of our product and technology. This team is responsible for product development and incorporating customer feedback into improving the technology Commercial Team – This includes sales, marketing, customer service, account managers, and so on, handling everything related to business growth and customer relations. General and Administrative Team – This smaller team supports functions like finance, HR, and administration. As with almost all businesses, we have teams that address the two core tasks of building (technical team) and selling (commercial team), but given the size we're at now, having the administrative team helps smoothen operations. Set broad goals but let your teams decide on execution What I've done is recruit highly skilled people who don't need me to micromanage them on a day-to-day basis. They're experts in their roles, and as Steve Jobs said, when you hire the right person, you don't have to tell them what to do—they understand the purpose and tell you what to do. So, my job as the CEO is to set the broader goals for them, review the plans they have to achieve those goals, and periodically check in on progress. For example, if our broad goal is to meet a certain revenue target, I break it down across teams: For the sales team, I’ll look at how they plan to hit that target—how many customers they need to sell to, how many salespeople they need, and what tactics and strategies they plan to use. For the technical team, I’ll evaluate our product offerings—whether they think we need to build new products to attract more customers, and whether they think it's scalable for the number of customers we plan to serve. This way, the entire organization's tasks are cascaded in alignment with our overarching goals, with me setting the direction and leaving the details of execution to the skilled team members that I hire.

Upselling from $8/mo to $2k/mo
reddit
LLM Vibe Score0
Human Vibe Score1
Afraid-Astronomer130This week

Upselling from $8/mo to $2k/mo

I just closed a client for $1947/mo. But 5 months ago he was spending only $8/mo. Most customers have way more purchasing power than you think. Unlock it with the power of stacking. Here's my 3-steps stacking formula: Step 1 - Build trust with a low-ticket product In a world full of scams and deceit, building trust is damn hard. The best way to combat skepticism is through a free or low-ticket product, where you can go above and beyond to demonstrate your credibility. When I first onboarded this client onto my SaaS, an AI to help you with HARO link-building, my product was at a very early stage with many rough edges. He gave me lots of great feedback. I implemented his suggestions the same day and got more feedback from him. After a couple of back-and-forths, I established myself as a trustworthy hustler, instead of just a stranger online. This is easy to do for an agile startup but impossible for big companies, so make good use of opportunities like this to build long-term relationships. Turn your customers into raving fans. Step 2 - Validate a mid-ticket offer Three months into his subscription, he told me he wanted to cancel. When digging into the why, he suggested a performance-based DFY service to remove all the work on his end. Inspired by his suggestion, I took on him and 6 other clients for $237, a one-time package for 1 backlink. It's sold through my newsletter email blast to 300 subscribers, with a total CAC of $0. I wrote about the details of this launch in another long form. At this price range, impulsive purchases can still happen if you have a strong offer and good copywriting. Use this mid-ticket offer to validate your offer and positioning, build out a team, and establish trust. We went beyond the 1 link for almost all our clients, including this one in particular. For $237, we got him on Forbes, HubSpot, 2 DR50+ sites, and a few other smaller media outlets. By doing this, we further built trust into the relationship and established authority in what we do. Step 3 - Create a high-ticket subscription-based offer By now, you'll hopefully have built enough trust to get through the skepticism filter for something high-ticket. Now, it's time to develop an offer that amplifies your previous one. Something that allows you to let your clients achieve their goals to the maximum extent. For me, this is pitching every relevant media query on every platform for this client every day, to leverage HARO link-building to its full extent, all for a fixed price of $1947/mo. This customized offer is based on direct client feedback, isn't publicized on our website, but we're confident it will directly contribute to achieving this client's goal. A subscription-based offer is much superior because it allows you to create a stable source of revenue, especially at the early stage. That's how I created 3 different offers to solve the same problem for one client. By stacking each offer on top of the previous one, I was able to guide clients from one option to the next. This formula isn't some new rocket science I came up with. It's proven over and over again by other agency owners building in public, like Nick from Baked Design who started with a $9 design kit and now sells $9k/mo design subscriptions at $1M ARR. By stacking offers, you position yourself as a committed partner in your client's long-term success. Lastly, I want to address a common objection: "My customers can't afford $2k/month." But consider this: most people are reading your site on their $3000 MacBook or $1000 iPhone. It's not that they lack the funds, it's more likely that your service isn't meeting their expectations. Talk to them to discover the irresistible offer they'll gladly pay for. Update: lots of DM asking about more specifics so I wrote about it here. https://coldstartblueprint.com/p/ai-agent-email-list-building

Why you should consider using small open source fine-tuned models
reddit
LLM Vibe Score0
Human Vibe Score0.929
hamada0001This week

Why you should consider using small open source fine-tuned models

Context I want to start off by giving some context on what fine-tuning is, why it's useful and who it would be useful for: What is fine-tuning? When controlling the output of an LLM there are, broadly, three levels. Prompt engineering, RAG and fine-tuning. Most of you are likely familiar with the first two. Prompt engineering is when you try to optimize the prompt to get the model to do what you want better. RAG (retrieval augmented generation) is when you first do a search on some data (usually stored in a vector database which allows you to search by similarity), then you insert the results into the prompt so that the model can use that context to more accurately answer any questions. It's like letting the LLM access external information right before answering, using that additional context to improve its response Fine-tuning is when you want to fundamentally teach a model something new or teach it to behave in a particular way. You would provide the model with high quality data (i.e. inputs and outputs) which it will train on. Why is it useful? At the moment, many of you use the largest and best LLMs because they give the best results. However, for a lot of use cases you are likely using a sledgehammer for a small nail. Does it do a great job? Damn yeah! Well... why not use a smaller hammer? Because it might miss or hit your finger. The solution shouldn't be to use a sledgehammer, but rather to learn how to use a smaller hammer properly so you never miss! That's exactly what fine-tuning a smaller model is like. Once you fine-tune it on a specific task with good high quality data, it can surpass even the best models at that specific task. It'll be 10x cheaper to run, much faster and, if you use an open source model, you'll own the model (no vendor lock-in!). If you run a SaaS and your biggest expense is AI costs then you should definitely consider fine-tuning. It'll take some time to set up but it'll be well worth it in the medium/long term (a bit like SEO). You can always resort to the best models for more complex tasks. How to fine-tune? I'm going to give you a breakdown of the process from beginning to end. You do need to be (a bit) technical in order to do this. Getting the data Let's suppose we want to fine-tune a model to make high-quality SEO content. At the moment, you might be using a large sophisticated prompt or using multiple large LLMs to write different parts or utilizing RAG. This is all slow and expensive but might be giving you great results. Our goal is to replace this with a fine-tuned model that is great at one thing: writing high-quality SEO content quickly at a much lower cost. The first step is gathering the appropriate data. If you want the model to write 3 or 4 paragraphs based on a prompt that contains the topic and a few keywords, then your data should match that. There are a few way you can do this: You can manually gather high-quality SEO content. You'd write the prompt and the response that the model should give. You can use a larger more powerful LLM to generate the content for you (also known as synthetic data). It'll be expensive but remember that it'll be a larger one-off cost to get the data. If you already have a pipeline that works great then you can use the prompts and the generated content that you already have from that pipeline. You can buy a high-quality dataset or get someone to make it for you. The data is the most important part of this process. Remember, garbage in garbage out. Your data needs to have a good variety and should not contain any bad examples. You should aim for around 1000 examples. The more the better! The actual fine-tuning. At this stage you are now ready to choose a model and setup the fine-tuning. If you are unsure I'd stick to the Llama 3.1 family of models. They are great and reliable. There are three models: 8b, 70b and 405b. Depending on the complexity of the task you should select an appropriate size. However, to really reap the cost saving benefits and the speed you should try to stick with the 8b model or the the 70b model if the 8b is not good enough. For our SEO example, let's use the 8b model. Important note on selecting a model: You might see multiple models with the 8b flag. You might see 4bit-bnb or instruct. The instruct version of the models have basically been trained to be chatbots. So if you want to keep the chatbot-like instruction-following functionality then you should use the instruct version as the base. The non-instruct version simply generates text. It won't 'act' like a chatbot which is better for use cases like creative writing. The 4bit-bnb means that the model has been 'quantized'. Basically it has been made 4x smaller (the original is in 16 bits) so that it is faster to download and faster to fine-tune. This slightly reduces the accuracy of the model but it's usually fine for most use cases :) Fine-tuning should be done on a good GPU. CPU aren't good enough. So you can't spin up a droplet on digital ocean and use that. You'll specifically need to spin up a GPU. One website that I think is great is Runpod .io (I am not affiliated with them). You simply pay for the GPU by the hour. If you want the training to be fast you can use the H100, if you want something cheaper but slower you can use the A40. Although the A40 won't be good enough to run the 70b parameter model. For the 405b model you'll need multiple H100s but let's leave that for more advanced use cases. Once you've spun up your H100 and ssh-ed into it. I would recommend using the unsloth open source library to do the fine-tuning. They have great docs and good boilerplate code. You want to train using a method called QLoRA. This won't train the entire model but only "part of it". I don't want to get into the technical details as t3hat isn't important but essentially it's a very efficient and effective way of fine-tuning models. When fine-tuning you can provide something called a 'validation set'. As your model is training it will be tested against the 'validation set' to see how well it's doing. You'll get an 'eval loss' which basically means how well is your model doing when compared with the unseen validation data. If you have 1000 training examples I'd recommend taking out 100-200 so it can act as the validation set. Your model may start off with an eval loss of 1.1 and by the end of the training (e.g. 3 epochs - the number of epochs is the number of times your model will be trained on the entire dataset. It's like reading a book more than once so you can understand it better. Usually 3-5 epochs is enough) the eval loss would drop to 0.6 or 0.7 which means your model has made great progress in learning your dataset! You don't want it to be too low as that means it is literally memorizing which isn't good. Post fine-tuning You'll want to save the model with the best eval loss. You actually won't have the whole model, just something called the "QLoRA adapters". These are basically like the new neurons that contain the "understanding" of the data you trained the model on. You can combine these with the base model (using unsloth again) to prompt the model. You can also (and I recommend this) convert the model to GGUF format (using unsloth again). This basically packages the QLoRA adapters and model together into an optimized format so you can easily and efficiently run it and prompt it (using unsloth again... lol). I would then recommend running some evaluations on the new model. You can do this by simply prompting the new model and a more powerful model (or using your old pipeline) and then asking a powerful model e.g. Claude to judge which is better. If your model consistently does better then you've hit a winner! You can then use runpod again to deploy the model to their serverless AI endpoint so you only pay when it's actually being inferenced. (Again, I'm not affiliated with them) I hope this was useful and you at least got a good idea of what fine-tuning is and how you might go about doing it. By the way, I've just launched a website where you can easily fine-tune Llama 3.1 models. I'm actually hoping to eventually automate this entire process as I believe small fine-tuned models will be much more common in the future. If you want more info, feel free to DM me :)

36 startup ideas found by analyzing podcasts (problem, solution & source episode)
reddit
LLM Vibe Score0
Human Vibe Score1
joepigeonThis week

36 startup ideas found by analyzing podcasts (problem, solution & source episode)

Hey, I've been a bit of a podcast nerd for a long time. Around a year ago I began experimenting with transcription of podcasts for a SaaS I was running. I realized pretty quickly that there's a lot of knowledge and value in podcast discussions that is for all intents and purposes entirely unsearchable or discoverable to most people. I ended up stopping work on that SaaS product (party for lack of product/market fit, and partly because podcasting was far more interesting), and focusing on the podcast technology full-time instead. I'm a long-time lurker and poster of r/startups and thought this would make for some interesting content and inspiration for folks. Given I'm in this space, have millions of transcripts, and transcribe thousands daily... I've been exploring fun ways to expose some of the interesting knowledge and conversations taking place that utilize our own data/API. I'm a big fan of the usual startup podcasts (My First Million, Greg Isenberg, etc. etc.) and so I built an automation that turns all of the startup ideas discussed into a weekly email digest. I always struggle to listen to as many episodes as I'd actually like to, so I thought I'd summarise the stuff I care about instead (startup opportunities being discussed). I thought it would be interesting to post some of the ideas extracted so far. They range from being completely whacky and blue sky, to pretty boring but realistic. A word of warning before anyone complains – this is a big mixture of tech, ai, non-tech, local services, etc. ideas: Some of the ideas are completely mundane, but realistic (e.g. local window cleaning service) Some of the ideas are completely insane, blue sky, but sound super interesting Here's the latest 36 ideas: |Idea Name|Problem|Solution|Source| |:-|:-|:-|:-| |SalesForce-as-a-Service - White Label Enterprise Sales Teams|White-label enterprise sales teams for B2B SaaS. Companies need sales but can't hire/train. Recruit retail sellers, train for tech, charge 30% of deals closed.|Create a white-label enterprise sales team by recruiting natural salespeople from retail and direct sales backgrounds (e.g. mall kiosks, cutco knives). Train them specifically in B2B SaaS sales techniques and processes. Offer this trained sales force to tech companies on a contract basis.|My First Million - "Life Hacks From The King of Introverts + 7 Business Ideas| |TechButler - Mobile Device Maintenance Service|Mobile tech maintenance service. Clean/optimize devices, improve WiFi, basic support. $100/visit to homes. Target affluent neighborhoods.|Mobile tech support service providing in-home device cleaning, optimization, and setup. Focus on common issues like WiFi improvement, device maintenance, and basic tech support.|My First Million - "Life Hacks From The King of Introverts + 7 Business Ideas| |MemoryBox - At-Home Video Digitization Service|Door-to-door VHS conversion service. Parents have boxes of old tapes. Pick up, digitize, deliver. $30/tape with minimum order. Going extinct.|Door-to-door VHS to digital conversion service that handles everything from pickup to digital delivery. Make it extremely convenient for customers to preserve their memories.|My First Million - "Life Hacks From The King of Introverts + 7 Business Ideas| |Elite Match Ventures - Success-Based Luxury Matchmaking|High-end matchmaking for 50M+ net worth individuals. Only charge $1M+ when they get married. No upfront fees. Extensive vetting process.|Premium matchmaking service exclusively for ultra-high net worth individuals with a pure contingency fee model - only get paid ($1M+) upon successful marriage. Focus on quality over quantity with extensive vetting and personalized matching.|My First Million - "Life Hacks From The King of Introverts + 7 Business Ideas| |LocalHost - Simple Small Business Websites|Simple WordPress sites for local businesses. $50/month includes hosting, updates, security. Target restaurants and shops. Recurring revenue play.|Simplified web hosting and WordPress management service targeting local small businesses. Focus on basic sites with standard templates, ongoing maintenance, and reliable support for a fixed monthly fee.|My First Million - "Life Hacks From The King of Introverts + 7 Business Ideas| |VoiceJournal AI - Voice-First Smart Journaling|Voice-to-text journaling app with AI insights. 8,100 monthly searches. $15/month subscription. Partners with journaling YouTubers.|AI-powered journaling app that combines voice recording, transcription, and intelligent insights. Users can speak their thoughts, which are automatically transcribed and analyzed for patterns, emotions, and actionable insights.|Where It Happens - "7 $1M+ AI startup ideas you can launch tomorrow with $0"| |AIGenAds - AI-Generated UGC Content Platform|AI platform turning product briefs into UGC-style video ads. Brands spending $500/video for human creators. Generate 100 variations for $99/month.|AI platform that generates UGC-style video ads using AI avatars and scripting. System would allow rapid generation of multiple ad variations at a fraction of the cost. Platform would use existing AI avatar technology combined with script generation to create authentic-looking testimonial-style content.|Where It Happens - "7 $1M+ AI startup ideas you can launch tomorrow with $0"| |InfographAI - Automated Infographic Generation Platform|AI turning blog posts into branded infographics. Marketers spending hours on design. $99/month unlimited generation.|AI-powered platform that automatically converts blog posts and articles into visually appealing infographics. System would analyze content, extract key points, and generate professional designs using predefined templates and brand colors.|Where It Happens - "7 $1M+ AI startup ideas you can launch tomorrow with $0"| |KidFinance - Children's Financial Education Entertainment|Children's media franchise teaching financial literacy. Former preschool teacher creating 'Dora for money'. Books, videos, merchandise potential.|Character-driven financial education content for kids, including books, videos, and potentially TV show. Focus on making money concepts fun and memorable.|The Side Hustle Show - "How a Free Challenge Turned Into a $500,000 a Year Business (Greatest Hits)"| |FinanceTasker - Daily Financial Task Challenge|Free 30-day financial challenge with daily action items. People overwhelmed by money management. Makes $500k/year through books, speaking, and premium membership.|A free 30-day financial challenge delivering one simple, actionable task per day via email. Each task includes detailed scripts and instructions. Participants join a Facebook community for support and accountability. The program focuses on quick wins to build momentum. Automated delivery allows scaling.|The Side Hustle Show - "How a Free Challenge Turned Into a $500,000 a Year Business (Greatest Hits)"| |FinanceAcademy - Expert Financial Training Platform|Premium financial education platform. $13/month for expert-led courses and live Q&As. 4000+ members generating $40k+/month.|Premium membership site with expert-led courses, live Q&As, and community support. Focus on specific topics like real estate investing, business creation, and advanced money management.|The Side Hustle Show - "How a Free Challenge Turned Into a $500,000 a Year Business (Greatest Hits)"| |SecurityFirst Compliance - Real Security + Compliance Platform|Security-first compliance platform built by hackers. Companies spending $50k+ on fake security. Making $7M/year showing why current solutions don't work.|A compliance platform built by security experts that combines mandatory compliance requirements with real security measures. The solution includes hands-on security testing, expert guidance, and a focus on actual threat prevention rather than just documentation. It merges traditional compliance workflows with practical security implementations.|In the Pit with Cody Schneider| |LinkedInbound - Automated Professional Visibility Engine|LinkedIn automation for inbound job offers. Professionals spending hours on manual outreach. $99/month per job seeker.|Automated system for creating visibility and generating inbound interest on LinkedIn through coordinated profile viewing and engagement. Uses multiple accounts to create visibility patterns that trigger curiosity and inbound messages.|In the Pit with Cody Schneider| |ConvoTracker - Community Discussion Monitoring Platform|Community discussion monitoring across Reddit, Twitter, HN. Companies missing sales opportunities. $499/month per brand tracked.|Comprehensive monitoring system that tracks competitor mentions and industry discussions across multiple platforms (Reddit, Twitter, Hacker News, etc.) with automated alerts and engagement suggestions.|In the Pit with Cody Schneider| |ContentAds Pro - Smart Display Ad Implementation|Display ad implementation service for content creators. Bloggers losing thousands in ad revenue monthly. Makes $3-5k per site setup plus ongoing optimization fees.|Implementation of professional display advertising through networks like Mediavine that specialize in optimizing ad placement and revenue while maintaining user experience. Include features like turning off ads for email subscribers and careful placement to minimize impact on core metrics.|The Side Hustle Show - "636: Is Business Coaching Worth It? A Look Inside the last 12 months of Side Hustle Nation"| |MoneyAppReviews - Professional Side Hustle App Testing|Professional testing service for money-making apps. People wasting time on low-paying apps. Makes $20k/month from affiliate commissions and ads.|Professional app testing service that systematically reviews money-making apps and creates detailed, honest reviews including actual earnings data, time investment, and practical tips.|The Side Hustle Show - "636: Is Business Coaching Worth It? A Look Inside the last 12 months of Side Hustle Nation"| |LightPro - Holiday Light Installation Service|Professional Christmas light installation service. Homeowners afraid of ladders. $500-2000 per house plus storage.|Professional Christmas light installation service targeting residential and commercial properties. Full-service offering including design, installation, maintenance, removal and storage. Focus on safety and premium aesthetic results.|The Side Hustle Show - "639: 30 Ways to Make Extra Money for the Holidays"| |FocusMatch - Research Participant Marketplace|Marketplace connecting companies to paid research participants. Companies spending weeks finding people. $50-150/hour per study.|Online platform connecting companies directly with paid research participants. Participants create detailed profiles and get matched to relevant studies. Companies get faster access to their target demographic while participants earn money sharing opinions.|The Side Hustle Show - "639: 30 Ways to Make Extra Money for the Holidays"| |SolarShine Pro - Specialized Solar Panel Cleaning Service|Solar panel cleaning service using specialized equipment. Panels lose 50% efficiency when dirty. $650 per job, automated scheduling generates $18k/month from repeat customers.|Professional solar panel cleaning service using specialized deionized water system and European cleaning equipment. Includes automated 6-month scheduling, professional liability coverage, and warranty-safe cleaning processes. Service is bundled with inspection and performance monitoring.|The UpFlip Podcast - "156. $18K/Month with This ONE Service — Niche Business Idea"| |ExteriorCare Complete - One-Stop Exterior Maintenance Service|One-stop exterior home cleaning service (solar, windows, gutters, bird proofing). Automated scheduling. $650 average ticket. 60% repeat customers on 6-month contracts.|All-in-one exterior cleaning service offering comprehensive maintenance packages including solar, windows, gutters, roof cleaning and bird proofing. Single point of contact, consistent quality, and automated scheduling for all services.|The UpFlip Podcast - "156. $18K/Month with This ONE Service — Niche Business Idea"| |ContentMorph - Automated Cross-Platform Content Adaptation|AI platform converting blog posts into platform-optimized social content. Marketing teams spending 5hrs/post on manual adaptation. $199/mo per brand with 50% margins.|An AI-powered platform that automatically transforms long-form content (blog posts, podcasts, videos) into platform-specific formats (Instagram reels, TikToks, tweets). The system would preserve brand voice while optimizing for each platform's unique requirements and best practices.|Entrepreneurs on Fire - "Digital Threads: The Entrepreneur Playbook for Digital-First Marketing with Neal Schaffer"| |MarketerMatch - Verified Digital Marketing Talent Marketplace|Marketplace for pre-vetted digital marketing specialists. Entrepreneurs spending 15hrs/week on marketing tasks. Platform takes 15% commission averaging $900/month per active client.|A specialized marketplace exclusively for digital marketing professionals, pre-vetted for specific skills (video editing, social media, SEO, etc.). Platform includes skill verification, portfolio review, and specialization matching.|Entrepreneurs on Fire - "Digital Threads: The Entrepreneur Playbook for Digital-First Marketing with Neal Schaffer"| |Tiger Window Cleaning - Premium Local Window Service|Local window cleaning service targeting homeowners. Traditional companies charging 2x market rate. Making $10k/month from $200 initial investment.|Local window cleaning service combining competitive pricing ($5/pane), excellent customer service, and quality guarantees. Uses modern tools like water-fed poles for efficiency. Implements systematic approach to customer communication and follow-up.|The Side Hustle Show - "630: How this College Student’s Side Hustle Brings in $10k a Month"| |RealViz3D - Real Estate Visualization Platform|3D visualization service turning architectural plans into photorealistic renderings for real estate agents. Agents struggling with unbuilt property sales. Making $30-40k/year per operator.|Professional 3D modeling and rendering service that creates photorealistic visualizations of properties before they're built or renovated. The service transforms architectural plans into immersive 3D representations that show lighting, textures, and realistic details. This helps potential buyers fully understand and connect with the space before it physically exists.|Side Hustle School - "#2861 - TBT: An Architect’s Side Hustle in 3D Real Estate Modeling"| |Somewhere - Global Talent Marketplace|Platform connecting US companies with vetted overseas talent. Tech roles costing $150k locally filled for 50% less. Grew from $15M to $52M valuation in 9 months.|Platform connecting US companies with pre-vetted overseas talent at significantly lower rates while maintaining high quality. Handles payments, contracts, and quality assurance to remove friction from global hiring.|My First Million - "I Lost Everything Twice… Then Made $26M In 18 Months| |GymLaunch - Rapid Gym Turnaround Service|Consultants flying to struggling gyms to implement proven member acquisition systems. Gym owners lacking sales expertise. Made $100k in first 21 days.|Expert consultants fly in to implement proven member acquisition systems, train staff, and rapidly fill gyms with new members. The service combines sales training, marketing automation, and proven conversion tactics to transform struggling gyms into profitable businesses within weeks.|My First Million - "I Lost Everything Twice… Then Made $26M In 18 Months| |PublishPlus - Publishing Backend Monetization|Backend monetization system for publishing companies. One-time customers becoming recurring revenue. Grew business from $2M to $110M revenue.|Add complementary backend products and services to increase customer lifetime value. Develop software tools and additional services that natural extend from initial publishing product. Focus on high-margin recurring revenue streams.|My First Million - "I Lost Everything Twice… Then Made $26M In 18 Months| |WelcomeBot - Automated Employee Onboarding Platform|Automated employee welcome platform. HR teams struggling with consistent onboarding. $99/month per 100 employees.|An automated onboarding platform that creates personalized welcome experiences through pre-recorded video messages, scheduled check-ins, and automated swag delivery. The platform would ensure consistent high-quality onboarding regardless of timing or location.|Entrepreneurs on Fire - "Free Training on Building Systems and Processes to Scale Your Business with Chris Ronzio: An EOFire Classic from 2021"| |ProcessBrain - Business Knowledge Documentation Platform|SaaS platform turning tribal knowledge into documented processes. Business owners spending hours training new hires. $199/month per company.|A software platform that makes it easy to document and delegate business processes and procedures. The platform would include templates, guided documentation flows, and tools to easily share and update procedures. It would help businesses create a comprehensive playbook of their operations.|Entrepreneurs on Fire - "Free Training on Building Systems and Processes to Scale Your Business with Chris Ronzio: An EOFire Classic from 2021"| |TradeMatch - Modern Manufacturing Job Marketplace|Modern job board making manufacturing sexy again. Factory jobs paying $40/hr but can't recruit. $500 per successful referral.|A specialized job marketplace and recruitment platform focused exclusively on modern manufacturing and trade jobs. The platform would combine TikTok-style content marketing, referral programs, and modern UX to make manufacturing jobs appealing to Gen Z and young workers. Would leverage existing $500 referral fees and industry demand.|My First Million - "He Sold His Company For $15M, Then Got A Job At McDonald’s"| |GroundLevel - Executive Immersion Program|Structured program putting CEOs in front-line jobs. Executives disconnected from workers. $25k per placement.|A structured program that places executives and founders in front-line jobs (retail, warehouse, service) for 2-4 weeks with documentation and learning framework. Similar to Scott Heiferman's McDonald's experience but productized.|My First Million - "He Sold His Company For $15M, Then Got A Job At McDonald’s"| |OneStepAhead - Micro-Mentorship Marketplace|Marketplace for 30-min mentorship calls with people one step ahead. Professionals seeking specific guidance. Takes 15% of session fees.|MicroMentor Marketplace - Platform connecting people with mentors who are just one step ahead in their journey for focused, affordable micro-mentorship sessions.|Entrepreneurs on Fire - "How to Create an Unbroken Business with Michael Unbroken: An EOFire Classic from 2021"| |VulnerableLeader - Leadership Authenticity Training Platform|Leadership vulnerability training platform. Leaders struggling with authentic communication. $2k/month per company subscription.|Leadership Vulnerability Platform - A digital training platform combining assessment tools, guided exercises, and peer support to help leaders develop authentic communication skills. The platform would include real-world scenarios, video coaching, and measurable metrics for tracking leadership growth through vulnerability.|Entrepreneurs on Fire - "How to Create an Unbroken Business with Michael Unbroken: An EOFire Classic from 2021"| |NetworkAI - Smart Network Intelligence Platform|AI analyzing your network to find hidden valuable connections. Professionals missing opportunities in existing contacts. $49/month per user.|AI Network Navigator - Smart tool that analyzes your professional network across platforms, identifies valuable hidden connections, and suggests specific actionable ways to leverage relationships for mutual benefit.|Entrepreneurs on Fire - "How to Create an Unbroken Business with Michael Unbroken: An EOFire Classic from 2021"| |Porch Pumpkins - Seasonal Decoration Service|Full-service porch pumpkin decoration. Homeowners spend $300-1350 per season. One operator making $1M in 8 weeks seasonal revenue.|Full-service seasonal porch decoration service focused on autumn/Halloween, including design, installation, maintenance, and removal. Offering premium curated pumpkin arrangements with various package tiers.|My First Million - "The guy who gets paid $80K/yr to do nothing"| |Silent Companion - Professional Presence Service|Professional silent companions for lonely people. Huge problem in Japan/globally. $68/session, $80k/year per companion. Non-sexual, just presence.|A professional companion service where individuals can rent a non-judgmental, quiet presence for various activities. The companion provides silent company without the pressure of conversation or social performance. They accompany clients to events, meals, or just sit quietly together.|My First Million - "The guy who gets paid $80K/yr to do nothing"| Hope this is useful. If anyone would like to ensure I include any particular podcasts or episodes etc. in future posts, very happy to do so. I'll generally send \~5 ideas per week in a short weekly digest format (you can see the format I'd usually use in here: podcastmarketwatch.beehiiv.com). I find it mindblowing that the latest models with large context windows make it even possible to analyze full transcripts at such scale. It's a very exciting time we're living through! Would love some feedback on this stuff, happy to iterate and improve the analysis/ideas... or create a new newsletter on a different topic if anyone would like. Cheers!

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies) (I will not promote)
reddit
LLM Vibe Score0
Human Vibe Score1
Royal_Rest8409This week

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies) (I will not promote)

AI Palette is an AI-driven platform that helps food and beverage companies predict emerging product trends. I had the opportunity recently to sit down with the founder to get his advice on building an AI-first startup, which he'll be going through in this post. (I will not promote) About AI Palette: Co-founders: >!2 (Somsubhra GanChoudhuri, Himanshu Upreti)!!100+!!$12.7M USD!!AI-powered predictive analytics for the CPG (Consumer Packaged Goods) industry!!Signed first paying customer in the first year!!65+ global brands, including Cargill, Diageo, Ajinomoto, Symrise, Mondelez, and L’Oréal, use AI Palette!!Every new product launched has secured a paying client within months!!Expanded into Beauty & Personal Care (BPC), onboarding one of India’s largest BPC companies within weeks!!Launched multiple new product lines in the last two years, creating a unified suite for brand innovation!Identify the pain points in your industry for ideas* When I was working in the flavour and fragrance industry, I noticed a major issue CPG companies faced: launching a product took at least one to two years. For instance, if a company decided today to launch a new juice, it wouldn’t hit the market until 2027. This long timeline made it difficult to stay relevant and on top of trends. Another big problem I noticed was that companies relied heavily on market research to determine what products to launch. While this might work for current consumer preferences, it was highly inefficient since the product wouldn’t actually reach the market for several years. By the time the product launched, the consumer trends had already shifted, making that research outdated. That’s where AI can play a crucial role. Instead of looking at what consumers like today, we realised that companies should use AI to predict what they will want next. This allows businesses to create products that are ahead of the curve. Right now, the failure rate for new product launches is alarmingly high, with 8 out of 10 products failing. By leveraging AI, companies can avoid wasting resources on products that won’t succeed, leading to better, more successful launches. Start by talking to as many industry experts as possible to identify the real problems When we first had the idea for AI Palette, it was just a hunch, a gut feeling—we had no idea whether people would actually pay for it. To validate the idea, we reached out to as many people as we could within the industry. Since our focus area was all about consumer insights, we spoke to professionals in the CPG sector, particularly those in the insights departments of CPG companies. Through these early conversations, we began to see a common pattern emerge and identified the exact problem we wanted to solve. Don’t tell people what you’re building—listen to their frustrations and challenges first. Going into these early customer conversations, our goal was to listen and understand their challenges without telling them what we were trying to build. This is crucial as it ensures that you can gather as much data about the problem to truly understand it and that you aren't biasing their answers by showing your solution. This process helped us in two key ways: First, it validated that there was a real problem in the industry through the number of people who spoke about experiencing the same problem. Second, it allowed us to understand the exact scale and depth of the problem—e.g., how much money companies were spending on consumer research, what kind of tools they were currently using, etc. Narrow down your focus to a small, actionable area to solve initially. Once we were certain that there was a clear problem worth solving, we didn’t try to tackle everything at once. As a small team of two people, we started by focusing on a specific area of the problem—something big enough to matter but small enough for us to handle. Then, we approached customers with a potential solution and asked them for feedback. We learnt that our solution seemed promising, but we wanted to validate it further. If customers are willing to pay you for the solution, it’s a strong validation signal for market demand. One of our early customer interviewees even asked us to deliver the solution, which we did manually at first. We used machine learning models to analyse the data and presented the results in a slide deck. They paid us for the work, which was a critical moment. It meant we had something with real potential, and we had customers willing to pay us before we had even built the full product. This was the key validation that we needed. By the time we were ready to build the product, we had already gathered crucial insights from our early customers. We understood the specific information they wanted and how they wanted the results to be presented. This input was invaluable in shaping the development of our final product. Building & Product Development Start with a simple concept/design to validate with customers before building When we realised the problem and solution, we began by designing the product, but not by jumping straight into coding. Instead, we created wireframes and user interfaces using tools like InVision and Figma. This allowed us to visually represent the product without the need for backend or frontend development at first. The goal was to showcase how the product would look and feel, helping potential customers understand its value before we even started building. We showed these designs to potential customers and asked for feedback. Would they want to buy this product? Would they pay for it? We didn’t dive into actual development until we found a customer willing to pay a significant amount for the solution. This approach helped us ensure we were on the right track and didn’t waste time or resources building something customers didn’t actually want. Deliver your solution using a manual consulting approach before developing an automated product Initially, we solved problems for customers in a more "consulting" manner, delivering insights manually. Recall how I mentioned that when one of our early customer interviewees asked us to deliver the solution, we initially did it manually by using machine learning models to analyse the data and presenting the results to them in a slide deck. This works for the initial stages of validating your solution, as you don't want to invest too much time into building a full-blown MVP before understanding the exact features and functionalities that your users want. However, after confirming that customers were willing to pay for what we provided, we moved forward with actual product development. This shift from a manual service to product development was key to scaling in a sustainable manner, as our building was guided by real-world feedback and insights rather than intuition. Let ongoing customer feedback drive iteration and the product roadmap Once we built the first version of the product, it was basic, solving only one problem. But as we worked closely with customers, they requested additional features and functionalities to make it more useful. As a result, we continued to evolve the product to handle more complex use cases, gradually developing new modules based on customer feedback. Product development is a continuous process. Our early customers pushed us to expand features and modules, from solving just 20% of their problems to tackling 50–60% of their needs. These demands shaped our product roadmap and guided the development of new features, ultimately resulting in a more complete solution. Revenue and user numbers are key metrics for assessing product-market fit. However, critical mass varies across industries Product-market fit (PMF) can often be gauged by looking at the size of your revenue and the number of customers you're serving. Once you've reached a certain critical mass of customers, you can usually tell that you're starting to hit product-market fit. However, this critical mass varies by industry and the type of customers you're targeting. For example, if you're building an app for a broad consumer market, you may need thousands of users. But for enterprise software, product-market fit may be reached with just a few dozen key customers. Compare customer engagement and retention with other available solutions on the market for product-market fit Revenue and the number of customers alone isn't always enough to determine if you're reaching product-market fit. The type of customer and the use case for your product also matter. The level of engagement with your product—how much time users are spending on the platform—is also an important metric to track. The more time they spend, the more likely it is that your product is meeting a crucial need. Another way to evaluate product-market fit is by assessing retention, i.e whether users are returning to your platform and relying on it consistently, as compared to other solutions available. That's another key indication that your solution is gaining traction in the market. Business Model & Monetisation Prioritise scalability Initially, we started with a consulting-type model where we tailor-made specific solutions for each customer use-case we encountered and delivered the CPG insights manually, but we soon realized that this wasn't scalable. The problem with consulting is that you need to do the same work repeatedly for every new project, which requires a large team to handle the workload. That is not how you sustain a high-growth startup. To solve this, we focused on building a product that would address the most common problems faced by our customers. Once built, this product could be sold to thousands of customers without significant overheads, making the business scalable. With this in mind, we decided on a SaaS (Software as a Service) business model. The benefit of SaaS is that once you create the software, you can sell it to many customers without adding extra overhead. This results in a business with higher margins, where the same product can serve many customers simultaneously, making it much more efficient than the consulting model. Adopt a predictable, simplistic business model for efficiency. Look to industry practices for guidance When it came to monetisation, we considered the needs of our CPG customers, who I knew from experience were already accustomed to paying annual subscriptions for sales databases and other software services. We decided to adopt the same model and charge our customers an annual upfront fee. This model worked well for our target market, aligning with industry standards and ensuring stable, recurring revenue. Moreover, our target CPG customers were already used to this business model and didn't have to choose from a huge variety of payment options, making closing sales a straightforward and efficient process. Marketing & Sales Educate the market to position yourself as a thought leader When we started, AI was not widely understood, especially in the CPG industry. We had to create awareness around both AI and its potential value. Our strategy focused on educating potential users and customers about AI, its relevance, and why they should invest in it. This education was crucial to the success of our marketing efforts. To establish credibility, we adopted a thought leadership approach. We wrote blogs on the importance of AI and how it could solve problems for CPG companies. We also participated in events and conferences to demonstrate our expertise in applying AI to the industry. This helped us build our brand and reputation as leaders in the AI space for CPG, and word-of-mouth spread as customers recognized us as the go-to company for AI solutions. It’s tempting for startups to offer products for free in the hopes of gaining early traction with customers, but this approach doesn't work in the long run. Free offerings don’t establish the value of your product, and customers may not take them seriously. You should always charge for pilots, even if the fee is minimal, to ensure that the customer is serious about potentially working with you, and that they are committed and engaged with the product. Pilots/POCs/Demos should aim to give a "flavour" of what you can deliver A paid pilot/POC trial also gives you the opportunity to provide a “flavour” of what your product can deliver, helping to build confidence and trust with the client. It allows customers to experience a detailed preview of what your product can do, which builds anticipation and desire for the full functionality. During this phase, ensure your product is built to give them a taste of the value you can provide, which sets the stage for a broader, more impactful adoption down the line. Fundraising & Financial Management Leverage PR to generate inbound interest from VCs When it comes to fundraising, our approach was fairly traditional—we reached out to VCs and used connections from existing investors to make introductions. However, looking back, one thing that really helped us build momentum during our fundraising process was getting featured in Tech in Asia. This wasn’t planned; it just so happened that Tech in Asia was doing a series on AI startups in Southeast Asia and they reached out to us for an article. During the interview, they asked if we were fundraising, and we mentioned that we were. As a result, several VCs we hadn’t yet contacted reached out to us. This inbound interest was incredibly valuable, and we found it far more effective than our outbound efforts. So, if you can, try to generate some PR attention—it can help create inbound interest from VCs, and that interest is typically much stronger and more promising than any outbound strategies because they've gone out of their way to reach out to you. Be well-prepared and deliberate about fundraising. Keep trying and don't lose heart When pitching to VCs, it’s crucial to be thoroughly prepared, as you typically only get one shot at making an impression. If you mess up, it’s unlikely they’ll give you a second chance. You need to have key metrics at your fingertips, especially if you're running a SaaS company. Be ready to answer questions like: What’s your retention rate? What are your projections for the year? How much will you close? What’s your average contract value? These numbers should be at the top of your mind. Additionally, fundraising should be treated as a structured process, not something you do on the side while juggling other tasks. When you start, create a clear plan: identify 20 VCs to reach out to each week. By planning ahead, you’ll maintain momentum and speed up the process. Fundraising can be exhausting and disheartening, especially when you face multiple rejections. Remember, you just need one investor to say yes to make it all worthwhile. When using funds, prioritise profitability and grow only when necessary. Don't rely on funding to survive. In the past, the common advice for startups was to raise money, burn through it quickly, and use it to boost revenue numbers, even if that meant operating at a loss. The idea was that profitability wasn’t the main focus, and the goal was to show rapid growth for the next funding round. However, times have changed, especially with the shift from “funding summer” to “funding winter.” My advice now is to aim for profitability as soon as possible and grow only when it's truly needed. For example, it’s tempting to hire a large team when you have substantial funds in the bank, but ask yourself: Do you really need 10 new hires, or could you get by with just four? Growing too quickly can lead to unnecessary expenses, so focus on reaching profitability as soon as possible, rather than just inflating your team or burn rate. The key takeaway is to spend your funds wisely and only when absolutely necessary to reach profitability. You want to avoid becoming dependent on future VC investments to keep your company afloat. Instead, prioritize reaching break-even as quickly as you can, so you're not reliant on external funding to survive in the long run. Team-Building & Leadership Look for complementary skill sets in co-founders When choosing a co-founder, it’s important to find someone with a complementary skill set, not just someone you’re close to. For example, I come from a business and commercial background, so I needed someone with technical expertise. That’s when I found my co-founder, Himanshu, who had experience in machine learning and AI. He was a great match because his technical knowledge complemented my business skills, and together we formed a strong team. It might seem natural to choose your best friend as your co-founder, but this can often lead to conflict. Chances are, you and your best friend share similar interests, skills, and backgrounds, which doesn’t bring diversity to the table. If both of you come from the same industry or have the same strengths, you may end up butting heads on how things should be done. Having diverse skill sets helps avoid this and fosters a more collaborative working relationship. Himanshu (left) and Somsubhra (right) co-founded AI Palette in 2018 Define roles clearly to prevent co-founder conflict To avoid conflict, it’s essential that your roles as co-founders are clearly defined from the beginning. If your co-founder and you have distinct responsibilities, there is no room for overlap or disagreement. This ensures that both of you can work without stepping on each other's toes, and there’s mutual respect for each other’s expertise. This is another reason as to why it helps to have a co-founder with a complementary skillset to yours. Not only is having similar industry backgrounds and skillsets not particularly useful when building out your startup, it's also more likely to lead to conflicts since you both have similar subject expertise. On the other hand, if your co-founder is an expert in something that you're not, you're less likely to argue with them about their decisions regarding that aspect of the business and vice versa when it comes to your decisions. Look for employees who are driven by your mission, not salary For early-stage startups, the first hires are crucial. These employees need to be highly motivated and excited about the mission. Since the salary will likely be low and the work demanding, they must be driven by something beyond just the paycheck. The right employees are the swash-buckling pirates and romantics, i.e those who are genuinely passionate about the startup’s vision and want to be part of something impactful beyond material gains. When employees are motivated by the mission, they are more likely to stick around and help take the startup to greater heights. A litmus test for hiring: Would you be excited to work with them on a Sunday? One of the most important rounds in the hiring process is the culture fit round. This is where you assess whether a candidate shares the same values as you and your team. A key question to ask yourself is: "Would I be excited to work with this person on a Sunday?" If there’s any doubt about your answer, it’s likely not a good fit. The idea is that you want employees who align with the company's culture and values and who you would enjoy collaborating with even outside of regular work hours. How we structure the team at AI Palette We have three broad functions in our organization. The first two are the big ones: Technical Team – This is the core of our product and technology. This team is responsible for product development and incorporating customer feedback into improving the technology Commercial Team – This includes sales, marketing, customer service, account managers, and so on, handling everything related to business growth and customer relations. General and Administrative Team – This smaller team supports functions like finance, HR, and administration. As with almost all businesses, we have teams that address the two core tasks of building (technical team) and selling (commercial team), but given the size we're at now, having the administrative team helps smoothen operations. Set broad goals but let your teams decide on execution What I've done is recruit highly skilled people who don't need me to micromanage them on a day-to-day basis. They're experts in their roles, and as Steve Jobs said, when you hire the right person, you don't have to tell them what to do—they understand the purpose and tell you what to do. So, my job as the CEO is to set the broader goals for them, review the plans they have to achieve those goals, and periodically check in on progress. For example, if our broad goal is to meet a certain revenue target, I break it down across teams: For the sales team, I’ll look at how they plan to hit that target—how many customers they need to sell to, how many salespeople they need, and what tactics and strategies they plan to use. For the technical team, I’ll evaluate our product offerings—whether they think we need to build new products to attract more customers, and whether they think it's scalable for the number of customers we plan to serve. This way, the entire organization's tasks are cascaded in alignment with our overarching goals, with me setting the direction and leaving the details of execution to the skilled team members that I hire.

Am I Offering Too Many Services as a New Agency? Need Advice!
reddit
LLM Vibe Score0
Human Vibe Score1
DinoR6This week

Am I Offering Too Many Services as a New Agency? Need Advice!

Hey everyone, I recently made a big decision to close my marketing agency (which was doing well) and reopen with my partner, who’s a talented graphic designer specializing in logos and branding. We’re excited to start fresh, but I’m worried we might be biting off more than we can chew. Our current plan is to offer a wide range of services, including: Graphic Design (logos, branding, social media graphics) Web Design & Development (landing pages, e-commerce, custom websites) Digital Marketing (SEO, PPC, email marketing, social media management) Copywriting (website copy, blogs, product descriptions) AI Solutions (chatbots, virtual assistants) To make it easier for clients, we’ve grouped these services into 3 main packages: Business Packages (websites, branding, and marketing essentials) Branding Packages (logos, brand identity, and design assets) Social Media Packages (content creation, management, and strategy) I used to focus solely on marketing, but now we’re combining our skills to offer a full-service experience. While this feels like a natural progression, I’m concerned about spreading ourselves too thin, especially as a new agency. Has anyone been in a similar situation? Should we narrow our focus initially and expand later, or is offering a broad range of services (with clear packages) a good strategy to attract more clients? Any advice or experiences would be super helpful! Thanks in advance!

Restarting My Agency / Compared To Full Time Corporate
reddit
LLM Vibe Score0
Human Vibe Score1
nomadpaddyThis week

Restarting My Agency / Compared To Full Time Corporate

I’m currently thinking about going back to consulting / agency work compared to my current tech job I have. Over a year ago I signed this tech client and they wanted more and more from me which ended up becoming a full time role. At the time, the challenge excited me as it was working on a very large project on a global scale, competing with some of the biggest brands in the world. I was making good money before working in my agency and consulting with lots of different brands on their paid media, websites and e-commerce. I have a healthy package where I’m at at the moment but want more. Working with different clients always created curiosity, no day was the same and that what I loved about it. So now I’m considering going to back to starting the business and working with clients again. My question is: What do businesses ACTUALLY want? Everyone wants great roas and an amazing site but what are core things people are looking for in a growth partner / agency? I’m thinking of relaunching with three pillars in mind: Digital (Paid Media, Lead Gen, Web Dev) AI implementation as a lot of businesses don’t know how to leverage AI completely for cost saving and efficiencies. Content (Video, SEO, Content Writing) for modern day ranking I’m currently rebuilding my pitch deck and thought I would ask the question here before I go back to my network and start opening up conversations again. Would love to hear people’s thoughts in addition to anyone that’s done the same?

Randomly asked ChatGPT and Claude for a 4 year roadmap for an ML Engineer
reddit
LLM Vibe Score0
Human Vibe Score1
Brilliant_Fishing110This week

Randomly asked ChatGPT and Claude for a 4 year roadmap for an ML Engineer

Title, Is it actually a good plan ?? If no, why not ?? \\🚀 4-Year Roadmap to Becoming a High-Earning ML Engineer & Entrepreneur\\ \\(With Smartwork & Realistic 60-70% Execution Feasibility)\\ \\🟢 Year 1: Strong Foundation & Initial Projects (0-12 Months)\\ 🎯 \\Goal: Master Python & ML Fundamentals\\ \\🔹 1-4 Months (Python & Math Strengthening)\\ ✅ Python Mastery \- Daily LeetCode Easy problems (minimum 2) \- Build automation projects \- NumPy & Pandas mastery \- DSA fundamentals ✅ Mathematics Foundation \- Linear Algebra basics \- Statistics fundamentals \- Basic calculus concepts ✅ First Mini-Hackathon Participation \- Join beginner-friendly hackathons \- Focus on Python-based challenges \- Team up with other beginners 💡 \\Smart Move:\\ \- Join Discord/Slack hackathon communities \- Practice collaborative coding \- Build network with fellow participants \\🔹 5-8 Months (ML Foundations)\\ ✅ Machine Learning Basics \- Supervised Learning \- Model evaluation \- Feature engineering \- scikit-learn projects ✅ Participate in 2-3 ML Hackathons \- Kaggle Getting Started competitions \- Local ML hackathons \- University hackathons ✅ Start LinkedIn & GitHub Portfolio 💡 \\Smart Move:\\ \- Document hackathon experiences \- Share learnings on LinkedIn \- Focus on completion over winning \\🔹 9-12 Months (Deep Learning Introduction)\\ ✅ Basic Deep Learning \- Neural network fundamentals \- PyTorch basics \- Computer vision tasks \- Basic NLP ✅ Advanced Hackathon Participation \- AI/ML specific hackathons \- Team lead in 1-2 hackathons \- Start mentoring beginners \\🔵 Year 1 Expected Outcome (60-70% Execution)\\ ✔ \\Strong Python & ML foundations\\ ✔ \\5-6 hackathon participations\\ ✔ \\Active GitHub (100+ commits)\\ ✔ \\Growing LinkedIn (300+ connections)\\ 💰 \\Earning Expectation → ₹8K-₹20K per month (Projects/Internship)\\ \\🟢 Year 2: Professional Growth & Specialization (12-24 Months)\\ 🎯 \\Goal: Build Professional Experience & Recognition\\ \\🔹 1-6 Months (Technical Depth)\\ ✅ Advanced ML Topics \- Deep Learning architectures \- Computer Vision OR NLP \- MLOps basics (Docker, FastAPI) \- Cloud fundamentals (AWS/GCP) ✅ Hackathon Achievements \- Win minor prizes in 2-3 hackathons \- Lead teams in major hackathons \- Network with sponsors ✅ Start Technical Blogging 💡 \\Smart Move:\\ \- Focus on hackathon projects that align with career goals \- Build relationships with companies at hackathons \- Create detailed project documentation \\🔹 7-12 Months (Professional Experience)\\ ✅ Secure ML Role/Internship ✅ Advanced Project Building ✅ Open Source Contributions ✅ Organize Small Hackathons 💡 \\Smart Move:\\ \- Use hackathon network for job referrals \- Convert hackathon projects into full products \- Build mentor reputation \\🔵 Year 2 Expected Outcome (60-70% Execution)\\ ✔ \\Professional ML experience\\ ✔ \\10+ hackathon participations\\ ✔ \\1-2 hackathon wins\\ ✔ \\Strong industry network\\ 💰 \\Earning Expectation → ₹40K-₹70K per month (Job/Freelancing)\\ \\🟢 Year 3: Scaling & Business Foundation (24-36 Months)\\ 🎯 \\Goal: Establish Multiple Income Streams\\ \\🔹 1-4 Months (Expertise Building)\\ ✅ Choose Specialization \- MLOps \- Computer Vision \- NLP/LLMs \- Generative AI ✅ Advanced Competitions \- International hackathons \- High-prize competitions \- Corporate ML challenges ✅ Start Consulting Services 💡 \\Smart Move:\\ \- Use hackathon wins for marketing \- Build service packages around expertise \- Network with corporate sponsors \\🔹 5-8 Months (Business Development)\\ ✅ Scale Services ✅ Build Client Network ✅ Create Training Programs ✅ Hackathon Mentorship Program 💡 \\Smart Move:\\ \- Convert hackathon projects to products \- Use event networks for client acquisition \- Build authority through speaking \\🔹 9-12 Months (Growth & Innovation)\\ ✅ Product Development ✅ Team Building ✅ Innovation Focus ✅ Hackathon Organization \\🔵 Year 3 Expected Outcome (60-70% Execution)\\ ✔ \\Established ML business/career\\ ✔ \\Known in hackathon community\\ ✔ \\Multiple income streams\\ ✔ \\Strong industry presence\\ 💰 \\Earning Expectation → ₹1L-₹2L per month (Multiple Streams)\\ \\🟢 Year 4: Scale & Leadership (36-48 Months)\\ 🎯 \\Goal: Build AI Company & Achieve Financial Freedom\\ \\🔹 1-4 Months (Business Scaling)\\ ✅ Company Formation \- AI consulting firm \- Product development \- Training programs ✅ Hackathon Innovation \- Launch own hackathon series \- Corporate partnerships \- Prize sponsorships ✅ Team Expansion 💡 \\Smart Move:\\ \- Use hackathon network for hiring \- Create unique event formats \- Build corporate relationships \\🔹 5-8 Months (Market Leadership)\\ ✅ Product Launch ✅ Service Expansion ✅ International Presence ✅ Innovation Hub Creation 💡 \\Smart Move:\\ \- Create hackathon-to-hiring pipeline \- Build educational programs \- Establish thought leadership \\🔹 9-12 Months (Empire Building)\\ ✅ Multiple Revenue Streams \- AI products \- Consulting services \- Educational programs \- Event organization \- Investment returns ✅ Industry Leadership \- Conference speaking \- Published content \- Community leadership \\🔵 Year 4 Expected Outcome (60-70% Execution)\\ ✔ \\Established AI company\\ ✔ \\Major hackathon organizer\\ ✔ \\Multiple product lines\\ ✔ \\Industry authority status\\ 💰 \\Earning Expectation → ₹3L-₹5L+ per month (Business Income)\\ \\📊 FINAL RATING\\ ✅ \\Comprehensive growth plan\\ ✅ \\Strong community focus\\ ✅ \\Multiple income pathways\\ 💡 \\If 100% Execution → 8.5/10 Feasibility\\ 💡 \\If 50% Execution → 6/10 Feasibility\\ 🔥 \\Conclusion: A balanced path to ML mastery and entrepreneurship, built through consistent growth and community engagement!\\ 🚀 \\Key Success Factors:\\ Regular hackathon participation Strong community involvement Consistent skill development Strategic network building Focus on both technical and business growth

Randomly asked ChatGPT and Claude for a 4 year roadmap for an ML Engineer
reddit
LLM Vibe Score0
Human Vibe Score1
Brilliant_Fishing110This week

Randomly asked ChatGPT and Claude for a 4 year roadmap for an ML Engineer

Title, Is it actually a good plan ?? If no, why not ?? \\🚀 4-Year Roadmap to Becoming a High-Earning ML Engineer & Entrepreneur\\ \\(With Smartwork & Realistic 60-70% Execution Feasibility)\\ \\🟢 Year 1: Strong Foundation & Initial Projects (0-12 Months)\\ 🎯 \\Goal: Master Python & ML Fundamentals\\ \\🔹 1-4 Months (Python & Math Strengthening)\\ ✅ Python Mastery \- Daily LeetCode Easy problems (minimum 2) \- Build automation projects \- NumPy & Pandas mastery \- DSA fundamentals ✅ Mathematics Foundation \- Linear Algebra basics \- Statistics fundamentals \- Basic calculus concepts ✅ First Mini-Hackathon Participation \- Join beginner-friendly hackathons \- Focus on Python-based challenges \- Team up with other beginners 💡 \\Smart Move:\\ \- Join Discord/Slack hackathon communities \- Practice collaborative coding \- Build network with fellow participants \\🔹 5-8 Months (ML Foundations)\\ ✅ Machine Learning Basics \- Supervised Learning \- Model evaluation \- Feature engineering \- scikit-learn projects ✅ Participate in 2-3 ML Hackathons \- Kaggle Getting Started competitions \- Local ML hackathons \- University hackathons ✅ Start LinkedIn & GitHub Portfolio 💡 \\Smart Move:\\ \- Document hackathon experiences \- Share learnings on LinkedIn \- Focus on completion over winning \\🔹 9-12 Months (Deep Learning Introduction)\\ ✅ Basic Deep Learning \- Neural network fundamentals \- PyTorch basics \- Computer vision tasks \- Basic NLP ✅ Advanced Hackathon Participation \- AI/ML specific hackathons \- Team lead in 1-2 hackathons \- Start mentoring beginners \\🔵 Year 1 Expected Outcome (60-70% Execution)\\ ✔ \\Strong Python & ML foundations\\ ✔ \\5-6 hackathon participations\\ ✔ \\Active GitHub (100+ commits)\\ ✔ \\Growing LinkedIn (300+ connections)\\ 💰 \\Earning Expectation → ₹8K-₹20K per month (Projects/Internship)\\ \\🟢 Year 2: Professional Growth & Specialization (12-24 Months)\\ 🎯 \\Goal: Build Professional Experience & Recognition\\ \\🔹 1-6 Months (Technical Depth)\\ ✅ Advanced ML Topics \- Deep Learning architectures \- Computer Vision OR NLP \- MLOps basics (Docker, FastAPI) \- Cloud fundamentals (AWS/GCP) ✅ Hackathon Achievements \- Win minor prizes in 2-3 hackathons \- Lead teams in major hackathons \- Network with sponsors ✅ Start Technical Blogging 💡 \\Smart Move:\\ \- Focus on hackathon projects that align with career goals \- Build relationships with companies at hackathons \- Create detailed project documentation \\🔹 7-12 Months (Professional Experience)\\ ✅ Secure ML Role/Internship ✅ Advanced Project Building ✅ Open Source Contributions ✅ Organize Small Hackathons 💡 \\Smart Move:\\ \- Use hackathon network for job referrals \- Convert hackathon projects into full products \- Build mentor reputation \\🔵 Year 2 Expected Outcome (60-70% Execution)\\ ✔ \\Professional ML experience\\ ✔ \\10+ hackathon participations\\ ✔ \\1-2 hackathon wins\\ ✔ \\Strong industry network\\ 💰 \\Earning Expectation → ₹40K-₹70K per month (Job/Freelancing)\\ \\🟢 Year 3: Scaling & Business Foundation (24-36 Months)\\ 🎯 \\Goal: Establish Multiple Income Streams\\ \\🔹 1-4 Months (Expertise Building)\\ ✅ Choose Specialization \- MLOps \- Computer Vision \- NLP/LLMs \- Generative AI ✅ Advanced Competitions \- International hackathons \- High-prize competitions \- Corporate ML challenges ✅ Start Consulting Services 💡 \\Smart Move:\\ \- Use hackathon wins for marketing \- Build service packages around expertise \- Network with corporate sponsors \\🔹 5-8 Months (Business Development)\\ ✅ Scale Services ✅ Build Client Network ✅ Create Training Programs ✅ Hackathon Mentorship Program 💡 \\Smart Move:\\ \- Convert hackathon projects to products \- Use event networks for client acquisition \- Build authority through speaking \\🔹 9-12 Months (Growth & Innovation)\\ ✅ Product Development ✅ Team Building ✅ Innovation Focus ✅ Hackathon Organization \\🔵 Year 3 Expected Outcome (60-70% Execution)\\ ✔ \\Established ML business/career\\ ✔ \\Known in hackathon community\\ ✔ \\Multiple income streams\\ ✔ \\Strong industry presence\\ 💰 \\Earning Expectation → ₹1L-₹2L per month (Multiple Streams)\\ \\🟢 Year 4: Scale & Leadership (36-48 Months)\\ 🎯 \\Goal: Build AI Company & Achieve Financial Freedom\\ \\🔹 1-4 Months (Business Scaling)\\ ✅ Company Formation \- AI consulting firm \- Product development \- Training programs ✅ Hackathon Innovation \- Launch own hackathon series \- Corporate partnerships \- Prize sponsorships ✅ Team Expansion 💡 \\Smart Move:\\ \- Use hackathon network for hiring \- Create unique event formats \- Build corporate relationships \\🔹 5-8 Months (Market Leadership)\\ ✅ Product Launch ✅ Service Expansion ✅ International Presence ✅ Innovation Hub Creation 💡 \\Smart Move:\\ \- Create hackathon-to-hiring pipeline \- Build educational programs \- Establish thought leadership \\🔹 9-12 Months (Empire Building)\\ ✅ Multiple Revenue Streams \- AI products \- Consulting services \- Educational programs \- Event organization \- Investment returns ✅ Industry Leadership \- Conference speaking \- Published content \- Community leadership \\🔵 Year 4 Expected Outcome (60-70% Execution)\\ ✔ \\Established AI company\\ ✔ \\Major hackathon organizer\\ ✔ \\Multiple product lines\\ ✔ \\Industry authority status\\ 💰 \\Earning Expectation → ₹3L-₹5L+ per month (Business Income)\\ \\📊 FINAL RATING\\ ✅ \\Comprehensive growth plan\\ ✅ \\Strong community focus\\ ✅ \\Multiple income pathways\\ 💡 \\If 100% Execution → 8.5/10 Feasibility\\ 💡 \\If 50% Execution → 6/10 Feasibility\\ 🔥 \\Conclusion: A balanced path to ML mastery and entrepreneurship, built through consistent growth and community engagement!\\ 🚀 \\Key Success Factors:\\ Regular hackathon participation Strong community involvement Consistent skill development Strategic network building Focus on both technical and business growth

MarkDrop
reddit
LLM Vibe Score0
Human Vibe Score1
Willing-Ear-8271This week

MarkDrop

I’m excited to share my Python package, Markdrop, which has hit 5.01k+ downloads in just a month, so updated it just now! 🚀 It’s a powerful tool for converting PDF documents into structured formats like Markdown (.md) and HTML (.html) while automatically processing images and tables into descriptions for downstream use. Here's what Markdrop does: Key Features: PDF to Markdown/HTML Conversion: Converts PDFs into clean, structured Markdown files (.md) or HTML outputs, preserving the content layout. AI-Powered Descriptions: Replaces tables and images with descriptive summaries generated by LLM, making the content fully textual and easy to analyze. Earlier I added support of 6 different LLM Clients, but to improve the inference time, now this supports only GEMINI\API\KEY and OPENAI\API\KEY. Downloadable Tables: Can add accurate download buttons in HTML for tables, allowing users to download them as Excel files. Seamless Table and Image Handling: Extracts tables and images, generating detailed summaries for each, which are then embedded into the final Markdown document. At the end, one can have a .md file that contains only textual data, including the AI-generated summaries of tables, images, graphs, etc. This results in a highly portable format that can be used directly for several downstream tasks, such as: Can be directly integrated into a RAG pipeline for enhanced content understanding and querying on documents containg useful images and tabular data. Ideal for automated content summarization and report generation. Facilitates extracting key data points from tables and images for further analysis. The .md files can serve as input for machine learning tasks or data-driven projects. Ideal for data extraction, simplifying the task of gathering key data from tables and images. The downloadable table feature is perfect for analysts, reducing the manual task of copying tables into Excel. Markdrop streamlines workflows for document processing, saving time and enhancing productivity. You can easily install it via: pip install markdrop There’s also a Colab demo available to try it out directly: Open in Colab. Github Repo If you've used Markdrop or plan to, I’d love to hear your feedback! Share your experience, any improvements, or how it helped in your workflow. Check it out on PyPI and let me know your thoughts!

Day 1 of my BIP for my AdonisJS Boilerplate (turbosaas) [Built in public]
reddit
LLM Vibe Score0
Human Vibe Score0.5
Ok_Bread_6005This week

Day 1 of my BIP for my AdonisJS Boilerplate (turbosaas) [Built in public]

Hello everyone, here is day 1 (not really, I started a bit earlier) of my project: A boilerplate using AdonisJS, Inertia What technologies are used/present? AdonisJS Inertia Stripe OpenAI TailwindCSS Vite (React) Why? Firstly, I want to save time when launching my projects, and I think you do too, so I've included as many relevant features as possible. I'm tired of seeing attitudes like 'develop your SaaS in 1 hour and produce terrible code!' The purpose of this codebase is to provide the highest quality code possible and to maintain that standard throughout the development process. You might spend an extra 20 minutes doing things right, but you'll save 2 hours on refactoring. And no, you won't have to pay for updates. (WTF by the way?) Why these technologies? I've seen a lot of NextJS for boilerplates, and I've also used NextJS before, but I quickly abandoned it. It quickly becomes a mess You lose track of what is what, and start doing anything Every update breaks your application Whereas with AdonisJS, life is beautiful. There are plenty of community packages already available, and everything you need is here. What am I offering? Authentication: Social authentication, OTP, Magic Links, and credentials, along with complete account management features like password recovery. Payment & Mailing Integration: Seamless integration from start to finish, with multiple options to choose from. Detailed Documentation: Thorough explanations of every aspect, covering even the smallest, potentially confusing details in the code. Maintainable & Scalable Code: Organized by features, allowing you to easily drag and drop features to extend functionality. Developer Tools: Handy commands for generating new features and automatically adding necessary imports; a complete config to enable/disable a feature in less than 10 seconds... Pre-made Pages: Ready-to-use pages such as an admin dashboard for tasks like automatically updating products on Stripe. Extensive Component Library: A variety of components to streamline development. I've designed this boilerplate to be as developer-friendly and robust as possible, aiming to support maintainability and scalability from the get-go. Summary of today and previous days Day 2 Stripe is a nightmare to set up if you've never done it before, it quickly becomes tedious. But I've finally finished setting everything up: one-time payments, subscriptions, and subscription updates. It was complicated. Today I finally implemented the 'forgot password' option, and I've completed all the authentication by adding magic links (working with OTP). I also set up automatic deployment with GitHub Actions, and everything works well. The build runs with the action to ensure everything goes smoothly, then using SSH, I pull the project, build it, and launch it. Tomorrow: What I want to do tomorrow Tomorrow, I want to create the blog, because yes, I want to include a blog as well, and especially complete it as soon as possible so it can be available on turbosaas(dot)dev, and write my build in public. It will probably use markdown. Thank you for reading this short build in public, you can also check out how it's going on turbosaas(dot)dev.

Just reached 300 users in 3 months!!!
reddit
LLM Vibe Score0
Human Vibe Score1
w-elm_This week

Just reached 300 users in 3 months!!!

Just reached 300 users after 3 months live!!! My co-founder has been posting a bit here and always got some strong support and he suggested I share my side of things so here it is: How it started I co-founded AirMedia almost a year ago and we both didn’t know much about design/marketing/coding (just studied programming during my 6-month exchange period. The quickest way to get started seemed to get a no-code product that we could put in front of users and get feedback. My co-founder then started learning about bubble and we put together a basic platform to show users. I was working on a custom-code database in the meantime and decided after month 2 that we wanted to get something better I.e. AI would be interacting with the UI and had to do everything custom-code for it. We’re now month 3 and started from scratch again. While I was working on the code, we started talking to some potential users and selling lifetime deals to validate the idea (this is where I would start if I had to do it over again). Well I progressively found out it was more complicated than expected and we only released our first beta product last August (6 months later) Some challenges pre-launch: Getting the Meta/LinkedIn permissions for scheduling took around 1 month As the whole process took more time than expected, the waitlist of 300 that we managed to put together only converted by 10% (into free users). Please don’t make our mistakes and always keep your waitlist updated on what’s going on. Some challenges post-launch: Getting the right feedback and how to prioritise Getting users Monetising (yes - we’re bootstrapped) To get the best feedback we implemented some tracking (according to GDPR of course) on the platform and implemented Microsoft Clarity. The latter is a game-changer, if you have a SaaS and don’t use it you’re missing out. I wasn’t really into getting users as my co-founder handled that but it’s mainly manual and personalised LinkedIn outreach at the beginning and Reddit sharing about the progress, answering questions and getting some feedback at the same time. To monetise we realised we’re too common and there are 100+ other nice schedulers around so we’re now focusing on cracking the content creation side of AI (to be released next week 👀) as there’s much less competitors and it seems like that’s our users want. In the meantime of growing the company, we had to find a way to pay the bills as it’s two of us living together. So my co-founder started using the bubble skills gained and doing some freelance. He did around 7 platforms the last 6 months and we’re now just launching a bubble agency as a part of the main company to get your idea of a SaaS done in 30 days. That’s QuickMVP. It seemed like the right move to help other people (I met many non-technical founder looking for someone to bring their idea to life that didn’t cost $10k and was reliable) and include the AirMedia subscription in the package so let’s see how this next step plays out. Thanks for reading until here :)

[Discussion] When ML and Data Science are the death of a good company: A cautionary tale.
reddit
LLM Vibe Score0
Human Vibe Score0.6
AlexSnakeKingThis week

[Discussion] When ML and Data Science are the death of a good company: A cautionary tale.

TD;LR: At Company A, Team X does advanced analytics using on-prem ERP tools and older programming languages. Their tools work very well and are designed based on very deep business and domain expertise. Team Y is a new and ambitious Data Science team that thinks they can replace Team X's tools with a bunch of R scripts and a custom built ML platform. Their models are simplistic, but more "fashionable" compared to the econometric models used by Team X, and team Y benefits from the ML/DS moniker so leadership is allowing Team Y to start a large scale overhaul of the analytics platform in question. Team Y doesn't have the experience for such a larger scale transformation, and is refusing to collaborate with team X. This project is very likely going to fail, and cause serious harm to the company as a whole financially and from a people perspective. I argue that this is not just because of bad leadership, but also because of various trends and mindsets in the DS community at large. Update (Jump to below the line for the original story): Several people in the comments are pointing out that this just a management failure, not something due to ML/DS, and that you can replace DS with any buzz tech and the story will still be relevant. My response: Of course, any failure at an organization level is ultimately a management failure one way or the other. Moreover, it is also the case that ML/DS when done correctly, will always improve a company's bottom line. There is no scenario where the proper ML solution, delivered at a reasonable cost and in a timely fashion, will somehow hurt the company's bottom line. My point is that in this case management is failing because of certain trends and practices that are specific to the ML/DS community, namely: The idea that DS teams should operate independently of tech and business orgs -- too much autonomy for DS teams The disregard for domain knowledge that seems prevalent nowadays thanks to the ML hype, that DS can be generalists and someone with good enough ML chops can solve any business problem. That wasn't the case when I first left academia for the industry in 2009 (back then nobody would even bother with a phone screen if you didn't have the right domain knowledge). Over reliance on resources who check all the ML hype related boxes (knows Python, R, Tensorflow, Shiny, etc..., has the right Coursera certifications, has blogged on the topic, etc...), but are lacking in depth of experience. DS interviews nowadays all seem to be: Can you tell me what a p-value is? What is elastic net regression? Show me how to fit a model in sklearn? How do you impute NAs in an R dataframe? Any smart person can look those up on Stackoverflow or Cross-Validated,.....Instead teams should be asking stuff like: why does portfolio optimization use QP not LP? How does a forecast influence a customer service level? When should a recommendation engine be content based and when should it use collaborative filtering? etc... (This is a true story, happening to the company I currently work for. Names, domains, algorithms, and roles have been shuffled around to protect my anonymity)  Company A has been around for several decades. It is not the biggest name in its domain, but it is a well respected one. Risk analysis and portfolio optimization have been a core of Company A's business since the 90s. They have a large team of 30 or so analysts who perform those tasks on a daily basis. These analysts use ERP solutions implemented for them by one the big ERP companies (SAP, Teradata, Oracle, JD Edwards,...) or one of the major tech consulting companies (Deloitte, Accenture, PWC, Capgemini, etc...) in collaboration with their own in house engineering team. The tools used are embarrassingly old school: Classic RDBMS running on on-prem servers or maybe even on mainframes, code written in COBOL, Fortran, weird proprietary stuff like ABAP or SPSS.....you get the picture. But the models and analytic functions were pretty sophisticated, and surprisingly cutting edge compared to the published academic literature. Most of all, they fit well with the company's enterprise ecosystem, and were honed based on years of deep domain knowledge.  They have a tech team of several engineers (poached from the aforementioned software and consulting companies) and product managers (who came from the experienced pools of analysts and managers who use the software, or poached from business rivals) maintaining and running this software. Their technology might be old school, but collectively, they know the domain and the company's overall architecture very, very well. They've guided the company through several large scale upgrades and migrations and they have a track record of delivering on time, without too much overhead. The few times they've stumbled, they knew how to pick themselves up very quickly. In fact within their industry niche, they have a reputation for their expertise, and have very good relations with the various vendors they've had to deal with. They were the launching pad of several successful ERP consulting careers.  Interestingly, despite dealing on a daily basis with statistical modeling and optimization algorithms, none of the analysts, engineers, or product managers involved describe themselves as data scientists or machine learning experts. It is mostly a cultural thing: Their expertise predates the Data Science/ML hype that started circa 2010, and they got most of their chops using proprietary enterprise tools instead of the open source tools popular nowadays. A few of them have formal statistical training, but most of them came from engineering or domain backgrounds and learned stats on the fly while doing their job. Call this team "Team X".  Sometime around the mid 2010s, Company A started having some serious anxiety issues: Although still doing very well for a company its size, overall economic and demographic trends were shrinking its customer base, and a couple of so called disruptors came up with a new app and business model that started seriously eating into their revenue. A suitable reaction to appease shareholders and Wall Street was necessary. The company already had a decent website and a pretty snazzy app, what more could be done? Leadership decided that it was high time that AI and ML become a core part of the company's business. An ambitious Manager, with no science or engineering background, but who had very briefly toyed with a recommender system a couple of years back, was chosen to build a data science team, call it team "Y" (he had a bachelor's in history from the local state college and worked for several years in the company's marketing org). Team "Y" consists mostly of internal hires who decided they wanted to be data scientists and completed a Coursera certification or a Galvanize boot camp, before being brought on to the team, along with a few of fresh Ph.D or M.Sc holders who didn't like academia and wanted to try their hand at an industry role. All of them were very bright people, they could write great Medium blog posts and give inspiring TED talks, but collectively they had very little real world industry experience. As is the fashion nowadays, this group was made part of a data science org that reported directly to the CEO and Board, bypassing the CIO and any tech or business VPs, since Company A wanted to claim the monikers "data driven" and "AI powered" in their upcoming shareholder meetings. In 3 or 4 years of existence, team Y produced a few Python and R scripts. Their architectural experience  consisted almost entirely in connecting Flask to S3 buckets or Redshift tables, with a couple of the more resourceful ones learning how to plug their models into Tableau or how to spin up a Kuberneties pod.  But they needn't worry: The aforementioned manager, who was now a director (and was also doing an online Masters to make up for his qualifications gap and bolster his chances of becoming VP soon - at least he now understands what L1 regularization is), was a master at playing corporate politics and self-promotion. No matter how few actionable insights team Y produced or how little code they deployed to production, he always had their back and made sure they had ample funding. In fact he now had grandiose plans for setting up an all-purpose machine learning platform that can be used to solve all of the company's data problems.  A couple of sharp minded members of team Y, upon googling their industry name along with the word "data science", realized that risk analysis was a prime candidate for being solved with Bayesian models, and there was already a nifty R package for doing just that, whose tutorial they went through on R-Bloggers.com. One of them had even submitted a Bayesian classifier Kernel for a competition on Kaggle (he was 203rd on the leaderboard), and was eager to put his new-found expertise to use on a real world problem. They pitched the idea to their director, who saw a perfect use case for his upcoming ML platform. They started work on it immediately, without bothering to check whether anybody at Company A was already doing risk analysis. Since their org was independent, they didn't really need to check with anybody else before they got funding for their initiative. Although it was basically a Naive Bayes classifier, the term ML was added to the project tile, to impress the board.  As they progressed with their work however, tensions started to build. They had asked the data warehousing and CA analytics teams to build pipelines for them, and word eventually got out to team X about their project. Team X was initially thrilled: They offered to collaborate whole heartedly, and would have loved to add an ML based feather to their already impressive cap. The product owners and analysts were totally onboard as well: They saw a chance to get in on the whole Data Science hype that they kept hearing about. But through some weird mix of arrogance and insecurity, team Y refused to collaborate with them or share any of their long term goals with them, even as they went to other parts of the company giving brown bag presentations and tutorials on the new model they created.  Team X got resentful: from what they saw of team Y's model, their approach was hopelessly naive and had little chances of scaling or being sustainable in production, and they knew exactly how to help with that. Deploying the model to production would have taken them a few days, given how comfortable they were with DevOps and continuous delivery (team Y had taken several months to figure out how to deploy a simple R script to production). And despite how old school their own tech was, team X were crafty enough to be able to plug it in to their existing architecture. Moreover, the output of the model was such that it didn't take into account how the business will consume it or how it was going to be fed to downstream systems, and the product owners could have gone a long way in making the model more amenable to adoption by the business stakeholders. But team Y wouldn't listen, and their leads brushed off any attempts at communication, let alone collaboration. The vibe that team Y was giving off was "We are the cutting edge ML team, you guys are the legacy server grunts. We don't need your opinion.", and they seemed to have a complete disregard for domain knowledge, or worse, they thought that all that domain knowledge consisted of was being able to grasp the definitions of a few business metrics.  Team X got frustrated and tried to express their concerns to leadership. But despite owning a vital link in Company A's business process, they were only \~50 people in a large 1000 strong technology and operations org, and they were several layers removed from the C-suite, so it was impossible for them to get their voices heard.  Meanwhile, the unstoppable director was doing what he did best: Playing corporate politics. Despite how little his team had actually delivered, he had convinced the board that all analysis and optimization tasks should now be migrated to his yet to be delivered ML platform. Since most leaders now knew that there was overlap between team Y and team X's objectives, his pitch was no longer that team Y was going to create a new insight, but that they were going to replace (or modernize) the legacy statistics based on-prem tools with more accurate cloud based ML tools. Never mind that there was no support in the academic literature for the idea that Naive Bayes works better than the Econometric approaches used by team X, let alone the additional wacky idea that Bayesian Optimization would definitely outperform the QP solvers that were running in production.  Unbeknownst to team X, the original Bayesian risk analysis project has now grown into a multimillion dollar major overhaul initiative, which included the eventual replacement of all of the tools and functions supported by team X along with the necessary migration to the cloud. The CIO and a couple of business VPs are on now board, and tech leadership is treating it as a done deal. An outside vendor, a startup who nobody had heard of, was contracted to help build the platform, since team Y has no engineering skills. The choice was deliberate, as calling on any of the established consulting or software companies would have eventually led leadership to the conclusion that team X was better suited for a transformation on this scale than team Y.  Team Y has no experience with any major ERP deployments, and no domain knowledge, yet they are being tasked with fundamentally changing the business process that is at the core of Company A's business. Their models actually perform worse than those deployed by team X, and their architecture is hopelessly simplistic, compared to what is necessary for running such a solution in production.  Ironically, using Bayesian thinking and based on all the evidence, the likelihood that team Y succeeds is close to 0%. At best, the project is going to end up being a write off of 50 million dollars or more. Once the !@#$!@hits the fan, a couple of executive heads are going to role, and dozens of people will get laid off. At worst, given how vital risk analysis and portfolio optimization is to Company A's revenue stream, the failure will eventually sink the whole company. It probably won't go bankrupt, but it will lose a significant portion of its business and work force. Failed ERP implementations can and do sink large companies: Just see what happened to National Grid US, SuperValu or Target Canada.  One might argue that this is more about corporate disfunction and bad leadership than about data science and AI. But I disagree. I think the core driver of this debacle is indeed the blind faith in Data Scientists, ML models and the promise of AI, and the overall culture of hype and self promotion that is very common among the ML crowd.  We haven't seen the end of this story: I sincerely hope that this ends well for the sake of my colleagues and all involved. Company A is a good company, and both its customers and its employees deserver better. But the chances of that happening are negligible given all the information available, and this failure will hit my company hard.

[R] Analysis of 400+ ML competitions in 2024
reddit
LLM Vibe Score0
Human Vibe Score1
hcarlensThis week

[R] Analysis of 400+ ML competitions in 2024

I run mlcontests.com, a website that lists ML competitions from across multiple platforms - Kaggle, DrivenData, AIcrowd, Zindi, etc… I’ve just spent a few months looking through all the info I could find on last year’s competitions, as well as winning solutions.  I found over 400 competitions that happened last year, plus info on the #1 winning solution for 70 of those.  Some highlights: Kaggle is still the biggest platform by total prize money, and also has a much bigger user base than the other platforms - though there are well over a dozen other platforms worth keeping track of, with regular interesting competitions and meaningful prize money. An increase in competitions with $1m+ prize pools (ARC Prize, AI Mathematical Olympiad, Vesuvius Challenge, AI Cyber Challenge) compared to previous years. Python continues to be the language of choice among competition winners, with almost everyone using Python as their main language. One winner used Rust, two used R.  Convolutional neural nets continue to do well in computer vision competitions, and are still more common among competition winners than transformer-based vision models.  PyTorch is still used a lot more than TensorFlow, roughly 9:1. Didn’t find any competition winners implementing neural nets in JAX or other libraries.  There were a few competition winners using AutoML packages, which seem to be getting increasingly useful. Any claims of generalist autonomous grandmaster-level agents seem premature though.  In language/text/sequence-related competitions, quantisation was key for making use of limited resources effectively. Usually 4-, 5-, or 8-bit. LoRA/QLoRA was also used quite often, though not always.  Gradient-boosted decision trees continue to win a lot of tabular/time-series competitions. They’re often ensembled with deep learning models. No tabular/time-series pre-trained foundation models were used by winners in 2024, as far as I can tell.  Starting to see more uptake of Polars for dataframes, with 7 winners using Polars in 2024 (up from 3 in 2023) vs 58 using Pandas. All those who used Polars also still used Pandas in some parts of their code.  In terms of hardware, competition winners almost entirely used NVIDIA GPUs to train their models. Some trained on CPU-only, or used a TPU through Colab. No AMD GPUs. The NVIDIA A100 was the most commonly used GPU among winners. Two of the $1m+ prize pool competitions were won by teams using 8xH100 nodes for training. A lot of other GPUs too though: T4/P100 (through Kaggle Notebooks), or consumer GPUs like RTX 3090/4090/3080/3060. Some spent hundreds of dollars on cloud compute to train their solutions.  An emerging pattern: using generative models to create additional synthetic training data to augment the training data provided.  There’s way more detail in the full report, which you can read here (no paywall): https://mlcontests.com/state-of-machine-learning-competitions-2024?ref=mlcr Processing img xmm4ywg9h9le1... The full report also features: A deep dive into the ARC Prize and the AI Mathematical Olympiad An overview of winning solutions to NLP/sequence competitions A breakdown of Python packages used in winning solutions (e.g. relative popularity of various gradient-boosted tree libraries) If you’d like to support this research, I’d really appreciate it if you could share it with anyone else who might find it interesting. You can also check out my newly-launched online magazine, Jolt ML \- featuring news from top ML conferences as well as long-read articles (just one so far, more to come!).  Thanks to the competition winners who shared info on their solutions, and also to the competition platforms who shared high-level data on their competitions.

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup
reddit
LLM Vibe Score0
Human Vibe Score0.667
milaworldThis week

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup

forbes article: https://www.forbes.com/sites/kenrickcai/2024/03/29/how-stability-ais-founder-tanked-his-billion-dollar-startup/ archive no paywall: https://archive.is/snbeV How Stability AI’s Founder Tanked His Billion-Dollar Startup Mar 29, 2024 Stability AI founder Emad Mostaque took the stage last week at the Terranea Resort in Palos Verdes, California to roaring applause and an introduction from an AI-generated Aristotle who announced him as “a modern Prometheus” with “the astuteness of Athena and the vision of Daedalus.” “Under his stewardship, AI becomes the Herculean force poised to vanquish the twin serpents of illness and ailment and extend the olive branch of longevity,” the faux Aristotle proclaimed. “I think that’s the best intro I’ve ever had,” Mostaque said. But behind Mostaque's hagiographic introduction lay a grim and fast metastasizing truth. Stability, once one of AI’s buzziest startups, was floundering. It had been running out of money for months and Mostaque had been unable to secure enough additional funding. It had defaulted on payments to Amazon whose cloud service undergirded Stability’s core offerings. The star research team behind its flagship text-to-image generator Stable Diffusion had tendered their resignations just three days before — as Forbes would first report — and other senior leaders had issued him an ultimatum: resign, or we walk too. Still, onstage before a massive audience of peers and acolytes, Mostaque talked a big game. “AI is jet planes for the mind,” he opined. “AI is our collective intelligence. It's the human Colossus.” He claimed a new, faster version of the Stable Diffusion image generator released earlier this month could generate “200 cats with hats per second.” But later, when he was asked about Stability’s financial model, Mostaque fumbled. “I can’t say that publicly,” he replied. “But it’s going well. We’re ahead of forecast.” Four days later, Mostaque stepped down as CEO of Stability, as Forbes first reported. In a post to X, the service formerly known as Twitter, he claimed he’d voluntarily abdicated his role to decentralize “the concentration of power in AI.” But sources told Forbes that was hardly the case. Behind the scenes, Mostaque had fought to maintain his position and control despite mounting pressure externally and internally to step down. Company documents and interviews with 32 current and former employees, investors, collaborators and industry observers suggest his abrupt exit was the result of poor business judgment and wild overspending that undermined confidence in his vision and leadership, and ultimately kneecapped the company. Mostaque, through his attorneys, declined to comment on record on a detailed list of questions about the reporting in this story. But in an email to Forbes earlier this week he broadly disputed the allegations. “Nobody tells you how hard it is to be a CEO and there are better CEOs than me to scale a business,” he said in a statement. “I am not sure anyone else would have been able to build and grow the research team to build the best and most widely used models out there and I’m very proud of the team there. I look forward to moving onto the next problem to handle and hopefully move the needle.” In an emailed statement, Christian Laforte and Shan Shan Wong, the interim co-CEOs who replaced Mostaque, said, "the company remains focused on commercializing its world leading technology” and providing it “to partners across the creative industries." After starting Stability in 2019, Mostaque built the company into an early AI juggernaut by seizing upon a promising research project that would become Stable Diffusion and funding it into a business reality. The ease with which the software generated detailed images from the simplest text prompts immediately captivated the public: 10 million people used it on any given day, the company told Forbes in early 2023. For some true believers, Mostaque was a crucial advocate for open-source AI development in a space dominated by the closed systems of OpenAI, Google and Anthropic. But his startup’s rise to one of the buzziest in generative AI was in part built on a series of exaggerations and misleading claims, as Forbes first reported last year (Mostaque disputed some points at the time). And they continued after he raised $100 million at a $1 billion valuation just days after launching Stable Diffusion in 2022. His failure to deliver on an array of grand promises, like building bespoke AI models for nation states, and his decision to pour tens of millions into research without a sustainable business plan, eroded Stability’s foundations and jeopardized its future. "He was just giving shit away,” one former employee told Forbes. “That man legitimately wanted to transform the world. He actually wanted to train AI models for kids in Malawi. Was it practical? Absolutely not." By October 2023, Stability would have less than $4 million left in the bank, according to an internal memo prepared for a board meeting and reviewed by Forbes. And mounting debt, including months of overdue Amazon Web Services payments, had already left it in the red. To avoid legal penalties for skipping Americans staff’s payroll, the document explained, the London-based startup was considering delaying tax payments to the U.K. government. It was Stability’s armada of GPUs, the wildly powerful and equally expensive chips undergirding AI, that were so taxing the company’s finances. Hosted by AWS, they had long been one of Mostaque’s bragging points; he often touted them as one of the world’s 10 largest supercomputers. They were responsible for helping Stability’s researchers build and maintain one of the top AI image generators, as well as break important new ground on generative audio, video and 3D models. “Undeniably, Stability has continued to ship a lot of models,” said one former employee. “They may not have profited off of it, but the broader ecosystem benefitted in a huge, huge way.” But the costs associated with so much compute were now threatening to sink the company. According to an internal October financial forecast seen by Forbes, Stability was on track to spend $99 million on compute in 2023. It noted as well that Stability was “underpaying AWS bills for July (by $1M)” and “not planning to pay AWS at the end of October for August usage ($7M).” Then there were the September and October bills, plus $1 million owed to Google Cloud and $600,000 to GPU cloud data center CoreWeave. (Amazon, Google and CoreWeave declined to comment.) With an additional $54 million allocated to wages and operating expenses, Stability’s total projected costs for 2023 were $153 million. But according to its October financial report, its projected revenue for the calendar year was just $11 million. Stability was on track to lose more money per month than it made in an entire year. The company’s dire financial position had thoroughly soured Stability’s current investors, including Coatue, which had invested tens of millions in the company during its $101 million funding round in 2022. In the middle of 2023, Mostaque agreed to an independent audit after Coatue raised a series of concerns, according to a source with direct knowledge of the matter. The outcome of the investigation is unclear. Coatue declined to comment. Within a week of an early October board meeting where Mostaque shared that financial forecast, Lightspeed Venture Partners, another major investor, sent a letter to the board urging them to sell the company. The distressing numbers had “severely undermined” the firm’s confidence in Mostaque’s ability to lead the company. “In particular, we are surprised and deeply concerned by a cash position just now disclosed to us that is inconsistent with prior discussions on this topic,” Lightspeed’s general counsel Brett Nissenberg wrote in the letter, a copy of which was viewed by Forbes. “Lightspeed believes that the company is not likely financeable on terms that would assure the company’s long term sound financial position.” (Lightspeed declined a request for comment.) The calls for a sale led Stability to quietly begin looking for a buyer. Bloomberg reported in November that Stability approached AI startups Cohere and Jasper to gauge their interest. Stability denied this, and Jasper CEO Timothy Young did the same when reached for comment by Forbes. A Cohere representative declined to comment. But one prominent AI company confirmed that Mostaque’s representatives had reached out to them to test the waters. Those talks did not advance because “the numbers didn’t add up,” this person, who declined to be named due to the confidential nature of the talks, told Forbes. Stability also tried to court Samsung as a buyer, going so far as to redecorate its office in advance of a planned meeting with the Korean electronics giant. (Samsung said that it invested in Stability in 2023 and that it does not comment on M&A discussions.) Coatue had been calling for Mostaque’s resignation for months, according to a source with direct knowledge. But it and other investors were unable to oust him because he was the company’s majority shareholder. When they tried a different tact by rallying other investors to offer him a juicy equity package to resign, Mostaque refused, said two sources. By October, Coatue and Lightspeed had had enough. Coatue left the board and Lightspeed resigned its observer seat. “Emad infuriated our initial investors so much it’s just making it impossible for us to raise more money under acceptable terms,” one current Stability executive told Forbes. The early months of 2024 saw Stability’s already precarious position eroding further still. Employees were quietly laid off. Three people in a position to know estimated that at least 10% of staff were cut. And cash reserves continued to dwindle. Mostaque mentioned a lifeline at the October board meeting: $95 million in tentative funding from new investors, pending due diligence. But in the end, only a fraction of it was wired, two sources say, much of it from Intel, which Forbes has learned invested $20 million, a fraction of what was reported. (Intel did not return a request for comment by publication time.) Two hours after Forbes broke the news of Mostaque’s plans to step down as CEO, Stability issued a press release confirming his resignation. Chief operating officer Wong and chief technology officer Laforte have taken over in the interim. Mostaque, who said on X that he still owns a majority of the company, also stepped down from the board, which has now initiated a search for a permanent CEO. There is a lot of work to be done to turn things around, and very little time in which to do it. Said the current Stability executive, “There’s still a possibility of a turnaround story, but the odds drop by the day.” In July of 2023, Mostaque still thought he could pull it off. Halfway through the month, he shared a fundraising plan with his lieutenants. It was wildly optimistic, detailing the raise of $500 million in cash and another $750 million in computing facilities from marquee investors like Nvidia, Google, Intel and the World Bank (Nvidia and Google declined comment. Intel did not respond. The World Bank said it did not invest in Stability). In a Slack message reviewed by Forbes, Mostaque said Google was “willing to move fast” and the round was “likely to be oversubscribed.” It wasn’t. Three people with direct knowledge of these fundraising efforts told Forbes that while there was some interest in Stability, talks often stalled when it came time to disclose financials. Two of them noted that earlier in the year, Mostaque had simply stopped engaging with VCs who asked for numbers. Only one firm invested around that time: actor Ashton Kutcher’s Sound Ventures, which invested $35 million in the form of a convertible SAFE note during the second quarter, according to an internal document. (Sound Ventures did not respond to a request for comment.) And though he’d managed to score a meeting with Nvidia and its CEO Jensen Huang, it ended in disaster, according to two sources. “Under Jensen's microscopic questions, Emad just fell apart,” a source in position to know told Forbes. Huang quickly concluded Stability wasn’t ready for an investment from Nvidia, the sources said. Mostaque told Forbes in an email that he had not met with Huang since 2022, except to say “hello and what’s up a few times after.” His July 2023 message references a plan to raise $150 million from Nvidia. (Nvidia declined to comment.) After a June Forbes investigation citing more than 30 sources revealed Mostaque’s history of misleading claims, Mostaque struggled to raise funding, a Stability investor told Forbes. (Mostaque disputed the story at the time and called it "coordinated lies" in his email this week to Forbes). Increasingly, investors scrutinized his assertions and pressed for data. And Young, now the CEO of Jasper, turned down a verbal offer to be Stability’s president after reading the article, according to a source with direct knowledge of the matter. The collapse of the talks aggravated the board and other executives, who had hoped Young would compensate for the sales and business management skills that Mostaque lacked, according to four people in a position to know. (Young declined to comment.) When Stability’s senior leadership convened in London for the CogX conference in September, the financing had still not closed. There, a group of executives confronted Mostaque asking questions about the company’s cash position and runway, according to three people with direct knowledge of the incident. They did not get the clarity they’d hoped for. By October, Mostaque had reduced his fundraising target by more than 80%. The months that followed saw a steady drumbeat of departures — general counsel Adam Avrunin, vice presidents Mike Melnicki, Ed Newton-Rex and Joe Penna, chief people officer Ozden Onder — culminating in the demoralizing March exit of Stable Diffusion’s primary developers Robin Rombach, Andreas Blattmann, Patrick Esser and Dominik Lorenz. Rombach, who led the team, had been angling to leave for months, two sources said, first threatening to resign last summer because of the fundraising failures. Others left over concerns about cash flow, as well as liabilities — including what four people described as Mostaque’s lax approach to ensuring that Stability products could not be used to produce child sexual abuse imagery. “Stability AI is committed to preventing the misuse of AI and prohibits the use of our image models and services for unlawful activity, including attempts to edit or create CSAM,” Ella Irwin, senior vice president of integrity, said in a statement. Newton-Rex told Forbes he resigned because he disagreed with Stability’s position that training AI on copyrighted work without consent is fair use. Melnicki and Penna declined to comment. Avrunin and Onder could not be reached for comment. None of the researchers responded to requests for comment. The Stable Diffusion researchers’ departure as a cohort says a lot about the state of Stability AI. The company’s researchers were widely viewed as its crown jewels, their work subsidized with a firehose of pricey compute power that was even extended to people outside the company. Martino Russi, an artificial intelligence researcher, told Forbes that though he was never formally employed by Stability, the company provided him a “staggering” amount of compute between January and April 2023 to play around with developing an AI video generator that Stability might someday use. “It was Candy Land or Coney Island,” said Russi, who estimates that his experiment, which was ultimately shelved, cost the company $2.5 million. Stable Diffusion was simultaneously Stability’s marquee product and its existential cash crisis. One current employee described it to Forbes as “a giant vacuum that absorbed everything: money, compute, people.” While the software was widely used, with Mostaque claiming downloads reaching into the hundreds of millions, Stability struggled to translate that wild success into revenue. Mostaque knew it could be done — peers at Databricks, Elastic and MongoDB had all turned a free product into a lucrative business — he just couldn’t figure out how. His first attempt was Stability’s API, which allowed paying customers to integrate Stable Diffusion into their own products. In early 2023, a handful of small companies, like art generator app NightCafe and presentation software startup Tome, signed on, according to four people with knowledge of the deals. But Stability’s poor account management services soured many, and in a matter of months NightCafe and Tome canceled their contracts, three people said. NightCafe founder Angus Russell told Forbes that his company switched to a competitor which “offered much cheaper inference costs and a broader service.” Tome did not respond to a request for comment. Meanwhile, Mostaque’s efforts to court larger companies like Samsung and Snapchat were failing, according to five people familiar with the effort. Canva, which was already one of the heaviest users of open-sourced Stable Diffusion, had multiple discussions with Stability, which was angling for a contract it hoped would generate several millions in annual revenue. But the deal never materialized, four sources said. “These three companies wanted and needed us,” one former employee told Forbes. “They would have been the perfect customers.” (Samsung, Snap and Canva declined to comment.) “It’s not that there was not an appetite to pay Stability — there were tons of companies that would have that wanted to,” the former employee said. “There was a huge opportunity and demand, but just a resistance to execution.” Mostaque’s other big idea was to provide governments with bespoke national AI models that would invigorate their economies and citizenry. “Emad envisions a world where AI through 100 national models serves not as a tool of the few, but as a benefactor to all promising to confront great adversaries, cancer, autism, and the sands of time itself,” the AI avatar of Aristotle said in his intro at the conference. Mostaque told several prospective customers that he could deliver such models within 60 days — an untenable timeline, according to two people in position to know. Stability attempted to develop a model for the Singaporean government over the protestation of employees who questioned its technical feasibility, three sources familiar with the effort told Forbes. But it couldn’t pull it off and Singapore never became a customer. (The government of Singapore confirmed it did not enter into a deal with Stability, but declined to answer additional questions.) As Stability careened from one new business idea to another, resources were abruptly reallocated and researchers reassigned. The whiplash shifts in a largely siloed organization demoralized and infuriated employees. “There were ‘urgent’ things, ‘urgent urgent’ things and ‘most urgent,’” one former employee complained. “None of these things seem important if everything is important.” Another former Stability executive was far more pointed in their assessment. “Emad is the most disorganized leader I have ever worked with in my career,” this person told Forbes. “He has no vision, and changes directions every week, often based on what he sees on Twitter.” In a video interview posted shortly before this story was published, Mostaque explained his leadership style: “I'm particularly great at taking creatives, developers, researchers, others, and achieving their full potential in designing systems. But I should not be dealing with, you know, HR and operations and business development and other elements. There are far better people than me to do that.” By December 2023, Stability had partially abandoned its open-source roots and announced that any commercial use of Stable Diffusion would cost customers at least $20 per month (non-commercial and research use of Stable Diffusion would remain free). But privately, Stability was considering a potentially more lucrative source of revenue: reselling the compute it was leasing from providers like AWS, according to six people familiar with the effort. Though it was essentially GPU arbitrage, Stability framed the strategy to investors as a “managed services” offering. Its damning October financial report projected optimistically that such an offering would bring in $139 million in 2024 — 98% of its revenue. Multiple employees at the time told Forbes they feared reselling compute, even if the company called it “managed services,” would violate the terms of Stability’s contract with AWS. Amazon declined to comment. “The line internally was that we are not reselling compute,” one former employee said. “This was some of the dirtiest feeling stuff.” Stability also discussed reselling a cluster of Nvidia A100 chips, leased via CoreWeave, to the venture capital firm Andreessen Horowitz, three sources said. “It was under the guise of managed services, but there wasn’t any management happening,” one of these people told Forbes. Andreessen Horowitz and CoreWeave declined to comment. Stability did not respond to questions about if it plans to continue this strategy now that Mostaque is out of the picture. Regardless, interim co-CEOs Wong and Laforte are on a tight timeline to clean up his mess. Board chairman Jim O’Shaughnessy said in a statement that he was confident the pair “will adeptly steer the company forward in developing and commercializing industry-leading generative AI products.” But burn continues to far outpace revenue. The Financial Times reported Friday that the company made $5.4 million of revenue in February, against $8 million in costs. Several sources said there are ongoing concerns about making payroll for the roughly 150 remaining employees. Leadership roles have gone vacant for months amid the disarray, leaving the company increasingly directionless. Meanwhile, a potentially catastrophic legal threat looms over the company: A trio of copyright infringement lawsuits brought by Getty Images and a group of artists in the U.S. and U.K., who claim Stability illegally used their art and photography to train the AI models powering Stable Diffusion. A London-based court has already rejected the company’s bid to throw out one of the lawsuits on the basis that none of its researchers were based in the U.K. And Stability’s claim that Getty’s Delaware lawsuit should be blocked because it's a U.K.-based company was rejected. (Stability did not respond to questions about the litigation.) AI-related copyright litigation “could go on for years,” according to Eric Goldman, a law professor at Santa Clara University. He told Forbes that though plaintiffs suing AI firms face an uphill battle overcoming the existing legal precedent on copyright infringement, the quantity of arguments available to make are virtually inexhaustible. “Like in military theory, if there’s a gap in your lines, that’s where the enemy pours through — if any one of those arguments succeeds, it could completely change the generative AI environment,” he said. “In some sense, generative AI as an industry has to win everything.” Stability, which had more than $100 million in the bank just a year and a half ago, is in a deep hole. Not only does it need more funding, it needs a viable business model — or a buyer with the vision and chops to make it successful in a fast-moving and highly competitive sector. At an all hands meeting this past Monday, Stability’s new leaders detailed a path forward. One point of emphasis: a plan to better manage resources and expenses, according to one person in attendance. It’s a start, but Mostaque’s meddling has left them with little runway to execute. His resignation, though, has given some employees hope. “A few people are 100% going to reconsider leaving after today,” said one current employee. “And the weird gloomy aura of hearing Emad talking nonsense for an hour is gone.” Shortly before Mostaque resigned, one current Stability executive told Forbes that they were optimistic his departure could make Stability appealing enough to receive a small investment or sale to a friendly party. “There are companies that have raised hundreds of millions of dollars that have much less intrinsic value than Stability,” the person said. “A white knight may still appear.”

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup
reddit
LLM Vibe Score0
Human Vibe Score0.667
milaworldThis week

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup

forbes article: https://www.forbes.com/sites/kenrickcai/2024/03/29/how-stability-ais-founder-tanked-his-billion-dollar-startup/ archive no paywall: https://archive.is/snbeV How Stability AI’s Founder Tanked His Billion-Dollar Startup Mar 29, 2024 Stability AI founder Emad Mostaque took the stage last week at the Terranea Resort in Palos Verdes, California to roaring applause and an introduction from an AI-generated Aristotle who announced him as “a modern Prometheus” with “the astuteness of Athena and the vision of Daedalus.” “Under his stewardship, AI becomes the Herculean force poised to vanquish the twin serpents of illness and ailment and extend the olive branch of longevity,” the faux Aristotle proclaimed. “I think that’s the best intro I’ve ever had,” Mostaque said. But behind Mostaque's hagiographic introduction lay a grim and fast metastasizing truth. Stability, once one of AI’s buzziest startups, was floundering. It had been running out of money for months and Mostaque had been unable to secure enough additional funding. It had defaulted on payments to Amazon whose cloud service undergirded Stability’s core offerings. The star research team behind its flagship text-to-image generator Stable Diffusion had tendered their resignations just three days before — as Forbes would first report — and other senior leaders had issued him an ultimatum: resign, or we walk too. Still, onstage before a massive audience of peers and acolytes, Mostaque talked a big game. “AI is jet planes for the mind,” he opined. “AI is our collective intelligence. It's the human Colossus.” He claimed a new, faster version of the Stable Diffusion image generator released earlier this month could generate “200 cats with hats per second.” But later, when he was asked about Stability’s financial model, Mostaque fumbled. “I can’t say that publicly,” he replied. “But it’s going well. We’re ahead of forecast.” Four days later, Mostaque stepped down as CEO of Stability, as Forbes first reported. In a post to X, the service formerly known as Twitter, he claimed he’d voluntarily abdicated his role to decentralize “the concentration of power in AI.” But sources told Forbes that was hardly the case. Behind the scenes, Mostaque had fought to maintain his position and control despite mounting pressure externally and internally to step down. Company documents and interviews with 32 current and former employees, investors, collaborators and industry observers suggest his abrupt exit was the result of poor business judgment and wild overspending that undermined confidence in his vision and leadership, and ultimately kneecapped the company. Mostaque, through his attorneys, declined to comment on record on a detailed list of questions about the reporting in this story. But in an email to Forbes earlier this week he broadly disputed the allegations. “Nobody tells you how hard it is to be a CEO and there are better CEOs than me to scale a business,” he said in a statement. “I am not sure anyone else would have been able to build and grow the research team to build the best and most widely used models out there and I’m very proud of the team there. I look forward to moving onto the next problem to handle and hopefully move the needle.” In an emailed statement, Christian Laforte and Shan Shan Wong, the interim co-CEOs who replaced Mostaque, said, "the company remains focused on commercializing its world leading technology” and providing it “to partners across the creative industries." After starting Stability in 2019, Mostaque built the company into an early AI juggernaut by seizing upon a promising research project that would become Stable Diffusion and funding it into a business reality. The ease with which the software generated detailed images from the simplest text prompts immediately captivated the public: 10 million people used it on any given day, the company told Forbes in early 2023. For some true believers, Mostaque was a crucial advocate for open-source AI development in a space dominated by the closed systems of OpenAI, Google and Anthropic. But his startup’s rise to one of the buzziest in generative AI was in part built on a series of exaggerations and misleading claims, as Forbes first reported last year (Mostaque disputed some points at the time). And they continued after he raised $100 million at a $1 billion valuation just days after launching Stable Diffusion in 2022. His failure to deliver on an array of grand promises, like building bespoke AI models for nation states, and his decision to pour tens of millions into research without a sustainable business plan, eroded Stability’s foundations and jeopardized its future. "He was just giving shit away,” one former employee told Forbes. “That man legitimately wanted to transform the world. He actually wanted to train AI models for kids in Malawi. Was it practical? Absolutely not." By October 2023, Stability would have less than $4 million left in the bank, according to an internal memo prepared for a board meeting and reviewed by Forbes. And mounting debt, including months of overdue Amazon Web Services payments, had already left it in the red. To avoid legal penalties for skipping Americans staff’s payroll, the document explained, the London-based startup was considering delaying tax payments to the U.K. government. It was Stability’s armada of GPUs, the wildly powerful and equally expensive chips undergirding AI, that were so taxing the company’s finances. Hosted by AWS, they had long been one of Mostaque’s bragging points; he often touted them as one of the world’s 10 largest supercomputers. They were responsible for helping Stability’s researchers build and maintain one of the top AI image generators, as well as break important new ground on generative audio, video and 3D models. “Undeniably, Stability has continued to ship a lot of models,” said one former employee. “They may not have profited off of it, but the broader ecosystem benefitted in a huge, huge way.” But the costs associated with so much compute were now threatening to sink the company. According to an internal October financial forecast seen by Forbes, Stability was on track to spend $99 million on compute in 2023. It noted as well that Stability was “underpaying AWS bills for July (by $1M)” and “not planning to pay AWS at the end of October for August usage ($7M).” Then there were the September and October bills, plus $1 million owed to Google Cloud and $600,000 to GPU cloud data center CoreWeave. (Amazon, Google and CoreWeave declined to comment.) With an additional $54 million allocated to wages and operating expenses, Stability’s total projected costs for 2023 were $153 million. But according to its October financial report, its projected revenue for the calendar year was just $11 million. Stability was on track to lose more money per month than it made in an entire year. The company’s dire financial position had thoroughly soured Stability’s current investors, including Coatue, which had invested tens of millions in the company during its $101 million funding round in 2022. In the middle of 2023, Mostaque agreed to an independent audit after Coatue raised a series of concerns, according to a source with direct knowledge of the matter. The outcome of the investigation is unclear. Coatue declined to comment. Within a week of an early October board meeting where Mostaque shared that financial forecast, Lightspeed Venture Partners, another major investor, sent a letter to the board urging them to sell the company. The distressing numbers had “severely undermined” the firm’s confidence in Mostaque’s ability to lead the company. “In particular, we are surprised and deeply concerned by a cash position just now disclosed to us that is inconsistent with prior discussions on this topic,” Lightspeed’s general counsel Brett Nissenberg wrote in the letter, a copy of which was viewed by Forbes. “Lightspeed believes that the company is not likely financeable on terms that would assure the company’s long term sound financial position.” (Lightspeed declined a request for comment.) The calls for a sale led Stability to quietly begin looking for a buyer. Bloomberg reported in November that Stability approached AI startups Cohere and Jasper to gauge their interest. Stability denied this, and Jasper CEO Timothy Young did the same when reached for comment by Forbes. A Cohere representative declined to comment. But one prominent AI company confirmed that Mostaque’s representatives had reached out to them to test the waters. Those talks did not advance because “the numbers didn’t add up,” this person, who declined to be named due to the confidential nature of the talks, told Forbes. Stability also tried to court Samsung as a buyer, going so far as to redecorate its office in advance of a planned meeting with the Korean electronics giant. (Samsung said that it invested in Stability in 2023 and that it does not comment on M&A discussions.) Coatue had been calling for Mostaque’s resignation for months, according to a source with direct knowledge. But it and other investors were unable to oust him because he was the company’s majority shareholder. When they tried a different tact by rallying other investors to offer him a juicy equity package to resign, Mostaque refused, said two sources. By October, Coatue and Lightspeed had had enough. Coatue left the board and Lightspeed resigned its observer seat. “Emad infuriated our initial investors so much it’s just making it impossible for us to raise more money under acceptable terms,” one current Stability executive told Forbes. The early months of 2024 saw Stability’s already precarious position eroding further still. Employees were quietly laid off. Three people in a position to know estimated that at least 10% of staff were cut. And cash reserves continued to dwindle. Mostaque mentioned a lifeline at the October board meeting: $95 million in tentative funding from new investors, pending due diligence. But in the end, only a fraction of it was wired, two sources say, much of it from Intel, which Forbes has learned invested $20 million, a fraction of what was reported. (Intel did not return a request for comment by publication time.) Two hours after Forbes broke the news of Mostaque’s plans to step down as CEO, Stability issued a press release confirming his resignation. Chief operating officer Wong and chief technology officer Laforte have taken over in the interim. Mostaque, who said on X that he still owns a majority of the company, also stepped down from the board, which has now initiated a search for a permanent CEO. There is a lot of work to be done to turn things around, and very little time in which to do it. Said the current Stability executive, “There’s still a possibility of a turnaround story, but the odds drop by the day.” In July of 2023, Mostaque still thought he could pull it off. Halfway through the month, he shared a fundraising plan with his lieutenants. It was wildly optimistic, detailing the raise of $500 million in cash and another $750 million in computing facilities from marquee investors like Nvidia, Google, Intel and the World Bank (Nvidia and Google declined comment. Intel did not respond. The World Bank said it did not invest in Stability). In a Slack message reviewed by Forbes, Mostaque said Google was “willing to move fast” and the round was “likely to be oversubscribed.” It wasn’t. Three people with direct knowledge of these fundraising efforts told Forbes that while there was some interest in Stability, talks often stalled when it came time to disclose financials. Two of them noted that earlier in the year, Mostaque had simply stopped engaging with VCs who asked for numbers. Only one firm invested around that time: actor Ashton Kutcher’s Sound Ventures, which invested $35 million in the form of a convertible SAFE note during the second quarter, according to an internal document. (Sound Ventures did not respond to a request for comment.) And though he’d managed to score a meeting with Nvidia and its CEO Jensen Huang, it ended in disaster, according to two sources. “Under Jensen's microscopic questions, Emad just fell apart,” a source in position to know told Forbes. Huang quickly concluded Stability wasn’t ready for an investment from Nvidia, the sources said. Mostaque told Forbes in an email that he had not met with Huang since 2022, except to say “hello and what’s up a few times after.” His July 2023 message references a plan to raise $150 million from Nvidia. (Nvidia declined to comment.) After a June Forbes investigation citing more than 30 sources revealed Mostaque’s history of misleading claims, Mostaque struggled to raise funding, a Stability investor told Forbes. (Mostaque disputed the story at the time and called it "coordinated lies" in his email this week to Forbes). Increasingly, investors scrutinized his assertions and pressed for data. And Young, now the CEO of Jasper, turned down a verbal offer to be Stability’s president after reading the article, according to a source with direct knowledge of the matter. The collapse of the talks aggravated the board and other executives, who had hoped Young would compensate for the sales and business management skills that Mostaque lacked, according to four people in a position to know. (Young declined to comment.) When Stability’s senior leadership convened in London for the CogX conference in September, the financing had still not closed. There, a group of executives confronted Mostaque asking questions about the company’s cash position and runway, according to three people with direct knowledge of the incident. They did not get the clarity they’d hoped for. By October, Mostaque had reduced his fundraising target by more than 80%. The months that followed saw a steady drumbeat of departures — general counsel Adam Avrunin, vice presidents Mike Melnicki, Ed Newton-Rex and Joe Penna, chief people officer Ozden Onder — culminating in the demoralizing March exit of Stable Diffusion’s primary developers Robin Rombach, Andreas Blattmann, Patrick Esser and Dominik Lorenz. Rombach, who led the team, had been angling to leave for months, two sources said, first threatening to resign last summer because of the fundraising failures. Others left over concerns about cash flow, as well as liabilities — including what four people described as Mostaque’s lax approach to ensuring that Stability products could not be used to produce child sexual abuse imagery. “Stability AI is committed to preventing the misuse of AI and prohibits the use of our image models and services for unlawful activity, including attempts to edit or create CSAM,” Ella Irwin, senior vice president of integrity, said in a statement. Newton-Rex told Forbes he resigned because he disagreed with Stability’s position that training AI on copyrighted work without consent is fair use. Melnicki and Penna declined to comment. Avrunin and Onder could not be reached for comment. None of the researchers responded to requests for comment. The Stable Diffusion researchers’ departure as a cohort says a lot about the state of Stability AI. The company’s researchers were widely viewed as its crown jewels, their work subsidized with a firehose of pricey compute power that was even extended to people outside the company. Martino Russi, an artificial intelligence researcher, told Forbes that though he was never formally employed by Stability, the company provided him a “staggering” amount of compute between January and April 2023 to play around with developing an AI video generator that Stability might someday use. “It was Candy Land or Coney Island,” said Russi, who estimates that his experiment, which was ultimately shelved, cost the company $2.5 million. Stable Diffusion was simultaneously Stability’s marquee product and its existential cash crisis. One current employee described it to Forbes as “a giant vacuum that absorbed everything: money, compute, people.” While the software was widely used, with Mostaque claiming downloads reaching into the hundreds of millions, Stability struggled to translate that wild success into revenue. Mostaque knew it could be done — peers at Databricks, Elastic and MongoDB had all turned a free product into a lucrative business — he just couldn’t figure out how. His first attempt was Stability’s API, which allowed paying customers to integrate Stable Diffusion into their own products. In early 2023, a handful of small companies, like art generator app NightCafe and presentation software startup Tome, signed on, according to four people with knowledge of the deals. But Stability’s poor account management services soured many, and in a matter of months NightCafe and Tome canceled their contracts, three people said. NightCafe founder Angus Russell told Forbes that his company switched to a competitor which “offered much cheaper inference costs and a broader service.” Tome did not respond to a request for comment. Meanwhile, Mostaque’s efforts to court larger companies like Samsung and Snapchat were failing, according to five people familiar with the effort. Canva, which was already one of the heaviest users of open-sourced Stable Diffusion, had multiple discussions with Stability, which was angling for a contract it hoped would generate several millions in annual revenue. But the deal never materialized, four sources said. “These three companies wanted and needed us,” one former employee told Forbes. “They would have been the perfect customers.” (Samsung, Snap and Canva declined to comment.) “It’s not that there was not an appetite to pay Stability — there were tons of companies that would have that wanted to,” the former employee said. “There was a huge opportunity and demand, but just a resistance to execution.” Mostaque’s other big idea was to provide governments with bespoke national AI models that would invigorate their economies and citizenry. “Emad envisions a world where AI through 100 national models serves not as a tool of the few, but as a benefactor to all promising to confront great adversaries, cancer, autism, and the sands of time itself,” the AI avatar of Aristotle said in his intro at the conference. Mostaque told several prospective customers that he could deliver such models within 60 days — an untenable timeline, according to two people in position to know. Stability attempted to develop a model for the Singaporean government over the protestation of employees who questioned its technical feasibility, three sources familiar with the effort told Forbes. But it couldn’t pull it off and Singapore never became a customer. (The government of Singapore confirmed it did not enter into a deal with Stability, but declined to answer additional questions.) As Stability careened from one new business idea to another, resources were abruptly reallocated and researchers reassigned. The whiplash shifts in a largely siloed organization demoralized and infuriated employees. “There were ‘urgent’ things, ‘urgent urgent’ things and ‘most urgent,’” one former employee complained. “None of these things seem important if everything is important.” Another former Stability executive was far more pointed in their assessment. “Emad is the most disorganized leader I have ever worked with in my career,” this person told Forbes. “He has no vision, and changes directions every week, often based on what he sees on Twitter.” In a video interview posted shortly before this story was published, Mostaque explained his leadership style: “I'm particularly great at taking creatives, developers, researchers, others, and achieving their full potential in designing systems. But I should not be dealing with, you know, HR and operations and business development and other elements. There are far better people than me to do that.” By December 2023, Stability had partially abandoned its open-source roots and announced that any commercial use of Stable Diffusion would cost customers at least $20 per month (non-commercial and research use of Stable Diffusion would remain free). But privately, Stability was considering a potentially more lucrative source of revenue: reselling the compute it was leasing from providers like AWS, according to six people familiar with the effort. Though it was essentially GPU arbitrage, Stability framed the strategy to investors as a “managed services” offering. Its damning October financial report projected optimistically that such an offering would bring in $139 million in 2024 — 98% of its revenue. Multiple employees at the time told Forbes they feared reselling compute, even if the company called it “managed services,” would violate the terms of Stability’s contract with AWS. Amazon declined to comment. “The line internally was that we are not reselling compute,” one former employee said. “This was some of the dirtiest feeling stuff.” Stability also discussed reselling a cluster of Nvidia A100 chips, leased via CoreWeave, to the venture capital firm Andreessen Horowitz, three sources said. “It was under the guise of managed services, but there wasn’t any management happening,” one of these people told Forbes. Andreessen Horowitz and CoreWeave declined to comment. Stability did not respond to questions about if it plans to continue this strategy now that Mostaque is out of the picture. Regardless, interim co-CEOs Wong and Laforte are on a tight timeline to clean up his mess. Board chairman Jim O’Shaughnessy said in a statement that he was confident the pair “will adeptly steer the company forward in developing and commercializing industry-leading generative AI products.” But burn continues to far outpace revenue. The Financial Times reported Friday that the company made $5.4 million of revenue in February, against $8 million in costs. Several sources said there are ongoing concerns about making payroll for the roughly 150 remaining employees. Leadership roles have gone vacant for months amid the disarray, leaving the company increasingly directionless. Meanwhile, a potentially catastrophic legal threat looms over the company: A trio of copyright infringement lawsuits brought by Getty Images and a group of artists in the U.S. and U.K., who claim Stability illegally used their art and photography to train the AI models powering Stable Diffusion. A London-based court has already rejected the company’s bid to throw out one of the lawsuits on the basis that none of its researchers were based in the U.K. And Stability’s claim that Getty’s Delaware lawsuit should be blocked because it's a U.K.-based company was rejected. (Stability did not respond to questions about the litigation.) AI-related copyright litigation “could go on for years,” according to Eric Goldman, a law professor at Santa Clara University. He told Forbes that though plaintiffs suing AI firms face an uphill battle overcoming the existing legal precedent on copyright infringement, the quantity of arguments available to make are virtually inexhaustible. “Like in military theory, if there’s a gap in your lines, that’s where the enemy pours through — if any one of those arguments succeeds, it could completely change the generative AI environment,” he said. “In some sense, generative AI as an industry has to win everything.” Stability, which had more than $100 million in the bank just a year and a half ago, is in a deep hole. Not only does it need more funding, it needs a viable business model — or a buyer with the vision and chops to make it successful in a fast-moving and highly competitive sector. At an all hands meeting this past Monday, Stability’s new leaders detailed a path forward. One point of emphasis: a plan to better manage resources and expenses, according to one person in attendance. It’s a start, but Mostaque’s meddling has left them with little runway to execute. His resignation, though, has given some employees hope. “A few people are 100% going to reconsider leaving after today,” said one current employee. “And the weird gloomy aura of hearing Emad talking nonsense for an hour is gone.” Shortly before Mostaque resigned, one current Stability executive told Forbes that they were optimistic his departure could make Stability appealing enough to receive a small investment or sale to a friendly party. “There are companies that have raised hundreds of millions of dollars that have much less intrinsic value than Stability,” the person said. “A white knight may still appear.”

[P] Open-source Neural Search framework to implement semantic search & multimedia search. Just released 2.0, seeking your feedback.
reddit
LLM Vibe Score0
Human Vibe Score0
opensourcecolumbusThis week

[P] Open-source Neural Search framework to implement semantic search & multimedia search. Just released 2.0, seeking your feedback.

I heard your feedback on 1.0 release post on my project Jina, many people were keen to use Jina for multimedia search because that's where use of Neural Networks makes significant difference. So I focused on that part and I was able to transform it from 1.0 to 2.0 within 3 months. Last post on 1.0 release to give you some idea what this project is about Actually, I should say - "'we' made this", because there were more than 155 contributors who did it, not just me. The primary changes we made We saw MachineLearning beginners struggle in using Jina 1.0, so we separated the codebase where Machine Learning expertise is required(jina-hub) and the one which MachineLearning beginners can use(the jina core). Now ML beginners don't need to worry about jina-hub and can use jina hub packages directly to implement ML specific tasks without the need to understand advanced ML concepts. While advanced ML users can create their own jina-hub packages. We cut down a lots of abstractions to make it easy to use for beginners Made python APIs more intuitive to use Improved performance(3.6x faster on startup) Here's Jina 2.0 and here's Jina 1.0. I seek feedback from people who are looking at this project for the first time, as well as people who have tried their hands before but had some challenges in using it. Few questions, I'm seeking answers to Do you feel that we have reduced complexity by a lot of margin? How easy it is to use for a beginner now? What questions are still unanswered?

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup
reddit
LLM Vibe Score0
Human Vibe Score0.667
milaworldThis week

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup

forbes article: https://www.forbes.com/sites/kenrickcai/2024/03/29/how-stability-ais-founder-tanked-his-billion-dollar-startup/ archive no paywall: https://archive.is/snbeV How Stability AI’s Founder Tanked His Billion-Dollar Startup Mar 29, 2024 Stability AI founder Emad Mostaque took the stage last week at the Terranea Resort in Palos Verdes, California to roaring applause and an introduction from an AI-generated Aristotle who announced him as “a modern Prometheus” with “the astuteness of Athena and the vision of Daedalus.” “Under his stewardship, AI becomes the Herculean force poised to vanquish the twin serpents of illness and ailment and extend the olive branch of longevity,” the faux Aristotle proclaimed. “I think that’s the best intro I’ve ever had,” Mostaque said. But behind Mostaque's hagiographic introduction lay a grim and fast metastasizing truth. Stability, once one of AI’s buzziest startups, was floundering. It had been running out of money for months and Mostaque had been unable to secure enough additional funding. It had defaulted on payments to Amazon whose cloud service undergirded Stability’s core offerings. The star research team behind its flagship text-to-image generator Stable Diffusion had tendered their resignations just three days before — as Forbes would first report — and other senior leaders had issued him an ultimatum: resign, or we walk too. Still, onstage before a massive audience of peers and acolytes, Mostaque talked a big game. “AI is jet planes for the mind,” he opined. “AI is our collective intelligence. It's the human Colossus.” He claimed a new, faster version of the Stable Diffusion image generator released earlier this month could generate “200 cats with hats per second.” But later, when he was asked about Stability’s financial model, Mostaque fumbled. “I can’t say that publicly,” he replied. “But it’s going well. We’re ahead of forecast.” Four days later, Mostaque stepped down as CEO of Stability, as Forbes first reported. In a post to X, the service formerly known as Twitter, he claimed he’d voluntarily abdicated his role to decentralize “the concentration of power in AI.” But sources told Forbes that was hardly the case. Behind the scenes, Mostaque had fought to maintain his position and control despite mounting pressure externally and internally to step down. Company documents and interviews with 32 current and former employees, investors, collaborators and industry observers suggest his abrupt exit was the result of poor business judgment and wild overspending that undermined confidence in his vision and leadership, and ultimately kneecapped the company. Mostaque, through his attorneys, declined to comment on record on a detailed list of questions about the reporting in this story. But in an email to Forbes earlier this week he broadly disputed the allegations. “Nobody tells you how hard it is to be a CEO and there are better CEOs than me to scale a business,” he said in a statement. “I am not sure anyone else would have been able to build and grow the research team to build the best and most widely used models out there and I’m very proud of the team there. I look forward to moving onto the next problem to handle and hopefully move the needle.” In an emailed statement, Christian Laforte and Shan Shan Wong, the interim co-CEOs who replaced Mostaque, said, "the company remains focused on commercializing its world leading technology” and providing it “to partners across the creative industries." After starting Stability in 2019, Mostaque built the company into an early AI juggernaut by seizing upon a promising research project that would become Stable Diffusion and funding it into a business reality. The ease with which the software generated detailed images from the simplest text prompts immediately captivated the public: 10 million people used it on any given day, the company told Forbes in early 2023. For some true believers, Mostaque was a crucial advocate for open-source AI development in a space dominated by the closed systems of OpenAI, Google and Anthropic. But his startup’s rise to one of the buzziest in generative AI was in part built on a series of exaggerations and misleading claims, as Forbes first reported last year (Mostaque disputed some points at the time). And they continued after he raised $100 million at a $1 billion valuation just days after launching Stable Diffusion in 2022. His failure to deliver on an array of grand promises, like building bespoke AI models for nation states, and his decision to pour tens of millions into research without a sustainable business plan, eroded Stability’s foundations and jeopardized its future. "He was just giving shit away,” one former employee told Forbes. “That man legitimately wanted to transform the world. He actually wanted to train AI models for kids in Malawi. Was it practical? Absolutely not." By October 2023, Stability would have less than $4 million left in the bank, according to an internal memo prepared for a board meeting and reviewed by Forbes. And mounting debt, including months of overdue Amazon Web Services payments, had already left it in the red. To avoid legal penalties for skipping Americans staff’s payroll, the document explained, the London-based startup was considering delaying tax payments to the U.K. government. It was Stability’s armada of GPUs, the wildly powerful and equally expensive chips undergirding AI, that were so taxing the company’s finances. Hosted by AWS, they had long been one of Mostaque’s bragging points; he often touted them as one of the world’s 10 largest supercomputers. They were responsible for helping Stability’s researchers build and maintain one of the top AI image generators, as well as break important new ground on generative audio, video and 3D models. “Undeniably, Stability has continued to ship a lot of models,” said one former employee. “They may not have profited off of it, but the broader ecosystem benefitted in a huge, huge way.” But the costs associated with so much compute were now threatening to sink the company. According to an internal October financial forecast seen by Forbes, Stability was on track to spend $99 million on compute in 2023. It noted as well that Stability was “underpaying AWS bills for July (by $1M)” and “not planning to pay AWS at the end of October for August usage ($7M).” Then there were the September and October bills, plus $1 million owed to Google Cloud and $600,000 to GPU cloud data center CoreWeave. (Amazon, Google and CoreWeave declined to comment.) With an additional $54 million allocated to wages and operating expenses, Stability’s total projected costs for 2023 were $153 million. But according to its October financial report, its projected revenue for the calendar year was just $11 million. Stability was on track to lose more money per month than it made in an entire year. The company’s dire financial position had thoroughly soured Stability’s current investors, including Coatue, which had invested tens of millions in the company during its $101 million funding round in 2022. In the middle of 2023, Mostaque agreed to an independent audit after Coatue raised a series of concerns, according to a source with direct knowledge of the matter. The outcome of the investigation is unclear. Coatue declined to comment. Within a week of an early October board meeting where Mostaque shared that financial forecast, Lightspeed Venture Partners, another major investor, sent a letter to the board urging them to sell the company. The distressing numbers had “severely undermined” the firm’s confidence in Mostaque’s ability to lead the company. “In particular, we are surprised and deeply concerned by a cash position just now disclosed to us that is inconsistent with prior discussions on this topic,” Lightspeed’s general counsel Brett Nissenberg wrote in the letter, a copy of which was viewed by Forbes. “Lightspeed believes that the company is not likely financeable on terms that would assure the company’s long term sound financial position.” (Lightspeed declined a request for comment.) The calls for a sale led Stability to quietly begin looking for a buyer. Bloomberg reported in November that Stability approached AI startups Cohere and Jasper to gauge their interest. Stability denied this, and Jasper CEO Timothy Young did the same when reached for comment by Forbes. A Cohere representative declined to comment. But one prominent AI company confirmed that Mostaque’s representatives had reached out to them to test the waters. Those talks did not advance because “the numbers didn’t add up,” this person, who declined to be named due to the confidential nature of the talks, told Forbes. Stability also tried to court Samsung as a buyer, going so far as to redecorate its office in advance of a planned meeting with the Korean electronics giant. (Samsung said that it invested in Stability in 2023 and that it does not comment on M&A discussions.) Coatue had been calling for Mostaque’s resignation for months, according to a source with direct knowledge. But it and other investors were unable to oust him because he was the company’s majority shareholder. When they tried a different tact by rallying other investors to offer him a juicy equity package to resign, Mostaque refused, said two sources. By October, Coatue and Lightspeed had had enough. Coatue left the board and Lightspeed resigned its observer seat. “Emad infuriated our initial investors so much it’s just making it impossible for us to raise more money under acceptable terms,” one current Stability executive told Forbes. The early months of 2024 saw Stability’s already precarious position eroding further still. Employees were quietly laid off. Three people in a position to know estimated that at least 10% of staff were cut. And cash reserves continued to dwindle. Mostaque mentioned a lifeline at the October board meeting: $95 million in tentative funding from new investors, pending due diligence. But in the end, only a fraction of it was wired, two sources say, much of it from Intel, which Forbes has learned invested $20 million, a fraction of what was reported. (Intel did not return a request for comment by publication time.) Two hours after Forbes broke the news of Mostaque’s plans to step down as CEO, Stability issued a press release confirming his resignation. Chief operating officer Wong and chief technology officer Laforte have taken over in the interim. Mostaque, who said on X that he still owns a majority of the company, also stepped down from the board, which has now initiated a search for a permanent CEO. There is a lot of work to be done to turn things around, and very little time in which to do it. Said the current Stability executive, “There’s still a possibility of a turnaround story, but the odds drop by the day.” In July of 2023, Mostaque still thought he could pull it off. Halfway through the month, he shared a fundraising plan with his lieutenants. It was wildly optimistic, detailing the raise of $500 million in cash and another $750 million in computing facilities from marquee investors like Nvidia, Google, Intel and the World Bank (Nvidia and Google declined comment. Intel did not respond. The World Bank said it did not invest in Stability). In a Slack message reviewed by Forbes, Mostaque said Google was “willing to move fast” and the round was “likely to be oversubscribed.” It wasn’t. Three people with direct knowledge of these fundraising efforts told Forbes that while there was some interest in Stability, talks often stalled when it came time to disclose financials. Two of them noted that earlier in the year, Mostaque had simply stopped engaging with VCs who asked for numbers. Only one firm invested around that time: actor Ashton Kutcher’s Sound Ventures, which invested $35 million in the form of a convertible SAFE note during the second quarter, according to an internal document. (Sound Ventures did not respond to a request for comment.) And though he’d managed to score a meeting with Nvidia and its CEO Jensen Huang, it ended in disaster, according to two sources. “Under Jensen's microscopic questions, Emad just fell apart,” a source in position to know told Forbes. Huang quickly concluded Stability wasn’t ready for an investment from Nvidia, the sources said. Mostaque told Forbes in an email that he had not met with Huang since 2022, except to say “hello and what’s up a few times after.” His July 2023 message references a plan to raise $150 million from Nvidia. (Nvidia declined to comment.) After a June Forbes investigation citing more than 30 sources revealed Mostaque’s history of misleading claims, Mostaque struggled to raise funding, a Stability investor told Forbes. (Mostaque disputed the story at the time and called it "coordinated lies" in his email this week to Forbes). Increasingly, investors scrutinized his assertions and pressed for data. And Young, now the CEO of Jasper, turned down a verbal offer to be Stability’s president after reading the article, according to a source with direct knowledge of the matter. The collapse of the talks aggravated the board and other executives, who had hoped Young would compensate for the sales and business management skills that Mostaque lacked, according to four people in a position to know. (Young declined to comment.) When Stability’s senior leadership convened in London for the CogX conference in September, the financing had still not closed. There, a group of executives confronted Mostaque asking questions about the company’s cash position and runway, according to three people with direct knowledge of the incident. They did not get the clarity they’d hoped for. By October, Mostaque had reduced his fundraising target by more than 80%. The months that followed saw a steady drumbeat of departures — general counsel Adam Avrunin, vice presidents Mike Melnicki, Ed Newton-Rex and Joe Penna, chief people officer Ozden Onder — culminating in the demoralizing March exit of Stable Diffusion’s primary developers Robin Rombach, Andreas Blattmann, Patrick Esser and Dominik Lorenz. Rombach, who led the team, had been angling to leave for months, two sources said, first threatening to resign last summer because of the fundraising failures. Others left over concerns about cash flow, as well as liabilities — including what four people described as Mostaque’s lax approach to ensuring that Stability products could not be used to produce child sexual abuse imagery. “Stability AI is committed to preventing the misuse of AI and prohibits the use of our image models and services for unlawful activity, including attempts to edit or create CSAM,” Ella Irwin, senior vice president of integrity, said in a statement. Newton-Rex told Forbes he resigned because he disagreed with Stability’s position that training AI on copyrighted work without consent is fair use. Melnicki and Penna declined to comment. Avrunin and Onder could not be reached for comment. None of the researchers responded to requests for comment. The Stable Diffusion researchers’ departure as a cohort says a lot about the state of Stability AI. The company’s researchers were widely viewed as its crown jewels, their work subsidized with a firehose of pricey compute power that was even extended to people outside the company. Martino Russi, an artificial intelligence researcher, told Forbes that though he was never formally employed by Stability, the company provided him a “staggering” amount of compute between January and April 2023 to play around with developing an AI video generator that Stability might someday use. “It was Candy Land or Coney Island,” said Russi, who estimates that his experiment, which was ultimately shelved, cost the company $2.5 million. Stable Diffusion was simultaneously Stability’s marquee product and its existential cash crisis. One current employee described it to Forbes as “a giant vacuum that absorbed everything: money, compute, people.” While the software was widely used, with Mostaque claiming downloads reaching into the hundreds of millions, Stability struggled to translate that wild success into revenue. Mostaque knew it could be done — peers at Databricks, Elastic and MongoDB had all turned a free product into a lucrative business — he just couldn’t figure out how. His first attempt was Stability’s API, which allowed paying customers to integrate Stable Diffusion into their own products. In early 2023, a handful of small companies, like art generator app NightCafe and presentation software startup Tome, signed on, according to four people with knowledge of the deals. But Stability’s poor account management services soured many, and in a matter of months NightCafe and Tome canceled their contracts, three people said. NightCafe founder Angus Russell told Forbes that his company switched to a competitor which “offered much cheaper inference costs and a broader service.” Tome did not respond to a request for comment. Meanwhile, Mostaque’s efforts to court larger companies like Samsung and Snapchat were failing, according to five people familiar with the effort. Canva, which was already one of the heaviest users of open-sourced Stable Diffusion, had multiple discussions with Stability, which was angling for a contract it hoped would generate several millions in annual revenue. But the deal never materialized, four sources said. “These three companies wanted and needed us,” one former employee told Forbes. “They would have been the perfect customers.” (Samsung, Snap and Canva declined to comment.) “It’s not that there was not an appetite to pay Stability — there were tons of companies that would have that wanted to,” the former employee said. “There was a huge opportunity and demand, but just a resistance to execution.” Mostaque’s other big idea was to provide governments with bespoke national AI models that would invigorate their economies and citizenry. “Emad envisions a world where AI through 100 national models serves not as a tool of the few, but as a benefactor to all promising to confront great adversaries, cancer, autism, and the sands of time itself,” the AI avatar of Aristotle said in his intro at the conference. Mostaque told several prospective customers that he could deliver such models within 60 days — an untenable timeline, according to two people in position to know. Stability attempted to develop a model for the Singaporean government over the protestation of employees who questioned its technical feasibility, three sources familiar with the effort told Forbes. But it couldn’t pull it off and Singapore never became a customer. (The government of Singapore confirmed it did not enter into a deal with Stability, but declined to answer additional questions.) As Stability careened from one new business idea to another, resources were abruptly reallocated and researchers reassigned. The whiplash shifts in a largely siloed organization demoralized and infuriated employees. “There were ‘urgent’ things, ‘urgent urgent’ things and ‘most urgent,’” one former employee complained. “None of these things seem important if everything is important.” Another former Stability executive was far more pointed in their assessment. “Emad is the most disorganized leader I have ever worked with in my career,” this person told Forbes. “He has no vision, and changes directions every week, often based on what he sees on Twitter.” In a video interview posted shortly before this story was published, Mostaque explained his leadership style: “I'm particularly great at taking creatives, developers, researchers, others, and achieving their full potential in designing systems. But I should not be dealing with, you know, HR and operations and business development and other elements. There are far better people than me to do that.” By December 2023, Stability had partially abandoned its open-source roots and announced that any commercial use of Stable Diffusion would cost customers at least $20 per month (non-commercial and research use of Stable Diffusion would remain free). But privately, Stability was considering a potentially more lucrative source of revenue: reselling the compute it was leasing from providers like AWS, according to six people familiar with the effort. Though it was essentially GPU arbitrage, Stability framed the strategy to investors as a “managed services” offering. Its damning October financial report projected optimistically that such an offering would bring in $139 million in 2024 — 98% of its revenue. Multiple employees at the time told Forbes they feared reselling compute, even if the company called it “managed services,” would violate the terms of Stability’s contract with AWS. Amazon declined to comment. “The line internally was that we are not reselling compute,” one former employee said. “This was some of the dirtiest feeling stuff.” Stability also discussed reselling a cluster of Nvidia A100 chips, leased via CoreWeave, to the venture capital firm Andreessen Horowitz, three sources said. “It was under the guise of managed services, but there wasn’t any management happening,” one of these people told Forbes. Andreessen Horowitz and CoreWeave declined to comment. Stability did not respond to questions about if it plans to continue this strategy now that Mostaque is out of the picture. Regardless, interim co-CEOs Wong and Laforte are on a tight timeline to clean up his mess. Board chairman Jim O’Shaughnessy said in a statement that he was confident the pair “will adeptly steer the company forward in developing and commercializing industry-leading generative AI products.” But burn continues to far outpace revenue. The Financial Times reported Friday that the company made $5.4 million of revenue in February, against $8 million in costs. Several sources said there are ongoing concerns about making payroll for the roughly 150 remaining employees. Leadership roles have gone vacant for months amid the disarray, leaving the company increasingly directionless. Meanwhile, a potentially catastrophic legal threat looms over the company: A trio of copyright infringement lawsuits brought by Getty Images and a group of artists in the U.S. and U.K., who claim Stability illegally used their art and photography to train the AI models powering Stable Diffusion. A London-based court has already rejected the company’s bid to throw out one of the lawsuits on the basis that none of its researchers were based in the U.K. And Stability’s claim that Getty’s Delaware lawsuit should be blocked because it's a U.K.-based company was rejected. (Stability did not respond to questions about the litigation.) AI-related copyright litigation “could go on for years,” according to Eric Goldman, a law professor at Santa Clara University. He told Forbes that though plaintiffs suing AI firms face an uphill battle overcoming the existing legal precedent on copyright infringement, the quantity of arguments available to make are virtually inexhaustible. “Like in military theory, if there’s a gap in your lines, that’s where the enemy pours through — if any one of those arguments succeeds, it could completely change the generative AI environment,” he said. “In some sense, generative AI as an industry has to win everything.” Stability, which had more than $100 million in the bank just a year and a half ago, is in a deep hole. Not only does it need more funding, it needs a viable business model — or a buyer with the vision and chops to make it successful in a fast-moving and highly competitive sector. At an all hands meeting this past Monday, Stability’s new leaders detailed a path forward. One point of emphasis: a plan to better manage resources and expenses, according to one person in attendance. It’s a start, but Mostaque’s meddling has left them with little runway to execute. His resignation, though, has given some employees hope. “A few people are 100% going to reconsider leaving after today,” said one current employee. “And the weird gloomy aura of hearing Emad talking nonsense for an hour is gone.” Shortly before Mostaque resigned, one current Stability executive told Forbes that they were optimistic his departure could make Stability appealing enough to receive a small investment or sale to a friendly party. “There are companies that have raised hundreds of millions of dollars that have much less intrinsic value than Stability,” the person said. “A white knight may still appear.”

[Discussion] When ML and Data Science are the death of a good company: A cautionary tale.
reddit
LLM Vibe Score0
Human Vibe Score0.6
AlexSnakeKingThis week

[Discussion] When ML and Data Science are the death of a good company: A cautionary tale.

TD;LR: At Company A, Team X does advanced analytics using on-prem ERP tools and older programming languages. Their tools work very well and are designed based on very deep business and domain expertise. Team Y is a new and ambitious Data Science team that thinks they can replace Team X's tools with a bunch of R scripts and a custom built ML platform. Their models are simplistic, but more "fashionable" compared to the econometric models used by Team X, and team Y benefits from the ML/DS moniker so leadership is allowing Team Y to start a large scale overhaul of the analytics platform in question. Team Y doesn't have the experience for such a larger scale transformation, and is refusing to collaborate with team X. This project is very likely going to fail, and cause serious harm to the company as a whole financially and from a people perspective. I argue that this is not just because of bad leadership, but also because of various trends and mindsets in the DS community at large. Update (Jump to below the line for the original story): Several people in the comments are pointing out that this just a management failure, not something due to ML/DS, and that you can replace DS with any buzz tech and the story will still be relevant. My response: Of course, any failure at an organization level is ultimately a management failure one way or the other. Moreover, it is also the case that ML/DS when done correctly, will always improve a company's bottom line. There is no scenario where the proper ML solution, delivered at a reasonable cost and in a timely fashion, will somehow hurt the company's bottom line. My point is that in this case management is failing because of certain trends and practices that are specific to the ML/DS community, namely: The idea that DS teams should operate independently of tech and business orgs -- too much autonomy for DS teams The disregard for domain knowledge that seems prevalent nowadays thanks to the ML hype, that DS can be generalists and someone with good enough ML chops can solve any business problem. That wasn't the case when I first left academia for the industry in 2009 (back then nobody would even bother with a phone screen if you didn't have the right domain knowledge). Over reliance on resources who check all the ML hype related boxes (knows Python, R, Tensorflow, Shiny, etc..., has the right Coursera certifications, has blogged on the topic, etc...), but are lacking in depth of experience. DS interviews nowadays all seem to be: Can you tell me what a p-value is? What is elastic net regression? Show me how to fit a model in sklearn? How do you impute NAs in an R dataframe? Any smart person can look those up on Stackoverflow or Cross-Validated,.....Instead teams should be asking stuff like: why does portfolio optimization use QP not LP? How does a forecast influence a customer service level? When should a recommendation engine be content based and when should it use collaborative filtering? etc... (This is a true story, happening to the company I currently work for. Names, domains, algorithms, and roles have been shuffled around to protect my anonymity)  Company A has been around for several decades. It is not the biggest name in its domain, but it is a well respected one. Risk analysis and portfolio optimization have been a core of Company A's business since the 90s. They have a large team of 30 or so analysts who perform those tasks on a daily basis. These analysts use ERP solutions implemented for them by one the big ERP companies (SAP, Teradata, Oracle, JD Edwards,...) or one of the major tech consulting companies (Deloitte, Accenture, PWC, Capgemini, etc...) in collaboration with their own in house engineering team. The tools used are embarrassingly old school: Classic RDBMS running on on-prem servers or maybe even on mainframes, code written in COBOL, Fortran, weird proprietary stuff like ABAP or SPSS.....you get the picture. But the models and analytic functions were pretty sophisticated, and surprisingly cutting edge compared to the published academic literature. Most of all, they fit well with the company's enterprise ecosystem, and were honed based on years of deep domain knowledge.  They have a tech team of several engineers (poached from the aforementioned software and consulting companies) and product managers (who came from the experienced pools of analysts and managers who use the software, or poached from business rivals) maintaining and running this software. Their technology might be old school, but collectively, they know the domain and the company's overall architecture very, very well. They've guided the company through several large scale upgrades and migrations and they have a track record of delivering on time, without too much overhead. The few times they've stumbled, they knew how to pick themselves up very quickly. In fact within their industry niche, they have a reputation for their expertise, and have very good relations with the various vendors they've had to deal with. They were the launching pad of several successful ERP consulting careers.  Interestingly, despite dealing on a daily basis with statistical modeling and optimization algorithms, none of the analysts, engineers, or product managers involved describe themselves as data scientists or machine learning experts. It is mostly a cultural thing: Their expertise predates the Data Science/ML hype that started circa 2010, and they got most of their chops using proprietary enterprise tools instead of the open source tools popular nowadays. A few of them have formal statistical training, but most of them came from engineering or domain backgrounds and learned stats on the fly while doing their job. Call this team "Team X".  Sometime around the mid 2010s, Company A started having some serious anxiety issues: Although still doing very well for a company its size, overall economic and demographic trends were shrinking its customer base, and a couple of so called disruptors came up with a new app and business model that started seriously eating into their revenue. A suitable reaction to appease shareholders and Wall Street was necessary. The company already had a decent website and a pretty snazzy app, what more could be done? Leadership decided that it was high time that AI and ML become a core part of the company's business. An ambitious Manager, with no science or engineering background, but who had very briefly toyed with a recommender system a couple of years back, was chosen to build a data science team, call it team "Y" (he had a bachelor's in history from the local state college and worked for several years in the company's marketing org). Team "Y" consists mostly of internal hires who decided they wanted to be data scientists and completed a Coursera certification or a Galvanize boot camp, before being brought on to the team, along with a few of fresh Ph.D or M.Sc holders who didn't like academia and wanted to try their hand at an industry role. All of them were very bright people, they could write great Medium blog posts and give inspiring TED talks, but collectively they had very little real world industry experience. As is the fashion nowadays, this group was made part of a data science org that reported directly to the CEO and Board, bypassing the CIO and any tech or business VPs, since Company A wanted to claim the monikers "data driven" and "AI powered" in their upcoming shareholder meetings. In 3 or 4 years of existence, team Y produced a few Python and R scripts. Their architectural experience  consisted almost entirely in connecting Flask to S3 buckets or Redshift tables, with a couple of the more resourceful ones learning how to plug their models into Tableau or how to spin up a Kuberneties pod.  But they needn't worry: The aforementioned manager, who was now a director (and was also doing an online Masters to make up for his qualifications gap and bolster his chances of becoming VP soon - at least he now understands what L1 regularization is), was a master at playing corporate politics and self-promotion. No matter how few actionable insights team Y produced or how little code they deployed to production, he always had their back and made sure they had ample funding. In fact he now had grandiose plans for setting up an all-purpose machine learning platform that can be used to solve all of the company's data problems.  A couple of sharp minded members of team Y, upon googling their industry name along with the word "data science", realized that risk analysis was a prime candidate for being solved with Bayesian models, and there was already a nifty R package for doing just that, whose tutorial they went through on R-Bloggers.com. One of them had even submitted a Bayesian classifier Kernel for a competition on Kaggle (he was 203rd on the leaderboard), and was eager to put his new-found expertise to use on a real world problem. They pitched the idea to their director, who saw a perfect use case for his upcoming ML platform. They started work on it immediately, without bothering to check whether anybody at Company A was already doing risk analysis. Since their org was independent, they didn't really need to check with anybody else before they got funding for their initiative. Although it was basically a Naive Bayes classifier, the term ML was added to the project tile, to impress the board.  As they progressed with their work however, tensions started to build. They had asked the data warehousing and CA analytics teams to build pipelines for them, and word eventually got out to team X about their project. Team X was initially thrilled: They offered to collaborate whole heartedly, and would have loved to add an ML based feather to their already impressive cap. The product owners and analysts were totally onboard as well: They saw a chance to get in on the whole Data Science hype that they kept hearing about. But through some weird mix of arrogance and insecurity, team Y refused to collaborate with them or share any of their long term goals with them, even as they went to other parts of the company giving brown bag presentations and tutorials on the new model they created.  Team X got resentful: from what they saw of team Y's model, their approach was hopelessly naive and had little chances of scaling or being sustainable in production, and they knew exactly how to help with that. Deploying the model to production would have taken them a few days, given how comfortable they were with DevOps and continuous delivery (team Y had taken several months to figure out how to deploy a simple R script to production). And despite how old school their own tech was, team X were crafty enough to be able to plug it in to their existing architecture. Moreover, the output of the model was such that it didn't take into account how the business will consume it or how it was going to be fed to downstream systems, and the product owners could have gone a long way in making the model more amenable to adoption by the business stakeholders. But team Y wouldn't listen, and their leads brushed off any attempts at communication, let alone collaboration. The vibe that team Y was giving off was "We are the cutting edge ML team, you guys are the legacy server grunts. We don't need your opinion.", and they seemed to have a complete disregard for domain knowledge, or worse, they thought that all that domain knowledge consisted of was being able to grasp the definitions of a few business metrics.  Team X got frustrated and tried to express their concerns to leadership. But despite owning a vital link in Company A's business process, they were only \~50 people in a large 1000 strong technology and operations org, and they were several layers removed from the C-suite, so it was impossible for them to get their voices heard.  Meanwhile, the unstoppable director was doing what he did best: Playing corporate politics. Despite how little his team had actually delivered, he had convinced the board that all analysis and optimization tasks should now be migrated to his yet to be delivered ML platform. Since most leaders now knew that there was overlap between team Y and team X's objectives, his pitch was no longer that team Y was going to create a new insight, but that they were going to replace (or modernize) the legacy statistics based on-prem tools with more accurate cloud based ML tools. Never mind that there was no support in the academic literature for the idea that Naive Bayes works better than the Econometric approaches used by team X, let alone the additional wacky idea that Bayesian Optimization would definitely outperform the QP solvers that were running in production.  Unbeknownst to team X, the original Bayesian risk analysis project has now grown into a multimillion dollar major overhaul initiative, which included the eventual replacement of all of the tools and functions supported by team X along with the necessary migration to the cloud. The CIO and a couple of business VPs are on now board, and tech leadership is treating it as a done deal. An outside vendor, a startup who nobody had heard of, was contracted to help build the platform, since team Y has no engineering skills. The choice was deliberate, as calling on any of the established consulting or software companies would have eventually led leadership to the conclusion that team X was better suited for a transformation on this scale than team Y.  Team Y has no experience with any major ERP deployments, and no domain knowledge, yet they are being tasked with fundamentally changing the business process that is at the core of Company A's business. Their models actually perform worse than those deployed by team X, and their architecture is hopelessly simplistic, compared to what is necessary for running such a solution in production.  Ironically, using Bayesian thinking and based on all the evidence, the likelihood that team Y succeeds is close to 0%. At best, the project is going to end up being a write off of 50 million dollars or more. Once the !@#$!@hits the fan, a couple of executive heads are going to role, and dozens of people will get laid off. At worst, given how vital risk analysis and portfolio optimization is to Company A's revenue stream, the failure will eventually sink the whole company. It probably won't go bankrupt, but it will lose a significant portion of its business and work force. Failed ERP implementations can and do sink large companies: Just see what happened to National Grid US, SuperValu or Target Canada.  One might argue that this is more about corporate disfunction and bad leadership than about data science and AI. But I disagree. I think the core driver of this debacle is indeed the blind faith in Data Scientists, ML models and the promise of AI, and the overall culture of hype and self promotion that is very common among the ML crowd.  We haven't seen the end of this story: I sincerely hope that this ends well for the sake of my colleagues and all involved. Company A is a good company, and both its customers and its employees deserver better. But the chances of that happening are negligible given all the information available, and this failure will hit my company hard.

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup
reddit
LLM Vibe Score0
Human Vibe Score0.667
milaworldThis week

[N] How Stability AI’s Founder Tanked His Billion-Dollar Startup

forbes article: https://www.forbes.com/sites/kenrickcai/2024/03/29/how-stability-ais-founder-tanked-his-billion-dollar-startup/ archive no paywall: https://archive.is/snbeV How Stability AI’s Founder Tanked His Billion-Dollar Startup Mar 29, 2024 Stability AI founder Emad Mostaque took the stage last week at the Terranea Resort in Palos Verdes, California to roaring applause and an introduction from an AI-generated Aristotle who announced him as “a modern Prometheus” with “the astuteness of Athena and the vision of Daedalus.” “Under his stewardship, AI becomes the Herculean force poised to vanquish the twin serpents of illness and ailment and extend the olive branch of longevity,” the faux Aristotle proclaimed. “I think that’s the best intro I’ve ever had,” Mostaque said. But behind Mostaque's hagiographic introduction lay a grim and fast metastasizing truth. Stability, once one of AI’s buzziest startups, was floundering. It had been running out of money for months and Mostaque had been unable to secure enough additional funding. It had defaulted on payments to Amazon whose cloud service undergirded Stability’s core offerings. The star research team behind its flagship text-to-image generator Stable Diffusion had tendered their resignations just three days before — as Forbes would first report — and other senior leaders had issued him an ultimatum: resign, or we walk too. Still, onstage before a massive audience of peers and acolytes, Mostaque talked a big game. “AI is jet planes for the mind,” he opined. “AI is our collective intelligence. It's the human Colossus.” He claimed a new, faster version of the Stable Diffusion image generator released earlier this month could generate “200 cats with hats per second.” But later, when he was asked about Stability’s financial model, Mostaque fumbled. “I can’t say that publicly,” he replied. “But it’s going well. We’re ahead of forecast.” Four days later, Mostaque stepped down as CEO of Stability, as Forbes first reported. In a post to X, the service formerly known as Twitter, he claimed he’d voluntarily abdicated his role to decentralize “the concentration of power in AI.” But sources told Forbes that was hardly the case. Behind the scenes, Mostaque had fought to maintain his position and control despite mounting pressure externally and internally to step down. Company documents and interviews with 32 current and former employees, investors, collaborators and industry observers suggest his abrupt exit was the result of poor business judgment and wild overspending that undermined confidence in his vision and leadership, and ultimately kneecapped the company. Mostaque, through his attorneys, declined to comment on record on a detailed list of questions about the reporting in this story. But in an email to Forbes earlier this week he broadly disputed the allegations. “Nobody tells you how hard it is to be a CEO and there are better CEOs than me to scale a business,” he said in a statement. “I am not sure anyone else would have been able to build and grow the research team to build the best and most widely used models out there and I’m very proud of the team there. I look forward to moving onto the next problem to handle and hopefully move the needle.” In an emailed statement, Christian Laforte and Shan Shan Wong, the interim co-CEOs who replaced Mostaque, said, "the company remains focused on commercializing its world leading technology” and providing it “to partners across the creative industries." After starting Stability in 2019, Mostaque built the company into an early AI juggernaut by seizing upon a promising research project that would become Stable Diffusion and funding it into a business reality. The ease with which the software generated detailed images from the simplest text prompts immediately captivated the public: 10 million people used it on any given day, the company told Forbes in early 2023. For some true believers, Mostaque was a crucial advocate for open-source AI development in a space dominated by the closed systems of OpenAI, Google and Anthropic. But his startup’s rise to one of the buzziest in generative AI was in part built on a series of exaggerations and misleading claims, as Forbes first reported last year (Mostaque disputed some points at the time). And they continued after he raised $100 million at a $1 billion valuation just days after launching Stable Diffusion in 2022. His failure to deliver on an array of grand promises, like building bespoke AI models for nation states, and his decision to pour tens of millions into research without a sustainable business plan, eroded Stability’s foundations and jeopardized its future. "He was just giving shit away,” one former employee told Forbes. “That man legitimately wanted to transform the world. He actually wanted to train AI models for kids in Malawi. Was it practical? Absolutely not." By October 2023, Stability would have less than $4 million left in the bank, according to an internal memo prepared for a board meeting and reviewed by Forbes. And mounting debt, including months of overdue Amazon Web Services payments, had already left it in the red. To avoid legal penalties for skipping Americans staff’s payroll, the document explained, the London-based startup was considering delaying tax payments to the U.K. government. It was Stability’s armada of GPUs, the wildly powerful and equally expensive chips undergirding AI, that were so taxing the company’s finances. Hosted by AWS, they had long been one of Mostaque’s bragging points; he often touted them as one of the world’s 10 largest supercomputers. They were responsible for helping Stability’s researchers build and maintain one of the top AI image generators, as well as break important new ground on generative audio, video and 3D models. “Undeniably, Stability has continued to ship a lot of models,” said one former employee. “They may not have profited off of it, but the broader ecosystem benefitted in a huge, huge way.” But the costs associated with so much compute were now threatening to sink the company. According to an internal October financial forecast seen by Forbes, Stability was on track to spend $99 million on compute in 2023. It noted as well that Stability was “underpaying AWS bills for July (by $1M)” and “not planning to pay AWS at the end of October for August usage ($7M).” Then there were the September and October bills, plus $1 million owed to Google Cloud and $600,000 to GPU cloud data center CoreWeave. (Amazon, Google and CoreWeave declined to comment.) With an additional $54 million allocated to wages and operating expenses, Stability’s total projected costs for 2023 were $153 million. But according to its October financial report, its projected revenue for the calendar year was just $11 million. Stability was on track to lose more money per month than it made in an entire year. The company’s dire financial position had thoroughly soured Stability’s current investors, including Coatue, which had invested tens of millions in the company during its $101 million funding round in 2022. In the middle of 2023, Mostaque agreed to an independent audit after Coatue raised a series of concerns, according to a source with direct knowledge of the matter. The outcome of the investigation is unclear. Coatue declined to comment. Within a week of an early October board meeting where Mostaque shared that financial forecast, Lightspeed Venture Partners, another major investor, sent a letter to the board urging them to sell the company. The distressing numbers had “severely undermined” the firm’s confidence in Mostaque’s ability to lead the company. “In particular, we are surprised and deeply concerned by a cash position just now disclosed to us that is inconsistent with prior discussions on this topic,” Lightspeed’s general counsel Brett Nissenberg wrote in the letter, a copy of which was viewed by Forbes. “Lightspeed believes that the company is not likely financeable on terms that would assure the company’s long term sound financial position.” (Lightspeed declined a request for comment.) The calls for a sale led Stability to quietly begin looking for a buyer. Bloomberg reported in November that Stability approached AI startups Cohere and Jasper to gauge their interest. Stability denied this, and Jasper CEO Timothy Young did the same when reached for comment by Forbes. A Cohere representative declined to comment. But one prominent AI company confirmed that Mostaque’s representatives had reached out to them to test the waters. Those talks did not advance because “the numbers didn’t add up,” this person, who declined to be named due to the confidential nature of the talks, told Forbes. Stability also tried to court Samsung as a buyer, going so far as to redecorate its office in advance of a planned meeting with the Korean electronics giant. (Samsung said that it invested in Stability in 2023 and that it does not comment on M&A discussions.) Coatue had been calling for Mostaque’s resignation for months, according to a source with direct knowledge. But it and other investors were unable to oust him because he was the company’s majority shareholder. When they tried a different tact by rallying other investors to offer him a juicy equity package to resign, Mostaque refused, said two sources. By October, Coatue and Lightspeed had had enough. Coatue left the board and Lightspeed resigned its observer seat. “Emad infuriated our initial investors so much it’s just making it impossible for us to raise more money under acceptable terms,” one current Stability executive told Forbes. The early months of 2024 saw Stability’s already precarious position eroding further still. Employees were quietly laid off. Three people in a position to know estimated that at least 10% of staff were cut. And cash reserves continued to dwindle. Mostaque mentioned a lifeline at the October board meeting: $95 million in tentative funding from new investors, pending due diligence. But in the end, only a fraction of it was wired, two sources say, much of it from Intel, which Forbes has learned invested $20 million, a fraction of what was reported. (Intel did not return a request for comment by publication time.) Two hours after Forbes broke the news of Mostaque’s plans to step down as CEO, Stability issued a press release confirming his resignation. Chief operating officer Wong and chief technology officer Laforte have taken over in the interim. Mostaque, who said on X that he still owns a majority of the company, also stepped down from the board, which has now initiated a search for a permanent CEO. There is a lot of work to be done to turn things around, and very little time in which to do it. Said the current Stability executive, “There’s still a possibility of a turnaround story, but the odds drop by the day.” In July of 2023, Mostaque still thought he could pull it off. Halfway through the month, he shared a fundraising plan with his lieutenants. It was wildly optimistic, detailing the raise of $500 million in cash and another $750 million in computing facilities from marquee investors like Nvidia, Google, Intel and the World Bank (Nvidia and Google declined comment. Intel did not respond. The World Bank said it did not invest in Stability). In a Slack message reviewed by Forbes, Mostaque said Google was “willing to move fast” and the round was “likely to be oversubscribed.” It wasn’t. Three people with direct knowledge of these fundraising efforts told Forbes that while there was some interest in Stability, talks often stalled when it came time to disclose financials. Two of them noted that earlier in the year, Mostaque had simply stopped engaging with VCs who asked for numbers. Only one firm invested around that time: actor Ashton Kutcher’s Sound Ventures, which invested $35 million in the form of a convertible SAFE note during the second quarter, according to an internal document. (Sound Ventures did not respond to a request for comment.) And though he’d managed to score a meeting with Nvidia and its CEO Jensen Huang, it ended in disaster, according to two sources. “Under Jensen's microscopic questions, Emad just fell apart,” a source in position to know told Forbes. Huang quickly concluded Stability wasn’t ready for an investment from Nvidia, the sources said. Mostaque told Forbes in an email that he had not met with Huang since 2022, except to say “hello and what’s up a few times after.” His July 2023 message references a plan to raise $150 million from Nvidia. (Nvidia declined to comment.) After a June Forbes investigation citing more than 30 sources revealed Mostaque’s history of misleading claims, Mostaque struggled to raise funding, a Stability investor told Forbes. (Mostaque disputed the story at the time and called it "coordinated lies" in his email this week to Forbes). Increasingly, investors scrutinized his assertions and pressed for data. And Young, now the CEO of Jasper, turned down a verbal offer to be Stability’s president after reading the article, according to a source with direct knowledge of the matter. The collapse of the talks aggravated the board and other executives, who had hoped Young would compensate for the sales and business management skills that Mostaque lacked, according to four people in a position to know. (Young declined to comment.) When Stability’s senior leadership convened in London for the CogX conference in September, the financing had still not closed. There, a group of executives confronted Mostaque asking questions about the company’s cash position and runway, according to three people with direct knowledge of the incident. They did not get the clarity they’d hoped for. By October, Mostaque had reduced his fundraising target by more than 80%. The months that followed saw a steady drumbeat of departures — general counsel Adam Avrunin, vice presidents Mike Melnicki, Ed Newton-Rex and Joe Penna, chief people officer Ozden Onder — culminating in the demoralizing March exit of Stable Diffusion’s primary developers Robin Rombach, Andreas Blattmann, Patrick Esser and Dominik Lorenz. Rombach, who led the team, had been angling to leave for months, two sources said, first threatening to resign last summer because of the fundraising failures. Others left over concerns about cash flow, as well as liabilities — including what four people described as Mostaque’s lax approach to ensuring that Stability products could not be used to produce child sexual abuse imagery. “Stability AI is committed to preventing the misuse of AI and prohibits the use of our image models and services for unlawful activity, including attempts to edit or create CSAM,” Ella Irwin, senior vice president of integrity, said in a statement. Newton-Rex told Forbes he resigned because he disagreed with Stability’s position that training AI on copyrighted work without consent is fair use. Melnicki and Penna declined to comment. Avrunin and Onder could not be reached for comment. None of the researchers responded to requests for comment. The Stable Diffusion researchers’ departure as a cohort says a lot about the state of Stability AI. The company’s researchers were widely viewed as its crown jewels, their work subsidized with a firehose of pricey compute power that was even extended to people outside the company. Martino Russi, an artificial intelligence researcher, told Forbes that though he was never formally employed by Stability, the company provided him a “staggering” amount of compute between January and April 2023 to play around with developing an AI video generator that Stability might someday use. “It was Candy Land or Coney Island,” said Russi, who estimates that his experiment, which was ultimately shelved, cost the company $2.5 million. Stable Diffusion was simultaneously Stability’s marquee product and its existential cash crisis. One current employee described it to Forbes as “a giant vacuum that absorbed everything: money, compute, people.” While the software was widely used, with Mostaque claiming downloads reaching into the hundreds of millions, Stability struggled to translate that wild success into revenue. Mostaque knew it could be done — peers at Databricks, Elastic and MongoDB had all turned a free product into a lucrative business — he just couldn’t figure out how. His first attempt was Stability’s API, which allowed paying customers to integrate Stable Diffusion into their own products. In early 2023, a handful of small companies, like art generator app NightCafe and presentation software startup Tome, signed on, according to four people with knowledge of the deals. But Stability’s poor account management services soured many, and in a matter of months NightCafe and Tome canceled their contracts, three people said. NightCafe founder Angus Russell told Forbes that his company switched to a competitor which “offered much cheaper inference costs and a broader service.” Tome did not respond to a request for comment. Meanwhile, Mostaque’s efforts to court larger companies like Samsung and Snapchat were failing, according to five people familiar with the effort. Canva, which was already one of the heaviest users of open-sourced Stable Diffusion, had multiple discussions with Stability, which was angling for a contract it hoped would generate several millions in annual revenue. But the deal never materialized, four sources said. “These three companies wanted and needed us,” one former employee told Forbes. “They would have been the perfect customers.” (Samsung, Snap and Canva declined to comment.) “It’s not that there was not an appetite to pay Stability — there were tons of companies that would have that wanted to,” the former employee said. “There was a huge opportunity and demand, but just a resistance to execution.” Mostaque’s other big idea was to provide governments with bespoke national AI models that would invigorate their economies and citizenry. “Emad envisions a world where AI through 100 national models serves not as a tool of the few, but as a benefactor to all promising to confront great adversaries, cancer, autism, and the sands of time itself,” the AI avatar of Aristotle said in his intro at the conference. Mostaque told several prospective customers that he could deliver such models within 60 days — an untenable timeline, according to two people in position to know. Stability attempted to develop a model for the Singaporean government over the protestation of employees who questioned its technical feasibility, three sources familiar with the effort told Forbes. But it couldn’t pull it off and Singapore never became a customer. (The government of Singapore confirmed it did not enter into a deal with Stability, but declined to answer additional questions.) As Stability careened from one new business idea to another, resources were abruptly reallocated and researchers reassigned. The whiplash shifts in a largely siloed organization demoralized and infuriated employees. “There were ‘urgent’ things, ‘urgent urgent’ things and ‘most urgent,’” one former employee complained. “None of these things seem important if everything is important.” Another former Stability executive was far more pointed in their assessment. “Emad is the most disorganized leader I have ever worked with in my career,” this person told Forbes. “He has no vision, and changes directions every week, often based on what he sees on Twitter.” In a video interview posted shortly before this story was published, Mostaque explained his leadership style: “I'm particularly great at taking creatives, developers, researchers, others, and achieving their full potential in designing systems. But I should not be dealing with, you know, HR and operations and business development and other elements. There are far better people than me to do that.” By December 2023, Stability had partially abandoned its open-source roots and announced that any commercial use of Stable Diffusion would cost customers at least $20 per month (non-commercial and research use of Stable Diffusion would remain free). But privately, Stability was considering a potentially more lucrative source of revenue: reselling the compute it was leasing from providers like AWS, according to six people familiar with the effort. Though it was essentially GPU arbitrage, Stability framed the strategy to investors as a “managed services” offering. Its damning October financial report projected optimistically that such an offering would bring in $139 million in 2024 — 98% of its revenue. Multiple employees at the time told Forbes they feared reselling compute, even if the company called it “managed services,” would violate the terms of Stability’s contract with AWS. Amazon declined to comment. “The line internally was that we are not reselling compute,” one former employee said. “This was some of the dirtiest feeling stuff.” Stability also discussed reselling a cluster of Nvidia A100 chips, leased via CoreWeave, to the venture capital firm Andreessen Horowitz, three sources said. “It was under the guise of managed services, but there wasn’t any management happening,” one of these people told Forbes. Andreessen Horowitz and CoreWeave declined to comment. Stability did not respond to questions about if it plans to continue this strategy now that Mostaque is out of the picture. Regardless, interim co-CEOs Wong and Laforte are on a tight timeline to clean up his mess. Board chairman Jim O’Shaughnessy said in a statement that he was confident the pair “will adeptly steer the company forward in developing and commercializing industry-leading generative AI products.” But burn continues to far outpace revenue. The Financial Times reported Friday that the company made $5.4 million of revenue in February, against $8 million in costs. Several sources said there are ongoing concerns about making payroll for the roughly 150 remaining employees. Leadership roles have gone vacant for months amid the disarray, leaving the company increasingly directionless. Meanwhile, a potentially catastrophic legal threat looms over the company: A trio of copyright infringement lawsuits brought by Getty Images and a group of artists in the U.S. and U.K., who claim Stability illegally used their art and photography to train the AI models powering Stable Diffusion. A London-based court has already rejected the company’s bid to throw out one of the lawsuits on the basis that none of its researchers were based in the U.K. And Stability’s claim that Getty’s Delaware lawsuit should be blocked because it's a U.K.-based company was rejected. (Stability did not respond to questions about the litigation.) AI-related copyright litigation “could go on for years,” according to Eric Goldman, a law professor at Santa Clara University. He told Forbes that though plaintiffs suing AI firms face an uphill battle overcoming the existing legal precedent on copyright infringement, the quantity of arguments available to make are virtually inexhaustible. “Like in military theory, if there’s a gap in your lines, that’s where the enemy pours through — if any one of those arguments succeeds, it could completely change the generative AI environment,” he said. “In some sense, generative AI as an industry has to win everything.” Stability, which had more than $100 million in the bank just a year and a half ago, is in a deep hole. Not only does it need more funding, it needs a viable business model — or a buyer with the vision and chops to make it successful in a fast-moving and highly competitive sector. At an all hands meeting this past Monday, Stability’s new leaders detailed a path forward. One point of emphasis: a plan to better manage resources and expenses, according to one person in attendance. It’s a start, but Mostaque’s meddling has left them with little runway to execute. His resignation, though, has given some employees hope. “A few people are 100% going to reconsider leaving after today,” said one current employee. “And the weird gloomy aura of hearing Emad talking nonsense for an hour is gone.” Shortly before Mostaque resigned, one current Stability executive told Forbes that they were optimistic his departure could make Stability appealing enough to receive a small investment or sale to a friendly party. “There are companies that have raised hundreds of millions of dollars that have much less intrinsic value than Stability,” the person said. “A white knight may still appear.”

I run an AI automation agency (AAA). My honest overview and review of this new business model
reddit
LLM Vibe Score0
Human Vibe Score1
AI_Scout_OfficialThis week

I run an AI automation agency (AAA). My honest overview and review of this new business model

I started an AI tools directory in February, and then branched off that to start an AI automation agency (AAA) in June. So far I've come across a lot of unsustainable "ideas" to make money with AI, but at the same time a few diamonds in the rough that aren't fully tapped into yet- especially the AAA model. Thought I'd share this post to shine light into this new business model and share some ways you could potentially start your own agency, or at the very least know who you are dealing with and how to pick and choose when you (inevitably) get bombarded with cold emails from them down the line. Foreword Running an AAA does NOT involve using AI tools directly to generate and sell content directly. That ship has sailed, and unless you are happy with $5 from Fiverr every month or so, it is not a real business model. Cry me a river but generating generic art with AI and slapping it onto a T-shirt to sell on Etsy won't make you a dime. At the same time, the AAA model will NOT require you to have a deep theoretical knowledge of AI, or any academic degree, as we are more so dealing with the practical applications of generative AI and how we can implement these into different workflows and tech-stacks, rather than building AI models from the ground up. Regardless of all that, common sense and a willingness to learn will help (a shit ton), as with anything. Keep in mind - this WILL involve work and motivation as well. The mindset that AI somehow means everything can be done for you on autopilot is not the right way to approach things. The common theme of businesses I've seen who have successfully implemented AI into their operations is the willingess to work with AI in a way that augments their existing operations, rather than flat out replace a worker or team. And this is exactly the train of thought you need when working with AI as a business model. However, as the field is relatively unsaturated and hype surrounding AI is still fresh for enterprises, right now is the prime time to start something new if generative AI interests you at all. With that being said, I'll be going over three of the most successful AI-adjacent businesses I've seen over this past year, in addition to some tips and resources to point you in the right direction. so.. WTF is an AI Automation Agency? The AI automation agency (or as some YouTubers have coined it, the AAA model) at its core involves creating custom AI solutions for businesses. I have over 1500 AI tools listed in my directory, however the feedback I've received from some enterprise users is that ready-made SaaS tools are too generic to meet their specific needs. Combine this with the fact virtually no smaller companies have the time or skills required to develop custom solutions right off the bat, and you have yourself real demand. I would say in practice, the AAA model is quite similar to Wordpress and even web dev agencies, with the major difference being all solutions you develop will incorporate key aspects of AI AND automation. Which brings me to my second point- JUST AI IS NOT ENOUGH. Rather than reducing the amount of time required to complete certain tasks, I've seen many AI agencies make the mistake of recommending and (trying to) sell solutions that more likely than not increase the workload of their clients. For example, if you were to make an internal tool that has AI answer questions based on their knowledge base, but this knowledge base has to be updated manually, this is creating unnecessary work. As such I think one of the key components of building successful AI solutions is incorporating the new (Generative AI/LLMs) with the old (programmtic automation- think Zapier, APIs, etc.). Finally, for this business model to be successful, ideally you should target a niche in which you have already worked and understand pain points and needs. Not only does this make it much easier to get calls booked with prospects, the solutions you build will have much greater value to your clients (meaning you get paid more). A mistake I've seen many AAA operators make (and I blame this on the "Get Rich Quick" YouTubers) is focusing too much on a specific productized service, rather than really understanding the needs of businesses. The former is much done via a SaaS model, but when going the agency route the only thing that makes sense is building custom solutions. This is why I always take a consultant-first approach. You can only build once you understand what they actually need and how certain solutions may impact their operations, workflows, and bottom-line. Basics of How to Get Started Pick a niche. As I mentioned previously, preferably one that you've worked in before. Niches I know of that are actively being bombarded with cold emails include real estate, e-commerce, auto-dealerships, lawyers, and medical offices. There is a reason for this, but I will tell you straight up this business model works well if you target any white-collar service business (internal tools approach) or high volume businesses (customer facing tools approach). Setup your toolbox. If you wanted to start a pressure washing business, you would need a pressure-washer. This is no different. For those without programming knowledge, I've seen two common ways AAA get setup to build- one is having a network of on-call web developers, whether its personal contacts or simply going to Upwork or any talent sourcing agency. The second is having an arsenal of no-code tools. I'll get to this more in a second, but this works beecause at its core, when we are dealing with the practical applications of AI, the code is quite simple, simply put. Start cold sales. Unless you have a network already, this is not a step you can skip. You've already picked a niche, so all you have to do is find the right message. Keep cold emails short, sweet, but enticing- and it will help a lot if you did step 1 correctly and intimately understand who your audience is. I'll be touching base later about how you can leverage AI yourself to help you with outreach and closing. The beauty of gen AI and the AAA model You don't need to be a seasoned web developer to make this business model work. The large majority of solutions that SME clients want is best done using an API for an LLM for the actual AI aspect. The value we create with the solutions we build comes with the conceptual framework and design that not only does what they need it to but integrates smoothly with their existing tech-stack and workflow. The actual implementation is quite straightforward once you understand the high level design and know which tools you are going to use. To give you a sense, even if you plan to build out these apps yourself (say in Python) the large majority of the nitty gritty technical work has already been done for you, especially if you leverage Python libraries and packages that offer high level abstraction for LLM-related functions. For instance, calling GPT can be as little as a single line of code. (And there are no-code tools where these functions are simply an icon on a GUI). Aside from understanding the capabilities and limitations of these tools and frameworks, the only thing that matters is being able to put them in a way that makes sense for what you want to build. Which is why outsourcing and no-code tools both work in our case. Okay... but how TF am I suppposed to actually build out these solutions? Now the fun part. I highly recommend getting familiar with Langchain and LlamaIndex. Both are Python libraires that help a lot with the high-level LLM abstraction I mentioned previously. The two most important aspects include being able to integrate internal data sources/knowledge bases with LLMs, and have LLMs perform autonomous actions. The two most common methods respectively are RAG and output parsing. RAG (retrieval augmented Generation) If you've ever seen a tool that seemingly "trains" GPT on your own data, and wonder how it all works- well I have an answer from you. At a high level, the user query is first being fed to what's called a vector database to run vector search. Vector search basically lets you do semantic search where you are searching data based on meaning. The vector databases then retrieves the most relevant sections of text as it relates to the user query, and this text gets APPENDED to your GPT prompt to provide extra context to the AI. Further, with prompt engineering, you can limit GPT to only generate an answer if it can be found within this extra context, greatly limiting the chance of hallucination (this is where AI makes random shit up). Aside from vector databases, we can also implement RAG with other data sources and retrieval methods, for example SQL databses (via parsing the outputs of LLM's- more on this later). Autonomous Agents via Output Parsing A common need of clients has been having AI actually perform tasks, rather than simply spitting out text. For example, with autonomous agents, we can have an e-commerce chatbot do the work of a basic customer service rep (i.e. look into orders, refunds, shipping). At a high level, what's going on is that the response of the LLM is being used programmtically to determine which API to call. Keeping on with the e-commerce example, if I wanted a chatbot to check shipping status, I could have a LLM response within my app (not shown to the user) with a prompt that outputs a random hash or string, and programmatically I can determine which API call to make based on this hash/string. And using the same fundamental concept as with RAG, I can append the the API response to a final prompt that would spit out the answer for the user. How No Code Tools Can Fit In (With some example solutions you can build) With that being said, you don't necessarily need to do all of the above by coding yourself, with Python libraries or otherwise. However, I will say that having that high level overview will help IMMENSELY when it comes to using no-code tools to do the actual work for you. Regardless, here are a few common solutions you might build for clients as well as some no-code tools you can use to build them out. Ex. Solution 1: AI Chatbots for SMEs (Small and Medium Enterprises) This involves creating chatbots that handle user queries, lead gen, and so forth with AI, and will use the principles of RAG at heart. After getting the required data from your client (i.e. product catalogues, previous support tickets, FAQ, internal documentation), you upload this into your knowledge base and write a prompt that makes sense for your use case. One no-code tool that does this well is MyAskAI. The beauty of it especially for building external chatbots is the ability to quickly ingest entire websites into your knowledge base via a sitemap, and bulk uploading files. Essentially, they've covered the entire grunt work required to do this manually. Finally, you can create a inline or chat widget on your client's website with a few lines of HTML, or altneratively integrate it with a Slack/Teams chatbot (if you are going for an internal Q&A chatbot approach). Other tools you could use include Botpress and Voiceflow, however these are less for RAG and more for building out complete chatbot flows that may or may not incorporate LLMs. Both apps are essentially GUIs that eliminate the pain and tears and trying to implement complex flows manually, and both natively incoporate AI intents and a knowledge base feature. Ex. Solution 2: Internal Apps Similar to the first example, except we go beyond making just chatbots but tools such as report generation and really any sort of internal tool or automations that may incorporate LLM's. For instance, you can have a tool that automatically generates replies to inbound emails based on your client's knowledge base. Or an automation that does the same thing but for replies to Instagram comments. Another example could be a tool that generates a description and screeenshot based on a URL (useful for directory sites, made one for my own :P). Getting into more advanced implementations of LLMs, we can have tools that can generate entire drafts of reports (think 80+ pages), based not only on data from a knowledge base but also the writing style, format, and author voice of previous reports. One good tool to create content generation panels for your clients would be MindStudio. You can train LLM's via prompt engineering in a structured way with your own data to essentially fine tune them for whatever text you need it to generate. Furthermore, it has a GUI where you can dictate the entire AI flow. You can also upload data sources via multiple formats, including PDF, CSV, and Docx. For automations that require interactions between multiple apps, I recommend the OG zapier/make.com if you want a no-code solution. For instance, for the automatic email reply generator, I can have a trigger such that when an email is received, a custom AI reply is generated by MyAskAI, and finally a draft is created in my email client. Or, for an automation where I can create a social media posts on multiple platforms based on a RSS feed (news feed), I can implement this directly in Zapier with their native GPT action (see screenshot) As for more complex LLM flows that may require multiple layers of LLMs, data sources, and APIs working together to generate a single response i.e. a long form 100 page report, I would recommend tools such as Stack AI or Flowise (open-source alternative) to build these solutions out. Essentially, you get most of the functions and features of Python packages such as Langchain and LlamaIndex in a GUI. See screenshot for an example of a flow How the hell are you supposed to find clients? With all that being said, none of this matters if you can't find anyone to sell to. You will have to do cold sales, one way or the other, especially if you are brand new to the game. And what better way to sell your AI services than with AI itself? If we want to integrate AI into the cold outreach process, first we must identify what it's good at doing, and that's obviously writing a bunch of text, in a short amount of time. Similar to the solutions that an AAA can build for its clients, we can take advantage of the same principles in our own sales processes. How to do outreach Once you've identified your niche and their pain points/opportunities for automation, you want to craft a compelling message in which you can send via cold email and cold calls to get prospects booked on demos/consultations. I won't get into too much detail in terms of exactly how to write emails or calling scripts, as there are millions of resources to help with this, but I will tell you a few key points you want to keep in mind when doing outreach for your AAA. First, you want to keep in mind that many businesses are still hesitant about AI and may not understand what it really is or how it can benefit their operations. However, we can take advantage of how mass media has been reporting on AI this past year- at the very least people are AWARE that sooner or later they may have to implement AI into their businesses to stay competitive. We want to frame our message in a way that introduces generative AI as a technology that can have a direct, tangible, and positive impact on their business. Although it may be hard to quantify, I like to include estimates of man-hours saved or costs saved at least in my final proposals to prospects. Times are TOUGH right now, and money is expensive, so you need to have a compelling reason for businesses to get on board. Once you've gotten your messaging down, you will want to create a list of prospects to contact. Tools you can use to find prospects include Apollo.io, reply.io, zoominfo (expensive af), and Linkedin Sales Navigator. What specific job titles, etc. to target will depend on your niche but for smaller companies this will tend to be the owner. For white collar niches, i.e. law, the professional that will be directly benefiting from the tool (i.e. partners) may be better to contact. And for larger organizations you may want to target business improvement and digital transformation leads/directors- these are the people directly in charge of projects like what you may be proposing. Okay- so you have your message, and your list, and now all it comes down to is getting the good word out. I won't be going into the details of how to send these out, a quick Google search will give you hundreds of resources for cold outreach methods. However, personalization is key and beyond simple dynamic variables you want to make sure you can either personalize your email campaigns directly with AI (SmartWriter.ai is an example of a tool that can do this), or at the very least have the ability to import email messages programmatically. Alternatively, ask ChatGPT to make you a Python Script that can take in a list of emails, scrape info based on their linkedin URL or website, and all pass this onto a GPT prompt that specifies your messaging to generate an email. From there, send away. How tf do I close? Once you've got some prospects booked in on your meetings, you will need to close deals with them to turn them into clients. Call #1: Consultation Tying back to when I mentioned you want to take a consultant-first appraoch, you will want to listen closely to their goals and needs and understand their pain points. This would be the first call, and typically I would provide a high level overview of different solutions we could build to tacke these. It really helps to have a presentation available, so you can graphically demonstrate key points and key technologies. I like to use Plus AI for this, it's basically a Google Slides add-on that can generate slide decks for you. I copy and paste my default company messaging, add some key points for the presentation, and it comes out with pretty decent slides. Call #2: Demo The second call would involve a demo of one of these solutions, and typically I'll quickly prototype it with boilerplate code I already have, otherwise I'll cook something up in a no-code tool. If you have a niche where one type of solution is commonly demanded, it helps to have a general demo set up to be able to handle a larger volume of calls, so you aren't burning yourself out. I'll also elaborate on how the final product would look like in comparison to the demo. Call #3 and Beyond: Once the initial consultation and demo is complete, you will want to alleviate any remaining concerns from your prospects and work with them to reach a final work proposal. It's crucial you lay out exactly what you will be building (in writing) and ensure the prospect understands this. Furthermore, be clear and transparent with timelines and communication methods for the project. In terms of pricing, you want to take this from a value-based approach. The same solution may be worth a lot more to client A than client B. Furthermore, you can create "add-ons" such as monthly maintenance/upgrade packages, training sessions for employeees, and so forth, separate from the initial setup fee you would charge. How you can incorporate AI into marketing your businesses Beyond cold sales, I highly recommend creating a funnel to capture warm leads. For instance, I do this currently with my AI tools directory, which links directly to my AI agency and has consistent branding throughout. Warm leads are much more likely to close (and honestly, much nicer to deal with). However, even without an AI-related website, at the very least you will want to create a presence on social media and the web in general. As with any agency, you will want basic a professional presence. A professional virtual address helps, in addition to a Google Business Profile (GBP) and TrustPilot. a GBP (especially for local SEO) and Trustpilot page also helps improve the looks of your search results immensely. For GBP, I recommend using ProfilePro, which is a chrome extension you can use to automate SEO work for your GBP. Aside from SEO optimzied business descriptions based on your business, it can handle Q/A answers, responses, updates, and service descriptions based on local keywords. Privacy and Legal Concerns of the AAA Model Aside from typical concerns for agencies relating to service contracts, there are a few issues (especially when using no-code tools) that will need to be addressed to run a successful AAA. Most of these surround privacy concerns when working with proprietary data. In your terms with your client, you will want to clearly define hosting providers and any third party tools you will be using to build their solution, and a DPA with these third parties listed as subprocessors if necessary. In addition, you will want to implement best practices like redacting private information from data being used for building solutions. In terms of addressing concerns directly from clients, it helps if you host your solutions on their own servers (not possible with AI tools), and address the fact only ChatGPT queries in the web app, not OpenAI API calls, will be used to train OpenAI's models (as reported by mainstream media). The key here is to be open and transparent with your clients about ALL the tools you are using, where there data will be going, and make sure to get this all in writing. have fun, and keep an open mind Before I finish this post, I just want to reiterate the fact that this is NOT an easy way to make money. Running an AI agency will require hours and hours of dedication and work, and constantly rearranging your schedule to meet prospect and client needs. However, if you are looking for a new business to run, and have a knack for understanding business operations and are genuinely interested in the pracitcal applications of generative AI, then I say go for it. The time is ticking before AAA becomes the new dropshipping or SMMA, and I've a firm believer that those who set foot first and establish themselves in this field will come out top. And remember, while 100 thousand people may read this post, only 2 may actually take initiative and start.

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies)
reddit
LLM Vibe Score0
Human Vibe Score1
Royal_Rest8409This week

How a founder built a B2B AI startup to serve with 65+ global brands (including Fortune500 companies)

AI Palette is an AI-driven platform that helps food and beverage companies predict emerging product trends. I had the opportunity recently to sit down with the founder to get his advice on building an AI-first startup, which he'll be going through in this post. About AI Palette: Co-founders: >!2 (Somsubhra GanChoudhuri, Himanshu Upreti)!!100+!!$12.7M USD!!AI-powered predictive analytics for the CPG (Consumer Packaged Goods) industry!!Signed first paying customer in the first year!!65+ global brands, including Cargill, Diageo, Ajinomoto, Symrise, Mondelez, and L’Oréal, use AI Palette!!Every new product launched has secured a paying client within months!!Expanded into Beauty & Personal Care (BPC), onboarding one of India’s largest BPC companies within weeks!!Launched multiple new product lines in the last two years, creating a unified suite for brand innovation!Identify the pain points in your industry for ideas* When I was working in the flavour and fragrance industry, I noticed a major issue CPG companies faced: launching a product took at least one to two years. For instance, if a company decided today to launch a new juice, it wouldn’t hit the market until 2027. This long timeline made it difficult to stay relevant and on top of trends. Another big problem I noticed was that companies relied heavily on market research to determine what products to launch. While this might work for current consumer preferences, it was highly inefficient since the product wouldn’t actually reach the market for several years. By the time the product launched, the consumer trends had already shifted, making that research outdated. That’s where AI can play a crucial role. Instead of looking at what consumers like today, we realised that companies should use AI to predict what they will want next. This allows businesses to create products that are ahead of the curve. Right now, the failure rate for new product launches is alarmingly high, with 8 out of 10 products failing. By leveraging AI, companies can avoid wasting resources on products that won’t succeed, leading to better, more successful launches. Start by talking to as many industry experts as possible to identify the real problems When we first had the idea for AI Palette, it was just a hunch, a gut feeling—we had no idea whether people would actually pay for it. To validate the idea, we reached out to as many people as we could within the industry. Since our focus area was all about consumer insights, we spoke to professionals in the CPG sector, particularly those in the insights departments of CPG companies. Through these early conversations, we began to see a common pattern emerge and identified the exact problem we wanted to solve. Don’t tell people what you’re building—listen to their frustrations and challenges first. Going into these early customer conversations, our goal was to listen and understand their challenges without telling them what we were trying to build. This is crucial as it ensures that you can gather as much data about the problem to truly understand it and that you aren't biasing their answers by showing your solution. This process helped us in two key ways: First, it validated that there was a real problem in the industry through the number of people who spoke about experiencing the same problem. Second, it allowed us to understand the exact scale and depth of the problem—e.g., how much money companies were spending on consumer research, what kind of tools they were currently using, etc. Narrow down your focus to a small, actionable area to solve initially. Once we were certain that there was a clear problem worth solving, we didn’t try to tackle everything at once. As a small team of two people, we started by focusing on a specific area of the problem—something big enough to matter but small enough for us to handle. Then, we approached customers with a potential solution and asked them for feedback. We learnt that our solution seemed promising, but we wanted to validate it further. If customers are willing to pay you for the solution, it’s a strong validation signal for market demand. One of our early customer interviewees even asked us to deliver the solution, which we did manually at first. We used machine learning models to analyse the data and presented the results in a slide deck. They paid us for the work, which was a critical moment. It meant we had something with real potential, and we had customers willing to pay us before we had even built the full product. This was the key validation that we needed. By the time we were ready to build the product, we had already gathered crucial insights from our early customers. We understood the specific information they wanted and how they wanted the results to be presented. This input was invaluable in shaping the development of our final product. Building & Product Development Start with a simple concept/design to validate with customers before building When we realised the problem and solution, we began by designing the product, but not by jumping straight into coding. Instead, we created wireframes and user interfaces using tools like InVision and Figma. This allowed us to visually represent the product without the need for backend or frontend development at first. The goal was to showcase how the product would look and feel, helping potential customers understand its value before we even started building. We showed these designs to potential customers and asked for feedback. Would they want to buy this product? Would they pay for it? We didn’t dive into actual development until we found a customer willing to pay a significant amount for the solution. This approach helped us ensure we were on the right track and didn’t waste time or resources building something customers didn’t actually want. Deliver your solution using a manual consulting approach before developing an automated product Initially, we solved problems for customers in a more "consulting" manner, delivering insights manually. Recall how I mentioned that when one of our early customer interviewees asked us to deliver the solution, we initially did it manually by using machine learning models to analyse the data and presenting the results to them in a slide deck. This works for the initial stages of validating your solution, as you don't want to invest too much time into building a full-blown MVP before understanding the exact features and functionalities that your users want. However, after confirming that customers were willing to pay for what we provided, we moved forward with actual product development. This shift from a manual service to product development was key to scaling in a sustainable manner, as our building was guided by real-world feedback and insights rather than intuition. Let ongoing customer feedback drive iteration and the product roadmap Once we built the first version of the product, it was basic, solving only one problem. But as we worked closely with customers, they requested additional features and functionalities to make it more useful. As a result, we continued to evolve the product to handle more complex use cases, gradually developing new modules based on customer feedback. Product development is a continuous process. Our early customers pushed us to expand features and modules, from solving just 20% of their problems to tackling 50–60% of their needs. These demands shaped our product roadmap and guided the development of new features, ultimately resulting in a more complete solution. Revenue and user numbers are key metrics for assessing product-market fit. However, critical mass varies across industries Product-market fit (PMF) can often be gauged by looking at the size of your revenue and the number of customers you're serving. Once you've reached a certain critical mass of customers, you can usually tell that you're starting to hit product-market fit. However, this critical mass varies by industry and the type of customers you're targeting. For example, if you're building an app for a broad consumer market, you may need thousands of users. But for enterprise software, product-market fit may be reached with just a few dozen key customers. Compare customer engagement and retention with other available solutions on the market for product-market fit Revenue and the number of customers alone isn't always enough to determine if you're reaching product-market fit. The type of customer and the use case for your product also matter. The level of engagement with your product—how much time users are spending on the platform—is also an important metric to track. The more time they spend, the more likely it is that your product is meeting a crucial need. Another way to evaluate product-market fit is by assessing retention, i.e whether users are returning to your platform and relying on it consistently, as compared to other solutions available. That's another key indication that your solution is gaining traction in the market. Business Model & Monetisation Prioritise scalability Initially, we started with a consulting-type model where we tailor-made specific solutions for each customer use-case we encountered and delivered the CPG insights manually, but we soon realized that this wasn't scalable. The problem with consulting is that you need to do the same work repeatedly for every new project, which requires a large team to handle the workload. That is not how you sustain a high-growth startup. To solve this, we focused on building a product that would address the most common problems faced by our customers. Once built, this product could be sold to thousands of customers without significant overheads, making the business scalable. With this in mind, we decided on a SaaS (Software as a Service) business model. The benefit of SaaS is that once you create the software, you can sell it to many customers without adding extra overhead. This results in a business with higher margins, where the same product can serve many customers simultaneously, making it much more efficient than the consulting model. Adopt a predictable, simplistic business model for efficiency. Look to industry practices for guidance When it came to monetisation, we considered the needs of our CPG customers, who I knew from experience were already accustomed to paying annual subscriptions for sales databases and other software services. We decided to adopt the same model and charge our customers an annual upfront fee. This model worked well for our target market, aligning with industry standards and ensuring stable, recurring revenue. Moreover, our target CPG customers were already used to this business model and didn't have to choose from a huge variety of payment options, making closing sales a straightforward and efficient process. Marketing & Sales Educate the market to position yourself as a thought leader When we started, AI was not widely understood, especially in the CPG industry. We had to create awareness around both AI and its potential value. Our strategy focused on educating potential users and customers about AI, its relevance, and why they should invest in it. This education was crucial to the success of our marketing efforts. To establish credibility, we adopted a thought leadership approach. We wrote blogs on the importance of AI and how it could solve problems for CPG companies. We also participated in events and conferences to demonstrate our expertise in applying AI to the industry. This helped us build our brand and reputation as leaders in the AI space for CPG, and word-of-mouth spread as customers recognized us as the go-to company for AI solutions. It’s tempting for startups to offer products for free in the hopes of gaining early traction with customers, but this approach doesn't work in the long run. Free offerings don’t establish the value of your product, and customers may not take them seriously. You should always charge for pilots, even if the fee is minimal, to ensure that the customer is serious about potentially working with you, and that they are committed and engaged with the product. Pilots/POCs/Demos should aim to give a "flavour" of what you can deliver A paid pilot/POC trial also gives you the opportunity to provide a “flavour” of what your product can deliver, helping to build confidence and trust with the client. It allows customers to experience a detailed preview of what your product can do, which builds anticipation and desire for the full functionality. During this phase, ensure your product is built to give them a taste of the value you can provide, which sets the stage for a broader, more impactful adoption down the line. Fundraising & Financial Management Leverage PR to generate inbound interest from VCs When it comes to fundraising, our approach was fairly traditional—we reached out to VCs and used connections from existing investors to make introductions. However, looking back, one thing that really helped us build momentum during our fundraising process was getting featured in Tech in Asia. This wasn’t planned; it just so happened that Tech in Asia was doing a series on AI startups in Southeast Asia and they reached out to us for an article. During the interview, they asked if we were fundraising, and we mentioned that we were. As a result, several VCs we hadn’t yet contacted reached out to us. This inbound interest was incredibly valuable, and we found it far more effective than our outbound efforts. So, if you can, try to generate some PR attention—it can help create inbound interest from VCs, and that interest is typically much stronger and more promising than any outbound strategies because they've gone out of their way to reach out to you. Be well-prepared and deliberate about fundraising. Keep trying and don't lose heart When pitching to VCs, it’s crucial to be thoroughly prepared, as you typically only get one shot at making an impression. If you mess up, it’s unlikely they’ll give you a second chance. You need to have key metrics at your fingertips, especially if you're running a SaaS company. Be ready to answer questions like: What’s your retention rate? What are your projections for the year? How much will you close? What’s your average contract value? These numbers should be at the top of your mind. Additionally, fundraising should be treated as a structured process, not something you do on the side while juggling other tasks. When you start, create a clear plan: identify 20 VCs to reach out to each week. By planning ahead, you’ll maintain momentum and speed up the process. Fundraising can be exhausting and disheartening, especially when you face multiple rejections. Remember, you just need one investor to say yes to make it all worthwhile. When using funds, prioritise profitability and grow only when necessary. Don't rely on funding to survive. In the past, the common advice for startups was to raise money, burn through it quickly, and use it to boost revenue numbers, even if that meant operating at a loss. The idea was that profitability wasn’t the main focus, and the goal was to show rapid growth for the next funding round. However, times have changed, especially with the shift from “funding summer” to “funding winter.” My advice now is to aim for profitability as soon as possible and grow only when it's truly needed. For example, it’s tempting to hire a large team when you have substantial funds in the bank, but ask yourself: Do you really need 10 new hires, or could you get by with just four? Growing too quickly can lead to unnecessary expenses, so focus on reaching profitability as soon as possible, rather than just inflating your team or burn rate. The key takeaway is to spend your funds wisely and only when absolutely necessary to reach profitability. You want to avoid becoming dependent on future VC investments to keep your company afloat. Instead, prioritize reaching break-even as quickly as you can, so you're not reliant on external funding to survive in the long run. Team-Building & Leadership Look for complementary skill sets in co-founders When choosing a co-founder, it’s important to find someone with a complementary skill set, not just someone you’re close to. For example, I come from a business and commercial background, so I needed someone with technical expertise. That’s when I found my co-founder, Himanshu, who had experience in machine learning and AI. He was a great match because his technical knowledge complemented my business skills, and together we formed a strong team. It might seem natural to choose your best friend as your co-founder, but this can often lead to conflict. Chances are, you and your best friend share similar interests, skills, and backgrounds, which doesn’t bring diversity to the table. If both of you come from the same industry or have the same strengths, you may end up butting heads on how things should be done. Having diverse skill sets helps avoid this and fosters a more collaborative working relationship. Himanshu (left) and Somsubhra (right) co-founded AI Palette in 2018 Define roles clearly to prevent co-founder conflict To avoid conflict, it’s essential that your roles as co-founders are clearly defined from the beginning. If your co-founder and you have distinct responsibilities, there is no room for overlap or disagreement. This ensures that both of you can work without stepping on each other's toes, and there’s mutual respect for each other’s expertise. This is another reason as to why it helps to have a co-founder with a complementary skillset to yours. Not only is having similar industry backgrounds and skillsets not particularly useful when building out your startup, it's also more likely to lead to conflicts since you both have similar subject expertise. On the other hand, if your co-founder is an expert in something that you're not, you're less likely to argue with them about their decisions regarding that aspect of the business and vice versa when it comes to your decisions. Look for employees who are driven by your mission, not salary For early-stage startups, the first hires are crucial. These employees need to be highly motivated and excited about the mission. Since the salary will likely be low and the work demanding, they must be driven by something beyond just the paycheck. The right employees are the swash-buckling pirates and romantics, i.e those who are genuinely passionate about the startup’s vision and want to be part of something impactful beyond material gains. When employees are motivated by the mission, they are more likely to stick around and help take the startup to greater heights. A litmus test for hiring: Would you be excited to work with them on a Sunday? One of the most important rounds in the hiring process is the culture fit round. This is where you assess whether a candidate shares the same values as you and your team. A key question to ask yourself is: "Would I be excited to work with this person on a Sunday?" If there’s any doubt about your answer, it’s likely not a good fit. The idea is that you want employees who align with the company's culture and values and who you would enjoy collaborating with even outside of regular work hours. How we structure the team at AI Palette We have three broad functions in our organization. The first two are the big ones: Technical Team – This is the core of our product and technology. This team is responsible for product development and incorporating customer feedback into improving the technology Commercial Team – This includes sales, marketing, customer service, account managers, and so on, handling everything related to business growth and customer relations. General and Administrative Team – This smaller team supports functions like finance, HR, and administration. As with almost all businesses, we have teams that address the two core tasks of building (technical team) and selling (commercial team), but given the size we're at now, having the administrative team helps smoothen operations. Set broad goals but let your teams decide on execution What I've done is recruit highly skilled people who don't need me to micromanage them on a day-to-day basis. They're experts in their roles, and as Steve Jobs said, when you hire the right person, you don't have to tell them what to do—they understand the purpose and tell you what to do. So, my job as the CEO is to set the broader goals for them, review the plans they have to achieve those goals, and periodically check in on progress. For example, if our broad goal is to meet a certain revenue target, I break it down across teams: For the sales team, I’ll look at how they plan to hit that target—how many customers they need to sell to, how many salespeople they need, and what tactics and strategies they plan to use. For the technical team, I’ll evaluate our product offerings—whether they think we need to build new products to attract more customers, and whether they think it's scalable for the number of customers we plan to serve. This way, the entire organization's tasks are cascaded in alignment with our overarching goals, with me setting the direction and leaving the details of execution to the skilled team members that I hire.

Thoughts on FasterCapital VC?
reddit
LLM Vibe Score0
Human Vibe Score1
Momof3rascalsThis week

Thoughts on FasterCapital VC?

TLDR: I pitched to FasterCapital and got an "offer". Trying to figure out if this is a legitimate opportunity or a waste of my time. I'm not familiar with VCs and hadn't considered actually getting an investor on board with my plan. I sent my pitch deck to FasterCapital, honestly not expecting a response. It was my first pitch deck and a complete long shot. I ended up getting a response, they asked me for clarification on a few things. Than I get this email about what they are offering here's the main part We specialize in warm introductions to angel investors, VCs, and HNWIs, ensuring you connect with the right investors through personalized recommendations—not ineffective mass email campaigns. Cold outreach, such as LinkedIn messages, rarely succeeds, as investors receive hundreds of such requests and disregard them. To raise money, you need a strong partner like ourselves who has a wide network and direct connection with those angel investors built throughout 10 years. You can see some of the reviews of the startups we have helped attached and reviews on independent sites. Based on our experience and the matching that we have done already on our own AI system and for raising $55M-$65M in 5 years, a suitable package in your case is $50k - $64k and the chances of raising money is %87 - %93, but you were accepted in the exceptional rising star offer, where you pay half of that amount as an advance which is $25k-$32k and the other half ONLY when we raise you the first $1M. Other startups in our standard offers pays double that amount. First, I don't understand all of it, except for the "where you pay half of that amount as an advance which is $25k-$32k" I am no where near being able to come close to that, mostly because if I had that much, I wouldn't apply to a VC. I responded and politely told her that was not something our company could financially do right now. Than this email Thanks for your kind reply. We are flexible on paying this amount into monthly installments. We offer money back guarantee if we didn't raise the capital in 6 months from signing. This is how much we are confident with our approach of warm introductions. Raising the first amount of money and getting the first investor onboard is the most challenging part. You need time to build trust and network of investors. You need to have a good partner to help you. Please note that the down payment is for raising at least $55M over five years as we are interested in long-term partnership to raise multiple rounds because we make money through the commission. Companies take only commission or success fee are doing cold introductions and mass emails and this approach has low chances of success when it comes to raising capital. It is about the chances of success. You can talk to these companies and ask them about their success rate. Mass emails campaign has zero chances of success.  We have helped more than 742 startups raise more than $2.2B. Our network includes 155,000 angel investors and more than 50K funding institutions (VCs, HNI, family offices..etc). We have been in this business for more than 10 years. We have more than 92% success rate in our program so far. So if you are familiar with VC, Is this an actual opportunity. I have a tendency to jump or dive head first into things. As much as I want to get excited because this would be the jumpstart to most of my goals and ambitions. I'm not familiar with VCs. I have bootstrapped all my ventures so far.

Started a content marketing agency 6 years ago - $0 to $5,974,324 (2023 update)
reddit
LLM Vibe Score0
Human Vibe Score1
mr_t_forhireThis week

Started a content marketing agency 6 years ago - $0 to $5,974,324 (2023 update)

Hey friends, My name is Tyler and for the past 6 years, I’ve been documenting my experience building a content marketing agency called Optimist. Year 1 - 0 to $500k ARR Year 2 - $500k to $1MM ARR Year 3 - $1MM ARR to $1.5MM(ish) ARR Year 4 - $3,333,686 Revenue Year 5 - $4,539,659 Revenue How Optimist Works First, an overview/recap of the Optimist business model: We operate as a “collective” of full time/professional freelancers Everyone aside from me is a contractor Entirely remote/distributed team Each freelancer earns $65-85/hour Clients pay us a flat monthly fee for full-service content marketing (research, strategy, writing, editing, design/photography, reporting and analytics, targeted linkbuilding, and more) We recently introduced hourly engagements for clients who fit our model but have some existing in-house support Packages range in price from $10-20k/mo We offer profit share to everyone on our core team as a way to give everyone ownership in the company In 2022, we posted $1,434,665 in revenue. It was our highest revenue year to date and brings our lifetime total to $5,974,324. Here’s our monthly revenue from January 2017 to December of 2022. But, like every year, it was a mix of ups and downs. Here’s my dispatch for 2023. — Running a business is like spilling a drink. It starts as a small and simple thing. But, if you don’t clean it up, the spill will spread and grow — taking up more space, seeping into every crack. There’s always something you could be doing. Marketing you could be working on. Pitches you could be making. Networking you could be doing. Client work you could help with. It can be all-consuming. And it will be — if you don’t clean up the spill. I realized this year that I had no containment for the spill that I created. Running an agency was spilling over into nearly every moment of my life. When I wasn’t working, I was thinking about work. When I wasn’t thinking about work, I was dreaming about it. Over the years, I’ve shared about a lot of my personal feelings and experience as an entrepreneur. And I also discussed my reckoning with the limitations of running the business we’ve built. My acceptance that it was an airplane but not a rocket. And my plan to try to compartmentalize the agency to make room in my life for other things — new business ideas, new revenue streams, and maybe some non-income-producing activity. 🤷 What I found in 2022 was that the business wasn’t quite ready for me to make that move. It was still sucking up too much of my time and attention. There were still too many gaps to fill and I was the one who was often filling them. So what do you do? Ultimately you have two choices on the table anytime you run a business and it’s not going the way you want it: Walk away Turn the ship — slowly For a huge number of reasons (personal, professional, financial, etc), walking away from Optimist was not really even an option or the right move for me. But it did feel like things needed to change. I needed to keep turning the ship to get it to the place where it fit into my life — instead of my life fitting around the business. This means 2022 was a year of transition for the agency. (Again?) Refocusing on Profit Some money is better than no money. Right? Oddly, this was one of the questions I found myself asking in 2022. Over the years, we’ve been fortunate to have many clients who have stuck with us a long time. In some cases, we’ve had clients work with us for 2, 3, or even 4 years. (That’s over half of our existence!) But, things have gotten more expensive — we’ve all felt it. We’ve had to increase pay to remain competitive for top talent. Software costs have gone up. It’s eaten into our margin. Because of our increasing costs and evolving scope, many of our best, most loyal clients were our least profitable. In fact, many were barely profitable — if at all. We’ve tried to combat that by increasing rates on new, incoming clients to reflect our new costs and try to make up for shrinking margin on long-term clients. But we didn’t have a good strategy in place for updating pricing for current clients. And it bit us in the ass. Subsidizing lower-profit, long-term clients with new, higher-margin clients ultimately didn’t work out. Our margins continued to dwindle and some months we were barely breaking even while posting six-figures of monthly revenue. 2022 was our highest revenue year but one of our least profitable. It only left one option. We had to raise rates on some of our long-term clients. But, of course, raising rates on a great, long-term client can be delicate. You’ve built a relationship with these people over the years and you’re setting yourself up for an ultimatum — are you more valuable to the client or is the client more valuable to you? Who will blink first? We offered all of these clients the opportunity to move to updated pricing. Unfortunately, some of them weren’t on board. Again, we had 2 options: Keep them at a low/no profit rate Let them churn It seems intuitive that having a low-profit client is better than having no client. But we’ve learned an important lesson many times over the years. Our business doesn’t scale infinitely and we can only handle so many clients at a time. That means that low-profit clients are actually costing us money in some cases. Say our average client generates $2,500 per month in profit — $30,000 per year. If one of our clients is only generating $500/mo in profit, working with them means missing out on bringing on a more profitable client (assuming our team is currently at capacity). Instead of $30,000/year, we’re only making $6,000. Keeping that client costs us $24,000. That’s called opportunity cost. So it’s clear: We had to let these clients churn. We decided to churn about 25% of our existing clients. On paper, the math made sense. And we had a pretty consistent flow of new opportunities coming our way. At the time, it felt like a no-brainer decision. And I felt confident that we could quickly replace these low-profit clients with higher-margin ones. I was wrong. Eating Shit Right after we initiated proactively churning some of our clients, other clients — ones we planned to keep — gave us notice that they were planning to end the engagement. Ouch. Fuck. We went from a 25% planned drop in revenue to a nearly 40% cliff staring us right in the face. Then things got even worse. Around Q3 of this year, talk of recession and layoffs really started to intensify. We work primarily with tech companies and startups. And these were the areas most heavily impacted by the economic news. Venture funding was drying up. Our leads started to slow down. This put us in a tough position. Looking back now, I think it’s clear that I made the wrong decision. We went about this process in the wrong way. The reality sinks in when you consider the imbalance between losing a client and gaining a client. It takes 30 days for someone to fire us. It’s a light switch. But it could take 1-3 months to qualify, close, and onboard a new client. We have lots of upfront work, research, and planning that goes into the process. We have to learn a new brand voice, tone, and style. It’s a marathon. So, for every client we “trade”, there’s a lapse in revenue and work. This means that, in retrospect, I would probably have made this transition using some kind of staggered schedule rather than a cut-and-dry approach. We could have gradually off-boarded clients when we had more definitive work to replace them. I was too confident. But that’s a lesson I had to learn the hard way. Rebuilding & Resetting Most of the voluntary and involuntary churn happened toward the end of 2022. So we’re still dealing with the fall out. Right now, it feels like a period of rebuilding. We didn’t quite lose 50% of our revenue, but we definitely saw a big hit heading into 2023. To be transparent: It sucks. It feels like a gigantic mistake that I made which set us back significantly from our previous high point. I acted rashly and it cost us a lot of money — at least on the surface. But I remind myself of the situation we were in previously. Nearly twice the revenue but struggling to maintain profitability. Would it have been better to try to slowly fix that situation and battle through months of loss or barely-break-even profits? Or was ripping off the bandaid the right move after all? I’m an optimist. (Heh, heh) Plus, I know that spiraling over past decisions won’t change them or help me move forward. So I’m choosing to look at this as an opportunity — to rebuild, reset, and refocus the company. I get to take all of the tough lessons I’ve learned over the last 6 years and apply them to build the company in a way that better aligns with our new and current goals. It’s not quite a fresh, clean start, but by parting ways with some of our oldest clients, we’ve eliminated some of the “debt” that’s accumulated over the years. We get a chance to fully realize the new positioning that we rolled out last year. Many of those long-term clients who churned had a scope of work or engagement structure that didn’t fit with our new positioning and focus. So, by losing them, we’re able to completely close up shop on the SOWs that no longer align with the future version of Optimist. Our smaller roster of clients is a better fit for that future. My job is to protect that positioning by ensuring that while we’re rebuilding our new roster of clients we don’t get desperate. We maintain the qualifications we set out for future clients and only take on work that fits. How’s that for seeing the upside? Some other upside from the situation is that we got an opportunity to ask for candid feedback from clients who were leaving. We asked for insight about their decision, what factors they considered, how they perceived us, and the value of our work. Some of the reasons clients left were obvious and possibly unavoidable. Things like budget cuts, insourcing, and uncertainty about the economy all played at least some part of these decisions. But, reading between the lines, where was one key insight that really struck me. It’s one of those, “oh, yeah — duh — I already knew that,” things that can be difficult to learn and easy to forget…. We’re in the Relationship Business (Plan Accordingly) For all of our focus on things like rankings, keywords, content, conversions, and a buffet of relevant metrics, it can be easy to lose the forest for the trees. Yes, the work itself matters. Yes, the outcomes — the metrics — matter. But sometimes the relationship matters more. When you’re running an agency, you can live or die by someone just liking you. Admittedly, this feels totally unfair. It opens up all kinds of dilemmas, frustration, opportunity for bias and prejudice, and other general messiness. But it’s the real world. If a client doesn’t enjoy working with us — even if for purely personal reasons — they could easily have the power to end of engagement, regardless of how well we did our actual job. We found some evidence of this in the offboarding conversations we had with clients. In some cases, we had clients who we had driven triple- and quadruple-digital growth. Our work was clearly moving the needle and generating positive ROI and we had the data to prove it. But they decided to “take things in another direction” regardless. And when we asked about why they made the decision, it was clear that it was more about the working relationship than anything we could have improved about the service itself. The inverse is also often true. Our best clients have lasting relationships with our team. The work is important — and they want results. But even if things aren’t quite going according to plan, they’re patient and quick to forgive. Those relationships feel solid — unshakeable. Many of these folks move onto new roles or new companies and quickly look for an opportunity to work with us again. On both sides, relationships are often more important than the work itself. We’ve already established that we’re not building a business that will scale in a massive way. Optimist will always be a small, boutique service firm. We don’t need 100 new leads per month We need a small, steady roster of clients who are a great fit for the work we do and the value we create. We want them to stick around. We want to be their long-term partner. I’m not built for churn-and-burn agency life. And neither is the business. When I look at things through this lens, I realize how much I can cut from our overall business strategy. We don’t need an ultra-sophisticated, multi-channel marketing strategy. We just need strong relationships — enough of them to make our business work. There are a few key things we can take away from this as a matter of business strategy: Put most of our effort into building and strengthening relationships with our existing clients Be intentional about establishing a strong relationship with new clients as part of onboarding Focus on relationships as the main driver of future business development Embracing Reality: Theory vs Practice Okay, so with the big learnings out the way, I want to pivot into another key lesson from 2022. It’s the importance of understanding theory vs practice — specifically when it comes to thinking about time, work, and life. It all started when I was considering how to best structure my days and weeks around running Optimist, my other ventures, and my life goals outside of work. Over the years, I’ve dabbled in many different ways to block time and find focus — to compartmentalize all of the things that are spinning and need my attention. As I mapped this out, I realized that I often tried to spread myself too thin throughout the week. Not just that I was trying to do too much but that I was spreading that work into too many small chunks rather than carving out time for focus. In theory, 5 hours is 5 hours. If you have 5 hours of work to get done, you just fit into your schedule whenever you have an open time slot. In reality, a single 5-hour block of work is 10x more productive and satisfying than 10, 30-minute blocks of work spread out across the week. In part, this is because of context switching. Turning your focus from one thing to another thing takes time. Achieving flow and focus takes time. And the more you jump from one project to another, the more time you “lose” to switching. This is insightful for me both in the context of work and planning my day, but also thinking about my life outside of Optimist. One of my personal goals is to put a finite limit on my work time and give myself more freedom. I can structure that in many different ways. Is it better to work 5 days a week but log off 1 hour early each day? Or should I try to fit more hours into each workday so I can take a full day off? Of course, it’s the latter. Both because of the cost of context switching and spreading work into more, smaller chunks — but also because of the remainder that I end up with when I’m done working. A single extra hour in my day probably means nothing. Maybe I can binge-watch one more episode of a new show or do a few extra chores around the house. But it doesn’t significantly improve my life or help me find greater balance. Most things I want to do outside of work can’t fit into a single extra hour. A full day off from work unlocks many more options. I can take the day to go hiking or biking. I can spend the day with my wife, planning or playing a game. Or I can push it up against the weekend and take a 3-day trip. It gives me more of the freedom and balance that I ultimately want. So this has become a guiding principle for how I structure my schedule. I want to: Minimize context switching Maximize focused time for work and for non-work The idea of embracing reality also bleeds into some of the shifts in business strategy that I mentioned above. In theory, any time spent on marketing will have a positive impact on the company. In reality, focusing more on relationships than blasting tweets into the ether is much more likely to drive the kind of growth and stability that we’re seeking. As I think about 2023, I think this is a recurring theme. It manifests in many ways. Companies are making budget cuts and tough decisions about focus and strategy. Most of us are looking for ways to rein in the excess and have greater impact with a bit less time and money. We can’t do everything. We can’t even do most things. So our #1 priority should be to understand the reality of our time and our effort to make the most of every moment (in both work and leisure). That means thinking deeply about our strengths and our limitations. Being practical, even if it feels like sacrifice. Update on Other Businesses Finally, I want to close up by sharing a bit about my ventures outside of Optimist. I shared last year how I planned to shift some of my (finite) time and attention to new ventures and opportunities. And, while I didn’t get to devote as much as I hoped to these new pursuits, they weren’t totally in vain. I made progress across the board on all of the items I laid out in my post. Here’s what happened: Juice: The first Optimist spin-out agency At the end of 2021, we launched our first new service business based on demand from Optimist clients. Focused entirely on building links for SEO, we called the agency Juice. Overall, we made strong progress toward turning this into a legitimate standalone business in 2022. Relying mostly on existing Optimist clients and a few word-of-mouth opportunities (no other marketing), we built a team and set up a decent workflow and operations. There’s still many kinks and challenges that we’re working through on this front. All told, Juice posted almost $100,000 in revenue in our first full year. Monetizing the community I started 2022 with a focus on figuring out how to monetize our free community, Top of the Funnel. Originally, my plan was to sell sponsorships as the main revenue driver. And that option is still on the table. But, this year, I pivoted to selling paid content and subscriptions. We launched a paid tier for content and SEO entrepreneurs where I share more of my lessons, workflows, and ideas for building and running a freelance or agency business. It’s gained some initial traction — we reached \~$1,000 MRR from paid subscriptions. In total, our community revenue for 2022 was about $2,500. In 2023, I’m hoping to turn this into a $30,000 - $50,000 revenue opportunity. Right now, we’re on track for \~$15,000. Agency partnerships and referrals In 2022, we also got more serious about referring leads to other agencies. Any opportunity that was not a fit for Optimist or we didn’t have capacity to take on, we’d try to connect with another partner. Transparently, we struggled to operationalize this as effectively as I would have liked. In part, this was driven by my lack of focus here. With the other challenges throughout the year, I wasn’t able to dedicate as much time as I’d like to setting goals and putting workflows into place. But it wasn’t a total bust. We referred out several dozen potential clients to partner agencies. Of those, a handful ended up converting into sales — and referral commission. In total, we generated about $10,000 in revenue from referrals. I still see this as a huge opportunity for us to unlock in 2023. Affiliate websites Lastly, I mentioned spending some time on my new and existing affiliate sites as another big business opportunity in 2022. This ultimately fell to the bottom of my list and didn’t get nearly the attention I wanted. But I did get a chance to spend a few weeks throughout the year building this income stream. For 2022, I generated just under $2,000 in revenue from affiliate content. My wife has graciously agreed to dedicate some of her time and talent to these projects. So, for 2023, I think this will become a bit of a family venture. I’m hoping to build a solid and consistent workflow, expand the team, and develop a more solid business strategy. Postscript — AI, SEO, OMG As I’m writing this, much of my world is in upheaval. If you’re not in this space (and/or have possibly been living under a rock), the release of ChatGPT in late 2022 has sparked an arms race between Google, Bing, OpenAI, and many other players. The short overview: AI is likely to fundamentally change the way internet search works. This has huge impact on almost all of the work that I do and the businesses that I run. Much of our focus is on SEO and understanding the current Google algorithm, how to generate traffic for clients, and how to drive traffic to our sites and projects. That may all change — very rapidly. This means we’re standing at a very interesting point in time. On the one hand, it’s scary as hell. There’s a non-zero chance that this will fundamentally shift — possibly upturn — our core business model at Optimist. It could dramatically change how we work and/or reduce demand for our core services. No bueno. But it’s also an opportunity (there’s the optimist in me, again). I certainly see a world where we can become leaders in this new frontier. We can pivot, adjust, and capitalize on a now-unknown version of SEO that’s focused on understanding and optimizing for AI-as-search. With that, we may also be able to help others — say, those in our community? — also navigate this tumultuous time. See? It’s an opportunity. I wish I had the answers right now. But, it’s still a time of uncertainty. I just know that there’s a lot of change happening and I want to be in front of it rather than trying to play catch up. Wish me luck. — Alright friends — that's my update for 2023! I’ve always appreciated sharing these updates with the Reddit community, getting feedback, being asked tough questions, and even battling it out with some of my haters (hey!! 👋) As usual, I’m going to pop in throughout the next few days to respond to comments or answer questions. Feel free to share thoughts, ideas, and brutal takedowns in the comments. If you're interested in following the Optimist journey and the other projects I'm working on in 2023, you can follow me on Twitter. Cheers, Tyler P.S. - If you're running or launching a freelance or agency business and looking for help figuring it out, please DM me. Our subscription community, Middle of the Funnel, was created to provide feedback, lessons, and resources for other entrepreneurs in this space.

Started a content marketing agency 8 years ago - $0 to $7,863,052 (2025 update)
reddit
LLM Vibe Score0
Human Vibe Score0.882
mr_t_forhireThis week

Started a content marketing agency 8 years ago - $0 to $7,863,052 (2025 update)

Hey friends, My name is Tyler and for the past 8 years, I’ve been documenting my experience building a content marketing agency called Optimist. Year 1 — 0 to $500k ARR Year 2 — $500k to $1MM ARR Year 3 — $1MM ARR to $1.5MM(ish) ARR Year 4 — $3,333,686 Revenue Year 5 — $4,539,659 Revenue Year 6 — $5,974,324 Revenue Year 7 - $6,815,503 Revenue (Edit: Seems like links are banned now. You can check my post history for all of my previous updates with lessons and learnings.) How Optimist Works First, an overview/recap of the Optimist business model: We operate as a “collective” of full time/professional freelancers Everyone aside from me is a contractor Entirely remote/distributed team We pay freelancers a flat fee for most work, working out to roughly $65-100/hour. Clients pay us a flat monthly fee for full-service content marketing (research, strategy, writing, editing, design/photography, reporting and analytics, targeted linkbuilding, and more)\ Packages range in price from \~$10-20k/mo \This is something we are revisiting now* The Financials In 2024, we posted $1,032,035.34 in revenue. This brings our lifetime revenue to $7,863,052. Here’s our monthly revenue from January 2017 to December of 2024. (Edit: Seems like I'm not allowed to link to the chart.) The good news: Revenue is up 23% YoY. EBITDA in Q4 trending up 1-2 points. We hosted our first retreat in 4 years, going to Ireland with about half the team. The bad news: Our revenue is still historically low. At $1MM for the year, we’re down about 33% from our previous years over $1.5MM. Revenue has been rocky. It doesn’t feel like we’ve really “recovered” from the bumps last year. The trend doesn’t really look great. Even though, anecdotally, it feels like we are moving in a good direction. EBITDA is still hovering at around 7%. Would love to get that closer to 20%. (For those who may ask: I’m calculating EBITDA after paying taxes and W2 portion of my income.) — Almost every year, my update starts the same way: This has been a year of growth and change. Both for my business—and me personally. 2024 was no different. I guess that tells you something about entrepreneurship. It’s a lot more like sailing a ship than driving a car. You’re constantly adapting, tides are shifting, and any blip of calm is usually just a moment before the next storm. As with past years, there’s a lot to unpack from the last 12 months. Here we go again. Everything is Burning In the last 2 years, everything has turned upside down in the world of content and SEO. Back in 2020, we made a big decision to re-position the agency. (See post history) We decided to narrow our focus to our most successful, profitable, and consistent segment of clients and re-work our entire operation to focus on serving them. We defined our ICP as: \~Series A ($10mm+ funding) with 6-12 months runway to scale organic as a channel Product-led company with “simple” sales cycle involving fewer stakeholders Demonstrable opportunity to use SEO to drive business growth Our services: Content focused on growing organic search (SEO) Full-service engagements that included research, planning, writing, design, reporting And our engagement structure: Engaged directly with an executive; ownership over strategy and day-to-day execution 1-2 points of contact or stakeholders Strategic partner that drives business growth (not a service vendor who makes content) Most importantly, we decided that we were no longer going to offer a broader range of content that we used to sell. That included everything from thought leadership content to case studies and ebooks. We doubled-down on “SEO content” for product-led SaaS companies. And this worked phenomenally for us. We started bringing on more clients than ever. We developed a lot of internal system and processes that helped us scale and take on more work than we’ve ever had and drive great outcomes for our ideal clients. But in 2023 and 2024, things started going awry. One big change, of course, was the rise of AI. Many companies and executives (and writers) feel that AI can write content just as well as an agency like ours. That made it a lot harder to sell a $10,000 per month engagement when they feel like the bulk of the work could be “done for free.” (Lots of thoughts on this if you want my opinions.) But it wasn’t just that. Google also started tinkering with their algorithm, introducing new features like AI Overviews, and generally changing the rules of the game. This created 3 big shifts in our world: The perceived value of content (especially “SEO content”) dropped dramatically in many people’s minds because of AI’s writing capabilities SEO became less predictable as a source of traffic and revenue It’s harder than ever for startups and smaller companies to rank for valuable keywords (let alone generate any meaningful traffic or revenue from them) The effect? The middle of the content market has hollowed out. People—like us—providing good, human-crafted content aimed on driving SEO growth saw a dramatic decline in demand. We felt it all year. Fewer and fewer leads. The leads we did see usually scoffed at our prices. They were indexing us against the cost of content mills and mass-produced AI articles. It was a time of soul-searching and looking for a way forward. I spent the first half of the year convinced that the only way to survive was to run toward the fire. We have to build our own AI workflows. We have to cut our rates internally. We have to get faster and cheaper to stay competitive with the agencies offering the same number of deliverables for a fraction of our rates. It’s the only way forward. But then I asked myself a question… Is this the game I actually want to play? As an entrepreneur, do I want to run a business where I’m competing mostly on price and efficiency rather than quality and value? Do I want to hop into a race toward cheaper and cheaper content? Do I want to help people chase a dwindling amount of organic traffic that’s shrinking in value? No. That’s not the game I want to play. That’s not a business I want to run. I don’t want to be in the content mill business. So I decided to turn the wheel—again. Repositioning Part II: Electric Boogaloo What do you do when the whole world shifts around you and the things that used to work aren’t working anymore? You pivot. You re-position the business and move in another direction. So that’s what we decided to do. Again. There was only one problem: I honestly wasn’t sure what opportunities existed in the content marketing industry outside of what we were already doing. We lived in a little echo chamber of startups and SEO. It felt like the whole market was on fire and I had fight through the smoke to find an escape hatch. So I started making calls. Good ol’ fashioned market research. I reached out to a few dozen marketing and content leaders at a bunch of different companies. I got on the phone and just asked lots of questions about their content programs, their goals, and their pain points. I wanted to understand what was happening in the market and how we could be valuable. And, luckily, this process really paid off. I learned a lot about the fragmentation happening across content and how views were shifting. I noticed key trends and how our old target market really wasn’t buying what we were selling. Startups and small companies are no longer willing to invest in an agency like ours. If they were doing content and SEO at all, they were focused entirely on using AI to scale output and minimize costs. VC money is still scarce and venture-backed companies are more focused on profitability than pure growth and raising another round. Larger companies (\~500+ employees) are doing more content than ever and drowning in content production. They want to focus on strategy but can barely tread water keeping up with content requests from sales, demand gen, the CEO, and everyone else. Many of the companies still investing in content are looking at channels and formats outside of SEO. Things like thought leadership, data reports, interview-driven content, and more. They see it as a way to stand out from the crowd of “bland SEO content.” Content needs are constantly in flux. They range from data reports and blog posts to product one-pagers. The idea of a fixed-scope retainer is a total mismatch for the needs of most companies. All of this led to the logical conclusion: We were talking to the wrong people about the wrong things\.\ Many companies came to one of two logical conclusions: SEO is a risky bet, so it’s gotta be a moonshot—super-low cost with a possibility for a big upside (i.e., use AI to crank out lots of content. If it works, great. If it doesn’t, then at least we aren’t out much money.) SEO is a risky bet, so we should diversify into other strategies and channels to drive growth (i.e., shift our budget from SEO and keyword-focused content to video, podcasts, thought leadership, social, etc) Unless we were going to lean into AI and dramatically cut our costs and rates, our old buyers weren’t interested. And the segment of the market that needs our help most are looking primarily for production support across a big range of content types. They’re not looking for a team to run a full-blown program focused entirely on SEO. So we had to go back to the drawing board. I’ve written before about our basic approach to repositioning the business. But, ultimately it comes down to identifying our unique strengths as a team and then connecting them to needs in the market. After reviewing the insights from my discussions and taking another hard look at our business and our strengths, I decided on a new direction: Move upmarket: Serve mid-size to enterprise businesses with \~500-5,000 employees instead of startups Focus on content that supports a broader range of business goals instead of solely on SEO and organic growth (e.g., sales, demand gen, brand, etc) Shift back to our broader playbook of content deliverables, including thought leadership, data studies, and more Focus on content execution and production to support an internally-directed content strategy across multiple functions In a way, it’s sort of a reverse-niche move. Rather than zooming in specifically on driving organic growth for startups, we want to be more of an end-to-end content production partner that solves issues of execution and operations for all kinds of content teams. It’s early days, but the response here has been promising. We’ve seen an uptick in leads through Q4. And more companies in our pipeline fit the new ICP. They’re bigger, often have more budget. (But they move more slowly). We should know by the end of the quarter if this maneuver is truly paying off. Hopefully, this will work out. Hopefully our research and strategy are right and we’ll find a soft landing serving a different type of client. If it doesn’t? Then it will be time to make some harder decisions. As I already mentioned, I’m not interested in the race to the bottom of AI content. And if that’s the only game left in town, then it might be time to think hard about a much bigger change. — To be done: Build new content playbooks for expanded deliverables Build new showcase page for expanded deliverables Retooling the Operation It’s easy to say we’re doing something new. It’s a lot harder to actually do it—and do it well. Beyond just changing our positioning, we have to do open-heart surgery on the entire content operation behind the scenes. We need to create new systems that work for a broader range of content types, formats, and goals. Here’s the first rub: All of our workflows are tooled specifically for SEO-focused content. Every template, worksheet, and process that we’ve built and scaled in the last 5 years assumes that the primary goal of every piece of content is SEO. Even something as simple as requiring a target keyword is a blocker in a world where we’re not entirely focused on SEO. This is relatively easy to fix, but it requires several key changes: Update content calendars to make keywords optional Update workflows to determine whether we need an optimization report for each deliverable Next, we need to break down the deliverables into parts rather than a single line item. In our old system, we would plan content as a single row in a Content Calendar spreadsheet. It was a really wide sheet with lots of fields where we’d define the dimensions of each individual article. This was very efficient and simple to follow. But every article had the same overall scope when it came to the workflow. In Asana (our project management tool), all of the steps in the creation were strung together in a single task. We would create a few basic templates for each client, and then each piece would flow through the same steps: Briefing Writing Editing Design etc. If we had anything that didn’t fit into the “standard” workflow, we’d just tag it in the calendar with an unofficial notation \[USING BRACKETS\]. It worked. But it wasn’t ideal. Now we need the steps to be more modular. Imagine, for example, a client asks us to create a mix of deliverables: 1 article with writing + design 1 content brief 1 long-form ebook with an interview + writing + design Each of these would require its own steps and its own workflow. We need to break down the work to accommodate for a wider variety of workflows and variables. This means we need to update the fields and structure of our calendar to accommodate for the new dimensions—while also keeping the planning process simple and manageable. This leads to the next challenge: The number of “products” that we’re offering could be almost infinite. Just looking at the example scope above, you can mix and match all of these different building blocks to create a huge variety of different types of work, each requiring its own workflow. This is part of the reason we pivoted away from this model to focus on a productized, SEO-focused content service back in 2020. Take something as simple as a case study. On the surface, it seems like one deliverable that can be easily scoped and priced, right? Well, unpack what goes into a case study: Is there already source material from the customer or do we need to conduct an interview? How long is it? Is it a short overview case study or a long-form narrative? Does it need images and graphics? How many? Each of these variables opens up 2-3 possibilities. And when you combine them, we end up with something like 10 possible permutations for this single type of deliverable. It gets a bit messy. But not only do we have to figure out how to scope and price all for all of these variables, we also have to figure out how to account for these variables in the execution. We have to specify—for every deliverable—what type it is, how long, which steps are involved and not involved, the timeline for delivery, and all of the other factors. We’re approaching infinite complexity, here. We have to figure out a system that allows for a high level of flexibility to serve the diverse needs of our clients but is also productized enough that we can build workflows, process, and templates to deliver the work. I’ve spent the last few months designing that system. Failed Attempt #1: Ultra-Productization In my first pass, I tried to make it as straight forward as possible. Just sit down, make a list of all of the possible deliverables we could provide and then assign them specific scopes and services. Want a case study? Okay that’ll include an interview, up to 2,000 words of content, and 5 custom graphics. It costs $X. But this solution quickly fell apart when we started testing it against real-world scenarios. What if the client provided the brief instead of us creating one? What if they didn’t want graphics? What if this particular case study really needs to be 3,000 words but all of the others should be 2,000? In order for this system to work, we’d need to individual scope and price all of these permutations of each productized service. Then we’d need to somehow keep track of all of these and make sure that we accurately scope, price, and deliver them across dozens of clients. It’s sort of like a restaurant handling food allergies by creating separate versions of every single dish to account for every individual type of allergy. Most restaurants have figured out that it makes way more sense to have a “standard” and an “allergy-free” version. Then you only need 2 options to cover 100% of the cases. Onto the next option. Failed Attempt #2: Deliverable-Agnostic Services Next, I sat down with my head of Ops, Katy, to try to map it out. We took a big step back and said: Why does the deliverable itself even matter? At the end of the day, what we’re selling is just a few types of work (research, writing, editing, design, etc) that can be packaged up in an infinite number of ways. Rather than try to define deliverables, shouldn’t we leave it open ended for maximum flexibility? From there, we decided to break down everything into ultra-modular building blocks. We started working on this super complex system of modular deliverables where we would have services like writing, design, editing, etc—plus a sliding scale for different scopes like the length of writing or the number of images. In theory, it would allow us to mix and match any combination of services to create custom deliverables for the client. In fact, we wanted the work to be deliverable-agnostic. That way we could mold it to fit any client’s needs and deliver any type of content, regardless of the format or goal. Want a 5,000-word case study with 15 custom graphics? That’ll be $X. Want a 2,000-word blog post with an interview and no visuals? $Y. Just want us to create 10 briefs, you handle the writing, and we do design? It’s $Z. Again, this feels like a reasonable solution. But it quickly spiraled out of amuck. (That’s an Office reference.) For this to work, we need to have incredibly precise scoping process for every single deliverable. Before we can begin work (or even quote a price), we need to know pretty much the exact word count of the final article, for example. In the real world? This almost never happens. The content is as long as the content needs to be. Clients rarely know if the blog post should be 2,000 words or 3,000 words. They just want good content. We have a general ballpark, but we can rarely dial it in within just 1,000 words until we’ve done enough research to create the brief. Plus, from a packaging and pricing perspective, it introduces all kind of weird scenarios where clients will owe exactly $10,321 for this ultra-specific combination of services. We were building an open system that could accommodate any and all types of potential deliverables. On the face that seems great because it makes us incredibly flexible. In reality, the ambiguity actually works against us. It makes it harder for us to communicate to clients clearly about what they’ll get, how much it will cost, and how long it will take. That, of course, also means that it hurts our client relationships. (This actually kind of goes back to my personal learnings, which I’ll mention in a bit. I tend to be a “let’s leave things vague so we don’t have to limit our options” kind of person. But I’m working on fixing this to be more precise, specific, and clear in everything that we do.) Dialing It In: Building a Closed System We were trying to build an open system. We need to build a closed system. We need to force clarity and get specific about what we do, what we don’t do, and how much it all costs. Then we need a system to expand on that closed system—add new types of deliverables, new content playbooks, and new workflows if and when the need arises. With that in mind, we can start by mapping out the key dimensions of any type of deliverable that we would ever want to deliver. These are the universal dimensions that determine the scope, workflow, and price of any deliverable—regardless of the specific type output. Dimensions are: Brief scope Writing + editing scope Design scope Interview scope Revision (rounds) Scope, essentially, just tells us how many words, graphics, interviews, etc are required for the content we’re creating. In our first crack at the system, we got super granular with these scopes. But to help force a more manageable system, we realized that we didn’t need tiny increments for most of this work. Instead, we just need boundaries—you pay $X for up to Y words. We still need some variability around the scope of these articles. Obviously, most clients won’t be willing to pay the same price for a 1,000-word article as a 10,000-word article. But we can be smarter about the realistic break points. We boiled it down to the most common ranges: (Up to) 250 words 1,000 words 3,000 words 6,000 words 10,000 words This gives us a much more manageable number of variables. But we still haven’t exactly closed the system. We need one final dimension: Deliverable type. This tells us what we’re actually building with these building blocks. This is how we’ll put a cap on the potentially infinite number of combinations we could offer. The deliverable type will define what the final product should look like (e.g., blog post, case study, ebook, etc). And it will also give us a way to put standards and expectations around different types of deliverables that we want to offer. Then we can expand on this list of deliverables to offer new services. In the mean time, only the deliverables that we have already defined are, “on the menu,” so to speak. If a client comes to us and asks for something like a podcast summary article (which we don’t currently offer), we’ll have to either say we can’t provide that work or create a new deliverable type and define the dimensions of that specific piece. But here’s the kicker: No matter the deliverable type, it has to still fit within the scopes we’ve already defined. And the pricing will be the same. This means that if you’re looking for our team to write up to 1,000 words of content, it costs the same amount—whether it’s a blog post, an ebook, a LinkedIn post, or anything else. Rather than trying to retool our entire system to offer this new podcast summary article deliverable, we’ll just create the new deliverable type, add it to the list of options, and it’s ready to sell with the pre-defined dimensions we’ve already identified. To do: Update onboarding workflow Update contracts and scope documents Dial in new briefing process Know Thyself For the last year, I’ve been going through personal therapy. (Huge shout out to my wife, Laura, for her support and encouragement throughout the process.) It’s taught me a lot about myself and my tendencies. It’s helped me find some of my weaknesses and think about how I can improve as a person, as a partner, and as an entrepreneur. And it’s forced me to face a lot of hard truths. For example, consider some of the critical decisions I’ve made for my business: Unconventional freelance “collective” model No formal management structure Open-ended retainers with near-infinite flexibility General contracts without defined scope “Take it or leave it” approach to sales and marketing Over the years, I’ve talked about almost everything on this list as a huge advantage. I saw these things as a reflection of how I wanted to do things differently and better than other companies. But now, I see them more as a reflection of my fears and insecurities. Why did I design my business like this? Why do I want so much “flexibility” and why do I want things left open-ended rather than clearly defined? One reason that could clearly explain it: I’m avoidant. If you’re not steeped in the world of therapy, this basically means that my fight or flight response gets turned all the way to “flight.” If I’m unhappy or uncomfortable, my gut reaction is usually to withdraw from the situation. I see commitment and specificity as a prelude to future conflict. And I avoid conflict whenever possible. So I built my business to minimize it. If I don’t have a specific schedule of work that I’m accountable for delivering, then we can fudge the numbers a bit and hope they even out in the end. If I don’t set a specific standard for the length of an article, then I don’t have to let the client know when their request exceeds that limit. Conflict….avoided? Now, that’s not to say that everything I’ve built was wrong or bad. There is a lot of value in having flexibility in your business. For example, I would say that our flexible retainers are, overall, an advantage. Clients have changing needs. Having flexibility to quickly adapt to those needs can be a huge value add. And not everything can be clearly defined upfront (at least not without a massive amount of time and work just to decide how long to write an article). Overly-rigid structures and processes can be just as problematic as loosey-goosey ones. But, on the whole, I realized that my avoidant tendencies and laissez faire approach to management have left a vacuum in many areas. The places where I avoided specificity were often the places where there was the most confusion, uncertainty, and frustration from the team and from clients. People simply didn’t know what to expect or what was expected of them. Ironically, this often creates the conflict I’m trying to avoid. For example, if I don’t give feedback to people on my team, then they feel uneasy about their work. Or they make assumptions about expectations that don’t match what I’m actually expecting. Then the client might get upset, I might get upset, and our team members may be upset. Conflict definitely not avoided. This happens on the client side, too. If we don’t define a specific timeline when something will be delivered, the client might expect it sooner than we can deliver—creating frustration when we don’t meet their expectation. This conflict actually would have been avoided if we set clearer expectations upfront. But we didn’t do that. I didn’t do that. So it’s time to step up and close the gaps. Stepping Up and Closing the Gaps If I’m going to address these gaps and create more clarity and stability, I have to step up. Both personally and professionally. I have to actually face the fear and uncertainty that drives me to be avoidant. And then apply that to my business in meaningful ways that aren’t cop-out ways of kinda-sorta providing structure without really doing it. I’ve gotta be all in. This means: Fill the gaps where I rely on other people to do things that aren’t really their job but I haven’t put someone in place to do it Set and maintain expectations about our internal work processes, policies, and standards Define clear boundaries on things like roles, timelines, budgets, and scopes Now, this isn’t going to happen overnight. And just because I say that I need to step up to close these gaps doesn’t mean that I need to be the one who’s responsible for them (at least not forever). It just means that, as the business leader, I need to make sure the gaps get filled—by me or by someone else who has been specifically charged with owning that part of the operation. So, this is probably my #1 focus over the coming quarter. And it starts by identifying the gaps that exist. Then, step into those gaps myself, pay someone else to fill that role, or figure out how to eliminate the gap another way. This means going all the way back to the most basic decisions in our business. One of the foundational things about Optimist is being a “different kind” of agency. I always wanted to build something that solved for the bureaucracy, hierarchy, and siloed structure of agencies. If a client has feedback, they should be able to talk directly to the person doing the work rather than going through 3 layers of account management and creative directors. So I tried to be clever. I tried to design all kinds of systems and processes that eliminated these middle rungs. (In retrospect, what I was actually doing was designing a system that played into my avoidant tendencies and made it easy to abdicate responsibility for lots of things.) Since we didn’t want to create hierarchy, we never implemented things like Junior and Senior roles. We never hired someone to manage or direct the individual creatives. We didn’t have Directors or VPs. (Hell, we barely had a project manager for the first several years of existence.) This aversion to hierarchy aligned with our values around elevating ownership and collective contribution. I still believe in the value a flat structure. But a flat structure doesn’t eliminate the complexity of a growing business. No one to review writers and give them 1:1 feedback? I guess I’ll just have to do that….when I have some spare time. No Content Director? Okay, well someone needs to manage our content playbooks and roll out new ones. Just add it to my task list. Our flat structure didn’t eliminate the need for these roles. It just eliminated the people to do them. All of those unfilled roles ultimately fell back on me or our ops person, Katy. Of course, this isn’t the first time we’ve recognized this. We’ve known there were growing holes in our business as it’s gotten bigger and more complex. Over the years, we’ve experimented with different ways to solve for it. The Old Solution: Distributed Ops One system we designed was a “distributed ops” framework. Basically, we had one person who was the head of ops (at the time, we considered anything that was non-client-facing to be “ops”). They’d plan and organize all of the various things that needed to happen around Optimist. Then they’d assign out the work to whoever was able to help. We had a whole system for tying this into the our profit share and even gave people “Partner” status based on their contributions to ops. It worked—kinda. One big downfall is that all of the tasks and projects were ad hoc. People would pick up jobs, but they didn’t have much context or expertise to apply. So the output often varied. Since we were trying to maintain a flat structure, there was minimal oversight or management of the work. In other words, we didn’t always get the best results. But, more importantly, we still didn’t close all of the gaps entirely. Because everything was an ad-hoc list of tasks and projects, we never really had the “big picture” view of everything that needed to be done across the business. This also meant we rarely had clarity on what was important, what was trivial, and what was critical. We need a better system. Stop Reinventing the Wheel (And Create a Damn Org Chart) It’s time to get serious about filling the gaps in our business. It can’t be a half-fix or an ad hoc set of projects and tasks. We need clarity on the roles that need to be filled and then fill them. The first step here is to create an org chart. A real one. Map out all of the jobs that need to be done for Optimist to be successful besides just writers and designers. Roles like: Content director Design director SEO manager Reporting Finance Account management Business development Sales Marketing Project management It feels a bit laughable listing all of these roles. Because most are either empty or have my name attached to them. And that’s the problem. I can’t do everything. And all of the empty roles are gaps in our structure—places where people aren’t getting the direction, feedback, or guidance they need to do their best work. Or where things just aren’t being done consistently. Content director, for example, should be responsible for steering the output of our content strategists, writers, and editors. They’re not micromanaging every deliverable. But they give feedback, set overall policy, and help our team identify opportunities to get better. Right now we don’t have anyone in that role. Which means it’s my job—when I have time. Looking at the org chart (a real org chart that I actually built to help with this), it’s plain as day how many roles look like this. Even if we aren’t going to implement a traditional agency structure and a strict hierarchy, we still need to address these gaps. And the only way for that to happen is face the reality and then create a plan to close the gaps. Now that we have a list of theoretical roles, we need to clearly define the responsibilities and boundaries of those roles to make sure they cover everything that actually needs to happen. Then we can begin the process of delegating, assigning, hiring, and otherwise addressing each one. So that’s what I need to do. To be done: Create job descriptions for all of the roles we need to fill Hire Biz Dev role Hire Account Lead role(s) Hire Head of Content Playing Offense As we move into Q1 of 2025 and I reflect on the tumultuous few years we’ve had, one thought keeps running through my head. We need to play offense. Most of the last 1-2 years was reacting to changes that were happening around us. Trying to make sense and chart a new path forward. Reeling. But what I really want—as a person and as an entrepreneur—is to be proactive. I want to think and plan ahead. Figure out where we want to go before we’re forced to change course by something that’s out of our control. So my overarching focus for Q1 is playing offense. Thinking longer term. Getting ahead of the daily deluge and creating space to be more proactive, innovative, and forward thinking. To do: Pilot new content formats Audit and update our own content strategy Improve feedback workflows Build out long-term roadmap for 1-2 years for Optimist Final Note on Follow-Through and Cadence In my reflection this year, one of the things I’ve realized is how helpful these posts are for me. I process by writing. So I actually end up making a lot of decisions and seeing things more clearly each time I sit down to reflect and write my yearly recap. It also gives me a space to hold myself accountable for the things I said I would do. So, I’m doing two things a bit differently from here on out. First: I’m identifying clear action items that I’m holding myself accountable for getting done in the next 3 months (listed in the above sections). In each future update, I’ll do an accounting of what I got done and what wasn’t finished (and why). Second: I’m going to start writing shorter quarterly updates. This will gives me more chances each year to reflect, process, and make decisions. Plus it gives me a shorter feedback loop for the action items that I identified above. (See—playing offense.) — Okay friends, enemies, and frenemies. This is my first update for 2025. Glad to share with y’all. And thanks to everyone who’s read, commented, reached out, and shared their own experiences over the years. We are all the accumulation of our connections and our experiences. As always, I will pop in to respond to comments and answer questions. Feel free to share your thoughts, questions, and general disdain down below. Cheers, Tyler

Switching Gears: Implementing AI for My Agency’s Marketing After a Decade
reddit
LLM Vibe Score0
Human Vibe Score0.333
Alarming_Management3This week

Switching Gears: Implementing AI for My Agency’s Marketing After a Decade

Hi there, I’ve been running a software development and design agency for the last 10 years, mainly focusing on building custom solutions for businesses and SaaS. For the last 2 years, I’ve consistently recommended that clients use AI technologies, especially for social media and content creation to generate traffic. Funny enough, I wasn’t practicing what I preached. Most of my client projects came from platforms like Upwork and word-of-mouth referrals from clients or people from networking events. Background I started my journey in 2014, switching from an employee to a freelancer. Within the first 10 months, my initial projects grew beyond what I could handle alone, prompting me to hire additional developers. This shift turned my role from a full-stack developer to a team lead and developer. Over the years, my focus has been a blend of tech and product. About five years ago, I realized the importance of design, leading me to adding designers to the agency to provide full-cycle service development—from product ideation and design to development, testing, launch, and support. I still continue to set up dedicated teams for some clients, maintaining a strong technical role as a tech lead, solution architect, and head product designer. To enhance my skills, I even completed UI/UX design courses to offer better product solutions. Despite these changes, building products has always been the easy part. The challenge was ensuring these client products didn’t end up in the graveyard due to poor product-market fit, often caused by inadequate marketing and sales strategies but more often just absence of them. (we are talking about startup and first time founders here 🙂 ) My Journey and Observations Advising Clients: I often found myself advising clients on increasing traffic for their SaaS products and crafting strategic marketing plans. Learning: I’ve gained most of my knowledge from consuming internet materials, courses, and blog posts and learning from successful client project launches. Realization: Despite giving this advice, I wasn’t applying these strategies to my own business, leading to low visits to my agency’s website. Initial Solution: Hiring a Marketer Hiring: I brought in a marketer with a solid background in content creating and interview video editing from an educational organization. Goal: The aim was to increase website visits through a comprehensive marketing strategy. Outcome: Although the content produced was high-quality and useful for pitching services, it didn’t lead to significant traffic increases. Issue: The marketer focused more on content creation rather than distribution channels, which limited effectiveness. Shift to AI-Driven Strategy Experiment: I decided to try using AI for content creation and distribution, which aligns with my agency’s specialization in design-driven development and AI integrations. Implementation plan: I will be generating all content with minimal edits using AI and implementing a strategic backlinking approach. Backlinking Strategy Initial Plan: I initially thought of hiring a specialist for backlinks. Realization: The costs and profiles of freelancers didn’t seem promising. Solution: I found AI-driven services for backlinks, which seem more efficient and cost-effective. Plan: My plan is to use these tools for programmatic SEO-driven AI-generated articles and third-party backlinking services over the next two to three months. Current Approach Management: This approach can be managed and executed by 1 person and monitored weekly, reducing human error and optimizing efficiency. I will start it myself and then replace myself with an editor with managing skills. Reflection: It’s a bit ironic and funny that it took me 10 years to start implementing these strategies in my own agency business, but I now feel more confident with AI and automation in place. Why Increase Website Visitors? You might ask, why do I want to increase the number of visitors to the site, and how can I ensure these visitors will be qualified? Hands-On Experience: To gain hands-on experience and perform this exercise effectively. Introduce Packaged Services: I want to introduce a set of low-cost packaged services tailored for non-technical people who want to build things for themselves - the DIY kits for non-technical folks. These services will provide a foundational template for them to build upon on top of existing established solutions such as Wix, Square Why am I Posting and Sharing Here? You might also wonder, why am I posting it here and sharing this? Well, I'm doing this more for myself. Most of my career, the things I’ve done have been behind the curtains. With this small project, I want to make it public to see the reaction of the community. Perhaps there will be good and smart suggestions offered, and maybe some insights or highlights of tools I wasn’t aware of or didn’t consider. I’ll keep sharing updates on this journey of website promotion, marketing, and SEO. My current goal is to reach 2,000 visits per month, which is a modest start. Looking forward to any thoughts or advice from this community! Disclaimer: This content was not generated by AI, but it was edited by it 😛

I run an AI automation agency (AAA). My honest overview and review of this new business model
reddit
LLM Vibe Score0
Human Vibe Score1
AI_Scout_OfficialThis week

I run an AI automation agency (AAA). My honest overview and review of this new business model

I started an AI tools directory in February, and then branched off that to start an AI automation agency (AAA) in June. So far I've come across a lot of unsustainable "ideas" to make money with AI, but at the same time a few diamonds in the rough that aren't fully tapped into yet- especially the AAA model. Thought I'd share this post to shine light into this new business model and share some ways you could potentially start your own agency, or at the very least know who you are dealing with and how to pick and choose when you (inevitably) get bombarded with cold emails from them down the line. Foreword Running an AAA does NOT involve using AI tools directly to generate and sell content directly. That ship has sailed, and unless you are happy with $5 from Fiverr every month or so, it is not a real business model. Cry me a river but generating generic art with AI and slapping it onto a T-shirt to sell on Etsy won't make you a dime. At the same time, the AAA model will NOT require you to have a deep theoretical knowledge of AI, or any academic degree, as we are more so dealing with the practical applications of generative AI and how we can implement these into different workflows and tech-stacks, rather than building AI models from the ground up. Regardless of all that, common sense and a willingness to learn will help (a shit ton), as with anything. Keep in mind - this WILL involve work and motivation as well. The mindset that AI somehow means everything can be done for you on autopilot is not the right way to approach things. The common theme of businesses I've seen who have successfully implemented AI into their operations is the willingess to work with AI in a way that augments their existing operations, rather than flat out replace a worker or team. And this is exactly the train of thought you need when working with AI as a business model. However, as the field is relatively unsaturated and hype surrounding AI is still fresh for enterprises, right now is the prime time to start something new if generative AI interests you at all. With that being said, I'll be going over three of the most successful AI-adjacent businesses I've seen over this past year, in addition to some tips and resources to point you in the right direction. so.. WTF is an AI Automation Agency? The AI automation agency (or as some YouTubers have coined it, the AAA model) at its core involves creating custom AI solutions for businesses. I have over 1500 AI tools listed in my directory, however the feedback I've received from some enterprise users is that ready-made SaaS tools are too generic to meet their specific needs. Combine this with the fact virtually no smaller companies have the time or skills required to develop custom solutions right off the bat, and you have yourself real demand. I would say in practice, the AAA model is quite similar to Wordpress and even web dev agencies, with the major difference being all solutions you develop will incorporate key aspects of AI AND automation. Which brings me to my second point- JUST AI IS NOT ENOUGH. Rather than reducing the amount of time required to complete certain tasks, I've seen many AI agencies make the mistake of recommending and (trying to) sell solutions that more likely than not increase the workload of their clients. For example, if you were to make an internal tool that has AI answer questions based on their knowledge base, but this knowledge base has to be updated manually, this is creating unnecessary work. As such I think one of the key components of building successful AI solutions is incorporating the new (Generative AI/LLMs) with the old (programmtic automation- think Zapier, APIs, etc.). Finally, for this business model to be successful, ideally you should target a niche in which you have already worked and understand pain points and needs. Not only does this make it much easier to get calls booked with prospects, the solutions you build will have much greater value to your clients (meaning you get paid more). A mistake I've seen many AAA operators make (and I blame this on the "Get Rich Quick" YouTubers) is focusing too much on a specific productized service, rather than really understanding the needs of businesses. The former is much done via a SaaS model, but when going the agency route the only thing that makes sense is building custom solutions. This is why I always take a consultant-first approach. You can only build once you understand what they actually need and how certain solutions may impact their operations, workflows, and bottom-line. Basics of How to Get Started Pick a niche. As I mentioned previously, preferably one that you've worked in before. Niches I know of that are actively being bombarded with cold emails include real estate, e-commerce, auto-dealerships, lawyers, and medical offices. There is a reason for this, but I will tell you straight up this business model works well if you target any white-collar service business (internal tools approach) or high volume businesses (customer facing tools approach). Setup your toolbox. If you wanted to start a pressure washing business, you would need a pressure-washer. This is no different. For those without programming knowledge, I've seen two common ways AAA get setup to build- one is having a network of on-call web developers, whether its personal contacts or simply going to Upwork or any talent sourcing agency. The second is having an arsenal of no-code tools. I'll get to this more in a second, but this works beecause at its core, when we are dealing with the practical applications of AI, the code is quite simple, simply put. Start cold sales. Unless you have a network already, this is not a step you can skip. You've already picked a niche, so all you have to do is find the right message. Keep cold emails short, sweet, but enticing- and it will help a lot if you did step 1 correctly and intimately understand who your audience is. I'll be touching base later about how you can leverage AI yourself to help you with outreach and closing. The beauty of gen AI and the AAA model You don't need to be a seasoned web developer to make this business model work. The large majority of solutions that SME clients want is best done using an API for an LLM for the actual AI aspect. The value we create with the solutions we build comes with the conceptual framework and design that not only does what they need it to but integrates smoothly with their existing tech-stack and workflow. The actual implementation is quite straightforward once you understand the high level design and know which tools you are going to use. To give you a sense, even if you plan to build out these apps yourself (say in Python) the large majority of the nitty gritty technical work has already been done for you, especially if you leverage Python libraries and packages that offer high level abstraction for LLM-related functions. For instance, calling GPT can be as little as a single line of code. (And there are no-code tools where these functions are simply an icon on a GUI). Aside from understanding the capabilities and limitations of these tools and frameworks, the only thing that matters is being able to put them in a way that makes sense for what you want to build. Which is why outsourcing and no-code tools both work in our case. Okay... but how TF am I suppposed to actually build out these solutions? Now the fun part. I highly recommend getting familiar with Langchain and LlamaIndex. Both are Python libraires that help a lot with the high-level LLM abstraction I mentioned previously. The two most important aspects include being able to integrate internal data sources/knowledge bases with LLMs, and have LLMs perform autonomous actions. The two most common methods respectively are RAG and output parsing. RAG (retrieval augmented Generation) If you've ever seen a tool that seemingly "trains" GPT on your own data, and wonder how it all works- well I have an answer from you. At a high level, the user query is first being fed to what's called a vector database to run vector search. Vector search basically lets you do semantic search where you are searching data based on meaning. The vector databases then retrieves the most relevant sections of text as it relates to the user query, and this text gets APPENDED to your GPT prompt to provide extra context to the AI. Further, with prompt engineering, you can limit GPT to only generate an answer if it can be found within this extra context, greatly limiting the chance of hallucination (this is where AI makes random shit up). Aside from vector databases, we can also implement RAG with other data sources and retrieval methods, for example SQL databses (via parsing the outputs of LLM's- more on this later). Autonomous Agents via Output Parsing A common need of clients has been having AI actually perform tasks, rather than simply spitting out text. For example, with autonomous agents, we can have an e-commerce chatbot do the work of a basic customer service rep (i.e. look into orders, refunds, shipping). At a high level, what's going on is that the response of the LLM is being used programmtically to determine which API to call. Keeping on with the e-commerce example, if I wanted a chatbot to check shipping status, I could have a LLM response within my app (not shown to the user) with a prompt that outputs a random hash or string, and programmatically I can determine which API call to make based on this hash/string. And using the same fundamental concept as with RAG, I can append the the API response to a final prompt that would spit out the answer for the user. How No Code Tools Can Fit In (With some example solutions you can build) With that being said, you don't necessarily need to do all of the above by coding yourself, with Python libraries or otherwise. However, I will say that having that high level overview will help IMMENSELY when it comes to using no-code tools to do the actual work for you. Regardless, here are a few common solutions you might build for clients as well as some no-code tools you can use to build them out. Ex. Solution 1: AI Chatbots for SMEs (Small and Medium Enterprises) This involves creating chatbots that handle user queries, lead gen, and so forth with AI, and will use the principles of RAG at heart. After getting the required data from your client (i.e. product catalogues, previous support tickets, FAQ, internal documentation), you upload this into your knowledge base and write a prompt that makes sense for your use case. One no-code tool that does this well is MyAskAI. The beauty of it especially for building external chatbots is the ability to quickly ingest entire websites into your knowledge base via a sitemap, and bulk uploading files. Essentially, they've covered the entire grunt work required to do this manually. Finally, you can create a inline or chat widget on your client's website with a few lines of HTML, or altneratively integrate it with a Slack/Teams chatbot (if you are going for an internal Q&A chatbot approach). Other tools you could use include Botpress and Voiceflow, however these are less for RAG and more for building out complete chatbot flows that may or may not incorporate LLMs. Both apps are essentially GUIs that eliminate the pain and tears and trying to implement complex flows manually, and both natively incoporate AI intents and a knowledge base feature. Ex. Solution 2: Internal Apps Similar to the first example, except we go beyond making just chatbots but tools such as report generation and really any sort of internal tool or automations that may incorporate LLM's. For instance, you can have a tool that automatically generates replies to inbound emails based on your client's knowledge base. Or an automation that does the same thing but for replies to Instagram comments. Another example could be a tool that generates a description and screeenshot based on a URL (useful for directory sites, made one for my own :P). Getting into more advanced implementations of LLMs, we can have tools that can generate entire drafts of reports (think 80+ pages), based not only on data from a knowledge base but also the writing style, format, and author voice of previous reports. One good tool to create content generation panels for your clients would be MindStudio. You can train LLM's via prompt engineering in a structured way with your own data to essentially fine tune them for whatever text you need it to generate. Furthermore, it has a GUI where you can dictate the entire AI flow. You can also upload data sources via multiple formats, including PDF, CSV, and Docx. For automations that require interactions between multiple apps, I recommend the OG zapier/make.com if you want a no-code solution. For instance, for the automatic email reply generator, I can have a trigger such that when an email is received, a custom AI reply is generated by MyAskAI, and finally a draft is created in my email client. Or, for an automation where I can create a social media posts on multiple platforms based on a RSS feed (news feed), I can implement this directly in Zapier with their native GPT action (see screenshot) As for more complex LLM flows that may require multiple layers of LLMs, data sources, and APIs working together to generate a single response i.e. a long form 100 page report, I would recommend tools such as Stack AI or Flowise (open-source alternative) to build these solutions out. Essentially, you get most of the functions and features of Python packages such as Langchain and LlamaIndex in a GUI. See screenshot for an example of a flow How the hell are you supposed to find clients? With all that being said, none of this matters if you can't find anyone to sell to. You will have to do cold sales, one way or the other, especially if you are brand new to the game. And what better way to sell your AI services than with AI itself? If we want to integrate AI into the cold outreach process, first we must identify what it's good at doing, and that's obviously writing a bunch of text, in a short amount of time. Similar to the solutions that an AAA can build for its clients, we can take advantage of the same principles in our own sales processes. How to do outreach Once you've identified your niche and their pain points/opportunities for automation, you want to craft a compelling message in which you can send via cold email and cold calls to get prospects booked on demos/consultations. I won't get into too much detail in terms of exactly how to write emails or calling scripts, as there are millions of resources to help with this, but I will tell you a few key points you want to keep in mind when doing outreach for your AAA. First, you want to keep in mind that many businesses are still hesitant about AI and may not understand what it really is or how it can benefit their operations. However, we can take advantage of how mass media has been reporting on AI this past year- at the very least people are AWARE that sooner or later they may have to implement AI into their businesses to stay competitive. We want to frame our message in a way that introduces generative AI as a technology that can have a direct, tangible, and positive impact on their business. Although it may be hard to quantify, I like to include estimates of man-hours saved or costs saved at least in my final proposals to prospects. Times are TOUGH right now, and money is expensive, so you need to have a compelling reason for businesses to get on board. Once you've gotten your messaging down, you will want to create a list of prospects to contact. Tools you can use to find prospects include Apollo.io, reply.io, zoominfo (expensive af), and Linkedin Sales Navigator. What specific job titles, etc. to target will depend on your niche but for smaller companies this will tend to be the owner. For white collar niches, i.e. law, the professional that will be directly benefiting from the tool (i.e. partners) may be better to contact. And for larger organizations you may want to target business improvement and digital transformation leads/directors- these are the people directly in charge of projects like what you may be proposing. Okay- so you have your message, and your list, and now all it comes down to is getting the good word out. I won't be going into the details of how to send these out, a quick Google search will give you hundreds of resources for cold outreach methods. However, personalization is key and beyond simple dynamic variables you want to make sure you can either personalize your email campaigns directly with AI (SmartWriter.ai is an example of a tool that can do this), or at the very least have the ability to import email messages programmatically. Alternatively, ask ChatGPT to make you a Python Script that can take in a list of emails, scrape info based on their linkedin URL or website, and all pass this onto a GPT prompt that specifies your messaging to generate an email. From there, send away. How tf do I close? Once you've got some prospects booked in on your meetings, you will need to close deals with them to turn them into clients. Call #1: Consultation Tying back to when I mentioned you want to take a consultant-first appraoch, you will want to listen closely to their goals and needs and understand their pain points. This would be the first call, and typically I would provide a high level overview of different solutions we could build to tacke these. It really helps to have a presentation available, so you can graphically demonstrate key points and key technologies. I like to use Plus AI for this, it's basically a Google Slides add-on that can generate slide decks for you. I copy and paste my default company messaging, add some key points for the presentation, and it comes out with pretty decent slides. Call #2: Demo The second call would involve a demo of one of these solutions, and typically I'll quickly prototype it with boilerplate code I already have, otherwise I'll cook something up in a no-code tool. If you have a niche where one type of solution is commonly demanded, it helps to have a general demo set up to be able to handle a larger volume of calls, so you aren't burning yourself out. I'll also elaborate on how the final product would look like in comparison to the demo. Call #3 and Beyond: Once the initial consultation and demo is complete, you will want to alleviate any remaining concerns from your prospects and work with them to reach a final work proposal. It's crucial you lay out exactly what you will be building (in writing) and ensure the prospect understands this. Furthermore, be clear and transparent with timelines and communication methods for the project. In terms of pricing, you want to take this from a value-based approach. The same solution may be worth a lot more to client A than client B. Furthermore, you can create "add-ons" such as monthly maintenance/upgrade packages, training sessions for employeees, and so forth, separate from the initial setup fee you would charge. How you can incorporate AI into marketing your businesses Beyond cold sales, I highly recommend creating a funnel to capture warm leads. For instance, I do this currently with my AI tools directory, which links directly to my AI agency and has consistent branding throughout. Warm leads are much more likely to close (and honestly, much nicer to deal with). However, even without an AI-related website, at the very least you will want to create a presence on social media and the web in general. As with any agency, you will want basic a professional presence. A professional virtual address helps, in addition to a Google Business Profile (GBP) and TrustPilot. a GBP (especially for local SEO) and Trustpilot page also helps improve the looks of your search results immensely. For GBP, I recommend using ProfilePro, which is a chrome extension you can use to automate SEO work for your GBP. Aside from SEO optimzied business descriptions based on your business, it can handle Q/A answers, responses, updates, and service descriptions based on local keywords. Privacy and Legal Concerns of the AAA Model Aside from typical concerns for agencies relating to service contracts, there are a few issues (especially when using no-code tools) that will need to be addressed to run a successful AAA. Most of these surround privacy concerns when working with proprietary data. In your terms with your client, you will want to clearly define hosting providers and any third party tools you will be using to build their solution, and a DPA with these third parties listed as subprocessors if necessary. In addition, you will want to implement best practices like redacting private information from data being used for building solutions. In terms of addressing concerns directly from clients, it helps if you host your solutions on their own servers (not possible with AI tools), and address the fact only ChatGPT queries in the web app, not OpenAI API calls, will be used to train OpenAI's models (as reported by mainstream media). The key here is to be open and transparent with your clients about ALL the tools you are using, where there data will be going, and make sure to get this all in writing. have fun, and keep an open mind Before I finish this post, I just want to reiterate the fact that this is NOT an easy way to make money. Running an AI agency will require hours and hours of dedication and work, and constantly rearranging your schedule to meet prospect and client needs. However, if you are looking for a new business to run, and have a knack for understanding business operations and are genuinely interested in the pracitcal applications of generative AI, then I say go for it. The time is ticking before AAA becomes the new dropshipping or SMMA, and I've a firm believer that those who set foot first and establish themselves in this field will come out top. And remember, while 100 thousand people may read this post, only 2 may actually take initiative and start.

Switching Gears: Implementing AI for My Agency’s Marketing After a Decade
reddit
LLM Vibe Score0
Human Vibe Score0.333
Alarming_Management3This week

Switching Gears: Implementing AI for My Agency’s Marketing After a Decade

Hi there, I’ve been running a software development and design agency for the last 10 years, mainly focusing on building custom solutions for businesses and SaaS. For the last 2 years, I’ve consistently recommended that clients use AI technologies, especially for social media and content creation to generate traffic. Funny enough, I wasn’t practicing what I preached. Most of my client projects came from platforms like Upwork and word-of-mouth referrals from clients or people from networking events. Background I started my journey in 2014, switching from an employee to a freelancer. Within the first 10 months, my initial projects grew beyond what I could handle alone, prompting me to hire additional developers. This shift turned my role from a full-stack developer to a team lead and developer. Over the years, my focus has been a blend of tech and product. About five years ago, I realized the importance of design, leading me to adding designers to the agency to provide full-cycle service development—from product ideation and design to development, testing, launch, and support. I still continue to set up dedicated teams for some clients, maintaining a strong technical role as a tech lead, solution architect, and head product designer. To enhance my skills, I even completed UI/UX design courses to offer better product solutions. Despite these changes, building products has always been the easy part. The challenge was ensuring these client products didn’t end up in the graveyard due to poor product-market fit, often caused by inadequate marketing and sales strategies but more often just absence of them. (we are talking about startup and first time founders here 🙂 ) My Journey and Observations Advising Clients: I often found myself advising clients on increasing traffic for their SaaS products and crafting strategic marketing plans. Learning: I’ve gained most of my knowledge from consuming internet materials, courses, and blog posts and learning from successful client project launches. Realization: Despite giving this advice, I wasn’t applying these strategies to my own business, leading to low visits to my agency’s website. Initial Solution: Hiring a Marketer Hiring: I brought in a marketer with a solid background in content creating and interview video editing from an educational organization. Goal: The aim was to increase website visits through a comprehensive marketing strategy. Outcome: Although the content produced was high-quality and useful for pitching services, it didn’t lead to significant traffic increases. Issue: The marketer focused more on content creation rather than distribution channels, which limited effectiveness. Shift to AI-Driven Strategy Experiment: I decided to try using AI for content creation and distribution, which aligns with my agency’s specialization in design-driven development and AI integrations. Implementation plan: I will be generating all content with minimal edits using AI and implementing a strategic backlinking approach. Backlinking Strategy Initial Plan: I initially thought of hiring a specialist for backlinks. Realization: The costs and profiles of freelancers didn’t seem promising. Solution: I found AI-driven services for backlinks, which seem more efficient and cost-effective. Plan: My plan is to use these tools for programmatic SEO-driven AI-generated articles and third-party backlinking services over the next two to three months. Current Approach Management: This approach can be managed and executed by 1 person and monitored weekly, reducing human error and optimizing efficiency. I will start it myself and then replace myself with an editor with managing skills. Reflection: It’s a bit ironic and funny that it took me 10 years to start implementing these strategies in my own agency business, but I now feel more confident with AI and automation in place. Why Increase Website Visitors? You might ask, why do I want to increase the number of visitors to the site, and how can I ensure these visitors will be qualified? Hands-On Experience: To gain hands-on experience and perform this exercise effectively. Introduce Packaged Services: I want to introduce a set of low-cost packaged services tailored for non-technical people who want to build things for themselves - the DIY kits for non-technical folks. These services will provide a foundational template for them to build upon on top of existing established solutions such as Wix, Square Why am I Posting and Sharing Here? You might also wonder, why am I posting it here and sharing this? Well, I'm doing this more for myself. Most of my career, the things I’ve done have been behind the curtains. With this small project, I want to make it public to see the reaction of the community. Perhaps there will be good and smart suggestions offered, and maybe some insights or highlights of tools I wasn’t aware of or didn’t consider. I’ll keep sharing updates on this journey of website promotion, marketing, and SEO. My current goal is to reach 2,000 visits per month, which is a modest start. Looking forward to any thoughts or advice from this community! Disclaimer: This content was not generated by AI, but it was edited by it 😛

Detailed Guide - How I've Been Self Employed for 2 Years Selling Posters
reddit
LLM Vibe Score0
Human Vibe Score1
tommo278This week

Detailed Guide - How I've Been Self Employed for 2 Years Selling Posters

Hey everyone, bit of context before you read through this. I have been selling POD posters full time for over 2 years now. My next venture is that I have started my own Print on Demand company for posters, PrintShrimp. As one way of creating customers for our service, we are teaching people for free how to also sell posters. Here is a guide I have written on how to sell posters on Etsy. Feel free to have a read through and then check out PrintShrimp, hopefully can help some of you guys out (and get us some more customers!) All of this is also available in video format on our website too, if you prefer to learn that way. Thanks guys! And as some people asked in other subs, no this isn't written with AI 😅 This took a couple of weeks to put together! Through this guide, we will teach you everything you need to know about starting to sell posters and generate some income. We will also show you why PrintShrimp is the best POD supplier for all of your poster needs. Trust me, you won’t need much convincing.  So, why are posters the best product to sell? Also, just thought I’d quickly answer the question - why posters? If you’ve been researching Print on Demand you’ve probably come across the infinite options of t-shirts, mugs, hats, phone cases, and more. All of these are viable options, however we think posters are the perfect place to start. You can always expand into other areas further down the line! So a brief summary of why posters are the perfect product for Print on Demand: \-They are very easy to design! Posters are a very easy shape to deal with - can’t go wrong with a rectangle. This makes designing products very easy. \-Similarly to this, what you see is what you get with a poster. You can literally see your finished product as you design it in either canva or photoshop. With T-Shirts for example, you have to make your design, and then place it on a t-shirt. Then you have to coordinate with your printers the size you would like the design on the tshirt and many other variables like that. There is no messing about with posters - what you see is what you get. \-The same high quality, everywhere. With other products, if you want to reap the benefits of a printing in various countries, you need to ensure each of your global suppliers stocks the same t-shirts, is able to print in the same way, carries the same sizes etc. Again with posters you avoid all of this hassle- your products will come out the same, no matter which of our global locations are used. \-They have a very favorable profit margin. As you will see later, the cost price of posters is very low. And people are prepared to pay quite a lot for a decent bit of wall art! I have tried out other products, and the profit margin combined with the order quantity of posters makes them my most profitable product, every single time. Using PrintShrimp, you can be sure to enjoy profits of anywhere between £6 - £40 pure profit per sale.  \-They are one of the easiest to print white label. This makes them perfect for Print on Demand. Your posters are simply put in a tube, and off they go. There are no extras you need to faff around with, compared to the extra elements other products come with, such as clothing labels on t-shirts.  Picking your poster niche So, you are ready to start selling posters. Great! Now, the blessing and curse with selling posters is that there are infinite possibilities regarding what you can sell. So, it can easily be quite overwhelming at first.  The first thing I would recommend doing is having a look at what others are selling. Etsy is a wonderful place for this (and will likely be a key part of your poster selling journey). So, log on to Etsy and simply type in ‘poster’ in the search bar. Get ready to write a massive list of the broad categories and type of posters that people are selling.  If you do not have more than 50 categories written down by the end, you are doing something wrong. There are seriously an infinite amount of posters! For example, here are some popular ones to get you started: Star sign posters, Kitchen posters, World map posters, Custom Dog Portrait posters, Music posters, Movie posters, Fine art posters, Skiing posters, Girl Power posters and Football posters.  Now, you have a huge list of potential products to sell. What next? There are a few important things you need to bear in mind when picking your niche: \-Does this interest me?  Don’t make the mistake of going down a niche that didn’t actually interest you just because it would probably be a money maker. Before you know it, what can be a very fun process of making designs can become incredibly \\\monotonous, and feel like a chore\\\. You need to bear in mind that you will be spending a lot of time creating designs - if it is something you are interested in you are much less likely to get burnt out! As well, \\\creativity will flow\\\ far better if it is something you are interested in, which at the end of the day will lead to better designs that are more likely to be purchased by customers.  \-Is this within my design range? Don’t let this put you off too much. We will go through how to get started on design later on in this guide. However, it is important to note that the plain truth of it is that some niches and designs are a hell of a lot more complicated than others. For example, quote posters can essentially be designed by anyone when you learn about how to put nice fonts together in a good color scheme. On the other hand, some posters you see may have been designed with complex illustrations in a program like Illustrator. To start with, it may be better to pick a niche that seems a bit more simple to get into, as you can always expand your range with other stores further down the line. A good way of evaluating the design complexity is by identifying if this poster is \\\a lot of elements put together\\\ or is \\\a lot of elements created by the designer themselves\\\\\.\\ Design can in a lot of cases be like a jigsaw - putting colours, shapes and text together to create an image. This will be a lot easier to start with and can be learnt by anyone, compared to complex drawings and illustrations.  \-Is this niche subject to copyright issues? Time to delve deep into good old copyright. Now, when you go through Etsy, you will without a doubt see hundreds of sellers selling music album posters, car posters, movie posters and more. Obviously, these posters contain the property of musicians, companies and more and are therefore copyrighted. The annoying thing is - these are \\\a complete cash cow.\\\ If you go down the music poster route, I will honestly be surprised if you \\don’t\\ make thousands. However it is only a matter of time before the copyright strikes start rolling in and you eventually get banned from Etsy.  So I would highly recommend \\\not making this mistake\\\. Etsy is an incredible platform for selling posters, and it is a hell of a lot easier to make sales on there compared to advertising your own website. And, you \\\only get one chance on Etsy.\\\ Once you have been banned once, you are not allowed to sign up again (and they do ID checks - so you won’t be able to rejoin again under your own name).  So, don’t be shortsighted when it comes to entering Print on Demand. If you keep your designs legitimate, they will last you a lifetime and you will then later be able to crosspost them to other platforms, again without the worry of ever getting shut down.  So, how do I actually design posters? Now you have an idea of what kind of posters you want to be making, it’s time to get creative and make some designs! Photoshop (and the creative cloud in general) is probably the best for this. However, when starting out it can be a scary investment (it costs about £30 a month unless you can get a student rate!).  So, while Photoshop is preferable in the long term, when starting out you can learn the ropes of design and get going with Canva. This can be great at the start as they have a load of templates that you can use to get used to designing and experimenting (while it might be tempting to slightly modify these and sell them - this will be quite saturated on places like Etsy so we would recommend doing something new).  What size format should I use? The best design format to start with is arguably the A sizes - as all the A sizes (A5, A4, A3, A2, A1, A0) are scalable. This means that you can make all of your designs in one size, for example A3, and these designs will be ready to fit to all other A sizes. For example, if you design an A3 poster and someone orders A1, you can just upload this A3 file to PrintShrimp and it will be ready to print. There is a wide range of other sizes you should consider offering on your shop, especially as these sizes are very popular with the American market. They have a wide range of popular options, which unfortunately aren’t all scalable with each other. This does mean that you will therefore have to make some slight modifications to your design in order to be able to offer them in American sizing, in a few different aspect ratios. What you can do however is design all of your products in UK sizing, and simply redesign to fit American sizing once you have had an order. Essentially: design in UK sizing, but list in both UK and US sizing. Then when you get a non-A size order, you can quickly redesign it on demand. This means that you don’t have to make a few different versions of each poster when first designing, and can simply do a quick redesign for US sizing when you need to. Below is PrintShrimps standard size offering. We can also offer any custom sizing too, so please get in touch if you are looking for anything else. With these sizes, your poster orders will be dispatched domestically in whatever country your customer orders from. Our recommendations for starting design One thing that will not be featured in this guide is a written out explanation or guide on how to design. Honestly, I can’t think of a more boring, or frankly worse, way to learn design. When it comes to getting started, experimenting is your best friend! Just have a play around and see what you can do. It is a really fun thing to get started with, and the satisfaction of when a poster design comes together is like no other. A good way to start is honestly by straight up copying a poster you see for sale online. And we don’t mean copying to sell! But just trying to replicate other designs is a great way to get a feel for it and what you can do. We really think you will be surprised at how easy it is to pull together a lot of designs that at first can appear quite complicated! Your best friend throughout this whole process will be google. At the start you will not really know how to do anything - but learning how to look into things you want to know about design is all part of the process. At first, it can be quite hard to even know how to search for what you are trying to do, but this will come with time (we promise). Learning how to google is a skill that you will learn throughout this process.  Above all, what we think is most important is this golden rule: take inspiration but do not steal. You want to be selling similar products in your niche, but not copies. You need to see what is selling in your niche and get ideas from that, but if you make designs too similar to ones already available, you won’t have much luck. At the end of the day, if two very similar posters are for sale and one shop has 1000 reviews and your newer one has 2, which one is the customer going to buy? You need to make yours offer something different and stand out enough to attract customers. Etsy SEO and maximizing your sales You may have noticed in this guide we have mentioned Etsy quite a few times! That is because we think it is hands down the best place to start selling posters. Why? Etsy is a go to place for many looking to decorate their homes and also to buy gifts. It might be tempting to start selling with your own website straight away, however we recommend Etsy as it brings the customers to you. For example, say you start selling Bathroom Posters. It is going to be a hell of a lot easier to convert sales when you already have customers being shown your page after searching ‘bathroom decor’, compared to advertising your own website. This is especially true as it can be hard to identify your ideal target audience to then advertise to via Meta (Facebook/Instagram) for example. Websites are a great avenue to explore eventually like I now have, but we recommend starting with Etsy and going from there. What costs do I need to be aware of? So, setting up an Etsy sellers account is currently costs £15. The only other upfront cost you will have is the cost of listing a product - this is 20 cents per listing. From then on, every time you make a sale you will be charged a transaction fee of 6.5%, a small payment processing fee, plus another 20 cents for a renewed listing fee. It normally works out to about 10% of each order, a small price to pay for all the benefits Etsy brings. No matter what platform you sell on, you will be faced with some form of transaction fee. Etsy is actually quite reasonable especially as they do not charge you to use their platform on a monthly basis.  What do I need to get selling? Getting your shop looking pretty \-Think of a shop name and design (now you are a professional designer) a logo \-Design a banner for the top of your shop \-Add in some about me info/shop announcement \-I recommend running a sale wherein orders of 3+ items get a 20% of discount. Another big benefit of PrintShrimp is that you receive large discounts when ordering multiple posters. This is great for attracting buyers and larger orders.  Making your products look attractive That is the bulk of the ‘decor’ you will need to do. Next up is placing your posters in mock ups! As you may notice on Etsy, most shops show their posters framed and hanging on walls. These are 99% of the time not real photos, but digital mock ups. This is where Photoshop comes in really handy, as you can automate this process through a plug in called Bulk Mock Up. If you don’t have photoshop, you can do this on Canva, you will just have to do it manually which can be rather time consuming.  Now, where can you get the actual Mock Ups? One platform we highly recommend for design in general is platforms like Envato Elements. These are design marketplaces where you have access to millions of design resources that you are fully licensed to use!  Titles, tags, and descriptions  Now for the slightly more nitty gritty part. You could have the world's most amazing looking poster, however, if you do not get the Etsy SEO right, no one is going to see it! We will take you through creating a new Etsy listing field by field so you can know how to best list your products.  The key to Etsy listing optimisation is to maximise. Literally cram in as many key words as you possibly can! Before you start this process, create a word map of anything you can think of relating to your listing. And come at this from the point of view of, if I was looking for a poster like mine, what would I search? Titles \-Here you are blessed with 140 characters to title your listing. Essentially, start off with a concise way of properly describing your poster. And then afterwards, add in as many key words as you can! Here is an example of the title of a well selling Skiing poster: Les Arcs Skiing Poster, Les Arcs Print, Les Alpes, France Ski Poster, Skiing Poster, Snowboarding Poster, Ski Resort Poster Holiday, French This is 139 characters out of 140 - you should try and maximise this as much as possible! As you can see, this crams in a lot of key words and search terms both related to Skiing as a whole, the poster category, and then the specifics of the poster itself (Les Arcs resort in France). Bear in mind that if you are listing a lot of listings that are of the same theme, you won’t have to spend time creating an entirely new title. For example if your next poster was of a ski resort in Italy, you can copy this one over and just swap out the specifics. For example change “France ski poster” to “Italy ski poster”, change “Les Arcs” to “The Dolomites”, etc.  Description \-Same logic applies for descriptions - try and cram in as many key words as you can! Here is an example for a Formula One poster: George Russell, Mercedes Formula One Poster  - item specific keywords Bright, modern and vibrant poster to liven up your home.  - Describes the style of the poster All posters are printed on high quality, museum grade 200gsm poster paper. Suitable for framing and frames. - Shows the quality of the print. Mentions frames whilst showing it comes unframed Experience the thrill of the racetrack with this stunning Formula One poster. Printed on high-quality paper, this racing car wall art print features a dynamic image of a Formula One car in action, perfect for adding a touch of speed and excitement to any motorsports room or man cave. Whether you're a die-hard fan or simply appreciate the adrenaline of high-speed racing, this poster is sure to impress. Available in a range of sizes, it makes a great addition to your home or office, or as a gift for a fellow Formula One enthusiast. Each poster is carefully packaged to ensure safe delivery, so you can enjoy your new piece of art as soon as possible. - A nice bit of text really highlighting a lot of key words such as gift, motorsports, racetrack etc.  You could go further with this too, by adding in extra things related to the poster such as ‘Perfect gift for a Mercedes F1 fan’ etc.  Tags Now, these are actually probably the most important part of your listing! You get 13 tags (20 character limit for each) and there are essentially search terms that will match your listing with what customers search for when shopping.  You really need to maximize these - whilst Title and Description play a part, these are the main things that will bring buyers to your listing. Once again, it is important to think about what customers are likely to be searching when looking for a poster similar to yours. Life hack alert! You can actually see what tags other sellers are using. All you need to do is go to a listing similar to yours that is selling well, scroll down and you can actually see them listed out at the bottom of the page! Here is an example of what this may look like: So, go through a few listings of competitors and make notes on common denominators that you can integrate into your listing. As you can see here, this seller uses tags such as ‘Birthday Gift’ and ‘Poster Print’. When you first start out, you may be better off swapping these out for more listing specific tags. This seller has been on Etsy for a few years however and has 15,000+ sales, so are more likely to see success from these tags.  If it’s not clear why, think about it this way. If you searched ‘poster print’ on Etsy today, there will be 10s of thousands of results. However, if you searched ‘Russell Mercedes Poster’, you will (as of writing) get 336 results. Etsy is far more likely to push your product to the top of the latter tag, against 300 other listings, rather than the top of ‘Poster Print’ where it is incredibly competitive. It is only when you are a more successful shop pulling in a high quantity of orders that these larger and more generic tags will work for you, as Etsy has more trust in your shop and will be more likely to push you to the front.  SKUs \-One important thing you need to do is add SKUs to all of your products! This is worth doing at the start as it will make your life so much easier when it comes to making sales and using PrintShrimp further down the line. What is an SKU? It is a ‘stock keeping unit’, and is essentially just a product identifier. Your SKUs need to match your file name that you upload to PrintShrimp. For example, if you made a poster about the eiffel tower, you can literally name the SKU eiffel-tower. There is no need to complicate things! As long as your file name (as in the image name of your poster on your computer) matches your SKU, you will be good to go.  \-It may be more beneficial to set up a system with unique identifiers, to make organising your files a lot easier further down the line. Say you get to 1000 posters eventually, you’ll want to be able to quickly search a code, and also ensure every SKU is always unique, so you won’t run into accidentally using the same SKU twice further down the line. For example, you can set it up so at the start of each file name, you have \[unique id\]\[info\], so your files will look like -  A1eiffeltower A2france And further down the line: A99aperolspritz B1potatoart This not only removes the potential issue of duplicating SKUs accidentally (for example if you made a few posters of the same subject), but also keeps your files well organised. If you need to find a file, you can search your files according to the code, so just by searching ‘a1’ for example, rather than having to trawl through a load of different files until you find the correct one. \-If your poster has variations, for example color variations, you can set a different SKU for each variation. Just click the little box when setting up variations that says ‘SKUs vary for each (variation)’. So if you have a poster available either in a white or black background, you can name each file, and therefore each SKU, a1eiffel-tower-black and a1eiffel-tower-white for example. \-The same goes for different sizes. As different American sizes have different aspect ratios, as mentioned above you may have to reformat some posters if you get a sale for one of these sizes. You can then add in the SKU to your listing once you have reformatted your poster. So for example if you sell a 16x20” version of the eiffel tower poster, you can name this file eiffel-tower-white-1620. Whilst this involves a little bit of set up, the time it saves you overall is massive!  Variations and Prices \-So, when selling posters there is a huge variety of sizes that you can offer, as mentioned previously. Non-negotiable is that you should be offering A5-A1. These will likely be your main sellers! Especially in the UK. It is also a good idea to offer inch sizing to appeal to a global audience (as bear in mind with PrintShrimp you will be able to print in multiple countries around the world!).  Below is a recommended pricing structure of what to charge on Etsy. Feel free to mess around with these! You may notice on Etsy that many shops charge a whole lot more for sizes such as A1, 24x36” etc. In my experience I prefer charging a lower rate to attract more sales, but there is validity in going for a lower amount of sales with higher profits. As mentioned above, you can also offer different variations on items - for example different colour schemes on posters. This is always a decent idea (if it suits the design) as it provides the customer with more options, which might help to convert the sale. You can always add this in later however if you want to keep it simple while you start! Setting up shipping profiles Etsy makes it very easy to set up different shipping rates for different countries. However, luckily with PrintShrimp you can offer free shipping to the majority of the major countries that are active on Etsy!  Using PrintShrimp means that your production costs are low enough in each domestic market to justify this. If you look on Etsy you can see there are many shops that post internationally to countries such as the US or Australia. Therefore, they often charge £8-10 in postage, and have a delivery time of 1-2 weeks. This really limits their customer base to their domestic market.  Using PrintShrimp avoids this and means you can offer free shipping (as we absorb the shipping cost in our prices) to the major markets of the UK, Australia, and USA (Europe coming soon!).  We also offer a 1 day processing time, unlike many POD poster suppliers. This means you can set your Etsy processing time to just one day, which combined with our quick shipping, means you will be one of the quickest on Etsy at sending out orders. This is obviously very attractive for customers, who are often very impatient with wanting their orders!  Getting the sales and extra tips \-Don’t list an insane amount of listings when you first get started. Etsy will be like ‘hang on a second’ if a brand new shop suddenly has 200 items in the first week. Warm up your account, and take things slow as you get going. We recommend 5 a day for the first week or so, and then you can start uploading more. You don’t want Etsy to flag your account for suspicious bot-like activity when you first get going.  \-It is very easy to copy listings when creating a new one. Simply select an old listing and press copy, and then you can just change the listing specific details to create a new one, rather than having to start from scratch. It can feel like a bit of a ball-ache setting up your first ever listing, but from then on you can just copy it over and just change the specifics.  \-Try and organize your listings into sections! This really helps the customer journey. Sometimes a customer will click onto your shop after seeing one of your listings, so it really helps if they can easily navigate your shop for what they are looking for. So, you now have a fully fledged Etsy shop. Well done! Time to start making £3,000 a month straight away right? Not quite. Please bear in mind, patience is key when starting out. If you started doing this because you are £10,000 in debt to the Albanian mafia and need to pay it off next week, you have come into this in the wrong frame of mind. If you have however started this to slowly build up a side hustle which hopefully one day become your full time gig, then winner winner chicken dinner.  Starting out on Etsy isn’t always easy. It takes time for your shop to build up trust! As I’ve said before, a buyer is far more likely to purchase from a shop with 1000s of reviews, than a brand new one with 0. But before you know it, you can become one of these shops! One thing you can do at the very start is to encourage your friends and family to buy your posters! This is a slightly naughty way of getting a few sales at the start, of course followed by a few glowing 5\* reviews. It really helps to give your shop this little boost at the start, so if this is something you can do then I recommend it.  Okay, so once you have a fully fledged shop with a decent amount of listings, you might be expecting the sales to start rolling in. And, if you are lucky, they indeed might. However, in my experience, you need to give your listings a little boost. So let us introduce you to: The wonderful world of Etsy ads Ads!! Oh no, that means money!! We imagine some of you more risk averse people are saying to yourself right now. And yes, it indeed does. But more often than not unfortunately you do have to spend money to make money.  Fortunately, in my experience anyway, Etsy ads do tend to work. This does however only apply if your products are actually good however, so if you’re back here after paying for ads for 2 months and are losing money at the same rate as your motivation, maybe go back to the start of this guide and pick another niche.  When you first start out, there are two main strategies.  Number 1: The Safer Option So, with PrintShrimp, you will essentially be making a minimum of £6 profit per order. With this in mind, I normally start a new shop with a safer strategy of advertising my products with a budget of $3-5 dollars a day. This then means that at the start, you only need to make 1 sale to break even, and anything above that is pure profit! This might not seem like the most dazzling proposition right now, but again please bear in mind that growth will be slow at the start. This means that you can gradually grow your shop, and therefore the trust that customers have in your shop, over time with a very small risk of ever actually losing money. Number 2: The Billy Big Balls Option If you were yawning while reading the first option, then this strategy may be for you. This will be better suited to those of you that are a bit more risk prone, and it also helps if you have a bit more cash to invest at the start. Through this strategy, you can essentially pay your way to the top of Etsy's rankings. For this, you’ll probably be looking at spending $20 a day on ads. So, this can really add up quickly and is definitely the riskier option. In my experience, the level of sales with this may not always match up to your spend every day. You may find that some days you rake in about 10 sales, and other days only one. But what this does mean is that as your listings get seen and purchased more, they will begin to rank higher in Etsy’s organic search rankings, at a much quicker rate than option one. This is the beauty of Etsy’s ads. You can pay to boost your products, but then results from this paid promotion feed into the organic ranking of your products. So you may find that you can splash the cash for a while at the start in order to race to the top, and then drop your ad spending later on when your products are already ranking well.  Sending your poster orders So, you’ve now done the hard bit. You have a running Etsy store, and essentially all you need to now on a daily basis is send out your orders and reply to customer messages! This is where it really becomes passive income.  \-Check out the PrintShrimp order portal. Simply sign up, and you can place individual orders through there. \-Bulk upload: We have an option to bulk upload your Esty orders via csv.  Seriously, when you are up and running with your first store, it is really as easy as that.  Once you have your first Etsy store up and running, you can think about expanding. There are many ways to expand your income. You can set up other Etsy stores, as long as the type of posters you are selling varies. You can look into setting up your own Shopify stores, and advertise them through Facebook, Instagram etc. Through this guide, we will teach you everything you need to know about starting to sell posters and generate some income. We will also show you why PrintShrimp is the best POD supplier for all of your poster needs. Trust me, you won’t need much convincing.

I run an AI automation agency (AAA). My honest overview and review of this new business model
reddit
LLM Vibe Score0
Human Vibe Score1
AI_Scout_OfficialThis week

I run an AI automation agency (AAA). My honest overview and review of this new business model

I started an AI tools directory in February, and then branched off that to start an AI automation agency (AAA) in June. So far I've come across a lot of unsustainable "ideas" to make money with AI, but at the same time a few diamonds in the rough that aren't fully tapped into yet- especially the AAA model. Thought I'd share this post to shine light into this new business model and share some ways you could potentially start your own agency, or at the very least know who you are dealing with and how to pick and choose when you (inevitably) get bombarded with cold emails from them down the line. Foreword Running an AAA does NOT involve using AI tools directly to generate and sell content directly. That ship has sailed, and unless you are happy with $5 from Fiverr every month or so, it is not a real business model. Cry me a river but generating generic art with AI and slapping it onto a T-shirt to sell on Etsy won't make you a dime. At the same time, the AAA model will NOT require you to have a deep theoretical knowledge of AI, or any academic degree, as we are more so dealing with the practical applications of generative AI and how we can implement these into different workflows and tech-stacks, rather than building AI models from the ground up. Regardless of all that, common sense and a willingness to learn will help (a shit ton), as with anything. Keep in mind - this WILL involve work and motivation as well. The mindset that AI somehow means everything can be done for you on autopilot is not the right way to approach things. The common theme of businesses I've seen who have successfully implemented AI into their operations is the willingess to work with AI in a way that augments their existing operations, rather than flat out replace a worker or team. And this is exactly the train of thought you need when working with AI as a business model. However, as the field is relatively unsaturated and hype surrounding AI is still fresh for enterprises, right now is the prime time to start something new if generative AI interests you at all. With that being said, I'll be going over three of the most successful AI-adjacent businesses I've seen over this past year, in addition to some tips and resources to point you in the right direction. so.. WTF is an AI Automation Agency? The AI automation agency (or as some YouTubers have coined it, the AAA model) at its core involves creating custom AI solutions for businesses. I have over 1500 AI tools listed in my directory, however the feedback I've received from some enterprise users is that ready-made SaaS tools are too generic to meet their specific needs. Combine this with the fact virtually no smaller companies have the time or skills required to develop custom solutions right off the bat, and you have yourself real demand. I would say in practice, the AAA model is quite similar to Wordpress and even web dev agencies, with the major difference being all solutions you develop will incorporate key aspects of AI AND automation. Which brings me to my second point- JUST AI IS NOT ENOUGH. Rather than reducing the amount of time required to complete certain tasks, I've seen many AI agencies make the mistake of recommending and (trying to) sell solutions that more likely than not increase the workload of their clients. For example, if you were to make an internal tool that has AI answer questions based on their knowledge base, but this knowledge base has to be updated manually, this is creating unnecessary work. As such I think one of the key components of building successful AI solutions is incorporating the new (Generative AI/LLMs) with the old (programmtic automation- think Zapier, APIs, etc.). Finally, for this business model to be successful, ideally you should target a niche in which you have already worked and understand pain points and needs. Not only does this make it much easier to get calls booked with prospects, the solutions you build will have much greater value to your clients (meaning you get paid more). A mistake I've seen many AAA operators make (and I blame this on the "Get Rich Quick" YouTubers) is focusing too much on a specific productized service, rather than really understanding the needs of businesses. The former is much done via a SaaS model, but when going the agency route the only thing that makes sense is building custom solutions. This is why I always take a consultant-first approach. You can only build once you understand what they actually need and how certain solutions may impact their operations, workflows, and bottom-line. Basics of How to Get Started Pick a niche. As I mentioned previously, preferably one that you've worked in before. Niches I know of that are actively being bombarded with cold emails include real estate, e-commerce, auto-dealerships, lawyers, and medical offices. There is a reason for this, but I will tell you straight up this business model works well if you target any white-collar service business (internal tools approach) or high volume businesses (customer facing tools approach). Setup your toolbox. If you wanted to start a pressure washing business, you would need a pressure-washer. This is no different. For those without programming knowledge, I've seen two common ways AAA get setup to build- one is having a network of on-call web developers, whether its personal contacts or simply going to Upwork or any talent sourcing agency. The second is having an arsenal of no-code tools. I'll get to this more in a second, but this works beecause at its core, when we are dealing with the practical applications of AI, the code is quite simple, simply put. Start cold sales. Unless you have a network already, this is not a step you can skip. You've already picked a niche, so all you have to do is find the right message. Keep cold emails short, sweet, but enticing- and it will help a lot if you did step 1 correctly and intimately understand who your audience is. I'll be touching base later about how you can leverage AI yourself to help you with outreach and closing. The beauty of gen AI and the AAA model You don't need to be a seasoned web developer to make this business model work. The large majority of solutions that SME clients want is best done using an API for an LLM for the actual AI aspect. The value we create with the solutions we build comes with the conceptual framework and design that not only does what they need it to but integrates smoothly with their existing tech-stack and workflow. The actual implementation is quite straightforward once you understand the high level design and know which tools you are going to use. To give you a sense, even if you plan to build out these apps yourself (say in Python) the large majority of the nitty gritty technical work has already been done for you, especially if you leverage Python libraries and packages that offer high level abstraction for LLM-related functions. For instance, calling GPT can be as little as a single line of code. (And there are no-code tools where these functions are simply an icon on a GUI). Aside from understanding the capabilities and limitations of these tools and frameworks, the only thing that matters is being able to put them in a way that makes sense for what you want to build. Which is why outsourcing and no-code tools both work in our case. Okay... but how TF am I suppposed to actually build out these solutions? Now the fun part. I highly recommend getting familiar with Langchain and LlamaIndex. Both are Python libraires that help a lot with the high-level LLM abstraction I mentioned previously. The two most important aspects include being able to integrate internal data sources/knowledge bases with LLMs, and have LLMs perform autonomous actions. The two most common methods respectively are RAG and output parsing. RAG (retrieval augmented Generation) If you've ever seen a tool that seemingly "trains" GPT on your own data, and wonder how it all works- well I have an answer from you. At a high level, the user query is first being fed to what's called a vector database to run vector search. Vector search basically lets you do semantic search where you are searching data based on meaning. The vector databases then retrieves the most relevant sections of text as it relates to the user query, and this text gets APPENDED to your GPT prompt to provide extra context to the AI. Further, with prompt engineering, you can limit GPT to only generate an answer if it can be found within this extra context, greatly limiting the chance of hallucination (this is where AI makes random shit up). Aside from vector databases, we can also implement RAG with other data sources and retrieval methods, for example SQL databses (via parsing the outputs of LLM's- more on this later). Autonomous Agents via Output Parsing A common need of clients has been having AI actually perform tasks, rather than simply spitting out text. For example, with autonomous agents, we can have an e-commerce chatbot do the work of a basic customer service rep (i.e. look into orders, refunds, shipping). At a high level, what's going on is that the response of the LLM is being used programmtically to determine which API to call. Keeping on with the e-commerce example, if I wanted a chatbot to check shipping status, I could have a LLM response within my app (not shown to the user) with a prompt that outputs a random hash or string, and programmatically I can determine which API call to make based on this hash/string. And using the same fundamental concept as with RAG, I can append the the API response to a final prompt that would spit out the answer for the user. How No Code Tools Can Fit In (With some example solutions you can build) With that being said, you don't necessarily need to do all of the above by coding yourself, with Python libraries or otherwise. However, I will say that having that high level overview will help IMMENSELY when it comes to using no-code tools to do the actual work for you. Regardless, here are a few common solutions you might build for clients as well as some no-code tools you can use to build them out. Ex. Solution 1: AI Chatbots for SMEs (Small and Medium Enterprises) This involves creating chatbots that handle user queries, lead gen, and so forth with AI, and will use the principles of RAG at heart. After getting the required data from your client (i.e. product catalogues, previous support tickets, FAQ, internal documentation), you upload this into your knowledge base and write a prompt that makes sense for your use case. One no-code tool that does this well is MyAskAI. The beauty of it especially for building external chatbots is the ability to quickly ingest entire websites into your knowledge base via a sitemap, and bulk uploading files. Essentially, they've covered the entire grunt work required to do this manually. Finally, you can create a inline or chat widget on your client's website with a few lines of HTML, or altneratively integrate it with a Slack/Teams chatbot (if you are going for an internal Q&A chatbot approach). Other tools you could use include Botpress and Voiceflow, however these are less for RAG and more for building out complete chatbot flows that may or may not incorporate LLMs. Both apps are essentially GUIs that eliminate the pain and tears and trying to implement complex flows manually, and both natively incoporate AI intents and a knowledge base feature. Ex. Solution 2: Internal Apps Similar to the first example, except we go beyond making just chatbots but tools such as report generation and really any sort of internal tool or automations that may incorporate LLM's. For instance, you can have a tool that automatically generates replies to inbound emails based on your client's knowledge base. Or an automation that does the same thing but for replies to Instagram comments. Another example could be a tool that generates a description and screeenshot based on a URL (useful for directory sites, made one for my own :P). Getting into more advanced implementations of LLMs, we can have tools that can generate entire drafts of reports (think 80+ pages), based not only on data from a knowledge base but also the writing style, format, and author voice of previous reports. One good tool to create content generation panels for your clients would be MindStudio. You can train LLM's via prompt engineering in a structured way with your own data to essentially fine tune them for whatever text you need it to generate. Furthermore, it has a GUI where you can dictate the entire AI flow. You can also upload data sources via multiple formats, including PDF, CSV, and Docx. For automations that require interactions between multiple apps, I recommend the OG zapier/make.com if you want a no-code solution. For instance, for the automatic email reply generator, I can have a trigger such that when an email is received, a custom AI reply is generated by MyAskAI, and finally a draft is created in my email client. Or, for an automation where I can create a social media posts on multiple platforms based on a RSS feed (news feed), I can implement this directly in Zapier with their native GPT action (see screenshot) As for more complex LLM flows that may require multiple layers of LLMs, data sources, and APIs working together to generate a single response i.e. a long form 100 page report, I would recommend tools such as Stack AI or Flowise (open-source alternative) to build these solutions out. Essentially, you get most of the functions and features of Python packages such as Langchain and LlamaIndex in a GUI. See screenshot for an example of a flow How the hell are you supposed to find clients? With all that being said, none of this matters if you can't find anyone to sell to. You will have to do cold sales, one way or the other, especially if you are brand new to the game. And what better way to sell your AI services than with AI itself? If we want to integrate AI into the cold outreach process, first we must identify what it's good at doing, and that's obviously writing a bunch of text, in a short amount of time. Similar to the solutions that an AAA can build for its clients, we can take advantage of the same principles in our own sales processes. How to do outreach Once you've identified your niche and their pain points/opportunities for automation, you want to craft a compelling message in which you can send via cold email and cold calls to get prospects booked on demos/consultations. I won't get into too much detail in terms of exactly how to write emails or calling scripts, as there are millions of resources to help with this, but I will tell you a few key points you want to keep in mind when doing outreach for your AAA. First, you want to keep in mind that many businesses are still hesitant about AI and may not understand what it really is or how it can benefit their operations. However, we can take advantage of how mass media has been reporting on AI this past year- at the very least people are AWARE that sooner or later they may have to implement AI into their businesses to stay competitive. We want to frame our message in a way that introduces generative AI as a technology that can have a direct, tangible, and positive impact on their business. Although it may be hard to quantify, I like to include estimates of man-hours saved or costs saved at least in my final proposals to prospects. Times are TOUGH right now, and money is expensive, so you need to have a compelling reason for businesses to get on board. Once you've gotten your messaging down, you will want to create a list of prospects to contact. Tools you can use to find prospects include Apollo.io, reply.io, zoominfo (expensive af), and Linkedin Sales Navigator. What specific job titles, etc. to target will depend on your niche but for smaller companies this will tend to be the owner. For white collar niches, i.e. law, the professional that will be directly benefiting from the tool (i.e. partners) may be better to contact. And for larger organizations you may want to target business improvement and digital transformation leads/directors- these are the people directly in charge of projects like what you may be proposing. Okay- so you have your message, and your list, and now all it comes down to is getting the good word out. I won't be going into the details of how to send these out, a quick Google search will give you hundreds of resources for cold outreach methods. However, personalization is key and beyond simple dynamic variables you want to make sure you can either personalize your email campaigns directly with AI (SmartWriter.ai is an example of a tool that can do this), or at the very least have the ability to import email messages programmatically. Alternatively, ask ChatGPT to make you a Python Script that can take in a list of emails, scrape info based on their linkedin URL or website, and all pass this onto a GPT prompt that specifies your messaging to generate an email. From there, send away. How tf do I close? Once you've got some prospects booked in on your meetings, you will need to close deals with them to turn them into clients. Call #1: Consultation Tying back to when I mentioned you want to take a consultant-first appraoch, you will want to listen closely to their goals and needs and understand their pain points. This would be the first call, and typically I would provide a high level overview of different solutions we could build to tacke these. It really helps to have a presentation available, so you can graphically demonstrate key points and key technologies. I like to use Plus AI for this, it's basically a Google Slides add-on that can generate slide decks for you. I copy and paste my default company messaging, add some key points for the presentation, and it comes out with pretty decent slides. Call #2: Demo The second call would involve a demo of one of these solutions, and typically I'll quickly prototype it with boilerplate code I already have, otherwise I'll cook something up in a no-code tool. If you have a niche where one type of solution is commonly demanded, it helps to have a general demo set up to be able to handle a larger volume of calls, so you aren't burning yourself out. I'll also elaborate on how the final product would look like in comparison to the demo. Call #3 and Beyond: Once the initial consultation and demo is complete, you will want to alleviate any remaining concerns from your prospects and work with them to reach a final work proposal. It's crucial you lay out exactly what you will be building (in writing) and ensure the prospect understands this. Furthermore, be clear and transparent with timelines and communication methods for the project. In terms of pricing, you want to take this from a value-based approach. The same solution may be worth a lot more to client A than client B. Furthermore, you can create "add-ons" such as monthly maintenance/upgrade packages, training sessions for employeees, and so forth, separate from the initial setup fee you would charge. How you can incorporate AI into marketing your businesses Beyond cold sales, I highly recommend creating a funnel to capture warm leads. For instance, I do this currently with my AI tools directory, which links directly to my AI agency and has consistent branding throughout. Warm leads are much more likely to close (and honestly, much nicer to deal with). However, even without an AI-related website, at the very least you will want to create a presence on social media and the web in general. As with any agency, you will want basic a professional presence. A professional virtual address helps, in addition to a Google Business Profile (GBP) and TrustPilot. a GBP (especially for local SEO) and Trustpilot page also helps improve the looks of your search results immensely. For GBP, I recommend using ProfilePro, which is a chrome extension you can use to automate SEO work for your GBP. Aside from SEO optimzied business descriptions based on your business, it can handle Q/A answers, responses, updates, and service descriptions based on local keywords. Privacy and Legal Concerns of the AAA Model Aside from typical concerns for agencies relating to service contracts, there are a few issues (especially when using no-code tools) that will need to be addressed to run a successful AAA. Most of these surround privacy concerns when working with proprietary data. In your terms with your client, you will want to clearly define hosting providers and any third party tools you will be using to build their solution, and a DPA with these third parties listed as subprocessors if necessary. In addition, you will want to implement best practices like redacting private information from data being used for building solutions. In terms of addressing concerns directly from clients, it helps if you host your solutions on their own servers (not possible with AI tools), and address the fact only ChatGPT queries in the web app, not OpenAI API calls, will be used to train OpenAI's models (as reported by mainstream media). The key here is to be open and transparent with your clients about ALL the tools you are using, where there data will be going, and make sure to get this all in writing. have fun, and keep an open mind Before I finish this post, I just want to reiterate the fact that this is NOT an easy way to make money. Running an AI agency will require hours and hours of dedication and work, and constantly rearranging your schedule to meet prospect and client needs. However, if you are looking for a new business to run, and have a knack for understanding business operations and are genuinely interested in the pracitcal applications of generative AI, then I say go for it. The time is ticking before AAA becomes the new dropshipping or SMMA, and I've a firm believer that those who set foot first and establish themselves in this field will come out top. And remember, while 100 thousand people may read this post, only 2 may actually take initiative and start.

I run an AI automation agency (AAA). My honest overview and review of this new business model
reddit
LLM Vibe Score0
Human Vibe Score1
AI_Scout_OfficialThis week

I run an AI automation agency (AAA). My honest overview and review of this new business model

I started an AI tools directory in February, and then branched off that to start an AI automation agency (AAA) in June. So far I've come across a lot of unsustainable "ideas" to make money with AI, but at the same time a few diamonds in the rough that aren't fully tapped into yet- especially the AAA model. Thought I'd share this post to shine light into this new business model and share some ways you could potentially start your own agency, or at the very least know who you are dealing with and how to pick and choose when you (inevitably) get bombarded with cold emails from them down the line. Foreword Running an AAA does NOT involve using AI tools directly to generate and sell content directly. That ship has sailed, and unless you are happy with $5 from Fiverr every month or so, it is not a real business model. Cry me a river but generating generic art with AI and slapping it onto a T-shirt to sell on Etsy won't make you a dime. At the same time, the AAA model will NOT require you to have a deep theoretical knowledge of AI, or any academic degree, as we are more so dealing with the practical applications of generative AI and how we can implement these into different workflows and tech-stacks, rather than building AI models from the ground up. Regardless of all that, common sense and a willingness to learn will help (a shit ton), as with anything. Keep in mind - this WILL involve work and motivation as well. The mindset that AI somehow means everything can be done for you on autopilot is not the right way to approach things. The common theme of businesses I've seen who have successfully implemented AI into their operations is the willingess to work with AI in a way that augments their existing operations, rather than flat out replace a worker or team. And this is exactly the train of thought you need when working with AI as a business model. However, as the field is relatively unsaturated and hype surrounding AI is still fresh for enterprises, right now is the prime time to start something new if generative AI interests you at all. With that being said, I'll be going over three of the most successful AI-adjacent businesses I've seen over this past year, in addition to some tips and resources to point you in the right direction. so.. WTF is an AI Automation Agency? The AI automation agency (or as some YouTubers have coined it, the AAA model) at its core involves creating custom AI solutions for businesses. I have over 1500 AI tools listed in my directory, however the feedback I've received from some enterprise users is that ready-made SaaS tools are too generic to meet their specific needs. Combine this with the fact virtually no smaller companies have the time or skills required to develop custom solutions right off the bat, and you have yourself real demand. I would say in practice, the AAA model is quite similar to Wordpress and even web dev agencies, with the major difference being all solutions you develop will incorporate key aspects of AI AND automation. Which brings me to my second point- JUST AI IS NOT ENOUGH. Rather than reducing the amount of time required to complete certain tasks, I've seen many AI agencies make the mistake of recommending and (trying to) sell solutions that more likely than not increase the workload of their clients. For example, if you were to make an internal tool that has AI answer questions based on their knowledge base, but this knowledge base has to be updated manually, this is creating unnecessary work. As such I think one of the key components of building successful AI solutions is incorporating the new (Generative AI/LLMs) with the old (programmtic automation- think Zapier, APIs, etc.). Finally, for this business model to be successful, ideally you should target a niche in which you have already worked and understand pain points and needs. Not only does this make it much easier to get calls booked with prospects, the solutions you build will have much greater value to your clients (meaning you get paid more). A mistake I've seen many AAA operators make (and I blame this on the "Get Rich Quick" YouTubers) is focusing too much on a specific productized service, rather than really understanding the needs of businesses. The former is much done via a SaaS model, but when going the agency route the only thing that makes sense is building custom solutions. This is why I always take a consultant-first approach. You can only build once you understand what they actually need and how certain solutions may impact their operations, workflows, and bottom-line. Basics of How to Get Started Pick a niche. As I mentioned previously, preferably one that you've worked in before. Niches I know of that are actively being bombarded with cold emails include real estate, e-commerce, auto-dealerships, lawyers, and medical offices. There is a reason for this, but I will tell you straight up this business model works well if you target any white-collar service business (internal tools approach) or high volume businesses (customer facing tools approach). Setup your toolbox. If you wanted to start a pressure washing business, you would need a pressure-washer. This is no different. For those without programming knowledge, I've seen two common ways AAA get setup to build- one is having a network of on-call web developers, whether its personal contacts or simply going to Upwork or any talent sourcing agency. The second is having an arsenal of no-code tools. I'll get to this more in a second, but this works beecause at its core, when we are dealing with the practical applications of AI, the code is quite simple, simply put. Start cold sales. Unless you have a network already, this is not a step you can skip. You've already picked a niche, so all you have to do is find the right message. Keep cold emails short, sweet, but enticing- and it will help a lot if you did step 1 correctly and intimately understand who your audience is. I'll be touching base later about how you can leverage AI yourself to help you with outreach and closing. The beauty of gen AI and the AAA model You don't need to be a seasoned web developer to make this business model work. The large majority of solutions that SME clients want is best done using an API for an LLM for the actual AI aspect. The value we create with the solutions we build comes with the conceptual framework and design that not only does what they need it to but integrates smoothly with their existing tech-stack and workflow. The actual implementation is quite straightforward once you understand the high level design and know which tools you are going to use. To give you a sense, even if you plan to build out these apps yourself (say in Python) the large majority of the nitty gritty technical work has already been done for you, especially if you leverage Python libraries and packages that offer high level abstraction for LLM-related functions. For instance, calling GPT can be as little as a single line of code. (And there are no-code tools where these functions are simply an icon on a GUI). Aside from understanding the capabilities and limitations of these tools and frameworks, the only thing that matters is being able to put them in a way that makes sense for what you want to build. Which is why outsourcing and no-code tools both work in our case. Okay... but how TF am I suppposed to actually build out these solutions? Now the fun part. I highly recommend getting familiar with Langchain and LlamaIndex. Both are Python libraires that help a lot with the high-level LLM abstraction I mentioned previously. The two most important aspects include being able to integrate internal data sources/knowledge bases with LLMs, and have LLMs perform autonomous actions. The two most common methods respectively are RAG and output parsing. RAG (retrieval augmented Generation) If you've ever seen a tool that seemingly "trains" GPT on your own data, and wonder how it all works- well I have an answer from you. At a high level, the user query is first being fed to what's called a vector database to run vector search. Vector search basically lets you do semantic search where you are searching data based on meaning. The vector databases then retrieves the most relevant sections of text as it relates to the user query, and this text gets APPENDED to your GPT prompt to provide extra context to the AI. Further, with prompt engineering, you can limit GPT to only generate an answer if it can be found within this extra context, greatly limiting the chance of hallucination (this is where AI makes random shit up). Aside from vector databases, we can also implement RAG with other data sources and retrieval methods, for example SQL databses (via parsing the outputs of LLM's- more on this later). Autonomous Agents via Output Parsing A common need of clients has been having AI actually perform tasks, rather than simply spitting out text. For example, with autonomous agents, we can have an e-commerce chatbot do the work of a basic customer service rep (i.e. look into orders, refunds, shipping). At a high level, what's going on is that the response of the LLM is being used programmtically to determine which API to call. Keeping on with the e-commerce example, if I wanted a chatbot to check shipping status, I could have a LLM response within my app (not shown to the user) with a prompt that outputs a random hash or string, and programmatically I can determine which API call to make based on this hash/string. And using the same fundamental concept as with RAG, I can append the the API response to a final prompt that would spit out the answer for the user. How No Code Tools Can Fit In (With some example solutions you can build) With that being said, you don't necessarily need to do all of the above by coding yourself, with Python libraries or otherwise. However, I will say that having that high level overview will help IMMENSELY when it comes to using no-code tools to do the actual work for you. Regardless, here are a few common solutions you might build for clients as well as some no-code tools you can use to build them out. Ex. Solution 1: AI Chatbots for SMEs (Small and Medium Enterprises) This involves creating chatbots that handle user queries, lead gen, and so forth with AI, and will use the principles of RAG at heart. After getting the required data from your client (i.e. product catalogues, previous support tickets, FAQ, internal documentation), you upload this into your knowledge base and write a prompt that makes sense for your use case. One no-code tool that does this well is MyAskAI. The beauty of it especially for building external chatbots is the ability to quickly ingest entire websites into your knowledge base via a sitemap, and bulk uploading files. Essentially, they've covered the entire grunt work required to do this manually. Finally, you can create a inline or chat widget on your client's website with a few lines of HTML, or altneratively integrate it with a Slack/Teams chatbot (if you are going for an internal Q&A chatbot approach). Other tools you could use include Botpress and Voiceflow, however these are less for RAG and more for building out complete chatbot flows that may or may not incorporate LLMs. Both apps are essentially GUIs that eliminate the pain and tears and trying to implement complex flows manually, and both natively incoporate AI intents and a knowledge base feature. Ex. Solution 2: Internal Apps Similar to the first example, except we go beyond making just chatbots but tools such as report generation and really any sort of internal tool or automations that may incorporate LLM's. For instance, you can have a tool that automatically generates replies to inbound emails based on your client's knowledge base. Or an automation that does the same thing but for replies to Instagram comments. Another example could be a tool that generates a description and screeenshot based on a URL (useful for directory sites, made one for my own :P). Getting into more advanced implementations of LLMs, we can have tools that can generate entire drafts of reports (think 80+ pages), based not only on data from a knowledge base but also the writing style, format, and author voice of previous reports. One good tool to create content generation panels for your clients would be MindStudio. You can train LLM's via prompt engineering in a structured way with your own data to essentially fine tune them for whatever text you need it to generate. Furthermore, it has a GUI where you can dictate the entire AI flow. You can also upload data sources via multiple formats, including PDF, CSV, and Docx. For automations that require interactions between multiple apps, I recommend the OG zapier/make.com if you want a no-code solution. For instance, for the automatic email reply generator, I can have a trigger such that when an email is received, a custom AI reply is generated by MyAskAI, and finally a draft is created in my email client. Or, for an automation where I can create a social media posts on multiple platforms based on a RSS feed (news feed), I can implement this directly in Zapier with their native GPT action (see screenshot) As for more complex LLM flows that may require multiple layers of LLMs, data sources, and APIs working together to generate a single response i.e. a long form 100 page report, I would recommend tools such as Stack AI or Flowise (open-source alternative) to build these solutions out. Essentially, you get most of the functions and features of Python packages such as Langchain and LlamaIndex in a GUI. See screenshot for an example of a flow How the hell are you supposed to find clients? With all that being said, none of this matters if you can't find anyone to sell to. You will have to do cold sales, one way or the other, especially if you are brand new to the game. And what better way to sell your AI services than with AI itself? If we want to integrate AI into the cold outreach process, first we must identify what it's good at doing, and that's obviously writing a bunch of text, in a short amount of time. Similar to the solutions that an AAA can build for its clients, we can take advantage of the same principles in our own sales processes. How to do outreach Once you've identified your niche and their pain points/opportunities for automation, you want to craft a compelling message in which you can send via cold email and cold calls to get prospects booked on demos/consultations. I won't get into too much detail in terms of exactly how to write emails or calling scripts, as there are millions of resources to help with this, but I will tell you a few key points you want to keep in mind when doing outreach for your AAA. First, you want to keep in mind that many businesses are still hesitant about AI and may not understand what it really is or how it can benefit their operations. However, we can take advantage of how mass media has been reporting on AI this past year- at the very least people are AWARE that sooner or later they may have to implement AI into their businesses to stay competitive. We want to frame our message in a way that introduces generative AI as a technology that can have a direct, tangible, and positive impact on their business. Although it may be hard to quantify, I like to include estimates of man-hours saved or costs saved at least in my final proposals to prospects. Times are TOUGH right now, and money is expensive, so you need to have a compelling reason for businesses to get on board. Once you've gotten your messaging down, you will want to create a list of prospects to contact. Tools you can use to find prospects include Apollo.io, reply.io, zoominfo (expensive af), and Linkedin Sales Navigator. What specific job titles, etc. to target will depend on your niche but for smaller companies this will tend to be the owner. For white collar niches, i.e. law, the professional that will be directly benefiting from the tool (i.e. partners) may be better to contact. And for larger organizations you may want to target business improvement and digital transformation leads/directors- these are the people directly in charge of projects like what you may be proposing. Okay- so you have your message, and your list, and now all it comes down to is getting the good word out. I won't be going into the details of how to send these out, a quick Google search will give you hundreds of resources for cold outreach methods. However, personalization is key and beyond simple dynamic variables you want to make sure you can either personalize your email campaigns directly with AI (SmartWriter.ai is an example of a tool that can do this), or at the very least have the ability to import email messages programmatically. Alternatively, ask ChatGPT to make you a Python Script that can take in a list of emails, scrape info based on their linkedin URL or website, and all pass this onto a GPT prompt that specifies your messaging to generate an email. From there, send away. How tf do I close? Once you've got some prospects booked in on your meetings, you will need to close deals with them to turn them into clients. Call #1: Consultation Tying back to when I mentioned you want to take a consultant-first appraoch, you will want to listen closely to their goals and needs and understand their pain points. This would be the first call, and typically I would provide a high level overview of different solutions we could build to tacke these. It really helps to have a presentation available, so you can graphically demonstrate key points and key technologies. I like to use Plus AI for this, it's basically a Google Slides add-on that can generate slide decks for you. I copy and paste my default company messaging, add some key points for the presentation, and it comes out with pretty decent slides. Call #2: Demo The second call would involve a demo of one of these solutions, and typically I'll quickly prototype it with boilerplate code I already have, otherwise I'll cook something up in a no-code tool. If you have a niche where one type of solution is commonly demanded, it helps to have a general demo set up to be able to handle a larger volume of calls, so you aren't burning yourself out. I'll also elaborate on how the final product would look like in comparison to the demo. Call #3 and Beyond: Once the initial consultation and demo is complete, you will want to alleviate any remaining concerns from your prospects and work with them to reach a final work proposal. It's crucial you lay out exactly what you will be building (in writing) and ensure the prospect understands this. Furthermore, be clear and transparent with timelines and communication methods for the project. In terms of pricing, you want to take this from a value-based approach. The same solution may be worth a lot more to client A than client B. Furthermore, you can create "add-ons" such as monthly maintenance/upgrade packages, training sessions for employeees, and so forth, separate from the initial setup fee you would charge. How you can incorporate AI into marketing your businesses Beyond cold sales, I highly recommend creating a funnel to capture warm leads. For instance, I do this currently with my AI tools directory, which links directly to my AI agency and has consistent branding throughout. Warm leads are much more likely to close (and honestly, much nicer to deal with). However, even without an AI-related website, at the very least you will want to create a presence on social media and the web in general. As with any agency, you will want basic a professional presence. A professional virtual address helps, in addition to a Google Business Profile (GBP) and TrustPilot. a GBP (especially for local SEO) and Trustpilot page also helps improve the looks of your search results immensely. For GBP, I recommend using ProfilePro, which is a chrome extension you can use to automate SEO work for your GBP. Aside from SEO optimzied business descriptions based on your business, it can handle Q/A answers, responses, updates, and service descriptions based on local keywords. Privacy and Legal Concerns of the AAA Model Aside from typical concerns for agencies relating to service contracts, there are a few issues (especially when using no-code tools) that will need to be addressed to run a successful AAA. Most of these surround privacy concerns when working with proprietary data. In your terms with your client, you will want to clearly define hosting providers and any third party tools you will be using to build their solution, and a DPA with these third parties listed as subprocessors if necessary. In addition, you will want to implement best practices like redacting private information from data being used for building solutions. In terms of addressing concerns directly from clients, it helps if you host your solutions on their own servers (not possible with AI tools), and address the fact only ChatGPT queries in the web app, not OpenAI API calls, will be used to train OpenAI's models (as reported by mainstream media). The key here is to be open and transparent with your clients about ALL the tools you are using, where there data will be going, and make sure to get this all in writing. have fun, and keep an open mind Before I finish this post, I just want to reiterate the fact that this is NOT an easy way to make money. Running an AI agency will require hours and hours of dedication and work, and constantly rearranging your schedule to meet prospect and client needs. However, if you are looking for a new business to run, and have a knack for understanding business operations and are genuinely interested in the pracitcal applications of generative AI, then I say go for it. The time is ticking before AAA becomes the new dropshipping or SMMA, and I've a firm believer that those who set foot first and establish themselves in this field will come out top. And remember, while 100 thousand people may read this post, only 2 may actually take initiative and start.

The 15 Best (Free to Use) AI Tools for Creating Websites, Presentations, Graphics, UIs, Photos, and more
reddit
LLM Vibe Score0
Human Vibe Score1
Tapedulema919This week

The 15 Best (Free to Use) AI Tools for Creating Websites, Presentations, Graphics, UIs, Photos, and more

While we wait for ChatGPT to roll out its own official image input+output tool, I wanted to put together a list of the best AI design tools I've seen so far. Obviously text-based tasks like writing and coding get the bulk of the attention, but I wanted to see how it’s being used in design and more visual tasks. From UI and full-on website design, to graphics and photo generation, there are a ton of interesting and free tools coming out that are worth trying and using as inspiration for your own projects. These tools cover a bunch of different use cases and can hopefully help some of you, whether you’re a professional designer looking to automate parts of your work or just someone who wants to find ways to speed up the design work for your business/side projects. All of them are free to try, but most have some kind of paid plan or limit on the number of free generations. Fair enough given it costs money to run the models, but I've tried to include notes on any that don't have permanent free plans. Let me know if you know of any tools I’ve missed so I can add them to the list! I’ve grouped them by categories, to make it easier to see what each tool is capable of, then given a bit more detail under each specific tool. AI Website, Graphic and UI Generators: Framer: Describe the website you want, and Framer will create it for you. Edit and instantly publish your site from their platform. Ironically my favorite thing about Framer isn’t its AI tool. Its real advantage is its website editor which is the best I’ve seen on any platform (and usable for free). It’s like Figma if Figma let you publish directly to the web. Microsoft Designer: Generates designs based on user input for social media posts, logos, and business graphics. It’s free to use with a Microsoft account, and fairly impressive if not always consistent. If you pay a lot or spend a ton of time on design/social media content, Designer is definitely worth checking out. UIzard: Transforms text and images into design mockups, wireframes, and full user interfaces. It’s an ambitious concept, but very cool. While Framer was better for generating websites from text prompts, UIZard offers something none of the others did: taking a sketch drawing and turning it into a UI and/or wireframing. Visualizations, Graphics and Illustrations: Taskade: AI powered productivity tool to visualize your notes, projects, and tasks. Taskade lets you easily generate mind maps and other visualizations of your work, and makes use of AI in a bunch of cool ways. For example, you can generate a mind map to help you brainstorm and then ask it to expand on a certain point or even research it for you with the internet. Bing Image Creator: Generate images from natural text descriptions, powered by DALL-E. Whether you’re looking for blog illustrations, images for your site’s pages or any other purpose, it’s worth trying. AutoDraw: Autodraw is a Google Project that lets you draw something freehand with your cursor, and AutoDraw uses AI to transform it into a refined image with icons and predrawn designs, all for free in your browser. AI Presentations and Slides: Plus AI for Google Slides: AI generated slides and full-on presentations, all within Google Slides. I liked how Plus AI worked within Google Slides and made it easy to make changes to the presentation (as lets be real, no AI tool is going to generate exactly* the content and formatting you need for a serious presentation). SlidesGo: Generate slides with illustrations, images, and icons chosen by AI. SlidesGo also has their own editor to let you edit and refine the AI generated presentation. Tome: Tell Tome what you want to say to your audience, and it will create a presentation that effectively communicates it clearly and effectively. Tome actually goes beyond just presentations and has a few cool formats worth checking out that I could see being useful for salespeople and anyone who needs to pitch an idea or product at work or to clients. Product Photography: These are all fairly similar so I’ve kept the descriptions short, but it’s genuinely a pretty useful category if you run any kind of business or side hustle that needs product photos. These photos establish the professionalism of your store/brand, and all the ones I tried had genuinely impressive results that seemed much better than what I could do myself. Pebblely: AI image generator for product images in various styles and settings. 40 free images, paid after that. Booth.ai: Generates professional-quality product photos using AI, focused on furniture, fashion, and packaged goods. Stylized.ai: Generates product photos integrated into ecommerce platforms like Shopify. Miscellaneous Tools: Fronty: Converts uploaded images or drawings into HTML and CSS code using AI. It’s a bit clunky, but a cool concept nonetheless. LetsEnhance: Uses AI to enhance the resolution of images and photographs. Generally works pretty well from my experience, and gives you 10 free credits with signup. Unfortunately beyond that it is a paid product. Remove.bg: Specializes in recognizing and removing image backgrounds effectively. Doesn’t promise much, but it does the job and doesn’t require you to sign up. TL;DR/Overall favorites: These are the ones I've found the most use for in my day-to-day work. Framer: responsive website design with a full-featured editor to edit and publish your site all in one place. Free + paid plans. Taskade: visualize and automate your workflows, projects, mind maps, and more with AI powered templates. Free + paid plans. Microsoft Designer: generate social media and other marketing graphics with AI. Free to use. Plus AI: plugin for Google Slides to generate slide content, designs, and make tweaks with AI. Free + paid plans. Pebblely: professional-quality product photos in various settings and backgrounds, free to generate up to 40 images* (through you can always sign up for another account…)

I run an AI automation agency (AAA). My honest overview and review of this new business model
reddit
LLM Vibe Score0
Human Vibe Score1
AI_Scout_OfficialThis week

I run an AI automation agency (AAA). My honest overview and review of this new business model

I started an AI tools directory in February, and then branched off that to start an AI automation agency (AAA) in June. So far I've come across a lot of unsustainable "ideas" to make money with AI, but at the same time a few diamonds in the rough that aren't fully tapped into yet- especially the AAA model. Thought I'd share this post to shine light into this new business model and share some ways you could potentially start your own agency, or at the very least know who you are dealing with and how to pick and choose when you (inevitably) get bombarded with cold emails from them down the line. Foreword Running an AAA does NOT involve using AI tools directly to generate and sell content directly. That ship has sailed, and unless you are happy with $5 from Fiverr every month or so, it is not a real business model. Cry me a river but generating generic art with AI and slapping it onto a T-shirt to sell on Etsy won't make you a dime. At the same time, the AAA model will NOT require you to have a deep theoretical knowledge of AI, or any academic degree, as we are more so dealing with the practical applications of generative AI and how we can implement these into different workflows and tech-stacks, rather than building AI models from the ground up. Regardless of all that, common sense and a willingness to learn will help (a shit ton), as with anything. Keep in mind - this WILL involve work and motivation as well. The mindset that AI somehow means everything can be done for you on autopilot is not the right way to approach things. The common theme of businesses I've seen who have successfully implemented AI into their operations is the willingess to work with AI in a way that augments their existing operations, rather than flat out replace a worker or team. And this is exactly the train of thought you need when working with AI as a business model. However, as the field is relatively unsaturated and hype surrounding AI is still fresh for enterprises, right now is the prime time to start something new if generative AI interests you at all. With that being said, I'll be going over three of the most successful AI-adjacent businesses I've seen over this past year, in addition to some tips and resources to point you in the right direction. so.. WTF is an AI Automation Agency? The AI automation agency (or as some YouTubers have coined it, the AAA model) at its core involves creating custom AI solutions for businesses. I have over 1500 AI tools listed in my directory, however the feedback I've received from some enterprise users is that ready-made SaaS tools are too generic to meet their specific needs. Combine this with the fact virtually no smaller companies have the time or skills required to develop custom solutions right off the bat, and you have yourself real demand. I would say in practice, the AAA model is quite similar to Wordpress and even web dev agencies, with the major difference being all solutions you develop will incorporate key aspects of AI AND automation. Which brings me to my second point- JUST AI IS NOT ENOUGH. Rather than reducing the amount of time required to complete certain tasks, I've seen many AI agencies make the mistake of recommending and (trying to) sell solutions that more likely than not increase the workload of their clients. For example, if you were to make an internal tool that has AI answer questions based on their knowledge base, but this knowledge base has to be updated manually, this is creating unnecessary work. As such I think one of the key components of building successful AI solutions is incorporating the new (Generative AI/LLMs) with the old (programmtic automation- think Zapier, APIs, etc.). Finally, for this business model to be successful, ideally you should target a niche in which you have already worked and understand pain points and needs. Not only does this make it much easier to get calls booked with prospects, the solutions you build will have much greater value to your clients (meaning you get paid more). A mistake I've seen many AAA operators make (and I blame this on the "Get Rich Quick" YouTubers) is focusing too much on a specific productized service, rather than really understanding the needs of businesses. The former is much done via a SaaS model, but when going the agency route the only thing that makes sense is building custom solutions. This is why I always take a consultant-first approach. You can only build once you understand what they actually need and how certain solutions may impact their operations, workflows, and bottom-line. Basics of How to Get Started Pick a niche. As I mentioned previously, preferably one that you've worked in before. Niches I know of that are actively being bombarded with cold emails include real estate, e-commerce, auto-dealerships, lawyers, and medical offices. There is a reason for this, but I will tell you straight up this business model works well if you target any white-collar service business (internal tools approach) or high volume businesses (customer facing tools approach). Setup your toolbox. If you wanted to start a pressure washing business, you would need a pressure-washer. This is no different. For those without programming knowledge, I've seen two common ways AAA get setup to build- one is having a network of on-call web developers, whether its personal contacts or simply going to Upwork or any talent sourcing agency. The second is having an arsenal of no-code tools. I'll get to this more in a second, but this works beecause at its core, when we are dealing with the practical applications of AI, the code is quite simple, simply put. Start cold sales. Unless you have a network already, this is not a step you can skip. You've already picked a niche, so all you have to do is find the right message. Keep cold emails short, sweet, but enticing- and it will help a lot if you did step 1 correctly and intimately understand who your audience is. I'll be touching base later about how you can leverage AI yourself to help you with outreach and closing. The beauty of gen AI and the AAA model You don't need to be a seasoned web developer to make this business model work. The large majority of solutions that SME clients want is best done using an API for an LLM for the actual AI aspect. The value we create with the solutions we build comes with the conceptual framework and design that not only does what they need it to but integrates smoothly with their existing tech-stack and workflow. The actual implementation is quite straightforward once you understand the high level design and know which tools you are going to use. To give you a sense, even if you plan to build out these apps yourself (say in Python) the large majority of the nitty gritty technical work has already been done for you, especially if you leverage Python libraries and packages that offer high level abstraction for LLM-related functions. For instance, calling GPT can be as little as a single line of code. (And there are no-code tools where these functions are simply an icon on a GUI). Aside from understanding the capabilities and limitations of these tools and frameworks, the only thing that matters is being able to put them in a way that makes sense for what you want to build. Which is why outsourcing and no-code tools both work in our case. Okay... but how TF am I suppposed to actually build out these solutions? Now the fun part. I highly recommend getting familiar with Langchain and LlamaIndex. Both are Python libraires that help a lot with the high-level LLM abstraction I mentioned previously. The two most important aspects include being able to integrate internal data sources/knowledge bases with LLMs, and have LLMs perform autonomous actions. The two most common methods respectively are RAG and output parsing. RAG (retrieval augmented Generation) If you've ever seen a tool that seemingly "trains" GPT on your own data, and wonder how it all works- well I have an answer from you. At a high level, the user query is first being fed to what's called a vector database to run vector search. Vector search basically lets you do semantic search where you are searching data based on meaning. The vector databases then retrieves the most relevant sections of text as it relates to the user query, and this text gets APPENDED to your GPT prompt to provide extra context to the AI. Further, with prompt engineering, you can limit GPT to only generate an answer if it can be found within this extra context, greatly limiting the chance of hallucination (this is where AI makes random shit up). Aside from vector databases, we can also implement RAG with other data sources and retrieval methods, for example SQL databses (via parsing the outputs of LLM's- more on this later). Autonomous Agents via Output Parsing A common need of clients has been having AI actually perform tasks, rather than simply spitting out text. For example, with autonomous agents, we can have an e-commerce chatbot do the work of a basic customer service rep (i.e. look into orders, refunds, shipping). At a high level, what's going on is that the response of the LLM is being used programmtically to determine which API to call. Keeping on with the e-commerce example, if I wanted a chatbot to check shipping status, I could have a LLM response within my app (not shown to the user) with a prompt that outputs a random hash or string, and programmatically I can determine which API call to make based on this hash/string. And using the same fundamental concept as with RAG, I can append the the API response to a final prompt that would spit out the answer for the user. How No Code Tools Can Fit In (With some example solutions you can build) With that being said, you don't necessarily need to do all of the above by coding yourself, with Python libraries or otherwise. However, I will say that having that high level overview will help IMMENSELY when it comes to using no-code tools to do the actual work for you. Regardless, here are a few common solutions you might build for clients as well as some no-code tools you can use to build them out. Ex. Solution 1: AI Chatbots for SMEs (Small and Medium Enterprises) This involves creating chatbots that handle user queries, lead gen, and so forth with AI, and will use the principles of RAG at heart. After getting the required data from your client (i.e. product catalogues, previous support tickets, FAQ, internal documentation), you upload this into your knowledge base and write a prompt that makes sense for your use case. One no-code tool that does this well is MyAskAI. The beauty of it especially for building external chatbots is the ability to quickly ingest entire websites into your knowledge base via a sitemap, and bulk uploading files. Essentially, they've covered the entire grunt work required to do this manually. Finally, you can create a inline or chat widget on your client's website with a few lines of HTML, or altneratively integrate it with a Slack/Teams chatbot (if you are going for an internal Q&A chatbot approach). Other tools you could use include Botpress and Voiceflow, however these are less for RAG and more for building out complete chatbot flows that may or may not incorporate LLMs. Both apps are essentially GUIs that eliminate the pain and tears and trying to implement complex flows manually, and both natively incoporate AI intents and a knowledge base feature. Ex. Solution 2: Internal Apps Similar to the first example, except we go beyond making just chatbots but tools such as report generation and really any sort of internal tool or automations that may incorporate LLM's. For instance, you can have a tool that automatically generates replies to inbound emails based on your client's knowledge base. Or an automation that does the same thing but for replies to Instagram comments. Another example could be a tool that generates a description and screeenshot based on a URL (useful for directory sites, made one for my own :P). Getting into more advanced implementations of LLMs, we can have tools that can generate entire drafts of reports (think 80+ pages), based not only on data from a knowledge base but also the writing style, format, and author voice of previous reports. One good tool to create content generation panels for your clients would be MindStudio. You can train LLM's via prompt engineering in a structured way with your own data to essentially fine tune them for whatever text you need it to generate. Furthermore, it has a GUI where you can dictate the entire AI flow. You can also upload data sources via multiple formats, including PDF, CSV, and Docx. For automations that require interactions between multiple apps, I recommend the OG zapier/make.com if you want a no-code solution. For instance, for the automatic email reply generator, I can have a trigger such that when an email is received, a custom AI reply is generated by MyAskAI, and finally a draft is created in my email client. Or, for an automation where I can create a social media posts on multiple platforms based on a RSS feed (news feed), I can implement this directly in Zapier with their native GPT action (see screenshot) As for more complex LLM flows that may require multiple layers of LLMs, data sources, and APIs working together to generate a single response i.e. a long form 100 page report, I would recommend tools such as Stack AI or Flowise (open-source alternative) to build these solutions out. Essentially, you get most of the functions and features of Python packages such as Langchain and LlamaIndex in a GUI. See screenshot for an example of a flow How the hell are you supposed to find clients? With all that being said, none of this matters if you can't find anyone to sell to. You will have to do cold sales, one way or the other, especially if you are brand new to the game. And what better way to sell your AI services than with AI itself? If we want to integrate AI into the cold outreach process, first we must identify what it's good at doing, and that's obviously writing a bunch of text, in a short amount of time. Similar to the solutions that an AAA can build for its clients, we can take advantage of the same principles in our own sales processes. How to do outreach Once you've identified your niche and their pain points/opportunities for automation, you want to craft a compelling message in which you can send via cold email and cold calls to get prospects booked on demos/consultations. I won't get into too much detail in terms of exactly how to write emails or calling scripts, as there are millions of resources to help with this, but I will tell you a few key points you want to keep in mind when doing outreach for your AAA. First, you want to keep in mind that many businesses are still hesitant about AI and may not understand what it really is or how it can benefit their operations. However, we can take advantage of how mass media has been reporting on AI this past year- at the very least people are AWARE that sooner or later they may have to implement AI into their businesses to stay competitive. We want to frame our message in a way that introduces generative AI as a technology that can have a direct, tangible, and positive impact on their business. Although it may be hard to quantify, I like to include estimates of man-hours saved or costs saved at least in my final proposals to prospects. Times are TOUGH right now, and money is expensive, so you need to have a compelling reason for businesses to get on board. Once you've gotten your messaging down, you will want to create a list of prospects to contact. Tools you can use to find prospects include Apollo.io, reply.io, zoominfo (expensive af), and Linkedin Sales Navigator. What specific job titles, etc. to target will depend on your niche but for smaller companies this will tend to be the owner. For white collar niches, i.e. law, the professional that will be directly benefiting from the tool (i.e. partners) may be better to contact. And for larger organizations you may want to target business improvement and digital transformation leads/directors- these are the people directly in charge of projects like what you may be proposing. Okay- so you have your message, and your list, and now all it comes down to is getting the good word out. I won't be going into the details of how to send these out, a quick Google search will give you hundreds of resources for cold outreach methods. However, personalization is key and beyond simple dynamic variables you want to make sure you can either personalize your email campaigns directly with AI (SmartWriter.ai is an example of a tool that can do this), or at the very least have the ability to import email messages programmatically. Alternatively, ask ChatGPT to make you a Python Script that can take in a list of emails, scrape info based on their linkedin URL or website, and all pass this onto a GPT prompt that specifies your messaging to generate an email. From there, send away. How tf do I close? Once you've got some prospects booked in on your meetings, you will need to close deals with them to turn them into clients. Call #1: Consultation Tying back to when I mentioned you want to take a consultant-first appraoch, you will want to listen closely to their goals and needs and understand their pain points. This would be the first call, and typically I would provide a high level overview of different solutions we could build to tacke these. It really helps to have a presentation available, so you can graphically demonstrate key points and key technologies. I like to use Plus AI for this, it's basically a Google Slides add-on that can generate slide decks for you. I copy and paste my default company messaging, add some key points for the presentation, and it comes out with pretty decent slides. Call #2: Demo The second call would involve a demo of one of these solutions, and typically I'll quickly prototype it with boilerplate code I already have, otherwise I'll cook something up in a no-code tool. If you have a niche where one type of solution is commonly demanded, it helps to have a general demo set up to be able to handle a larger volume of calls, so you aren't burning yourself out. I'll also elaborate on how the final product would look like in comparison to the demo. Call #3 and Beyond: Once the initial consultation and demo is complete, you will want to alleviate any remaining concerns from your prospects and work with them to reach a final work proposal. It's crucial you lay out exactly what you will be building (in writing) and ensure the prospect understands this. Furthermore, be clear and transparent with timelines and communication methods for the project. In terms of pricing, you want to take this from a value-based approach. The same solution may be worth a lot more to client A than client B. Furthermore, you can create "add-ons" such as monthly maintenance/upgrade packages, training sessions for employeees, and so forth, separate from the initial setup fee you would charge. How you can incorporate AI into marketing your businesses Beyond cold sales, I highly recommend creating a funnel to capture warm leads. For instance, I do this currently with my AI tools directory, which links directly to my AI agency and has consistent branding throughout. Warm leads are much more likely to close (and honestly, much nicer to deal with). However, even without an AI-related website, at the very least you will want to create a presence on social media and the web in general. As with any agency, you will want basic a professional presence. A professional virtual address helps, in addition to a Google Business Profile (GBP) and TrustPilot. a GBP (especially for local SEO) and Trustpilot page also helps improve the looks of your search results immensely. For GBP, I recommend using ProfilePro, which is a chrome extension you can use to automate SEO work for your GBP. Aside from SEO optimzied business descriptions based on your business, it can handle Q/A answers, responses, updates, and service descriptions based on local keywords. Privacy and Legal Concerns of the AAA Model Aside from typical concerns for agencies relating to service contracts, there are a few issues (especially when using no-code tools) that will need to be addressed to run a successful AAA. Most of these surround privacy concerns when working with proprietary data. In your terms with your client, you will want to clearly define hosting providers and any third party tools you will be using to build their solution, and a DPA with these third parties listed as subprocessors if necessary. In addition, you will want to implement best practices like redacting private information from data being used for building solutions. In terms of addressing concerns directly from clients, it helps if you host your solutions on their own servers (not possible with AI tools), and address the fact only ChatGPT queries in the web app, not OpenAI API calls, will be used to train OpenAI's models (as reported by mainstream media). The key here is to be open and transparent with your clients about ALL the tools you are using, where there data will be going, and make sure to get this all in writing. have fun, and keep an open mind Before I finish this post, I just want to reiterate the fact that this is NOT an easy way to make money. Running an AI agency will require hours and hours of dedication and work, and constantly rearranging your schedule to meet prospect and client needs. However, if you are looking for a new business to run, and have a knack for understanding business operations and are genuinely interested in the pracitcal applications of generative AI, then I say go for it. The time is ticking before AAA becomes the new dropshipping or SMMA, and I've a firm believer that those who set foot first and establish themselves in this field will come out top. And remember, while 100 thousand people may read this post, only 2 may actually take initiative and start.

Is the idea of simplifying long 10,000+ word research articles into under 100 words of key findings with a case study a good approach?
reddit
LLM Vibe Score0
Human Vibe Score1
PresentationHot3332This week

Is the idea of simplifying long 10,000+ word research articles into under 100 words of key findings with a case study a good approach?

During a visit to a top Indian university few year back, I noticed students creating extensive research papers that ended up in dusty, cobwebbed cupboards. Surprisingly, only 1% of this research was ever implemented. Most students moved on to higher education or high-paying jobs, leaving their work behind. Only a few received grants to continue their research. This experience highlighted how much valuable knowledge was being wasted, hidden away and unused. (To give you a context, there are many products in the world have already comes from research based finding - few examples are - VR headset, Zipper packages and etc) Problem: There are over 200 million research articles online, but many valuable ideas and solutions are overlooked. Finding, uploading, and summarizing these articles is difficult and time-consuming.(Even using AI - we need some kind of human intervention to simplifying in terms of data visualization) Solution: Create a simple platform, like a Twitter page, to share key findings from long research articles. Use AI tools to help summarize the articles, while humans curate and verify the information. This would make it easier for people to find existing solutions to problems without having to read through long papers. Users can still explore the full articles if they want more details. Opportunity - This can be great for people, teams or business that want to work on problem which is yet to executed or referenced in real world.

AI ChatBo Business System Digital - Software Bring Yours SALES UP + COSTS DOWN With Digital Systems
reddit
LLM Vibe Score0
Human Vibe Score0
Individual_Brain_513This week

AI ChatBo Business System Digital - Software Bring Yours SALES UP + COSTS DOWN With Digital Systems

Recommend the AI ​​ChatBo Business System from especially for coaches & consultants, e-commerce and retail and build a passive income in the mega-trend of AI & WhatsApp marketing. Your advantages: Lifetime 10 percent recurring commissions for the software licenses. One-off 10% for the service. ​No more losses due to changing browsers and devices thanks to the unique multi-device tracking using hash key technology from our partner Klick-Tip (commissions are 46 percent higher on average). One of the largest companies in the German-speaking region for digital payment processing. Software made and hosted in Germany. Click here to get it now: https://bit.ly/3TXNKm9 Start with a little and let it grow ChatboOne is THE all-in-one solution for marketing and sales and is available in three versions... Base \- reduces your manual effort, improves the overview of your sales campaigns and increases the conversion of your website. Expert \- Automates communication with customers and interested parties, offers campaigns via email and WhatsApp and makes planning your customer appointments easier. Professional \- The complete package including websites and landing pages, member area and affiliate marketing tool. Brilliant for you: no matter where you are with your business, start at the optimal level and let the system grow with you until you reach the professional level. ​ Click here to get it now: https://bit.ly/3TXNKm9 ​

TiCs -where innovation meets intelligence
reddit
LLM Vibe Score0
Human Vibe Score1
MohammadBaisThis week

TiCs -where innovation meets intelligence

Be Part of India’s AI Revolution – Join the TiCs Movement! We are TiCs (Tuba International Cooperative Society)—India’s first global AI powerhouse. We’re not just building a company; we’re launching a movement that will redefine AI-driven healthcare, fitness, and well-being. Through our brands WellNest (AI-powered health ecosystem) and Zenova (next-gen smart wearables), we are pioneering a future where technology truly understands and enhances human health. Why Are We Calling You? We’re assembling a community of passionate minds—AI enthusiasts, developers, designers, innovators, and problem-solvers—who want to be part of something bigger. This is NOT an internship. This is NOT a job. This is a mission to build the future of health-tech. What’s in It for You? ✅ Work on groundbreaking AI & LLM projects that solve real-world healthcare problems ✅ Hands-on experience in AI, ML, IoT, and smart wearables ✅ Mentorship & learning opportunities from top AI leaders ✅ Exclusive perks like health, wellness, and gym packages ✅ Recognition & growth opportunities—top contributors will be given leadership roles as we scale ✅ Certificates & endorsements to showcase your contributions ✅ Opportunity to be part of a global AI-led revolution in healthcare & fitness ✅ Network with like-minded innovators, entrepreneurs, and industry pioneers ✅ Early access to WellNest & Zenova products and AI-driven health plans ✅ Possibility of paid roles & equity-based opportunities for the most dedicated members Who Should Join? Students & fresh graduates eager to apply their skills AI & tech enthusiasts passionate about real-world innovation Developers, designers, and creators who want to build something impactful Anyone who believes in the power of AI for good and wants to contribute This is More Than Just a Tech Project We’re building an AI-powered health revolution. If you want to be part of something that changes lives, breaks barriers, and creates real impact, this is your chance. Movements aren’t built by employees—they are led by believers. If you believe in the power of AI to transform health, join us and let’s build the future together!

SUPIR
github
LLM Vibe Score0.599
Human Vibe Score0.8316614420062696
Fanghua-YuMar 28, 2025

SUPIR

(CVPR2024) Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild [Paper]   [Project Page]   [[Online App]](https://supir.suppixel.ai/home) Fanghua, Yu, Jinjin Gu, Zheyuan Li, Jinfan Hu, Xiangtao Kong, Xintao Wang, Jingwen He, Yu Qiao, Chao Dong Shenzhen Institute of Advanced Technology; Shanghai AI Laboratory; University of Sydney; The Hong Kong Polytechnic University; ARC Lab, Tencent PCG; The Chinese University of Hong Kong 🚀 We're thrilled to announce the official launch of SupPixel AI! Experience the next level of image processing and upscaling with our cutting-edge AI technology based on SUPIR. Explore now at suppixel.ai. 🔧 Dependencies and Installation Clone repo Install dependent packages Download Checkpoints For users who can connect to huggingface, please setting LLAVACLIPPATH, SDXLCLIP1PATH, SDXLCLIP2CKPTPTH in CKPTPTH.py as None. These CLIPs will be downloaded automatically. Dependent Models SDXL CLIP Encoder-1 SDXL CLIP Encoder-2 SDXL base 1.00.9vae LLaVA CLIP LLaVA v1.5 13B (optional) Juggernaut-XLv9RunDiffusionPhotov2 Replacement of SDXL base 1.0_0.9vae for Photo Realistic (optional) JuggernautRunDiffusionPhoto2Lightning4Steps Distilling model used in SUPIRv0Juggernautv9_lightning.yaml Models we provided: SUPIR-v0Q: Baidu Netdisk, Google Drive Default training settings with paper. High generalization and high image quality in most cases. SUPIR-v0F: Baidu Netdisk, Google Drive Training with light degradation settings. Stage1 encoder of SUPIR-v0F remains more details when facing light degradations. Edit Custom Path for Checkpoints ⚡ Quick Inference Val Dataset RealPhoto60: Baidu Netdisk, Google Drive Usage of SUPIR Python Script Gradio Demo Online App We've just launched SupPixel AI, an easy-to-use tool designed to help with high-quality image processing and upscaling. It builds on SUPIR. Whether you’re into photography, digital art, or just love playing around with image enhancement, we’d love for you to check it out.~ BibTeX @misc{yu2024scaling, title={Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild}, author={Fanghua Yu and Jinjin Gu and Zheyuan Li and Jinfan Hu and Xiangtao Kong and Xintao Wang and Jingwen He and Yu Qiao and Chao Dong}, year={2024}, eprint={2401.13627}, archivePrefix={arXiv}, primaryClass={cs.CV} } 📧 Contact If you have any question, please email fanghuayu96@gmail.com or jinjin.gu@suppixel.ai. Non-Commercial Use Only Declaration The SUPIR ("Software") is made available for use, reproduction, and distribution strictly for non-commercial purposes. For the purposes of this declaration, "non-commercial" is defined as not primarily intended for or directed towards commercial advantage or monetary compensation. By using, reproducing, or distributing the Software, you agree to abide by this restriction and not to use the Software for any commercial purposes without obtaining prior written permission from Dr. Jinjin Gu. This declaration does not in any way limit the rights under any open source license that may apply to the Software; it solely adds a condition that the Software shall not be used for commercial purposes. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. For inquiries or to obtain permission for commercial use, please contact Dr. Jinjin Gu (jinjin.gu@suppixel.ai).

activepieces
github
LLM Vibe Score0.66
Human Vibe Score1
activepiecesMar 28, 2025

activepieces

An open source replacement for Zapier Documentation 🌪️ Create a Piece 🖉 Deploy 🔥 Join Discord 🤯 Welcome to Activepieces Your friendliest open source all-in-one automation tool, designed to be extensible through a type-safe pieces framework written in Typescript. 🔥 Why Activepieces is Different: 💖 Loved by Everyone: Intuitive interface and great experience for both technical and non-technical users with a quick learning curve. 🌐 Open Ecosystem: All pieces are open source and available on npmjs.com, 60% of the pieces are contributed by the community. 🛠️ Pieces are written in Typescript: Pieces are npm packages in TypeScript, offering full customization with the best developer experience, including hot reloading for local piece development on your machine. 😎 🤖 AI-Ready: Native AI pieces let you experiment with various providers, or create your own agents using our AI SDK, and there is Copilot to help you build flows inside the builder. 🏢 Enterprise-Ready: Developers set up the tools, and anyone in the organization can use the no-code builder. Full customization from branding to control. 🔒 Secure by Design: Self-hosted and network-gapped for maximum security and control over your data. 🧠 Human in the Loop: Delay execution for a period of time or require approval. These are just pieces built on top of the piece framework, and you can build many pieces like that. 🎨 💻 Human Input Interfaces: Built-in support for human input triggers like "Chat Interface" 💬 and "Form Interface" 📝 🛠️ Builder Features: [x] Loops [x] Branches [x] Auto Retries [x] HTTP [x] Code with NPM [x] ASK AI in Code Piece (Non technical user can clean data without knowing to code) [x] Flows are fully versioned. [x] Languages Translations [x] Customizable Templates [X] 200+ Pieces, check https://www.activepieces.com/pieces We release updates frequently. Check the product changelog for the latest features. 🔌 Create Your Own Piece Activepieces supports integrations with Google Sheets, OpenAI, Discord, RSS, and over 200 other services. Check out the full list of supported integrations, which is constantly expanding thanks to our community's contributions. As an open ecosystem, all integration source code is accessible in our repository. These integrations are versioned and published directly to npmjs.com upon contribution. You can easily create your own integration using our TypeScript framework. For detailed instructions, please refer to our Contributor's Guide. License Activepieces' Community Edition is released as open source under the MIT license and enterprise features are released under Commercial License Read more about the feature comparison here https://www.activepieces.com/docs/about/editions 💭 Join Our Community 🌐 Contributions We welcome contributions big or small and in different directions. The best way to do this is to check this document and we are always up to talk on our Discord Server. 📚 Translations Not into coding but still interested in contributing? Come join our Discord and visit https://www.activepieces.com/docs/about/i18n for more information. !fr translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !it translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !de translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !ja translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) !pt-BR translation].data.translationProgress&url=https%3A%2F%2Fbadges.awesome-crowdin.com%2Fstats-16093902-626364-update.json) 🦫 Contributors ShahedAlMashni🔌 AbdulTheActivePiecer🚧 Khaled Mashaly🚧 Mohammed Abu Aboud🚧 Abdulrahman Zeineddin🔌 ahmad jaber🔌 ashrafsamhouri🔌 Mohammad Abu Musa📆 Mukewa Wekalao🔌 Osama Abdallah Essa Haikal🔌 Arman🛡️ Oskar Krämer📖 Thibaut Patel🤔 🔌 Applesaucesomer🤔 crazyTweek🤔 Muhammad Tabaza🔌 Shay Punter📖 🔌 abaza738🔌 Jona Boeddinghaus🔌 fomojola💻 Alexander Storozhevsky💻 J0LGER🛡️ Patrick Veverka🐛 Berk Sümbül📖 Willian Guedes🔌 Abdullah Ranginwala💻 Dennis Tychsen🔌 MyWay🔌 Bibhuti Bhusan Panda🔌 Tarun Samanta🐛 Herman Kudria🔌 [NULL] Dev🔌 Jan Bebendorf🔌 Nilesh🔌 Vraj Gohil🔌 BastienMe🔌 Stephen Foskett📖 Nathan📖 Marcin Natanek🔌 Mark van Bellen🔌 Olivier Guzzi🔌 Osama Zakarneh🔌 phestvik🤔 Rajdeep Pal📖 Camilo Usuga🔌 Kishan Parmar📖 🔌 BBND🔌 Haseeb Rehman🔌 Rita Gorokhod🔌 Fábio Ferreira🔌 Florin Buffet📖 Drew Lewis🔌 Benjamin André-Micolon🔌 Denis Gurskij🔌 Nefer Lopez📖 fardeenpanjwani-codeglo📖 Landon Moir🔌 Diego Nijboer🔌 Tân Một Nắng🔌 Gavin Foley📖 Dennis Trautwein🐛 Andrew Rosenblatt🐛 rika🔌 Cyril Selasi🔌 Franck Nijimbere🔌 Aleksandr Denisov🔌 Reuben Swartz📖 joselupianez🔌 Awais Manzoor🐛 💻 Andrei🐛 derbbre📖 Maor Rozenfeld💻 Michael Huynh📖 Filip Dunđer💻 Don Thorp📖 Joe Workman🔌 Aykut Akgün💻 Yann Petitjean🔌 🐛 pfernandez98🔌 Daniel O.🔌 Meng-Yuan Huang📖 Leyla🐛 i-nithin🔌 la3rence🔌 Dennis Rongo🐛 🔌 Kartik Mehta📖 💻 Zakher Masri📖 💻 AbdullahBitar🔌 Mario Meyer🔌 Karim Khaleel🔌 CPonchet🐛 Olivier Sambourg🔌 Ahmad(Ed)🔌 leenmashni🔌 M Abdul Rauf📖 Vincent Barrier🔌 John💻 🔌 Joost de Valk🔌 MJ🔌 ShravanShenoy💻 Jon Kristian📖 cr0fters🐛 Bibek Timsina🐛 Viktor Szépe💻 Rendy Tan📖 🔌 Islam Abdelfattah🐛 Yoonjae Choi💻 Javier HM🔌 Mohamed Hassan🐛 Christian Schab🔌 Pratik Kinage🔌 Abdelrahman Mostafa 🔌 Hamza Zagha🐛 Lasse Schuirmann🔌 Cyril Duchon-Doris🔌 Javiink🔌 Harshit Harchani🔌 MrAkber📖 marek-slavicek🔌 hugh-codes🔌 Alex Lewis🐛 Yuanlin Lin📖 Ala Shiban📖 hamsh💻 Anne Mariel Catapang🔌 Carlo Gino Catapang🔌 Aditya Rathore🔌 coderbob2🔌 Ramy Gamal🔌 Alexandru-Dan Pop💻 Frank Micheal 🔌 Emmanuel Ferdman📖 Sany A🔌 Niels Swimberghe🐛 lostinbug🔌 gushkool🔌 Omar Sayed🔌 rSnapkoOpenOps🐛 ahronshor🔌 Cezar🐛 Shawn Lim🔌 Shawn Lim🔌 pavloDeshko🐛 abc💻 manoj kumar d🔌 Feli🔌 Miguel🔌 Instasent DEV🔌 Matthieu Lombard🔌 beyondlevi🔌 Rafal Zawadzki🔌 Simon Courtois🔌 alegria-solutions🔌 D-Rowe-FS🔌 张晟杰🔌 Ashot🔌 Amr Abu Aza🔌 John Goodliff🔌 Diwash Dev🔌 André🔌 Lou | Digital Marketing🔌 Maarten Coppens🔌 Mahmoud Hamed🔌 Theo Dammaretz🔌 s31w4n📖 Abdul Rahman🔌 Kent Smith🔌 Arvind Ramesh💻 valentin-mourtialon🔌 psgpsg16🔌 Mariia Shyn🔌 Joshua Heslin🔌 Ahmad🔌 you💻 Daniel Poon💻 Kévin Yu🔌 노영은🔌 reemayoush🔌 Brice🛡️ Mg Wunna🔌 This project follows the all-contributors specification. Contributions of any kind are welcome!

n8n
github
LLM Vibe Score0.66
Human Vibe Score1
n8n-ioMar 28, 2025

n8n

!Banner image n8n - Secure Workflow Automation for Technical Teams n8n is a workflow automation platform that gives technical teams the flexibility of code with the speed of no-code. With 400+ integrations, native AI capabilities, and a fair-code license, n8n lets you build powerful automations while maintaining full control over your data and deployments. !n8n.io - Screenshot Key Capabilities Code When You Need It: Write JavaScript/Python, add npm packages, or use the visual interface AI-Native Platform: Build AI agent workflows based on LangChain with your own data and models Full Control: Self-host with our fair-code license or use our cloud offering Enterprise-Ready: Advanced permissions, SSO, and air-gapped deployments Active Community: 400+ integrations and 900+ ready-to-use templates Quick Start Try n8n instantly with npx (requires Node.js): Or deploy with Docker: Access the editor at http://localhost:5678 Resources 📚 Documentation 🔧 400+ Integrations 💡 Example Workflows 🤖 AI & LangChain Guide 👥 Community Forum 📖 Community Tutorials Support Need help? Our community forum is the place to get support and connect with other users: community.n8n.io License n8n is fair-code distributed under the Sustainable Use License and n8n Enterprise License. Source Available: Always visible source code Self-Hostable: Deploy anywhere Extensible: Add your own nodes and functionality Enterprise licenses available for additional features and support. Additional information about the license model can be found in the docs. Contributing Found a bug 🐛 or have a feature idea ✨? Check our Contributing Guide to get started. Join the Team Want to shape the future of automation? Check out our job posts and join our team! What does n8n mean? Short answer: It means "nodemation" and is pronounced as n-eight-n. Long answer: "I get that question quite often (more often than I expected) so I decided it is probably best to answer it here. While looking for a good name for the project with a free domain I realized very quickly that all the good ones I could think of were already taken. So, in the end, I chose nodemation. 'node-' in the sense that it uses a Node-View and that it uses Node.js and '-mation' for 'automation' which is what the project is supposed to help with. However, I did not like how long the name was and I could not imagine writing something that long every time in the CLI. That is when I then ended up on 'n8n'." - Jan Oberhauser, Founder and CEO, n8n.io

mentals-ai
github
LLM Vibe Score0.476
Human Vibe Score0.004852164397547106
turing-machinesMar 28, 2025

mentals-ai

Mentals AI is a tool designed for creating and operating agents that feature loops, memory, and various tools, all through straightforward markdown files with a .gen extension. Think of an agent file as an executable file. You focus entirely on the logic of the agent, eliminating the necessity to write scaffolding code in Python or any other language. Essentially, it redefines the foundational frameworks for future AI applications 🍓 [!NOTE] [work in progress] A local vector database to store your chats with the agents as well as your private information. See memory branch. [work in progress] Web UI with agents, tools, and vector storage Getting Started Differences from Other Frameworks Key Concepts Instruction (prompt) Working Memory (context) Short-Term Memory (experimental) Control flow: From strings to algorithms Roadmap The Idea 📌 Examples Word chain game in a self-loop controlled by LLM: !Word Chain game in a loop NLOP — Natural Language Operation Or more complex use cases: | 🔄 Any multi-agent interactions | 👾 Space Invaders generator agent | 🍄 2D platformer generator agent | |--------------------|-----------|--------------| |!react | !spaceinvaders.gen | !mario.gen | Or help with the content: Collect YouTube videos on a given topic and save them to a .csv file with the videos, views, channel name, and link; Get the transcription from the video and create a table of contents; Take top news from Hacker News, choose a topic and write an article on the topic with the participation of the critic, and save to a file. All of the above examples are located in the agents folder. [!NOTE] Llama3 support is available for providers using a compatible OpenAI API. 🚀 Getting Started Begin by securing an OpenAI API key through the creation of an OpenAI account. If you already have an API key, skip this step. 🏗️ Build and Run Prerequisites Before building the project, ensure the following dependencies are installed: libcurl: Used for making HTTP requests libfmt: Provides an API for formatting pgvector: Vector operations with PostgreSQL poppler: Required for PDF processing Depending on your operating system, you can install these using the following commands: Linux macOS Windows For Windows, it's recommended to use vcpkg or a similar package manager: pgvector installation [!NOTE] In the main branch you can skip this step Build from sources Docker, Homebrew, PGXN, APT, etc. Clone the repository Configuration Place your API key in the config.toml file: Build the project Run 🆚 Differences from Other Frameworks Mentals AI distinguishes itself from other frameworks in three significant ways: The Agent Executor 🧠 operates through a recursive loop. The LLM determines the next steps: selecting instructions (prompts) and managing data based on previous loops. This recursive decision-making process is integral to our system, outlined in mentalssystem.prompt Agents of any complexity can be created using Markdown, eliminating the need for traditional programming languages. However, Python can be integrated directly into the agent's Markdown script if necessary. Unlike platforms that include preset reasoning frameworks, Mentals AI serves as a blank canvas. It enables the creation and integration of your own reasoning frameworks, including existing ones: Tree of Thoughts, ReAct, Self-Discovery, Auto-CoT, and others. One can also link these frameworks together into more complex sequences, even creating a network of various reasoning frameworks. 🗝️ Key Concepts The agent file is a textual description of the agent instructions with a .gen extension. 📖 Instruction (prompt) Instruction is the basic component of an agent in Mentals. An agent can consist of one or more instructions, which can refer to each other. Instructions can be written in free form, but they always have a name that starts with the # symbol. The use: directive is used to specify a reference to other instructions. Multiple references are listed separated by commas. Below is an example with two instructions root and meme_explain with a reference: In this example, the root instruction calls the memeexplain instruction. The response from memeexplain is then returned to the instruction from which it was called, namely the root. An instruction can take an input parameter, which is automatically generated based on the context when the instruction is called. To specify the input data more precisely, you can use a free-form prompt in the input: directive, such as a JSON object or null. Using a document for input: Using a JSON object as input: [!NOTE] Instruction calls are implemented independently from function or tool calls at OpenAI, enabling the operation of agents with models like Llama3. The implementation of instruction calls is transparent and included in the mentals_system.prompt file. 🛠️ Tool Tool is a kind of instruction. Mentals has a set of native tools to handle message output, user input, file handling, Python interpreter, Bash commands, and Short-term memory. Ask user example: File handling example: The full list of native tools is listed in the file native_tools.toml. 🧠 Working Memory (context) Each instruction has its own working memory — context. When exiting an instruction and re-entering it, the context is kept by default. To clear the context when exiting an instruction, you can use the keep_context: false directive: By default, the size of the instruction context is not limited. To limit the context, there is a directive max_context: number which specifies that only the number of the most recent messages should be stored. Older messages will be pushed out of the context. This feature is useful when you want to keep the most recent data in context so that older data does not affect the chain of reasoning. ⏳ Short-Term Memory (experimental) Short-term memory allows for the storage of intermediate results from an agent's activities, which can then be used for further reasoning. The contents of this memory are accessible across all instruction contexts. The memory tool is used to store data. When data is stored, a keyword and a description of the content are generated. In the example below, the meme_recall instruction is aware of the meme because it was previously stored in memory. ⚙️ Control flow: From strings to algorithms The control flow, which includes conditions, instruction calls, and loops (such as ReAct, Auto-CoT, etc.), is fully expressed in natural language. This method enables the creation of semantic conditions that direct data stream branching. For instance, you can request an agent to autonomously play a word chain game in a loop or establish an ambiguous exit condition: exit the loop if you are satisfied with the result. Here, the language model and its context determine whether to continue or stop. All this is achieved without needing to define flow logic in Python or any other programming language. ⚖️ Reason Action (ReAct) example 🌳 Tree of Thoughts (ToT) example The idea behind ToT is to generate multiple ideas to solve a problem and then evaluate their value. Valuable ideas are kept and developed, other ideas are discarded. Let's take the example of the 24 game. The 24 puzzle is an arithmetical puzzle in which the objective is to find a way to manipulate four integers so that the end result is 24. First, we define the instruction that creates and manipulates the tree data structure. The model knows what a tree is and can represent it in any format, from plain text to XML/JSON or any custom format. In this example, we will use the plain text format: Next, we need to initialize the tree with initial data, let's start with the root instruction: Calling the root instruction will suggest 8 possible next steps to calculate with the first 2 numbers and store these steps as tree nodes. Further work by the agent results in the construction of a tree that is convenient for the model to understand and infer the final answer. A complete example is contained in the agents/treestructure.gen 🗺️ Roadmap [ ] Web UI -- WIP [ ] Vector database tools -- WIP [ ] Agent's experience (experimental) [ ] Tools: Image generation, Browser ✨ The Idea The concept originated from studies on psychoanalysis Executive functions, Exploring Central Executive, Alan Baddeley, 1996. He described a system that orchestrates cognitive processes and working memory, facilitating retrievals from long-term memory. The LLM functions as System 1, processing queries and executing instructions without inherent motivation or goal-setting. So, what then is System 2? Drawing from historical insights now reconsidered through a scientific lens: The central executive, or executive functions, is crucial for controlled processing in working memory. It manages tasks including directing attention, maintaining task objectives, decision-making, and memory retrieval. This sparks an intriguing possibility: constructing more sophisticated agents by integrating System 1 and System 2. The LLM, as the cognitive executor System 1, works in tandem with the Central Executive System 2, which governs and controls the LLM. This partnership forms the dual relationship foundational to Mentals AI.

xpert
github
LLM Vibe Score0.457
Human Vibe Score0.0831216059433162
xpert-aiMar 28, 2025

xpert

English | 中文 [uri_license]: https://www.gnu.org/licenses/agpl-3.0.html [urilicenseimage]: https://img.shields.io/badge/License-AGPL%20v3-blue.svg Xpert Cloud · Self-hosting · Documentation · Enterprise inquiry Open-Source AI Platform for Enterprise Data Analysis, Indicator Management and Agents Orchestration Xpert AI is an open-source enterprise-level AI system that perfectly integrates two major platforms: agent orchestration and data analysis. 💡 What's New Agent and Workflow Hybrid Architecture In today's rapidly evolving AI landscape, enterprises face a critical dilemma: how to balance the creativity of LLMs with the stability of processes? While purely agent-based architectures offer flexibility, they are difficult to control; traditional workflows, though reliable, lack adaptability. The Agent and Workflow Hybrid Architecture of the Xpert AI platform is designed to resolve this conflict — it allows AI to possess "free will" while adhering to "rules and order." !agent-workflow-hybrid-architecture Blog - Agent and Workflow Hybrid Architecture Agent Orchestration Platform By coordinating the collaboration of multiple agents, Xpert completes complex tasks. Xpert integrates different types of AI agents through an efficient management mechanism, utilizing their capabilities to solve multidimensional problems. Xpert Agents Data Analysis Platform An agile data analysis platform based on cloud computing for multidimensional modeling, indicator management, and BI display. It supports connecting to various data sources, achieving efficient and flexible data analysis and visualization, and provides multiple intelligent analysis functions and tools to help enterprises quickly and accurately discover business value and make operational decisions. ChatBI ChatBI is an innovative feature we are introducing, combining chat functionality with business intelligence (BI) analysis capabilities. It offers users a more intuitive and convenient data analysis experience through natural language interaction. ChatBI_Demo.mp4 🚀 Quick Start Before installing Xpert, make sure your machine meets the following minimum system requirements: CPU >= 2 Core RAM >= 4 GiB Node.js (ESM and CommonJS) - 18.x, 19.x, 20.x, 22.x The easiest way to start the Xpert server is through docker compose. Before running Xpert with the following commands, make sure that Docker and Docker Compose are installed on your machine: After running, you can access the Xpert dashboard in your browser at http://localhost/onboarding and start the initialization process. Please check our Wiki - Development to get started quickly. 🎯 Mission Empowering enterprises with intelligent collaboration and data-driven insights through innovative AI orchestration and agile analytics. 🌼 Screenshots Show / Hide Screenshots Pareto analysis open in new tab !Pareto analysis Screenshot Product profit analysis open in new tab !Product profit analysis Screenshot Reseller analysis open in new tab !Reseller analysis Screenshot Bigview dashboard open in new tab !Bigview dashboard Screenshot Indicator application open in new tab !Indicator application Screenshot Indicator mobile app open in new tab !Indicator mobile app Screenshot 💻 Demo, Downloads, Testing and Production Demo Xpert AI Platform Demo at . Notes: You can generate samples data in the home dashbaord page. Production (SaaS) Xpert AI Platform SaaS is available at . Note: it's currently in Alpha version / in testing mode, please use it with caution! 🧱 Technology Stack and Requirements TypeScript language NodeJs / NestJs Nx Angular RxJS TypeORM Langchain ECharts Java Mondrian For Production, we recommend: PostgreSQL PM2 See also README.md and CREDITS.md files in relevant folders for lists of libraries and software included in the Platform, information about licenses, and other details 📄 Documentation Please refer to our official Platform Documentation and to our Wiki (WIP). 💌 Contact Us For business inquiries: Xpert AI Platform @ Twitter 🛡️ License We support the open-source community. This software is available under the following licenses: Xpert AI Platform Community Edition Xpert AI Platform Small Business Xpert AI Platform Enterprise Please see LICENSE for more information on licenses. 💪 Thanks to our Contributors Contributors Please give us :star: on Github, it helps! You are more than welcome to submit feature requests in the Xpert AI repo Pull requests are always welcome! Please base pull requests against the develop branch and follow the contributing guide.

GenAI_Agents
github
LLM Vibe Score0.563
Human Vibe Score0.24210481455988786
NirDiamantMar 28, 2025

GenAI_Agents

🌟 Support This Project: Your sponsorship fuels innovation in GenAI agent development. Become a sponsor to help maintain and expand this valuable resource! GenAI Agents: Comprehensive Repository for Development and Implementation 🚀 Welcome to one of the most extensive and dynamic collections of Generative AI (GenAI) agent tutorials and implementations available today. This repository serves as a comprehensive resource for learning, building, and sharing GenAI agents, ranging from simple conversational bots to complex, multi-agent systems. 📫 Stay Updated! 🚀Cutting-edgeUpdates 💡ExpertInsights 🎯Top 0.1%Content Join over 15,000 of AI enthusiasts getting unique cutting-edge insights and free tutorials! Plus, subscribers get exclusive early access and special 33% discounts to my book and the upcoming RAG Techniques course! Introduction Generative AI agents are at the forefront of artificial intelligence, revolutionizing the way we interact with and leverage AI technologies. This repository is designed to guide you through the development journey, from basic agent implementations to advanced, cutting-edge systems. 📚 Learn to Build Your First AI Agent Your First AI Agent: Simpler Than You Think This detailed blog post complements the repository by providing a complete A-Z walkthrough with in-depth explanations of core concepts, step-by-step implementation, and the theory behind AI agents. It's designed to be incredibly simple to follow while covering everything you need to know to build your first working agent from scratch. 💡 Plus: Subscribe to the newsletter for exclusive early access to tutorials and special discounts on upcoming courses and books! Our goal is to provide a valuable resource for everyone - from beginners taking their first steps in AI to seasoned practitioners pushing the boundaries of what's possible. By offering a range of examples from foundational to complex, we aim to facilitate learning, experimentation, and innovation in the rapidly evolving field of GenAI agents. Furthermore, this repository serves as a platform for showcasing innovative agent creations. Whether you've developed a novel agent architecture or found an innovative application for existing techniques, we encourage you to share your work with the community. Related Projects 📚 Dive into my comprehensive guide on RAG techniques to learn about integrating external knowledge into AI systems, enhancing their capabilities with up-to-date and relevant information retrieval. 🖋️ Explore my Prompt Engineering Techniques guide for an extensive collection of prompting strategies, from fundamental concepts to advanced methods, improving your ability to communicate effectively with AI language models. A Community-Driven Knowledge Hub This repository grows stronger with your contributions! Join our vibrant Discord community — the central hub for shaping and advancing this project together 🤝 GenAI Agents Discord Community Whether you're a novice eager to learn or an expert ready to share your knowledge, your insights can shape the future of GenAI agents. Join us to propose ideas, get feedback, and collaborate on innovative implementations. For contribution guidelines, please refer to our CONTRIBUTING.md file. Let's advance GenAI agent technology together! 🔗 For discussions on GenAI, agents, or to explore knowledge-sharing opportunities, feel free to connect on LinkedIn. Key Features 🎓 Learn to build GenAI agents from beginner to advanced levels 🧠 Explore a wide range of agent architectures and applications 📚 Step-by-step tutorials and comprehensive documentation 🛠️ Practical, ready-to-use agent implementations 🌟 Regular updates with the latest advancements in GenAI 🤝 Share your own agent creations with the community GenAI Agent Implementations Explore our extensive list of GenAI agent implementations, sorted by categories: 🌱 Beginner-Friendly Agents Simple Conversational Agent LangChain PydanticAI Overview 🔎 A context-aware conversational AI maintains information across interactions, enabling more natural dialogues. Implementation 🛠️ Integrates a language model, prompt template, and history manager to generate contextual responses and track conversation sessions. Simple Question Answering Agent Overview 🔎 Answering (QA) agent using LangChain and OpenAI's language model understands user queries and provides relevant, concise answers. Implementation 🛠️ Combines OpenAI's GPT model, a prompt template, and an LLMChain to process user questions and generate AI-driven responses in a streamlined manner. Simple Data Analysis Agent LangChain PydanticAI Overview 🔎 An AI-powered data analysis agent interprets and answers questions about datasets using natural language, combining language models with data manipulation tools for intuitive data exploration. Implementation 🛠️ Integrates a language model, data manipulation framework, and agent framework to process natural language queries and perform data analysis on a synthetic dataset, enabling accessible insights for non-technical users. 🔧 Framework Tutorial: LangGraph Introduction to LangGraph: Building Modular AI Workflows Overview 🔎 This tutorial introduces LangGraph, a powerful framework for creating modular, graph-based AI workflows. Learn how to leverage LangGraph to build more complex and flexible AI agents that can handle multi-step processes efficiently. Implementation 🛠️ Step-by-step guide on using LangGraph to create a StateGraph workflow. The tutorial covers key concepts such as state management, node creation, and graph compilation. It demonstrates these principles by constructing a simple text analysis pipeline, serving as a foundation for more advanced agent architectures. Additional Resources 📚 Blog Post 🎓 Educational and Research Agents ATLAS: Academic Task and Learning Agent System Overview 🔎 ATLAS demonstrates how to build an intelligent multi-agent system that transforms academic support through AI-powered assistance. The system leverages LangGraph's workflow framework to coordinate multiple specialized agents that provide personalized academic planning, note-taking, and advisory support. Implementation 🛠️ Implements a state-managed multi-agent architecture using four specialized agents (Coordinator, Planner, Notewriter, and Advisor) working in concert through LangGraph's workflow framework. The system features sophisticated workflows for profile analysis and academic support, with continuous adaptation based on student performance and feedback. Additional Resources 📚 YouTube Explanation Blog Post Scientific Paper Agent - Literature Review Overview 🔎 An intelligent research assistant that helps users navigate, understand, and analyze scientific literature through an orchestrated workflow. The system combines academic APIs with sophisticated paper processing techniques to automate literature review tasks, enabling researchers to efficiently extract insights from academic papers while maintaining research rigor and quality control. Implementation 🛠️ Leverages LangGraph to create a five-node workflow system including decision making, planning, tool execution, and quality validation nodes. The system integrates the CORE API for paper access, PDFplumber for document processing, and advanced language models for analysis. Key features include a retry mechanism for robust paper downloads, structured data handling through Pydantic models, and quality-focused improvement cycles with human-in-the-loop validation options. Additional Resources 📚 YouTube Explanation Blog Post Chiron - A Feynman-Enhanced Learning Agent Overview 🔎 An adaptive learning agent that guides users through educational content using a structured checkpoint system and Feynman-style teaching. The system processes learning materials (either user-provided or web-retrieved), verifies understanding through interactive checkpoints, and provides simplified explanations when needed, creating a personalized learning experience that mimics one-on-one tutoring. Implementation 🛠️ Uses LangGraph to orchestrate a learning workflow that includes checkpoint definition, context building, understanding verification, and Feynman teaching nodes. The system integrates web search for dynamic content retrieval, employs semantic chunking for context processing, and manages embeddings for relevant information retrieval. Key features include a 70% understanding threshold for progression, interactive human-in-the-loop validation, and structured output through Pydantic models for consistent data handling. Additional Resources 📚 YouTube Explanation 💼 Business and Professional Agents Customer Support Agent (LangGraph) Overview 🔎 An intelligent customer support agent using LangGraph categorizes queries, analyzes sentiment, and provides appropriate responses or escalates issues. Implementation 🛠️ Utilizes LangGraph to create a workflow combining state management, query categorization, sentiment analysis, and response generation. Essay Grading Agent (LangGraph) Overview 🔎 An automated essay grading system using LangGraph and an LLM model evaluates essays based on relevance, grammar, structure, and depth of analysis. Implementation 🛠️ Utilizes a state graph to define the grading workflow, incorporating separate grading functions for each criterion. Travel Planning Agent (LangGraph) Overview 🔎 A Travel Planner using LangGraph demonstrates how to build a stateful, multi-step conversational AI application that collects user input and generates personalized travel itineraries. Implementation 🛠️ Utilizes StateGraph to define the application flow, incorporates custom PlannerState for process management. GenAI Career Assistant Agent Overview 🔎 The GenAI Career Assistant demonstrates how to create a multi-agent system that provides personalized guidance for careers in Generative AI. Using LangGraph and Gemini LLM, the system delivers customized learning paths, resume assistance, interview preparation, and job search support. Implementation 🛠️ Leverages a multi-agent architecture using LangGraph to coordinate specialized agents (Learning, Resume, Interview, Job Search) through TypedDict-based state management. The system employs sophisticated query categorization and routing while integrating with external tools like DuckDuckGo for job searches and dynamic content generation. Additional Resources 📚 YouTube Explanation Project Manager Assistant Agent Overview 🔎 An AI agent designed to assist in project management tasks by automating the process of creating actionable tasks from project descriptions, identifying dependencies, scheduling work, and assigning tasks to team members based on expertise. The system includes risk assessment and self-reflection capabilities to optimize project plans through multiple iterations, aiming to minimize overall project risk. Implementation 🛠️ Leverages LangGraph to orchestrate a workflow of specialized nodes including task generation, dependency mapping, scheduling, allocation, and risk assessment. Each node uses GPT-4o-mini for structured outputs following Pydantic models. The system implements a feedback loop for self-improvement, where risk scores trigger reflection cycles that generate insights to optimize the project plan. Visualization tools display Gantt charts of the generated schedules across iterations. Additional Resources 📚 YouTube Explanation Contract Analysis Assistant (ClauseAI) Overview 🔎 ClauseAI demonstrates how to build an AI-powered contract analysis system using a multi-agent approach. The system employs specialized AI agents for different aspects of contract review, from clause analysis to compliance checking, and leverages LangGraph for workflow orchestration and Pinecone for efficient clause retrieval and comparison. Implementation 🛠️ Implements a sophisticated state-based workflow using LangGraph to coordinate multiple AI agents through contract analysis stages. The system features Pydantic models for data validation, vector storage with Pinecone for clause comparison, and LLM-based analysis for generating comprehensive contract reports. The implementation includes parallel processing capabilities and customizable report generation based on user requirements. Additional Resources 📚 YouTube Explanation E2E Testing Agent Overview 🔎 The E2E Testing Agent demonstrates how to build an AI-powered system that converts natural language test instructions into executable end-to-end web tests. Using LangGraph for workflow orchestration and Playwright for browser automation, the system enables users to specify test cases in plain English while handling the complexity of test generation and execution. Implementation 🛠️ Implements a structured workflow using LangGraph to coordinate test generation, validation, and execution. The system features TypedDict state management, integration with Playwright for browser automation, and LLM-based code generation for converting natural language instructions into executable test scripts. The implementation includes DOM state analysis, error handling, and comprehensive test reporting. Additional Resources 📚 YouTube Explanation 🎨 Creative and Content Generation Agents GIF Animation Generator Agent (LangGraph) Overview 🔎 A GIF animation generator that integrates LangGraph for workflow management, GPT-4 for text generation, and DALL-E for image creation, producing custom animations from user prompts. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow that generates character descriptions, plots, and image prompts using GPT-4, creates images with DALL-E 3, and assembles them into GIFs using PIL. Employs asynchronous programming for efficient parallel processing. TTS Poem Generator Agent (LangGraph) Overview 🔎 An advanced text-to-speech (TTS) agent using LangGraph and OpenAI's APIs classifies input text, processes it based on content type, and generates corresponding speech output. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow that classifies input text using GPT models, applies content-specific processing, and converts the processed text to speech using OpenAI's TTS API. The system adapts its output based on the identified content type (general, poem, news, or joke). Music Compositor Agent (LangGraph) Overview 🔎 An AI Music Compositor using LangGraph and OpenAI's language models generates custom musical compositions based on user input. The system processes the input through specialized components, each contributing to the final musical piece, which is then converted to a playable MIDI file. Implementation 🛠️ LangGraph orchestrates a workflow that transforms user input into a musical composition, using ChatOpenAI (GPT-4) to generate melody, harmony, and rhythm, which are then style-adapted. The final AI-generated composition is converted to a MIDI file using music21 and can be played back using pygame. Content Intelligence: Multi-Platform Content Generation Agent Overview 🔎 Content Intelligence demonstrates how to build an advanced content generation system that transforms input text into platform-optimized content across multiple social media channels. The system employs LangGraph for workflow orchestration to analyze content, conduct research, and generate tailored content while maintaining brand consistency across different platforms. Implementation 🛠️ Implements a sophisticated workflow using LangGraph to coordinate multiple specialized nodes (Summary, Research, Platform-Specific) through the content generation process. The system features TypedDict and Pydantic models for state management, integration with Tavily Search for research enhancement, and platform-specific content generation using GPT-4. The implementation includes parallel processing for multiple platforms and customizable content templates. Additional Resources 📚 YouTube Explanation Business Meme Generator Using LangGraph and Memegen.link Overview 🔎 The Business Meme Generator demonstrates how to create an AI-powered system that generates contextually relevant memes based on company website analysis. Using LangGraph for workflow orchestration, the system combines Groq's Llama model for text analysis and the Memegen.link API to automatically produce brand-aligned memes for digital marketing. Implementation 🛠️ Implements a state-managed workflow using LangGraph to coordinate website content analysis, meme concept generation, and image creation. The system features Pydantic models for data validation, asynchronous processing with aiohttp, and integration with external APIs (Groq, Memegen.link) to create a complete meme generation pipeline with customizable templates. Additional Resources 📚 YouTube Explanation Murder Mystery Game with LLM Agents Overview 🔎 A text-based detective game that utilizes autonomous LLM agents as interactive characters in a procedurally generated murder mystery. Drawing inspiration from the UNBOUNDED paper, the system creates unique scenarios each time, with players taking on the role of Sherlock Holmes to solve the case through character interviews and deductive reasoning. Implementation 🛠️ Leverages two LangGraph workflows - a main game loop for story/character generation and game progression, and a conversation sub-graph for character interactions. The system uses a combination of LLM-powered narrative generation, character AI, and structured game mechanics to create an immersive investigative experience with replayable storylines. Additional Resources 📚 YouTube Explanation 📊 Analysis and Information Processing Agents Memory-Enhanced Conversational Agent Overview 🔎 A memory-enhanced conversational AI agent incorporates short-term and long-term memory systems to maintain context within conversations and across multiple sessions, improving interaction quality and personalization. Implementation 🛠️ Integrates a language model with separate short-term and long-term memory stores, utilizes a prompt template incorporating both memory types, and employs a memory manager for storage and retrieval. The system includes an interaction loop that updates and utilizes memories for each response. Multi-Agent Collaboration System Overview 🔎 A multi-agent collaboration system combining historical research with data analysis, leveraging large language models to simulate specialized agents working together to answer complex historical questions. Implementation 🛠️ Utilizes a base Agent class to create specialized HistoryResearchAgent and DataAnalysisAgent, orchestrated by a HistoryDataCollaborationSystem. The system follows a five-step process: historical context provision, data needs identification, historical data provision, data analysis, and final synthesis. Self-Improving Agent Overview 🔎 A Self-Improving Agent using LangChain engages in conversations, learns from interactions, and continuously improves its performance over time through reflection and adaptation. Implementation 🛠️ Integrates a language model with chat history management, response generation, and a reflection mechanism. The system employs a learning system that incorporates insights from reflection to enhance future performance, creating a continuous improvement loop. Task-Oriented Agent Overview 🔎 A language model application using LangChain that summarizes text and translates the summary to Spanish, combining custom functions, structured tools, and an agent for efficient text processing. Implementation 🛠️ Utilizes custom functions for summarization and translation, wrapped as structured tools. Employs a prompt template to guide the agent, which orchestrates the use of tools. An agent executor manages the process, taking input text and producing both an English summary and its Spanish translation. Internet Search and Summarize Agent Overview 🔎 An intelligent web research assistant that combines web search capabilities with AI-powered summarization, automating the process of gathering information from the internet and distilling it into concise, relevant summaries. Implementation 🛠️ Integrates a web search module using DuckDuckGo's API, a result parser, and a text summarization engine leveraging OpenAI's language models. The system performs site-specific or general searches, extracts relevant content, generates concise summaries, and compiles attributed results for efficient information retrieval and synthesis. Multi agent research team - Autogen Overview 🔎 This technique explores a multi-agent system for collaborative research using the AutoGen library. It employs agents to solve tasks collaboratively, focusing on efficient execution and quality assurance. The system enhances research by distributing tasks among specialized agents. Implementation 🛠️ Agents are configured with specific roles using the GPT-4 model, including admin, developer, planner, executor, and quality assurance. Interaction management ensures orderly communication with defined transitions. Task execution involves collaborative planning, coding, execution, and quality checking, demonstrating a scalable framework for various domains. Additional Resources 📚 comprehensive solution with UI Blogpost Sales Call Analyzer Overview 🔎 An intelligent system that automates the analysis of sales call recordings by combining audio transcription with advanced natural language processing. The analyzer transcribes audio using OpenAI's Whisper, processes the text using NLP techniques, and generates comprehensive reports including sentiment analysis, key phrases, pain points, and actionable recommendations to improve sales performance. Implementation 🛠️ Utilizes multiple components in a structured workflow: OpenAI Whisper for audio transcription, CrewAI for task automation and agent management, and LangChain for orchestrating the analysis pipeline. The system processes audio through a series of steps from transcription to detailed analysis, leveraging custom agents and tasks to generate structured JSON reports containing insights about customer sentiment, sales opportunities, and recommended improvements. Additional Resources 📚 YouTube Explanation Weather Emergency & Response System Overview 🔎 A comprehensive system demonstrating two agent graph implementations for weather emergency response: a real-time graph processing live weather data, and a hybrid graph combining real and simulated data for testing high-severity scenarios. The system handles complete workflow from data gathering through emergency plan generation, with automated notifications and human verification steps. Implementation 🛠️ Utilizes LangGraph for orchestrating complex workflows with state management, integrating OpenWeatherMap API for real-time data, and Gemini for analysis and response generation. The system incorporates email notifications, social media monitoring simulation, and severity-based routing with configurable human verification for low/medium severity events. Additional Resources 📚 YouTube Explanation Self-Healing Codebase System Overview 🔎 An intelligent system that automatically detects, diagnoses, and fixes runtime code errors using LangGraph workflow orchestration and ChromaDB vector storage. The system maintains a memory of encountered bugs and their fixes through vector embeddings, enabling pattern recognition for similar errors across the codebase. Implementation 🛠️ Utilizes a state-based graph workflow that processes function definitions and runtime arguments through specialized nodes for error detection, code analysis, and fix generation. Incorporates ChromaDB for vector-based storage of bug patterns and fixes, with automated search and retrieval capabilities for similar error patterns, while maintaining code execution safety through structured validation steps. Additional Resources 📚 YouTube Explanation DataScribe: AI-Powered Schema Explorer Overview 🔎 An intelligent agent system that enables intuitive exploration and querying of relational databases through natural language interactions. The system utilizes a fleet of specialized agents, coordinated by a stateful Supervisor, to handle schema discovery, query planning, and data analysis tasks while maintaining contextual understanding through vector-based relationship graphs. Implementation 🛠️ Leverages LangGraph for orchestrating a multi-agent workflow including discovery, inference, and planning agents, with NetworkX for relationship graph visualization and management. The system incorporates dynamic state management through TypedDict classes, maintains database context between sessions using a db_graph attribute, and includes safety measures to prevent unauthorized database modifications. Memory-Enhanced Email Agent (LangGraph & LangMem) Overview 🔎 An intelligent email assistant that combines three types of memory (semantic, episodic, and procedural) to create a system that improves over time. The agent can triage incoming emails, draft contextually appropriate responses using stored knowledge, and enhance its performance based on user feedback. Implementation 🛠️ Leverages LangGraph for workflow orchestration and LangMem for sophisticated memory management across multiple memory types. The system implements a triage workflow with memory-enhanced decision making, specialized tools for email composition and calendar management, and a self-improvement mechanism that updates its own prompts based on feedback and past performance. Additional Resources 📚 Blog Post 📰 News and Information Agents News TL;DR using LangGraph Overview 🔎 A news summarization system that generates concise TL;DR summaries of current events based on user queries. The system leverages large language models for decision making and summarization while integrating with news APIs to access up-to-date content, allowing users to quickly catch up on topics of interest through generated bullet-point summaries. Implementation 🛠️ Utilizes LangGraph to orchestrate a workflow combining multiple components: GPT-4o-mini for generating search terms and article summaries, NewsAPI for retrieving article metadata, BeautifulSoup for web scraping article content, and Asyncio for concurrent processing. The system follows a structured pipeline from query processing through article selection and summarization, managing the flow between components to produce relevant TL;DRs of current news articles. Additional Resources 📚 YouTube Explanation Blog Post AInsight: AI/ML Weekly News Reporter Overview 🔎 AInsight demonstrates how to build an intelligent news aggregation and summarization system using a multi-agent architecture. The system employs three specialized agents (NewsSearcher, Summarizer, Publisher) to automatically collect, process and summarize AI/ML news for general audiences through LangGraph-based workflow orchestration. Implementation 🛠️ Implements a state-managed multi-agent system using LangGraph to coordinate the news collection (Tavily API), technical content summarization (GPT-4), and report generation processes. The system features modular architecture with TypedDict-based state management, external API integration, and markdown report generation with customizable templates. Additional Resources 📚 YouTube Explanation Journalism-Focused AI Assistant Overview 🔎 A specialized AI assistant that helps journalists tackle modern journalistic challenges like misinformation, bias, and information overload. The system integrates fact-checking, tone analysis, summarization, and grammar review tools to enhance the accuracy and efficiency of journalistic work while maintaining ethical reporting standards. Implementation 🛠️ Leverages LangGraph to orchestrate a workflow of specialized components including language models for analysis and generation, web search integration via DuckDuckGo's API, document parsing tools like PyMuPDFLoader and WebBaseLoader, text splitting with RecursiveCharacterTextSplitter, and structured JSON outputs. Each component works together through a unified workflow to analyze content, verify facts, detect bias, extract quotes, and generate comprehensive reports. Blog Writer (Open AI Swarm) Overview 🔎 A multi-agent system for collaborative blog post creation using OpenAI's Swarm package. It leverages specialized agents to perform research, planning, writing, and editing tasks efficiently. Implementation 🛠️ Utilizes OpenAI's Swarm Package to manage agent interactions. Includes an admin, researcher, planner, writer, and editor, each with specific roles. The system follows a structured workflow: topic setting, outlining, research, drafting, and editing. This approach enhances content creation through task distribution, specialization, and collaborative problem-solving. Additional Resources 📚 Swarm Repo Podcast Internet Search and Generate Agent 🎙️ Overview 🔎 A two step agent that first searches the internet for a given topic and then generates a podcast on the topic found. The search step uses a search agent and search function to find the most relevant information. The second step uses a podcast generation agent and generation function to create a podcast on the topic found. Implementation 🛠️ Utilizes LangGraph to orchestrate a two-step workflow. The first step involves a search agent and function to gather information from the internet. The second step uses a podcast generation agent and function to create a podcast based on the gathered information. 🛍️ Shopping and Product Analysis Agents ShopGenie - Redefining Online Shopping Customer Experience Overview 🔎 An AI-powered shopping assistant that helps customers make informed purchasing decisions even without domain expertise. The system analyzes product information from multiple sources, compares specifications and reviews, identifies the best option based on user needs, and delivers recommendations through email with supporting video reviews, creating a comprehensive shopping experience. Implementation 🛠️ Uses LangGraph to orchestrate a workflow combining Tavily for web search, Llama-3.1-70B for structured data analysis and product comparison, and YouTube API for review video retrieval. The system processes search results through multiple nodes including schema mapping, product comparison, review identification, and email generation. Key features include structured Pydantic models for consistent data handling, retry mechanisms for robust API interactions, and email delivery through SMTP for sharing recommendations. Additional Resources 📚 YouTube Explanation Car Buyer AI Agent Overview 🔎 The Smart Product Buyer AI Agent demonstrates how to build an intelligent system that assists users in making informed purchasing decisions. Using LangGraph and LLM-based intelligence, the system processes user requirements, scrapes product listings from websites like AutoTrader, and provides detailed analysis and recommendations for car purchases. Implementation 🛠️ Implements a state-based workflow using LangGraph to coordinate user interaction, web scraping, and decision support. The system features TypedDict state management, async web scraping with Playwright, and integrates with external APIs for comprehensive product analysis. The implementation includes a Gradio interface for real-time chat interaction and modular scraper architecture for easy extension to additional product categories. Additional Resources 📚 YouTube Explanation 🎯 Task Management and Productivity Agents Taskifier - Intelligent Task Allocation & Management Overview 🔎 An intelligent task management system that analyzes user work styles and creates personalized task breakdown strategies, born from the observation that procrastination often stems from task ambiguity among students and early-career professionals. The system evaluates historical work patterns, gathers relevant task information through web search, and generates customized step-by-step approaches to optimize productivity and reduce workflow paralysis. Implementation 🛠️ Leverages LangGraph for orchestrating a multi-step workflow including work style analysis, information gathering via Tavily API, and customized plan generation. The system maintains state through the process, integrating historical work pattern data with fresh task research to output detailed, personalized task execution plans aligned with the user's natural working style. Additional Resources 📚 YouTube Explanation Grocery Management Agents System Overview 🔎 A multi-agent system built with CrewAI that automates grocery management tasks including receipt interpretation, expiration date tracking, inventory management, and recipe recommendations. The system uses specialized agents to extract data from receipts, estimate product shelf life, track consumption, and suggest recipes to minimize food waste. Implementation 🛠️ Implements four specialized agents using CrewAI - a Receipt Interpreter that extracts item details from receipts, an Expiration Date Estimator that determines shelf life using online sources, a Grocery Tracker that maintains inventory based on consumption, and a Recipe Recommender that suggests meals using available ingredients. Each agent has specific tools and tasks orchestrated through a crew workflow. Additional Resources 📚 YouTube Explanation 🔍 Quality Assurance and Testing Agents LangGraph-Based Systems Inspector Overview 🔎 A comprehensive testing and validation tool for LangGraph-based applications that automatically analyzes system architecture, generates test cases, and identifies potential vulnerabilities through multi-agent inspection. The inspector employs specialized AI testers to evaluate different aspects of the system, from basic functionality to security concerns and edge cases. Implementation 🛠️ Integrates LangGraph for workflow orchestration, multiple LLM-powered testing agents, and a structured evaluation pipeline that includes static analysis, test case generation, and results verification. The system uses Pydantic for data validation, NetworkX for graph representation, and implements a modular architecture that allows for parallel test execution and comprehensive result analysis. Additional Resources 📚 YouTube Explanation Blog Post EU Green Deal FAQ Bot Overview 🔎 The EU Green Deal FAQ Bot demonstrates how to build a RAG-based AI agent that helps businesses understand EU green deal policies. The system processes complex regulatory documents into manageable chunks and provides instant, accurate answers to common questions about environmental compliance, emissions reporting, and waste management requirements. Implementation 🛠️ Implements a sophisticated RAG pipeline using FAISS vectorstore for document storage, semantic chunking for preprocessing, and multiple specialized agents (Retriever, Summarizer, Evaluator) for query processing. The system features query rephrasing for improved accuracy, cross-reference with gold Q&A datasets for answer validation, and comprehensive evaluation metrics to ensure response quality and relevance. Additional Resources 📚 YouTube Explanation Systematic Review Automation System + Paper Draft Creation Overview 🔎 A comprehensive system for automating academic systematic reviews using a directed graph architecture and LangChain components. The system generates complete, publication-ready systematic review papers, automatically processing everything from literature search through final draft generation with multiple revision cycles. Implementation 🛠️ Utilizes a state-based graph workflow that handles paper search and selection (up to 3 papers), PDF processing, and generates a complete academic paper with all standard sections (abstract, introduction, methods, results, conclusions, references). The system incorporates multiple revision cycles with automated critique and improvement phases, all orchestrated through LangGraph state management. Additional Resources 📚 YouTube Explanation 🌟 Special Advanced Technique 🌟 Sophisticated Controllable Agent for Complex RAG Tasks 🤖 Overview 🔎 An advanced RAG solution designed to tackle complex questions that simple semantic similarity-based retrieval cannot solve. This approach uses a sophisticated deterministic graph as the "brain" 🧠 of a highly controllable autonomous agent, capable of answering non-trivial questions from your own data. Implementation 🛠️ • Implement a multi-step process involving question anonymization, high-level planning, task breakdown, adaptive information retrieval and question answering, continuous re-planning, and rigorous answer verification to ensure grounded and accurate responses. Getting Started To begin exploring and building GenAI agents: Clone this repository: Navigate to the technique you're interested in: Follow the detailed implementation guide in each technique's notebook. Contributing We welcome contributions from the community! If you have a new technique or improvement to suggest: Fork the repository Create your feature branch: git checkout -b feature/AmazingFeature Commit your changes: git commit -m 'Add some AmazingFeature' Push to the branch: git push origin feature/AmazingFeature Open a pull request Contributors License This project is licensed under a custom non-commercial license - see the LICENSE file for details. ⭐️ If you find this repository helpful, please consider giving it a star! Keywords: GenAI, Generative AI, Agents, NLP, AI, Machine Learning, Natural Language Processing, LLM, Conversational AI, Task-Oriented AI

LLMs-from-scratch
github
LLM Vibe Score0.62
Human Vibe Score1
rasbtMar 28, 2025

LLMs-from-scratch

Build a Large Language Model (From Scratch) This repository contains the code for developing, pretraining, and finetuning a GPT-like LLM and is the official code repository for the book Build a Large Language Model (From Scratch). In Build a Large Language Model (From Scratch), you'll learn and understand how large language models (LLMs) work from the inside out by coding them from the ground up, step by step. In this book, I'll guide you through creating your own LLM, explaining each stage with clear text, diagrams, and examples. The method described in this book for training and developing your own small-but-functional model for educational purposes mirrors the approach used in creating large-scale foundational models such as those behind ChatGPT. In addition, this book includes code for loading the weights of larger pretrained models for finetuning. Link to the official source code repository Link to the book at Manning (the publisher's website) Link to the book page on Amazon.com ISBN 9781633437166 To download a copy of this repository, click on the Download ZIP button or execute the following command in your terminal: (If you downloaded the code bundle from the Manning website, please consider visiting the official code repository on GitHub at https://github.com/rasbt/LLMs-from-scratch for the latest updates.) Table of Contents Please note that this README.md file is a Markdown (.md) file. If you have downloaded this code bundle from the Manning website and are viewing it on your local computer, I recommend using a Markdown editor or previewer for proper viewing. If you haven't installed a Markdown editor yet, MarkText is a good free option. You can alternatively view this and other files on GitHub at https://github.com/rasbt/LLMs-from-scratch in your browser, which renders Markdown automatically. Tip: If you're seeking guidance on installing Python and Python packages and setting up your code environment, I suggest reading the README.md file located in the setup directory. | Chapter Title | Main Code (for Quick Access) | All Code + Supplementary | |------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------------------| | Setup recommendations | - | - | | Ch 1: Understanding Large Language Models | No code | - | | Ch 2: Working with Text Data | - ch02.ipynb- dataloader.ipynb (summary)- exercise-solutions.ipynb | ./ch02 | | Ch 3: Coding Attention Mechanisms | - ch03.ipynb- multihead-attention.ipynb (summary) - exercise-solutions.ipynb| ./ch03 | | Ch 4: Implementing a GPT Model from Scratch | - ch04.ipynb- gpt.py (summary)- exercise-solutions.ipynb | ./ch04 | | Ch 5: Pretraining on Unlabeled Data | - ch05.ipynb- gpttrain.py (summary) - gptgenerate.py (summary) - exercise-solutions.ipynb | ./ch05 | | Ch 6: Finetuning for Text Classification | - ch06.ipynb - gptclassfinetune.py - exercise-solutions.ipynb | ./ch06 | | Ch 7: Finetuning to Follow Instructions | - ch07.ipynb- gptinstructionfinetuning.py (summary)- ollamaevaluate.py (summary)- exercise-solutions.ipynb | ./ch07 | | Appendix A: Introduction to PyTorch | - code-part1.ipynb- code-part2.ipynb- DDP-script.py- exercise-solutions.ipynb | ./appendix-A | | Appendix B: References and Further Reading | No code | - | | Appendix C: Exercise Solutions | No code | - | | Appendix D: Adding Bells and Whistles to the Training Loop | - appendix-D.ipynb | ./appendix-D | | Appendix E: Parameter-efficient Finetuning with LoRA | - appendix-E.ipynb | ./appendix-E | The mental model below summarizes the contents covered in this book. Hardware Requirements The code in the main chapters of this book is designed to run on conventional laptops within a reasonable timeframe and does not require specialized hardware. This approach ensures that a wide audience can engage with the material. Additionally, the code automatically utilizes GPUs if they are available. (Please see the setup doc for additional recommendations.) Bonus Material Several folders contain optional materials as a bonus for interested readers: Setup Python Setup Tips Installing Python Packages and Libraries Used In This Book Docker Environment Setup Guide Chapter 2: Working with text data Byte Pair Encoding (BPE) Tokenizer From Scratch Comparing Various Byte Pair Encoding (BPE) Implementations Understanding the Difference Between Embedding Layers and Linear Layers Dataloader Intuition with Simple Numbers Chapter 3: Coding attention mechanisms Comparing Efficient Multi-Head Attention Implementations Understanding PyTorch Buffers Chapter 4: Implementing a GPT model from scratch FLOPS Analysis Chapter 5: Pretraining on unlabeled data: Alternative Weight Loading Methods Pretraining GPT on the Project Gutenberg Dataset Adding Bells and Whistles to the Training Loop Optimizing Hyperparameters for Pretraining Building a User Interface to Interact With the Pretrained LLM Converting GPT to Llama Llama 3.2 From Scratch Memory-efficient Model Weight Loading Extending the Tiktoken BPE Tokenizer with New Tokens PyTorch Performance Tips for Faster LLM Training Chapter 6: Finetuning for classification Additional experiments finetuning different layers and using larger models Finetuning different models on 50k IMDB movie review dataset Building a User Interface to Interact With the GPT-based Spam Classifier Chapter 7: Finetuning to follow instructions Dataset Utilities for Finding Near Duplicates and Creating Passive Voice Entries Evaluating Instruction Responses Using the OpenAI API and Ollama Generating a Dataset for Instruction Finetuning Improving a Dataset for Instruction Finetuning Generating a Preference Dataset with Llama 3.1 70B and Ollama Direct Preference Optimization (DPO) for LLM Alignment Building a User Interface to Interact With the Instruction Finetuned GPT Model Questions, Feedback, and Contributing to This Repository I welcome all sorts of feedback, best shared via the Manning Forum or GitHub Discussions. Likewise, if you have any questions or just want to bounce ideas off others, please don't hesitate to post these in the forum as well. Please note that since this repository contains the code corresponding to a print book, I currently cannot accept contributions that would extend the contents of the main chapter code, as it would introduce deviations from the physical book. Keeping it consistent helps ensure a smooth experience for everyone. Citation If you find this book or code useful for your research, please consider citing it. Chicago-style citation: Raschka, Sebastian. Build A Large Language Model (From Scratch). Manning, 2024. ISBN: 978-1633437166. BibTeX entry:

AITreasureBox
github
LLM Vibe Score0.447
Human Vibe Score0.1014145151561518
superiorluMar 28, 2025

AITreasureBox

AI TreasureBox English | 中文 Collect practical AI repos, tools, websites, papers and tutorials on AI. Translated from ChatGPT, picture from Midjourney. Catalog Repos Tools Websites Report&Paper Tutorials Repos updated repos and stars every 2 hours and re-ranking automatically. | No. | Repos | Description | | ----:|:-----------------------------------------|:------------------------------------------------------------------------------------------------------| | 1|🔥codecrafters-io/build-your-own-x !2025-03-28364681428|Master programming by recreating your favorite technologies from scratch.| | 2|sindresorhus/awesome !2025-03-28353614145|😎 Awesome lists about all kinds of interesting topics| | 3|public-apis/public-apis !2025-03-28334299125|A collective list of free APIs| | 4|kamranahmedse/developer-roadmap !2025-03-2831269540|Interactive roadmaps, guides and other educational content to help developers grow in their careers.| | 5|vinta/awesome-python !2025-03-28238581114|A curated list of awesome Python frameworks, libraries, software and resources| | 6|practical-tutorials/project-based-learning !2025-03-28222661124|Curated list of project-based tutorials| | 7|tensorflow/tensorflow !2025-03-281888714|An Open Source Machine Learning Framework for Everyone| | 8|Significant-Gravitas/AutoGPT !2025-03-2817391338|An experimental open-source attempt to make GPT-4 fully autonomous.| | 9|jackfrued/Python-100-Days !2025-03-2816305141|Python - 100天从新手到大师| | 10|AUTOMATIC1111/stable-diffusion-webui !2025-03-2815011553|Stable Diffusion web UI| | 11|huggingface/transformers !2025-03-2814207850|🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.| | 12|ollama/ollama !2025-03-28135166151|Get up and running with Llama 2, Mistral, Gemma, and other large language models.| | 13|f/awesome-chatgpt-prompts !2025-03-2812212738 |This repo includes ChatGPT prompt curation to use ChatGPT better.| | 14|justjavac/free-programming-books-zhCN !2025-03-2811316119|📚 免费的计算机编程类中文书籍,欢迎投稿| | 15|krahets/hello-algo !2025-03-2811107930|《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing| | 16|yt-dlp/yt-dlp !2025-03-28105801114|A feature-rich command-line audio/video downloader| | 17|langchain-ai/langchain !2025-03-2810449479|⚡ Building applications with LLMs through composability ⚡| | 18|goldbergyoni/nodebestpractices !2025-03-281021629|✅ The Node.js best practices list (July 2024)| | 19|puppeteer/puppeteer !2025-03-289018212|JavaScript API for Chrome and Firefox| | 20|pytorch/pytorch !2025-03-288833938|Tensors and Dynamic neural networks in Python with strong GPU acceleration| | 21|neovim/neovim !2025-03-288781482|Vim-fork focused on extensibility and usability| | 22|🔥🔥langgenius/dify !2025-03-2887342639 |One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.| | 23|mtdvio/every-programmer-should-know !2025-03-28867069|A collection of (mostly) technical things every software developer should know about| | 24|open-webui/open-webui !2025-03-2886025159|User-friendly WebUI for LLMs (Formerly Ollama WebUI)| | 25|ChatGPTNextWeb/NextChat !2025-03-288231521|✨ Light and Fast AI Assistant. Support: Web | | 26|supabase/supabase !2025-03-287990956|The open source Firebase alternative.| | 27|openai/whisper !2025-03-287905542|Robust Speech Recognition via Large-Scale Weak Supervision| | 28|home-assistant/core !2025-03-287773219|🏡 Open source home automation that puts local control and privacy first.| | 29|tensorflow/models !2025-03-28774694|Models and examples built with TensorFlow| | 30| ggerganov/llama.cpp !2025-03-287731836 | Port of Facebook's LLaMA model in C/C++ | | 31|3b1b/manim !2025-03-287641918|Animation engine for explanatory math videos| | 32|microsoft/generative-ai-for-beginners !2025-03-287623860|12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/| | 33|nomic-ai/gpt4all !2025-03-28729285 |gpt4all: an ecosystem of open-source chatbots trained on a massive collection of clean assistant data including code, stories and dialogue| | 34|comfyanonymous/ComfyUI !2025-03-2872635111|The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.| | 35|bregman-arie/devops-exercises !2025-03-2872225209|Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions| | 36|elastic/elasticsearch !2025-03-28721419|Free and Open, Distributed, RESTful Search Engine| | 37|🔥n8n-io/n8n !2025-03-2872093495|Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.| | 38|fighting41love/funNLP !2025-03-287200422|The Most Powerful NLP-Weapon Arsenal| | 39|hoppscotch/hoppscotch !2025-03-287060134|Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)| | 40|abi/screenshot-to-code !2025-03-286932817|Drop in a screenshot and convert it to clean HTML/Tailwind/JS code| | 41|binary-husky/gptacademic !2025-03-28680374|Academic Optimization of GPT| | 42|d2l-ai/d2l-zh !2025-03-286774142|Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries| | 43|josephmisiti/awesome-machine-learning !2025-03-286739215|A curated list of awesome Machine Learning frameworks, libraries and software.| | 44|grafana/grafana !2025-03-286725414|The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.| | 45|python/cpython !2025-03-286602218|The Python programming language| | 46|apache/superset !2025-03-286519020|Apache Superset is a Data Visualization and Data Exploration Platform| | 47|xtekky/gpt4free !2025-03-28639391 |decentralizing the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered API's ( poe.com, phind.com, chat.openai.com etc...)| | 48|sherlock-project/sherlock !2025-03-286332536|Hunt down social media accounts by username across social networks| | 49|twitter/the-algorithm !2025-03-28630586 |Source code for Twitter's Recommendation Algorithm| | 50|keras-team/keras !2025-03-28627835|Deep Learning for humans| | 51|openai/openai-cookbook !2025-03-28625136 |Examples and guides for using the OpenAI API| | 52|immich-app/immich !2025-03-286238670|High performance self-hosted photo and video management solution.| | 53|AppFlowy-IO/AppFlowy !2025-03-286173528|Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.| | 54|scikit-learn/scikit-learn !2025-03-286158212|scikit-learn: machine learning in Python| | 55|binhnguyennus/awesome-scalability !2025-03-286117021|The Patterns of Scalable, Reliable, and Performant Large-Scale Systems| | 56|labmlai/annotateddeeplearningpaperimplementations !2025-03-285951726|🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠| | 57|OpenInterpreter/open-interpreter !2025-03-285894710|A natural language interface for computers| | 58|lobehub/lobe-chat !2025-03-285832054|🤖 Lobe Chat - an open-source, extensible (Function Calling), high-performance chatbot framework. It supports one-click free deployment of your private ChatGPT/LLM web application.| | 59|meta-llama/llama !2025-03-28579536|Inference code for Llama models| | 60|nuxt/nuxt !2025-03-28566437|The Intuitive Vue Framework.| | 61|imartinez/privateGPT !2025-03-28555192|Interact with your documents using the power of GPT, 100% privately, no data leaks| | 62|Stirling-Tools/Stirling-PDF !2025-03-285500846|#1 Locally hosted web application that allows you to perform various operations on PDF files| | 63|PlexPt/awesome-chatgpt-prompts-zh !2025-03-285459720|ChatGPT Chinese Training Guide. Guidelines for various scenarios. Learn how to make it listen to you| | 64|dair-ai/Prompt-Engineering-Guide !2025-03-285451025 |🐙 Guides, papers, lecture, notebooks and resources for prompt engineering| | 65|ageitgey/facerecognition !2025-03-28544382|The world's simplest facial recognition api for Python and the command line| | 66|CorentinJ/Real-Time-Voice-Cloning !2025-03-285384814|Clone a voice in 5 seconds to generate arbitrary speech in real-time| | 67|geekan/MetaGPT !2025-03-285375376|The Multi-Agent Meta Programming Framework: Given one line Requirement, return PRD, Design, Tasks, Repo | | 68|gpt-engineer-org/gpt-engineer !2025-03-285367419|Specify what you want it to build, the AI asks for clarification, and then builds it.| | 69|lencx/ChatGPT !2025-03-2853653-3|🔮 ChatGPT Desktop Application (Mac, Windows and Linux)| | 70|deepfakes/faceswap !2025-03-28535672|Deepfakes Software For All| | 71|langflow-ai/langflow !2025-03-285319584|Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.| | 72|commaai/openpilot !2025-03-28529759|openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.| | 73|clash-verge-rev/clash-verge-rev !2025-03-2852848124|Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)| | 74|All-Hands-AI/OpenHands !2025-03-285150675|🙌 OpenHands: Code Less, Make More| | 75|xai-org/grok-1 !2025-03-28502504|Grok open release| | 76|meilisearch/meilisearch !2025-03-284999122|A lightning-fast search API that fits effortlessly into your apps, websites, and workflow| | 77|🔥browser-use/browser-use !2025-03-2849910294|Make websites accessible for AI agents| | 78|jgthms/bulma !2025-03-28496783|Modern CSS framework based on Flexbox| | 79|facebookresearch/segment-anything !2025-03-284947116|The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.| |!green-up-arrow.svg 80|hacksider/Deep-Live-Cam !2025-03-2848612146|real time face swap and one-click video deepfake with only a single image (uncensored)| |!red-down-arrow 81|mlabonne/llm-course !2025-03-284860934|Course with a roadmap and notebooks to get into Large Language Models (LLMs).| | 82|PaddlePaddle/PaddleOCR !2025-03-284785530|Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)| | 83|alist-org/alist !2025-03-284732618|🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。| | 84|infiniflow/ragflow !2025-03-2847027129|RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.| | 85|Avik-Jain/100-Days-Of-ML-Code !2025-03-284679312|100 Days of ML Coding| | 86|v2ray/v2ray-core !2025-03-28458706|A platform for building proxies to bypass network restrictions.| | 87|hiyouga/LLaMA-Factory !2025-03-284555881|Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)| | 88|Asabeneh/30-Days-Of-Python !2025-03-284544930|30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw| | 89|type-challenges/type-challenges !2025-03-284488511|Collection of TypeScript type challenges with online judge| | 90|lllyasviel/Fooocus !2025-03-284402716|Focus on prompting and generating| | 91|RVC-Boss/GPT-SoVITS !2025-03-284327738|1 min voice data can also be used to train a good TTS model! (few shot voice cloning)| | 92|rasbt/LLMs-from-scratch !2025-03-284320667|Implementing a ChatGPT-like LLM from scratch, step by step| | 93|oobabooga/text-generation-webui !2025-03-284302012 |A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA.| | 94|vllm-project/vllm !2025-03-2842982102|A high-throughput and memory-efficient inference and serving engine for LLMs| | 95|dani-garcia/vaultwarden !2025-03-284297121|Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs| | 96|microsoft/autogen !2025-03-284233049|Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ| | 97|jeecgboot/JeecgBoot !2025-03-284205920|🔥「企业级低代码平台」前后端分离架构SpringBoot 2.x/3.x,SpringCloud,Ant Design&Vue3,Mybatis,Shiro,JWT。强大的代码生成器让前后端代码一键生成,无需写任何代码! 引领新的开发模式OnlineCoding->代码生成->手工MERGE,帮助Java项目解决70%重复工作,让开发更关注业务,既能快速提高效率,帮助公司节省成本,同时又不失灵活性。| | 98|Mintplex-Labs/anything-llm !2025-03-284186955|A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces.| | 99|THUDM/ChatGLM-6B !2025-03-28410192 |ChatGLM-6B: An Open Bilingual Dialogue Language Model| | 100|hpcaitech/ColossalAI !2025-03-28406902|Making large AI models cheaper, faster and more accessible| | 101|Stability-AI/stablediffusion !2025-03-28406337|High-Resolution Image Synthesis with Latent Diffusion Models| | 102|mingrammer/diagrams !2025-03-28405063|🎨 Diagram as Code for prototyping cloud system architectures| | 103|Kong/kong !2025-03-28404616|🦍 The Cloud-Native API Gateway and AI Gateway.| | 104|getsentry/sentry !2025-03-284040913|Developer-first error tracking and performance monitoring| | 105| karpathy/nanoGPT !2025-03-284034613 |The simplest, fastest repository for training/finetuning medium-sized GPTs| | 106|fastlane/fastlane !2025-03-2840014-1|🚀 The easiest way to automate building and releasing your iOS and Android apps| | 107|psf/black !2025-03-28399765|The uncompromising Python code formatter| | 108|OpenBB-finance/OpenBBTerminal !2025-03-283972074 |Investment Research for Everyone, Anywhere.| | 109|2dust/v2rayNG !2025-03-283943415|A V2Ray client for Android, support Xray core and v2fly core| | 110|apache/airflow !2025-03-283937314|Apache Airflow - A platform to programmatically author, schedule, and monitor workflows| | 111|KRTirtho/spotube !2025-03-283902746|🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!| | 112|coqui-ai/TTS !2025-03-283889719 |🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production| | 113|ggerganov/whisper.cpp !2025-03-283882116|Port of OpenAI's Whisper model in C/C++| | 114|ultralytics/ultralytics !2025-03-283866951|NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite| | 115|typst/typst !2025-03-283863914|A new markup-based typesetting system that is powerful and easy to learn.| | 116|streamlit/streamlit !2025-03-283845828|Streamlit — A faster way to build and share data apps.| | 117|LC044/WeChatMsg !2025-03-283836931|提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手| | 118|lm-sys/FastChat !2025-03-283822112 |An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.| | 119|NaiboWang/EasySpider !2025-03-283819013|A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。| | 120|microsoft/DeepSpeed !2025-03-283765816 |A deep learning optimization library that makes distributed training and inference easy, efficient, and effective| | 121|QuivrHQ/quivr !2025-03-28376067|Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.| | 122|freqtrade/freqtrade !2025-03-283757817 |Free, open source crypto trading bot| | 123|suno-ai/bark !2025-03-28373178 |🔊 Text-Prompted Generative Audio Model| | 124|🔥cline/cline !2025-03-2837307282|Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way.| | 125|LAION-AI/Open-Assistant !2025-03-28372712 |OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.| | 126|penpot/penpot !2025-03-283716217|Penpot: The open-source design tool for design and code collaboration| | 127|gradio-app/gradio !2025-03-283713320|Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!| | 128|FlowiseAI/Flowise !2025-03-283667135 |Drag & drop UI to build your customized LLM flow using LangchainJS| | 129|SimplifyJobs/Summer2025-Internships !2025-03-28366506|Collection of Summer 2025 tech internships!| | 130|TencentARC/GFPGAN !2025-03-28365027 |GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.| | 131|ray-project/ray !2025-03-283626819|Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.| | 132|babysor/MockingBird !2025-03-28360498|🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time| | 133|unslothai/unsloth !2025-03-283603691|5X faster 50% less memory LLM finetuning| | 134|zhayujie/chatgpt-on-wechat !2025-03-283600124 |Wechat robot based on ChatGPT, which uses OpenAI api and itchat library| | 135|upscayl/upscayl !2025-03-283599824|🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.| | 136|freeCodeCamp/devdocs !2025-03-28359738|API Documentation Browser| | 137|XingangPan/DragGAN !2025-03-28359043 |Code for DragGAN (SIGGRAPH 2023)| | 138|2noise/ChatTTS !2025-03-283543922|ChatTTS is a generative speech model for daily dialogue.| | 139|google-research/google-research !2025-03-28352207 |Google Research| | 140|karanpratapsingh/system-design !2025-03-28351003|Learn how to design systems at scale and prepare for system design interviews| | 141|lapce/lapce !2025-03-28350855|Lightning-fast and Powerful Code Editor written in Rust| | 142| microsoft/TaskMatrix !2025-03-2834500-3 | Talking, Drawing and Editing with Visual Foundation Models| | 143|chatchat-space/Langchain-Chatchat !2025-03-283442020|Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain| | 144|unclecode/crawl4ai !2025-03-283434163|🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper| | 145|Bin-Huang/chatbox !2025-03-283374733 |A desktop app for GPT-4 / GPT-3.5 (OpenAI API) that supports Windows, Mac & Linux| | 146|milvus-io/milvus !2025-03-283366525 |A cloud-native vector database, storage for next generation AI applications| | 147|mendableai/firecrawl !2025-03-2833297128|🔥 Turn entire websites into LLM-ready markdown| | 148|pola-rs/polars !2025-03-283269320|Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends| | 149|Pythagora-io/gpt-pilot !2025-03-28325321|PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation| | 150|hashicorp/vault !2025-03-28320797|A tool for secrets management, encryption as a service, and privileged access management| | 151|shardeum/shardeum !2025-03-28319580|Shardeum is an EVM based autoscaling blockchain| | 152|Chanzhaoyu/chatgpt-web !2025-03-28319242 |A demonstration website built with Express and Vue3 called ChatGPT| | 153|lllyasviel/ControlNet !2025-03-283186413 |Let us control diffusion models!| | 154|google/jax !2025-03-28317727|Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more| | 155|facebookresearch/detectron2 !2025-03-28315987|Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.| | 156|myshell-ai/OpenVoice !2025-03-28315233|Instant voice cloning by MyShell| | 157|TheAlgorithms/C-Plus-Plus !2025-03-283151411|Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.| | 158|hiroi-sora/Umi-OCR !2025-03-283138129|OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。| | 159|mudler/LocalAI !2025-03-283127815|🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.| | 160|facebookresearch/fairseq !2025-03-28312124 |Facebook AI Research Sequence-to-Sequence Toolkit written in Python.| | 161|alibaba/nacos !2025-03-28310559|an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.| | 162|yunjey/pytorch-tutorial !2025-03-28310326|PyTorch Tutorial for Deep Learning Researchers| | 163|v2fly/v2ray-core !2025-03-28307448|A platform for building proxies to bypass network restrictions.| | 164|mckaywrigley/chatbot-ui !2025-03-283067714|The open-source AI chat interface for everyone.| | 165|TabbyML/tabby !2025-03-28305949 |Self-hosted AI coding assistant| | 166|deepseek-ai/awesome-deepseek-integration !2025-03-283053193|| | 167|danielmiessler/fabric !2025-03-283028914|fabric is an open-source framework for augmenting humans using AI.| | 168|xinntao/Real-ESRGAN !2025-03-283026623 |Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.| | 169|paul-gauthier/aider !2025-03-283014642|aider is GPT powered coding in your terminal| | 170|tatsu-lab/stanfordalpaca !2025-03-28299022 |Code and documentation to train Stanford's Alpaca models, and generate the data.| | 171|DataTalksClub/data-engineering-zoomcamp !2025-03-282971817|Free Data Engineering course!| | 172|HeyPuter/puter !2025-03-282967014|🌐 The Internet OS! Free, Open-Source, and Self-Hostable.| | 173|mli/paper-reading !2025-03-282962314|Classic Deep Learning and In-Depth Reading of New Papers Paragraph by Paragraph| | 174|linexjlin/GPTs !2025-03-28295568|leaked prompts of GPTs| | 175|s0md3v/roop !2025-03-28295286 |one-click deepfake (face swap)| | 176|JushBJJ/Mr.-Ranedeer-AI-Tutor !2025-03-2829465-1 |A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.| | 177|opendatalab/MinerU !2025-03-282927074|A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。| | 178|mouredev/Hello-Python !2025-03-282920720|Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...| | 179|Lightning-AI/pytorch-lightning !2025-03-28292039|Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.| | 180|crewAIInc/crewAI !2025-03-282919344|Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.| | 181|facebook/folly !2025-03-282916612|An open-source C++ library developed and used at Facebook.| | 182|google-ai-edge/mediapipe !2025-03-28291519|Cross-platform, customizable ML solutions for live and streaming media.| | 183| getcursor/cursor !2025-03-282892025 | An editor made for programming with AI| | 184|chatanywhere/GPTAPIfree !2025-03-282856424|Free ChatGPT API Key, Free ChatGPT API, supports GPT-4 API (free), ChatGPT offers a free domestic forwarding API that allows direct connections without the need for a proxy. It can be used in conjunction with software/plugins like ChatBox, significantly reducing interface usage costs. Enjoy unlimited and unrestricted chatting within China| | 185|meta-llama/llama3 !2025-03-28285552|The official Meta Llama 3 GitHub site| | 186|tinygrad/tinygrad !2025-03-282845811|You like pytorch? You like micrograd? You love tinygrad! ❤️| | 187|google-research/tuningplaybook !2025-03-282841514|A playbook for systematically maximizing the performance of deep learning models.| | 188|huggingface/diffusers !2025-03-282830222|🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.| | 189|tokio-rs/tokio !2025-03-28282408|A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...| | 190|RVC-Project/Retrieval-based-Voice-Conversion-WebUI !2025-03-282823817|Voice data !2025-03-282822612|Jan is an open source alternative to ChatGPT that runs 100% offline on your computer| | 192|openai/CLIP !2025-03-282814720|CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image| | 193|🔥khoj-ai/khoj !2025-03-2828112313|Your AI second brain. A copilot to get answers to your questions, whether they be from your own notes or from the internet. Use powerful, online (e.g gpt4) or private, local (e.g mistral) LLMs. Self-host locally or use our web app. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.| | 194| acheong08/ChatGPT !2025-03-2828054-2 | Reverse engineered ChatGPT API | | 195|iperov/DeepFaceLive !2025-03-28279345 |Real-time face swap for PC streaming or video calls| | 196|eugeneyan/applied-ml !2025-03-28278471|📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.| | 197|XTLS/Xray-core !2025-03-282778213|Xray, Penetrates Everything. Also the best v2ray-core, with XTLS support. Fully compatible configuration.| | 198|feder-cr/JobsApplierAIAgent !2025-03-282776410|AutoJobsApplierAI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.| | 199|mindsdb/mindsdb !2025-03-282750631|The platform for customizing AI from enterprise data| | 200|DataExpert-io/data-engineer-handbook !2025-03-282721611|This is a repo with links to everything you'd ever want to learn about data engineering| | 201|exo-explore/exo !2025-03-282721633|Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚| | 202|taichi-dev/taichi !2025-03-2826926-1|Productive, portable, and performant GPU programming in Python.| | 203|mem0ai/mem0 !2025-03-282689134|The memory layer for Personalized AI| | 204|svc-develop-team/so-vits-svc !2025-03-28268096 |SoftVC VITS Singing Voice Conversion| | 205|OpenBMB/ChatDev !2025-03-28265624|Create Customized Software using Natural Language Idea (through Multi-Agent Collaboration)| | 206|roboflow/supervision !2025-03-282632010|We write your reusable computer vision tools. 💜| | 207|drawdb-io/drawdb !2025-03-282626913|Free, simple, and intuitive online database design tool and SQL generator.| | 208|karpathy/llm.c !2025-03-28261633|LLM training in simple, raw C/CUDA| | 209|airbnb/lottie-ios !2025-03-28261431|An iOS library to natively render After Effects vector animations| | 210|openai/openai-python !2025-03-282607713|The OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language.| | 211|academic/awesome-datascience !2025-03-28259876|📝 An awesome Data Science repository to learn and apply for real world problems.| | 212|harry0703/MoneyPrinterTurbo !2025-03-282576618|Generate short videos with one click using a large model| | 213|gabime/spdlog !2025-03-282571511|Fast C++ logging library.| | 214|ocrmypdf/OCRmyPDF !2025-03-2825674217|OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched| | 215|Vision-CAIR/MiniGPT-4 !2025-03-28256170 |Enhancing Vision-language Understanding with Advanced Large Language Models| | 216|Stability-AI/generative-models !2025-03-28255936|Generative Models by Stability AI| | 217|DS4SD/docling !2025-03-282555662|Get your docs ready for gen AI| | 218|PostHog/posthog !2025-03-282533227|🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.| | 219|nrwl/nx !2025-03-282509612|Smart Monorepos · Fast CI| | 220|continuedev/continue !2025-03-282500737|⏩ the open-source copilot chat for software development—bring the power of ChatGPT to VS Code| | 221|opentofu/opentofu !2025-03-28247968|OpenTofu lets you declaratively manage your cloud infrastructure.| | 222|invoke-ai/InvokeAI !2025-03-28247293|InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.| | 223|deepinsight/insightface !2025-03-282471615 |State-of-the-art 2D and 3D Face Analysis Project| | 224|apache/flink !2025-03-28246865|Apache Flink| | 225|ComposioHQ/composio !2025-03-28246436|Composio equips agents with well-crafted tools empowering them to tackle complex tasks| | 226|Genesis-Embodied-AI/Genesis !2025-03-282458314|A generative world for general-purpose robotics & embodied AI learning.| | 227|stretchr/testify !2025-03-28243184|A toolkit with common assertions and mocks that plays nicely with the standard library| | 228| yetone/openai-translator !2025-03-28242921 | Browser extension and cross-platform desktop application for translation based on ChatGPT API | | 229|frappe/erpnext !2025-03-282425211|Free and Open Source Enterprise Resource Planning (ERP)| | 230|songquanpeng/one-api !2025-03-282410034|OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.| | 231| microsoft/JARVIS !2025-03-28240604 | a system to connect LLMs with ML community | | 232|google/flatbuffers !2025-03-28239965|FlatBuffers: Memory Efficient Serialization Library| | 233|microsoft/graphrag !2025-03-282398928|A modular graph-based Retrieval-Augmented Generation (RAG) system| | 234|rancher/rancher !2025-03-28239675|Complete container management platform| | 235|bazelbuild/bazel !2025-03-282384618|a fast, scalable, multi-language and extensible build system| | 236|modularml/mojo !2025-03-28238236 |The Mojo Programming Language| | 237|danny-avila/LibreChat !2025-03-282378753|Enhanced ChatGPT Clone: Features OpenAI, GPT-4 Vision, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development| |!green-up-arrow.svg 238|🔥🔥🔥Shubhamsaboo/awesome-llm-apps !2025-03-28237391211|Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.| |!red-down-arrow 239|microsoft/semantic-kernel !2025-03-282373611|Integrate cutting-edge LLM technology quickly and easily into your apps| |!red-down-arrow 240|TheAlgorithms/Rust !2025-03-28236995|All Algorithms implemented in Rust| | 241|stanford-oval/storm !2025-03-28236326|An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.| | 242|openai/gpt-2 !2025-03-28232483|Code for the paper "Language Models are Unsupervised Multitask Learners"| | 243|labring/FastGPT !2025-03-282319445|A platform that uses the OpenAI API to quickly build an AI knowledge base, supporting many-to-many relationships.| | 244|pathwaycom/llm-app !2025-03-2822928-10|Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.| | 245|warpdotdev/Warp !2025-03-282286825|Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.| | 246|🔥agno-agi/agno !2025-03-2822833298|Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, knowledge, tools and reasoning.| | 247|qdrant/qdrant !2025-03-282275214 |Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/| | 248|ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code !2025-03-282271815|500 AI Machine learning Deep learning Computer vision NLP Projects with code| | 249|stanfordnlp/dspy !2025-03-282268321|Stanford DSPy: The framework for programming—not prompting—foundation models| | 250|PaddlePaddle/Paddle !2025-03-28226246|PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)| | 251|zulip/zulip !2025-03-28225464|Zulip server and web application. Open-source team chat that helps teams stay productive and focused.| | 252|Hannibal046/Awesome-LLM !2025-03-282240721|Awesome-LLM: a curated list of Large Language Model| | 253|facefusion/facefusion !2025-03-282218812|Next generation face swapper and enhancer| | 254|Mozilla-Ocho/llamafile !2025-03-28220624|Distribute and run LLMs with a single file.| | 255|yuliskov/SmartTube !2025-03-282201614|SmartTube - an advanced player for set-top boxes and tvs running Android OS| | 256|haotian-liu/LLaVA !2025-03-282201316 |Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.| | 257|ashishps1/awesome-system-design-resources !2025-03-282189367|This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems| | 258|Cinnamon/kotaemon !2025-03-28218248|An open-source RAG-based tool for chatting with your documents.| | 259|CodePhiliaX/Chat2DB !2025-03-282179757|🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.| | 260|blakeblackshear/frigate !2025-03-282177113|NVR with realtime local object detection for IP cameras| | 261|facebookresearch/audiocraft !2025-03-28217111|Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.| | 262|karpathy/minGPT !2025-03-28216567|A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training| | 263|grpc/grpc-go !2025-03-282159510|The Go language implementation of gRPC. HTTP/2 based RPC| | 264|HumanSignal/label-studio !2025-03-282137618|Label Studio is a multi-type data labeling and annotation tool with standardized output format| | 265|yoheinakajima/babyagi !2025-03-28212764 |uses OpenAI and Pinecone APIs to create, prioritize, and execute tasks, This is a pared-down version of the original Task-Driven Autonomous Agent| | 266|deepseek-ai/DeepSeek-Coder !2025-03-282118210|DeepSeek Coder: Let the Code Write Itself| | 267|BuilderIO/gpt-crawler !2025-03-282118010|Crawl a site to generate knowledge files to create your own custom GPT from a URL| | 268| openai/chatgpt-retrieval-plugin !2025-03-2821152-1 | Plugins are chat extensions designed specifically for language models like ChatGPT, enabling them to access up-to-date information, run computations, or interact with third-party services in response to a user's request.| | 269|microsoft/OmniParser !2025-03-282113123|A simple screen parsing tool towards pure vision based GUI agent| | 270|black-forest-labs/flux !2025-03-282107219|Official inference repo for FLUX.1 models| | 271|ItzCrazyKns/Perplexica !2025-03-282099154|Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI| | 272|microsoft/unilm !2025-03-28209876|Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities| | 273|Sanster/lama-cleaner !2025-03-282077614|Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.| | 274|assafelovic/gpt-researcher !2025-03-282057222|GPT based autonomous agent that does online comprehensive research on any given topic| | 275|PromtEngineer/localGPT !2025-03-28204230 |Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.| | 276|elastic/kibana !2025-03-28203482|Your window into the Elastic Stack| | 277|fishaudio/fish-speech !2025-03-282033222|Brand new TTS solution| | 278|mlc-ai/mlc-llm !2025-03-282028110 |Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.| | 279|deepset-ai/haystack !2025-03-282005320|🔍 Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more.| | 280|tree-sitter/tree-sitter !2025-03-28200487|An incremental parsing system for programming tools| | 281|Anjok07/ultimatevocalremovergui !2025-03-281999811|GUI for a Vocal Remover that uses Deep Neural Networks.| | 282|guidance-ai/guidance !2025-03-28199622|A guidance language for controlling large language models.| | 283|ml-explore/mlx !2025-03-28199619|MLX: An array framework for Apple silicon| | 284|mlflow/mlflow !2025-03-281995314|Open source platform for the machine learning lifecycle| | 285|ml-tooling/best-of-ml-python !2025-03-28198631|🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.| | 286|BerriAI/litellm !2025-03-281981862|Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)| | 287|LazyVim/LazyVim !2025-03-281981320|Neovim config for the lazy| | 288|wez/wezterm !2025-03-281976018|A GPU-accelerated cross-platform terminal emulator and multiplexer written by @wez and implemented in Rust| | 289|valkey-io/valkey !2025-03-281970416|A flexible distributed key-value datastore that supports both caching and beyond caching workloads.| | 290|LiLittleCat/awesome-free-chatgpt !2025-03-28196185|🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.| | 291|Byaidu/PDFMathTranslate !2025-03-281947645|PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker| | 292|openai/swarm !2025-03-281947111|Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.| | 293|HqWu-HITCS/Awesome-Chinese-LLM !2025-03-281921423|Organizing smaller, cost-effective, privately deployable open-source Chinese language models, including related datasets and tutorials| | 294|stitionai/devika !2025-03-28190903|Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.| | 295|OpenBMB/MiniCPM-o !2025-03-28190887|MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone| | 296|samber/lo !2025-03-281904815|💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)| | 297|chroma-core/chroma !2025-03-281895221 |the AI-native open-source embedding database| | 298|DarkFlippers/unleashed-firmware !2025-03-28189278|Flipper Zero Unleashed Firmware| | 299|brave/brave-browser !2025-03-281892710|Brave browser for Android, iOS, Linux, macOS, Windows.| | 300| tloen/alpaca-lora !2025-03-28188641 | Instruct-tune LLaMA on consumer hardware| | 301|VinciGit00/Scrapegraph-ai !2025-03-281884618|Python scraper based on AI| | 302|gitroomhq/postiz-app !2025-03-281879110|📨 Schedule social posts, measure them, exchange with other members and get a lot of help from AI 🚀| | 303|PrefectHQ/prefect !2025-03-281878715|Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines| | 304|ymcui/Chinese-LLaMA-Alpaca !2025-03-28187723 |Chinese LLaMA & Alpaca LLMs| | 305|kenjihiranabe/The-Art-of-Linear-Algebra !2025-03-28187335|Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"| | 306|joonspk-research/generativeagents !2025-03-28187288|Generative Agents: Interactive Simulacra of Human Behavior| | 307|renovatebot/renovate !2025-03-28186820|Universal dependency update tool that fits into your workflows.| | 308|gventuri/pandas-ai !2025-03-28186109 |Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational| | 309|thingsboard/thingsboard !2025-03-28185184|Open-source IoT Platform - Device management, data collection, processing and visualization.| | 310|ente-io/ente !2025-03-28184722|Fully open source, End to End Encrypted alternative to Google Photos and Apple Photos| | 311|serengil/deepface !2025-03-281840113|A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python| | 312|Raphire/Win11Debloat !2025-03-281840132|A simple, easy to use PowerShell script to remove pre-installed apps from windows, disable telemetry, remove Bing from windows search as well as perform various other changes to declutter and improve your windows experience. This script works for both windows 10 and windows 11.| | 313|Avaiga/taipy !2025-03-28179235|Turns Data and AI algorithms into production-ready web applications in no time.| | 314|microsoft/qlib !2025-03-281784231|Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.| | 315|CopilotKit/CopilotKit !2025-03-281778571|Build in-app AI chatbots 🤖, and AI-powered Textareas ✨, into react web apps.| | 316|QwenLM/Qwen-7B !2025-03-281766017|The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.| | 317|w-okada/voice-changer !2025-03-28176078 |リアルタイムボイスチェンジャー Realtime Voice Changer| | 318|rlabbe/Kalman-and-Bayesian-Filters-in-Python !2025-03-281756011|Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.| | 319|Mikubill/sd-webui-controlnet !2025-03-28174794 |WebUI extension for ControlNet| | 320|jingyaogong/minimind !2025-03-2817380116|「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!| | 321|apify/crawlee !2025-03-28172696|Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.| | 322|apple/ml-stable-diffusion !2025-03-28172395|Stable Diffusion with Core ML on Apple Silicon| | 323| transitive-bullshit/chatgpt-api !2025-03-28172095 | Node.js client for the official ChatGPT API. | | 324|teableio/teable !2025-03-281719222|✨ The Next Gen Airtable Alternative: No-Code Postgres| | 325| xx025/carrot !2025-03-28170900 | Free ChatGPT Site List | | 326|microsoft/LightGBM !2025-03-28170723|A fast, distributed, high-performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.| | 327|VikParuchuri/surya !2025-03-28169827|Accurate line-level text detection and recognition (OCR) in any language| | 328|deepseek-ai/Janus !2025-03-281692825|Janus-Series: Unified Multimodal Understanding and Generation Models| | 329|ardalis/CleanArchitecture !2025-03-28168823|Clean Architecture Solution Template: A starting point for Clean Architecture with ASP.NET Core| | 330|neondatabase/neon !2025-03-28166466|Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.| | 331|kestra-io/kestra !2025-03-281661313|⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...| | 332|Dao-AILab/flash-attention !2025-03-281659720|Fast and memory-efficient exact attention| | 333|RPCS3/rpcs3 !2025-03-281655712|PS3 emulator/debugger| | 334|meta-llama/llama-recipes !2025-03-28165486|Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger| | 335|emilwallner/Screenshot-to-code !2025-03-28165180|A neural network that transforms a design mock-up into a static website.| | 336|datawhalechina/llm-cookbook !2025-03-281650922|面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版| | 337|e2b-dev/awesome-ai-agents !2025-03-281643923|A list of AI autonomous agents| | 338|QwenLM/Qwen2.5 !2025-03-281641114|Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.| | 339|dair-ai/ML-YouTube-Courses !2025-03-28164114|📺 Discover the latest machine learning / AI courses on YouTube.| | 340|pybind/pybind11 !2025-03-28163620|Seamless operability between C++11 and Python| | 341|graphdeco-inria/gaussian-splatting !2025-03-281627116|Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"| | 342|meta-llama/codellama !2025-03-28162531|Inference code for CodeLlama models| | 343|TransformerOptimus/SuperAGI !2025-03-28161292 | SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.| | 344|microsoft/onnxruntime !2025-03-28161169|ONNX Runtime: cross-platform, high-performance ML inferencing and training accelerator| | 345|IDEA-Research/Grounded-Segment-Anything !2025-03-281601411 |Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect, Segment and Generate Anything with Image and Text Inputs| | 346|ddbourgin/numpy-ml !2025-03-28160054|Machine learning, in numpy| | 347|eosphoros-ai/DB-GPT !2025-03-281585225|Revolutionizing Database Interactions with Private LLM Technology| | 348|Stability-AI/StableLM !2025-03-28158310 |Stability AI Language Models| | 349|openai/evals !2025-03-28157935 |Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.| | 350|THUDM/ChatGLM2-6B !2025-03-28157500|ChatGLM2-6B: An Open Bilingual Chat LLM | | 351|sunner/ChatALL !2025-03-28156761 |Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers| | 352|abseil/abseil-cpp !2025-03-28156656|Abseil Common Libraries (C++)| | 353|NVIDIA/open-gpu-kernel-modules !2025-03-28156531|NVIDIA Linux open GPU kernel module source| | 354|letta-ai/letta !2025-03-281563718|Letta (formerly MemGPT) is a framework for creating LLM services with memory.| | 355|typescript-eslint/typescript-eslint !2025-03-28156211|✨ Monorepo for all the tooling which enables ESLint to support TypeScript| | 356|umijs/umi !2025-03-28156211|A framework in react community ✨| | 357|AI4Finance-Foundation/FinGPT !2025-03-281561215|Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model.| | 358|amplication/amplication !2025-03-28156022|🔥🔥🔥 The Only Production-Ready AI-Powered Backend Code Generation| | 359|KindXiaoming/pykan !2025-03-28155477|Kolmogorov Arnold Networks| | 360|arc53/DocsGPT !2025-03-28154900|GPT-powered chat for documentation, chat with your documents| | 361|influxdata/telegraf !2025-03-28154502|Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.| | 362|microsoft/Bringing-Old-Photos-Back-to-Life !2025-03-28154084|Bringing Old Photo Back to Life (CVPR 2020 oral)| | 363|GaiZhenbiao/ChuanhuChatGPT !2025-03-2815394-2|GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.| | 364|Zeyi-Lin/HivisionIDPhotos !2025-03-281529710|⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。| | 365| mayooear/gpt4-pdf-chatbot-langchain !2025-03-281529518 | GPT4 & LangChain Chatbot for large PDF docs | | 366|1Panel-dev/MaxKB !2025-03-2815277148|? Based on LLM large language model knowledge base Q&A system. Ready to use out of the box, supports quick integration into third-party business systems. Officially produced by 1Panel| | 367|ai16z/eliza !2025-03-281526811|Conversational Agent for Twitter and Discord| | 368|apache/arrow !2025-03-28151684|Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing| | 369|princeton-nlp/SWE-agent !2025-03-281516119|SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models| | 370|mlc-ai/web-llm !2025-03-281509311 |Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.| | 371|guillaumekln/faster-whisper !2025-03-281507117 |Faster Whisper transcription with CTranslate2| | 372|overleaf/overleaf !2025-03-28150316|A web-based collaborative LaTeX editor| | 373|triton-lang/triton !2025-03-28150169|Development repository for the Triton language and compiler| | 374|soxoj/maigret !2025-03-281500410|🕵️‍♂️ Collect a dossier on a person by username from thousands of sites| | 375|alibaba/lowcode-engine !2025-03-28149841|An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系| | 376|espressif/esp-idf !2025-03-28148545|Espressif IoT Development Framework. Official development framework for Espressif SoCs.| | 377|pgvector/pgvector !2025-03-281484913|Open-source vector similarity search for Postgres| | 378|datawhalechina/leedl-tutorial !2025-03-28148246|《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases| | 379|xcanwin/KeepChatGPT !2025-03-28147972 |Using ChatGPT is more efficient and smoother, perfectly solving ChatGPT network errors. No longer do you need to frequently refresh the webpage, saving over 10 unnecessary steps| | 380|m-bain/whisperX !2025-03-281471313|WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)| | 381|HumanAIGC/AnimateAnyone !2025-03-2814706-1|Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation| |!green-up-arrow.svg 382|naklecha/llama3-from-scratch !2025-03-281469024|llama3 implementation one matrix multiplication at a time| |!red-down-arrow 383| fauxpilot/fauxpilot !2025-03-28146871 | An open-source GitHub Copilot server | | 384|LlamaFamily/Llama-Chinese !2025-03-28145111|Llama Chinese Community, the best Chinese Llama large model, fully open source and commercially available| | 385|BradyFU/Awesome-Multimodal-Large-Language-Models !2025-03-281450121|Latest Papers and Datasets on Multimodal Large Language Models| | 386|vanna-ai/vanna !2025-03-281449819|🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.| | 387|bleedline/aimoneyhunter !2025-03-28144845|AI Side Hustle Money Mega Collection: Teaching You How to Utilize AI for Various Side Projects to Earn Extra Income.| | 388|stefan-jansen/machine-learning-for-trading !2025-03-28144629|Code for Machine Learning for Algorithmic Trading, 2nd edition.| | 389|state-spaces/mamba !2025-03-28144139|Mamba: Linear-Time Sequence Modeling with Selective State Spaces| | 390|vercel/ai-chatbot !2025-03-281434614|A full-featured, hackable Next.js AI chatbot built by Vercel| | 391|steven-tey/novel !2025-03-281428410|Notion-style WYSIWYG editor with AI-powered autocompletions| | 392|unifyai/ivy !2025-03-281409348|Unified AI| | 393|chidiwilliams/buzz !2025-03-281402411 |Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.| | 394|lukas-blecher/LaTeX-OCR !2025-03-28139769|pix2tex: Using a ViT to convert images of equations into LaTeX code.| | 395|openai/tiktoken !2025-03-28139599|tiktoken is a fast BPE tokeniser for use with OpenAI's models.| | 396|nocobase/nocobase !2025-03-281391522|NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.| | 397|neonbjb/tortoise-tts !2025-03-28139010 |A multi-voice TTS system trained with an emphasis on quality| | 398|yamadashy/repomix !2025-03-281382036|📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.| | 399|adobe/react-spectrum !2025-03-28136766|A collection of libraries and tools that help you build adaptive, accessible, and robust user experiences.| | 400|THUDM/ChatGLM3 !2025-03-28136684|ChatGLM3 series: Open Bilingual Chat LLMs | | 401|NVIDIA/NeMo !2025-03-28134837|A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)| | 402|BlinkDL/RWKV-LM !2025-03-28134346 |RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it combines the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.| | 403| fuergaosi233/wechat-chatgpt !2025-03-28133330 | Use ChatGPT On Wechat via wechaty | | 404|udecode/plate !2025-03-28133325|A rich-text editor powered by AI| | 405|xenova/transformers.js !2025-03-281331219|State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!| | 406|stas00/ml-engineering !2025-03-281325615|Machine Learning Engineering Guides and Tools| | 407| wong2/chatgpt-google-extension !2025-03-2813241-1 | A browser extension that enhances search engines with ChatGPT, this repos will not be updated from 2023-02-20| | 408|mrdbourke/pytorch-deep-learning !2025-03-281317520|Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.| | 409|Koenkk/zigbee2mqtt !2025-03-28131544|Zigbee 🐝 to MQTT bridge 🌉, get rid of your proprietary Zigbee bridges 🔨| | 410|vercel-labs/ai !2025-03-281298528|Build AI-powered applications with React, Svelte, and Vue| | 411|netease-youdao/QAnything !2025-03-28129318|Question and Answer based on Anything.| | 412|huggingface/trl !2025-03-281289622|Train transformer language models with reinforcement learning.| | 413|microsoft/BitNet !2025-03-28128503|Official inference framework for 1-bit LLMs| | 414|mediar-ai/screenpipe !2025-03-281283915|24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.| | 415|Skyvern-AI/skyvern !2025-03-281277612|Automate browser-based workflows with LLMs and Computer Vision| | 416|pytube/pytube !2025-03-28126591|A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.| | 417|official-stockfish/Stockfish !2025-03-28126574|UCI chess engine| | 418|sgl-project/sglang !2025-03-281260143|SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable.| | 419|plasma-umass/scalene !2025-03-28125535|Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals| | 420|danswer-ai/danswer !2025-03-28125503|Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.| | 421|OpenTalker/SadTalker !2025-03-28125226|[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation| | 422|facebookresearch/AnimatedDrawings !2025-03-28123693 |Code to accompany "A Method for Animating Children's Drawings of the Human Figure"| | 423|activepieces/activepieces !2025-03-28123609|Your friendliest open source all-in-one automation tool ✨ Workflow automation tool 100+ integration / Enterprise automation tool / Zapier Alternative| | 424|ggerganov/ggml !2025-03-28121992 |Tensor library for machine learning| | 425|bytebase/bytebase !2025-03-28121694|World's most advanced database DevOps and CI/CD for Developer, DBA and Platform Engineering teams. The GitLab/GitHub for database DevOps.| | 426| willwulfken/MidJourney-Styles-and-Keywords-Reference !2025-03-28120971 | A reference containing Styles and Keywords that you can use with MidJourney AI| | 427|Huanshere/VideoLingo !2025-03-281207013|Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | | 428|OpenLMLab/MOSS !2025-03-28120330 |An open-source tool-augmented conversational language model from Fudan University| | 429|llmware-ai/llmware !2025-03-281200727|Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.| | 430|PKU-YuanGroup/Open-Sora-Plan !2025-03-28119362|This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.| | 431|ShishirPatil/gorilla !2025-03-28119332 |Gorilla: An API store for LLMs| | 432|NVIDIA/Megatron-LM !2025-03-281192716|Ongoing research training transformer models at scale| | 433|illacloud/illa-builder !2025-03-28119192|Create AI-Driven Apps like Assembling Blocks| | 434|marimo-team/marimo !2025-03-281191521|A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.| | 435|smol-ai/developer !2025-03-28119111 | With 100k context windows on the way, it's now feasible for every dev to have their own smol developer| | 436|Lightning-AI/litgpt !2025-03-28118878|Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.| | 437|openai/shap-e !2025-03-28118474 |Generate 3D objects conditioned on text or images| | 438|eugeneyan/open-llms !2025-03-28118451 |A list of open LLMs available for commercial use.| | 439|andrewyng/aisuite !2025-03-28118124|Simple, unified interface to multiple Generative AI providers| | 440|hajimehoshi/ebiten !2025-03-28117816|Ebitengine - A dead simple 2D game engine for Go| | 441|kgrzybek/modular-monolith-with-ddd !2025-03-28117493|Full Modular Monolith application with Domain-Driven Design approach.| | 442|h2oai/h2ogpt !2025-03-2811736-1 |Come join the movement to make the world's best open source GPT led by H2O.ai - 100% private chat and document search, no data leaks, Apache 2.0| | 443|owainlewis/awesome-artificial-intelligence !2025-03-28117332|A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.| | 444|DataTalksClub/mlops-zoomcamp !2025-03-28116643|Free MLOps course from DataTalks.Club| | 445|Rudrabha/Wav2Lip !2025-03-281163410|This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.| | 446|aishwaryanr/awesome-generative-ai-guide !2025-03-281152810|A one stop repository for generative AI research updates, interview resources, notebooks and much more!| | 447|karpathy/micrograd !2025-03-28115146|A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API| | 448|InstantID/InstantID !2025-03-28115111|InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥| | 449|facebookresearch/seamlesscommunication !2025-03-28114434|Foundational Models for State-of-the-Art Speech and Text Translation| | 450|anthropics/anthropic-cookbook !2025-03-281140112|A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.| | 451|mastra-ai/mastra !2025-03-281139240|the TypeScript AI agent framework| | 452|NVIDIA/TensorRT !2025-03-28113864|NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.| | 453|plandex-ai/plandex !2025-03-28113645|An AI coding engine for complex tasks| | 454|RUCAIBox/LLMSurvey !2025-03-28112735 |A collection of papers and resources related to Large Language Models.| | 455|kubeshark/kubeshark !2025-03-28112711|The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clusters. Inspired by Wireshark, purposely built for Kubernetes| | 456|electric-sql/pglite !2025-03-28112617|Lightweight Postgres packaged as WASM into a TypeScript library for the browser, Node.js, Bun and Deno from https://electric-sql.com| | 457|lightaime/camel !2025-03-281124441 |🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society| | 458|huggingface/lerobot !2025-03-281120184|🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch| | 459|normal-computing/outlines !2025-03-28111657|Generative Model Programming| | 460|libretro/RetroArch !2025-03-28110701|Cross-platform, sophisticated frontend for the libretro API. Licensed GPLv3.| | 461|THUDM/CogVideo !2025-03-28110599|Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)| | 462|bentoml/OpenLLM !2025-03-28110495|An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease.| | 463|vosen/ZLUDA !2025-03-28110429|CUDA on AMD GPUs| | 464|dair-ai/ML-Papers-of-the-Week !2025-03-28110304 |🔥Highlighting the top ML papers every week.| | 465|WordPress/gutenberg !2025-03-28110212|The Block Editor project for WordPress and beyond. Plugin is available from the official repository.| | 466|microsoft/data-formulator !2025-03-281099827|🪄 Create rich visualizations with AI| | 467|LibreTranslate/LibreTranslate !2025-03-28109887|Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.| | 468|block/goose !2025-03-281097737|an open-source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM| | 469|getumbrel/llama-gpt !2025-03-28109553|A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device.| | 470|HigherOrderCO/HVM !2025-03-28109182|A massively parallel, optimal functional runtime in Rust| | 471|databrickslabs/dolly !2025-03-2810812-3 | A large language model trained on the Databricks Machine Learning Platform| | 472|srush/GPU-Puzzles !2025-03-28108014|Solve puzzles. Learn CUDA.| | 473|Z3Prover/z3 !2025-03-28107952|The Z3 Theorem Prover| | 474|UFund-Me/Qbot !2025-03-281079313 |Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment| | 475|langchain-ai/langgraph !2025-03-281077336|| | 476|lz4/lz4 !2025-03-28107647|Extremely Fast Compression algorithm| | 477|magic-research/magic-animate !2025-03-28107160|MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model| | 478|PaperMC/Paper !2025-03-281071410|The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies| | 479|getomni-ai/zerox !2025-03-281071015|Zero shot pdf OCR with gpt-4o-mini| |!green-up-arrow.svg 480|🔥NirDiamant/GenAIAgents !2025-03-2810693318|This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.| |!red-down-arrow 481|Unstructured-IO/unstructured !2025-03-28106889|Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.| | 482|apache/thrift !2025-03-28106610|Apache Thrift| | 483| TheR1D/shellgpt !2025-03-28106097 | A command-line productivity tool powered by ChatGPT, will help you accomplish your tasks faster and more efficiently | | 484|TheRamU/Fay !2025-03-281060312 |Fay is a complete open source project that includes Fay controller and numeral models, which can be used in different applications such as virtual hosts, live promotion, numeral human interaction and so on| | 485|zyronon/douyin !2025-03-28105566|Vue3 + Pinia + Vite5 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile| | 486|THU-MIG/yolov10 !2025-03-28105485|YOLOv10: Real-Time End-to-End Object Detection| | 487|idootop/mi-gpt !2025-03-281052522|? Transform XiaoAi speaker into a personal voice assistant with ChatGPT and DouBao integration.| | 488|SakanaAI/AI-Scientist !2025-03-281051310|The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬| | 489|szimek/sharedrop !2025-03-28105101|Easy P2P file transfer powered by WebRTC - inspired by Apple AirDrop| | 490|salesforce/LAVIS !2025-03-28103942 |LAVIS - A One-stop Library for Language-Vision Intelligence| | 491|aws/amazon-sagemaker-examples !2025-03-28103654|Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.| | 492|artidoro/qlora !2025-03-28103402 |QLoRA: Efficient Finetuning of Quantized LLMs| | 493|lllyasviel/stable-diffusion-webui-forge !2025-03-281029314| a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference| | 494|NielsRogge/Transformers-Tutorials !2025-03-28102487|This repository contains demos I made with the Transformers library by HuggingFace.| | 495|kedro-org/kedro !2025-03-28102371|Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.| | 496| chathub-dev/chathub !2025-03-28102301 | All-in-one chatbot client | | 497|microsoft/promptflow !2025-03-28101612|Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.| | 498|mistralai/mistral-src !2025-03-28101372|Reference implementation of Mistral AI 7B v0.1 model.| | 499|burn-rs/burn !2025-03-28101183|Burn - A Flexible and Comprehensive Deep Learning Framework in Rust| | 500|AIGC-Audio/AudioGPT !2025-03-28101150 |AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head| | 501|facebookresearch/dinov2 !2025-03-281011210 |PyTorch code and models for the DINOv2 self-supervised learning method.| | 502|RockChinQ/LangBot !2025-03-281008455|😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 🤖 | | 503|78/xiaozhi-esp32 !2025-03-281008180|Build your own AI friend| | 504|cumulo-autumn/StreamDiffusion !2025-03-28100761|StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation| | 505|DataTalksClub/machine-learning-zoomcamp !2025-03-28100664|The code from the Machine Learning Bookcamp book and a free course based on the book| | 506|nerfstudio-project/nerfstudio !2025-03-28100343|A collaboration friendly studio for NeRFs| | 507|cupy/cupy !2025-03-28100344|NumPy & SciPy for GPU| | 508|NVIDIA/TensorRT-LLM !2025-03-281000823|TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.| | 509|wasp-lang/open-saas !2025-03-2899665|A free, open-source SaaS app starter for React & Node.js with superpowers. Production-ready. Community-driven.| | 510|huggingface/text-generation-inference !2025-03-2899383|Large Language Model Text Generation Inference| | 511|jxnl/instructor !2025-03-2899224|structured outputs for llms| | 512|GoogleCloudPlatform/generative-ai !2025-03-2899086|Sample code and notebooks for Generative AI on Google Cloud| | 513|manticoresoftware/manticoresearch !2025-03-2898799|Easy to use open source fast database for search | | 514|langfuse/langfuse !2025-03-28985134|🪢 Open source LLM engineering platform. Observability, metrics, evals, prompt management, testing, prompt playground, datasets, LLM evaluations -- 🍊YC W23 🤖 integrate via Typescript, Python / Decorators, OpenAI, Langchain, LlamaIndex, Litellm, Instructor, Mistral, Perplexity, Claude, Gemini, Vertex| | 515|keephq/keep !2025-03-2897949|The open-source alert management and AIOps platform| | 516|sashabaranov/go-openai !2025-03-2897843|OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go| | 517|autowarefoundation/autoware !2025-03-2897766|Autoware - the world's leading open-source software project for autonomous driving| | 518|anthropics/courses !2025-03-2897269|Anthropic's educational courses| | 519|popcorn-official/popcorn-desktop !2025-03-2896853|Popcorn Time is a multi-platform, free software BitTorrent client that includes an integrated media player ( Windows / Mac / Linux ) A Butter-Project Fork| | 520|getmaxun/maxun !2025-03-28968515|🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]| | 521|wandb/wandb !2025-03-2896763|🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.| | 522|karpathy/minbpe !2025-03-2895353|Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.| | 523|bigscience-workshop/petals !2025-03-2895142|🌸 Run large language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading| | 524|OthersideAI/self-operating-computer !2025-03-2894931|A framework to enable multimodal models to operate a computer.| | 525|mshumer/gpt-prompt-engineer !2025-03-2894911|| | 526| BloopAI/bloop !2025-03-2894710 | A fast code search engine written in Rust| | 527|BlinkDL/ChatRWKV !2025-03-289467-1 |ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.| | 528|timlrx/tailwind-nextjs-starter-blog !2025-03-2894677|This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Perfect as a replacement to existing Jekyll and Hugo individual blogs.| | 529|google/benchmark !2025-03-2893634|A microbenchmark support library| | 530|facebookresearch/nougat !2025-03-2893603|Implementation of Nougat Neural Optical Understanding for Academic Documents| | 531|modelscope/facechain !2025-03-2893536|FaceChain is a deep-learning toolchain for generating your Digital-Twin.| | 532|DrewThomasson/ebook2audiobook !2025-03-2893388|Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!| | 533|RayTracing/raytracing.github.io !2025-03-2893035|Main Web Site (Online Books)| | 534|QwenLM/Qwen2.5-VL !2025-03-28930249|Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.| | 535|WongKinYiu/yolov9 !2025-03-2892201|Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information| | 536|alibaba-damo-academy/FunASR !2025-03-28920222|A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.| | 537|Visualize-ML/Book4Power-of-Matrix !2025-03-2891931|Book4 'Power of Matrix' | | 538|dice2o/BingGPT !2025-03-289185-1 |Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)| | 539|browserbase/stagehand !2025-03-28917621|An AI web browsing framework focused on simplicity and extensibility.| | 540|FlagOpen/FlagEmbedding !2025-03-28914111|Dense Retrieval and Retrieval-augmented LLMs| | 541|Const-me/Whisper !2025-03-2890979|High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model| | 542|lucidrains/denoising-diffusion-pytorch !2025-03-2890942|Implementation of Denoising Diffusion Probabilistic Model in Pytorch| | 543|Chainlit/chainlit !2025-03-28904422|Build Conversational AI in minutes ⚡️| | 544|togethercomputer/OpenChatKit !2025-03-2890160 |OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications| | 545|Stability-AI/StableStudio !2025-03-2889631 |Community interface for generative AI| | 546|voicepaw/so-vits-svc-fork !2025-03-2889482 |so-vits-svc fork with realtime support, improved interface and more features.| | 547|pymc-devs/pymc !2025-03-2889413|Bayesian Modeling and Probabilistic Programming in Python| | 548|espnet/espnet !2025-03-2889302|End-to-End Speech Processing Toolkit| | 549|kedacore/keda !2025-03-2888991|KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes| | 550|open-mmlab/Amphion !2025-03-28886911|Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.| | 551|gorse-io/gorse !2025-03-2888451|Gorse open source recommender system engine| | 552|adams549659584/go-proxy-bingai !2025-03-288768-1 |A Microsoft New Bing demo site built with Vue3 and Go, providing a consistent UI experience, supporting ChatGPT prompts, and accessible within China| | 553|open-mmlab/mmsegmentation !2025-03-2887513|OpenMMLab Semantic Segmentation Toolbox and Benchmark.| | 554|bytedance/monolith !2025-03-2887223|ByteDance's Recommendation System| | 555|LouisShark/chatgptsystemprompt !2025-03-2887216|store all agent's system prompt| | 556|brexhq/prompt-engineering !2025-03-2887080 |Tips and tricks for working with Large Language Models like OpenAI's GPT-4.| | 557|erincatto/box2d !2025-03-2886841|Box2D is a 2D physics engine for games| | 558|🔥microsoft/ai-agents-for-beginners !2025-03-288669323|10 Lessons to Get Started Building AI Agents| | 559|nashsu/FreeAskInternet !2025-03-2886102|FreeAskInternet is a completely free, private and locally running search aggregator & answer generate using LLM, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to the ChatGPT3.5 LLM and generate the answer based on search results.| | 560|goldmansachs/gs-quant !2025-03-2885981|Python toolkit for quantitative finance| | 561|srbhr/Resume-Matcher !2025-03-2885800|Open Source Free ATS Tool to compare Resumes with Job Descriptions and create a score to rank them.| | 562|facebookresearch/ImageBind !2025-03-2885681 |ImageBind One Embedding Space to Bind Them All| | 563|ashawkey/stable-dreamfusion !2025-03-2885481 |A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion.| | 564|meetecho/janus-gateway !2025-03-2885232|Janus WebRTC Server| | 565|google/magika !2025-03-2885003|Detect file content types with deep learning| | 566|huggingface/chat-ui !2025-03-2884871 |Open source codebase powering the HuggingChat app| | 567|EleutherAI/lm-evaluation-harness !2025-03-28843012|A framework for few-shot evaluation of autoregressive language models.| | 568|jina-ai/reader !2025-03-2884089|Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/| | 569|microsoft/TypeChat !2025-03-288406-1|TypeChat is a library that makes it easy to build natural language interfaces using types.| | 570|thuml/Time-Series-Library !2025-03-28839715|A Library for Advanced Deep Time Series Models.| | 571|OptimalScale/LMFlow !2025-03-2883882|An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.| | 572|baptisteArno/typebot.io !2025-03-2883845|💬 Typebot is a powerful chatbot builder that you can self-host.| | 573|jzhang38/TinyLlama !2025-03-2883504|The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.| | 574|fishaudio/Bert-VITS2 !2025-03-2883472|vits2 backbone with multilingual-bert| | 575|OpenBMB/XAgent !2025-03-2882683|An Autonomous LLM Agent for Complex Task Solving| | 576|Acly/krita-ai-diffusion !2025-03-2882387|Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.| | 577|jasonppy/VoiceCraft !2025-03-2882151|Zero-Shot Speech Editing and Text-to-Speech in the Wild| | 578|SJTU-IPADS/PowerInfer !2025-03-2881693|High-speed Large Language Model Serving on PCs with Consumer-grade GPUs| | 579|modelscope/DiffSynth-Studio !2025-03-28814713|Enjoy the magic of Diffusion models!| | 580|o3de/o3de !2025-03-2881443|Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations without any fees or commercial obligations.| | 581|zmh-program/chatnio !2025-03-2881325|🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot 等模型,支持对话分享,自定义预设,云端同步,模型市场,支持弹性计费和订阅计划模式,支持图片解析,支持联网搜索,支持模型缓存,丰富美观的后台管理与仪表盘数据统计。| | 582|leptonai/searchwithlepton !2025-03-2880632|Building a quick conversation-based search demo with Lepton AI.| | 583|sebastianstarke/AI4Animation !2025-03-2880620|Bringing Characters to Life with Computer Brains in Unity| | 584|wangrongding/wechat-bot !2025-03-2880528|🤖一个基于 WeChaty 结合 DeepSeek / ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...| | 585|openvinotoolkit/openvino !2025-03-2880528|OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference| | 586|steven2358/awesome-generative-ai !2025-03-28802610|A curated list of modern Generative Artificial Intelligence projects and services| | 587|adam-maj/tiny-gpu !2025-03-2880234|A minimal GPU design in Verilog to learn how GPUs work from the ground up| | 588| anse-app/chatgpt-demo !2025-03-2880180 | A demo repo based on OpenAI API (gpt-3.5-turbo) | | 589| acheong08/EdgeGPT !2025-03-288015-1 |Reverse engineered API of Microsoft's Bing Chat | | 590|ai-collection/ai-collection !2025-03-2879994 |The Generative AI Landscape - A Collection of Awesome Generative AI Applications| | 591|GreyDGL/PentestGPT !2025-03-2879953 |A GPT-empowered penetration testing tool| | 592|delta-io/delta !2025-03-2879112|An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs| | 593|dataelement/bisheng !2025-03-2879085|Bisheng is an open LLM devops platform for next generation AI applications.| | 594|e2b-dev/e2b !2025-03-2878447 |Vercel for AI agents. We help developers to build, deploy, and monitor AI agents. Focusing on specialized AI agents that build software for you - your personal software developers.| | 595|01-ai/Yi !2025-03-2878311|A series of large language models trained from scratch by developers @01-ai| | 596|Plachtaa/VALL-E-X !2025-03-287830-1|An open source implementation of Microsoft's VALL-E X zero-shot TTS model. The demo is available at https://plachtaa.github.io| | 597|abhishekkrthakur/approachingalmost !2025-03-2878204|Approaching (Almost) Any Machine Learning Problem| | 598|pydantic/pydantic-ai !2025-03-28781041|Agent Framework / shim to use Pydantic with LLMs| | 599|rany2/edge-tts !2025-03-2877901|Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key| | 600|CASIA-IVA-Lab/FastSAM !2025-03-2877881|Fast Segment Anything| | 601|netease-youdao/EmotiVoice !2025-03-2877817|EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine| | 602|lllyasviel/IC-Light !2025-03-2877804|More relighting!| | 603|kroma-network/tachyon !2025-03-287774-1|Modular ZK(Zero Knowledge) backend accelerated by GPU| | 604|deep-floyd/IF !2025-03-2877731 |A novel state-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding| | 605|oumi-ai/oumi !2025-03-2877705|Everything you need to build state-of-the-art foundation models, end-to-end.| | 606|reorproject/reor !2025-03-2877681|AI note-taking app that runs models locally.| | 607|lightpanda-io/browser !2025-03-28775813|Lightpanda: the headless browser designed for AI and automation| | 608|xiangsx/gpt4free-ts !2025-03-287755-1|Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free| | 609|IDEA-Research/GroundingDINO !2025-03-28773311|Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"| | 610|bunkerity/bunkerweb !2025-03-2877326|🛡️ Make your web services secure by default !| | 611|vikhyat/moondream !2025-03-2877057|tiny vision language model| | 612|firmai/financial-machine-learning !2025-03-287703-1|A curated list of practical financial machine learning tools and applications.| | 613|n8n-io/self-hosted-ai-starter-kit !2025-03-28765121|The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.| | 614|intel-analytics/ipex-llm !2025-03-2876507|Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.| | 615|jrouwe/JoltPhysics !2025-03-28764510|A multi core friendly rigid body physics and collision detection library. Written in C++. Suitable for games and VR applications. Used by Horizon Forbidden West.| | 616|THUDM/CodeGeeX2 !2025-03-2876270|CodeGeeX2: A More Powerful Multilingual Code Generation Model| | 617|meta-llama/llama-stack !2025-03-2875866|Composable building blocks to build Llama Apps| | 618|sweepai/sweep !2025-03-287530-1|Sweep is an AI junior developer| | 619|lllyasviel/Omost !2025-03-2875301|Your image is almost there!| | 620|ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide !2025-03-2875050|Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.| | 621|dair-ai/ML-Papers-Explained !2025-03-2875050|Explanation to key concepts in ML| | 622|zaidmukaddam/scira !2025-03-28750110|Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.| | 623|Portkey-AI/gateway !2025-03-28749416|A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.| | 624|web-infra-dev/midscene !2025-03-28748729|An AI-powered automation SDK can control the page, perform assertions, and extract data in JSON format using natural language.| | 625|zilliztech/GPTCache !2025-03-2874801 |GPTCache is a library for creating semantic cache to store responses from LLM queries.| | 626|niedev/RTranslator !2025-03-2874742|RTranslator is the world's first open source real-time translation app.| |!green-up-arrow.svg 627|roboflow/notebooks !2025-03-2874666|Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.| |!red-down-arrow 628|openlm-research/openllama !2025-03-2874652|OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset| | 629|LiheYoung/Depth-Anything !2025-03-2874155|Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data| | 630|enso-org/enso !2025-03-2874040|Hybrid visual and textual functional programming.| | 631|bigcode-project/starcoder !2025-03-287401-1 |Home of StarCoder: fine-tuning & inference!| | 632|git-ecosystem/git-credential-manager !2025-03-2873975|Secure, cross-platform Git credential storage with authentication to GitHub, Azure Repos, and other popular Git hosting services.| | 633|OpenGVLab/InternVL !2025-03-2873634|[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型| | 634|WooooDyy/LLM-Agent-Paper-List !2025-03-2873551|The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.| | 635|lencx/Noi !2025-03-2873157|🦄 AI + Tools + Plugins + Community| | 636|udlbook/udlbook !2025-03-2873075|Understanding Deep Learning - Simon J.D. Prince| | 637|OpenBMB/MiniCPM !2025-03-2872841|MiniCPM-2B: An end-side LLM outperforms Llama2-13B.| | 638|jaywalnut310/vits !2025-03-2872815 |VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech| | 639|xorbitsai/inference !2025-03-28727528|Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.| | 640|PWhiddy/PokemonRedExperiments !2025-03-2872492|Playing Pokemon Red with Reinforcement Learning| | 641|Canner/WrenAI !2025-03-28723213|🤖 Open-source AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑‍💻| | 642|miurla/morphic !2025-03-2872258|An AI-powered answer engine with a generative UI| | 643|ml-explore/mlx-examples !2025-03-2872168|Examples in the MLX framework| | 644|PKU-YuanGroup/ChatLaw !2025-03-2872010|Chinese Legal Large Model| | 645|NVIDIA/cutlass !2025-03-2871883|CUDA Templates for Linear Algebra Subroutines| | 646|FoundationVision/VAR !2025-03-28717444|[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"| | 647|ymcui/Chinese-LLaMA-Alpaca-2 !2025-03-2871561|Chinese LLaMA-2 & Alpaca-2 LLMs| | 648|nadermx/backgroundremover !2025-03-2871514 |Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.| | 649|onuratakan/gpt-computer-assistant !2025-03-28714514|gpt-4o for windows, macos and ubuntu| | 650|graviraja/MLOps-Basics !2025-03-2871326|| | 651|Future-House/paper-qa !2025-03-287118-1|High accuracy RAG for answering questions from scientific documents with citations| | 652|open-mmlab/mmagic !2025-03-2871102 |OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox| | 653|bhaskatripathi/pdfGPT !2025-03-2870941 |PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot!| | 654|ollama/ollama-python !2025-03-28709117|Ollama Python library| | 655|facebookresearch/DiT !2025-03-2870376|Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"| | 656|geekyutao/Inpaint-Anything !2025-03-2870262 |Inpaint anything using Segment Anything and inpainting models.| | 657|AbdullahAlfaraj/Auto-Photoshop-StableDiffusion-Plugin !2025-03-2870160 |A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend.| | 658|apple/corenet !2025-03-2869990|CoreNet: A library for training deep neural networks| | 659|openstatusHQ/openstatus !2025-03-2869926|🏓 The open-source synthetic monitoring platform 🏓| | 660|weaviate/Verba !2025-03-2869772|Retrieval Augmented Generation (RAG) chatbot powered by Weaviate| | 661|meshery/meshery !2025-03-2869630|Meshery, the cloud native manager| | 662|OpenTalker/video-retalking !2025-03-2869530|[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild| | 663|digitalinnovationone/dio-lab-open-source !2025-03-28689013|Repositório do lab "Contribuindo em um Projeto Open Source no GitHub" da Digital Innovation One.| | 664|jianchang512/ChatTTS-ui !2025-03-2868842|一个简单的本地网页界面,直接使用ChatTTS将文字合成为语音,同时支持对外提供API接口。| | 665|patchy631/ai-engineering-hub !2025-03-28686434|In-depth tutorials on LLMs, RAGs and real-world AI agent applications.| | 666|gunnarmorling/1brc !2025-03-2868512|1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java| | 667|Azure-Samples/azure-search-openai-demo !2025-03-2868482 |A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.| | 668|mit-han-lab/streaming-llm !2025-03-2868382|Efficient Streaming Language Models with Attention Sinks| | 669|InternLM/InternLM !2025-03-2868352|InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.| | 670|dependency-check/DependencyCheck !2025-03-2868191|OWASP dependency-check is a software composition analysis utility that detects publicly disclosed vulnerabilities in application dependencies.| | 671|Soulter/AstrBot !2025-03-28678643|✨易上手的多平台 LLM 聊天机器人及开发框架✨。支持 QQ、QQ频道、Telegram、微信平台(Gewechat, 企业微信)、内置 Web Chat,OpenAI GPT、DeepSeek、Ollama、Llama、GLM、Gemini、OneAPI、LLMTuner,支持 LLM Agent 插件开发,可视化面板。一键部署。支持 Dify 工作流、代码执行器、Whisper 语音转文字。| | 672|react-native-webview/react-native-webview !2025-03-2867792|React Native Cross-Platform WebView| | 673|modelscope/agentscope !2025-03-28676916|Start building LLM-empowered multi-agent applications in an easier way.| | 674|mylxsw/aidea !2025-03-2867381|AIdea is a versatile app that supports GPT and domestic large language models,also supports "Stable Diffusion" text-to-image generation, image-to-image generation, SDXL 1.0, super-resolution, and image colorization| | 675|langchain-ai/ollama-deep-researcher !2025-03-28668635|Fully local web research and report writing assistant| | 676|threestudio-project/threestudio !2025-03-2866653|A unified framework for 3D content generation.| | 677|gaomingqi/Track-Anything !2025-03-2866631 |A flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.| | 678|spdustin/ChatGPT-AutoExpert !2025-03-2866570|🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).| | 679|HariSekhon/DevOps-Bash-tools !2025-03-2866463|1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..| | 680|modelscope/swift !2025-03-28661530|ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs| | 681|langchain-ai/opengpts !2025-03-2866080|This is an open source effort to create a similar experience to OpenAI's GPTs and Assistants API| | 682| yihong0618/xiaogpt !2025-03-2865131 | Play ChatGPT with xiaomi ai speaker | | 683| civitai/civitai !2025-03-2865111 | Build a platform where people can share their stable diffusion models | | 684|KoljaB/RealtimeSTT !2025-03-28649513|A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.| | 685|qunash/chatgpt-advanced !2025-03-2864910 | A browser extension that augments your ChatGPT prompts with web results.| | 686|Licoy/ChatGPT-Midjourney !2025-03-2864850|🎨 Own your own ChatGPT+Midjourney web service with one click| | 687|friuns2/BlackFriday-GPTs-Prompts !2025-03-2864744|List of free GPTs that doesn't require plus subscription| | 688|PixarAnimationStudios/OpenUSD !2025-03-2864700|Universal Scene Description| | 689|linyiLYi/street-fighter-ai !2025-03-2864630 |This is an AI agent for Street Fighter II Champion Edition.| | 690|run-llama/rags !2025-03-2864380|Build ChatGPT over your data, all with natural language| | 691|frdel/agent-zero !2025-03-2864154|Agent Zero AI framework| | 692|microsoft/DeepSpeedExamples !2025-03-2863911 |Example models using DeepSpeed| | 693|k8sgpt-ai/k8sgpt !2025-03-2863882|Giving Kubernetes Superpowers to everyone| | 694|open-metadata/OpenMetadata !2025-03-2863514|OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration.| | 695|google/gemma.cpp !2025-03-2863163|lightweight, standalone C++ inference engine for Google's Gemma models.| | 696|RayVentura/ShortGPT !2025-03-286314-1|🚀🎬 ShortGPT - An experimental AI framework for automated short/video content creation. Enables creators to rapidly produce, manage, and deliver content using AI and automation.| | 697|openai/consistencymodels !2025-03-2862940 |Official repo for consistency models.| | 698|yangjianxin1/Firefly !2025-03-2862924|Firefly: Chinese conversational large language model (full-scale fine-tuning + QLoRA), supporting fine-tuning of Llma2, Llama, Baichuan, InternLM, Ziya, Bloom, and other large models| | 699|enricoros/big-AGI !2025-03-2862665|Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.| | 700|aptos-labs/aptos-core !2025-03-2862633|Aptos is a layer 1 blockchain built to support the widespread use of blockchain through better technology and user experience.| | 701|wenda-LLM/wenda !2025-03-286262-1 |Wenda: An LLM invocation platform. Its objective is to achieve efficient content generation tailored to specific environments while considering the limited computing resources of individuals and small businesses, as well as knowledge security and privacy concerns| | 702|Project-MONAI/MONAI !2025-03-2862603|AI Toolkit for Healthcare Imaging| | 703|HVision-NKU/StoryDiffusion !2025-03-2862470|Create Magic Story!| | 704|deepseek-ai/DeepSeek-LLM !2025-03-2862463|DeepSeek LLM: Let there be answers| | 705|Tohrusky/Final2x !2025-03-2862393|2^x Image Super-Resolution| | 706|OpenSPG/KAG !2025-03-28619611|KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.| | 707|Moonvy/OpenPromptStudio !2025-03-2861861 |AIGC Hint Word Visualization Editor| | 708|levihsu/OOTDiffusion !2025-03-2861761|Official implementation of OOTDiffusion| | 709|tmc/langchaingo !2025-03-2861729|LangChain for Go, the easiest way to write LLM-based programs in Go| | 710|vladmandic/automatic !2025-03-2861374|SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models| | 711|clovaai/donut !2025-03-2861231 |Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022| | 712|Shaunwei/RealChar !2025-03-286121-1|🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖| | 713|microsoft/TinyTroupe !2025-03-2861142|LLM-powered multiagent persona simulation for imagination enhancement and business insights.| | 714| rustformers/llm !2025-03-2861010 | Run inference for Large Language Models on CPU, with Rust| | 715|firebase/firebase-ios-sdk !2025-03-2860950|Firebase SDK for Apple App Development| | 716|vespa-engine/vespa !2025-03-2860824|The open big data serving engine. https://vespa.ai| | 717|n4ze3m/page-assist !2025-03-28607610|Use your locally running AI models to assist you in your web browsing| | 718|Dooy/chatgpt-web-midjourney-proxy !2025-03-2860646|chatgpt web, midjourney, gpts,tts, whisper 一套ui全搞定| | 719|ethereum-optimism/optimism !2025-03-2860213|Optimism is Ethereum, scaled.| | 720|sczhou/ProPainter !2025-03-2859971|[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting| | 721|MineDojo/Voyager !2025-03-2859951 |An Open-Ended Embodied Agent with Large Language Models| | 722|lavague-ai/LaVague !2025-03-2859800|Automate automation with Large Action Model framework| | 723|SevaSk/ecoute !2025-03-2859770 |Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.| | 724|google/mesop !2025-03-2859661|| | 725|pengxiao-song/LaWGPT !2025-03-2859542 |Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge| | 726|fr0gger/Awesome-GPT-Agents !2025-03-2859434|A curated list of GPT agents for cybersecurity| | 727|google-deepmind/graphcast !2025-03-2859412|| | 728|comet-ml/opik !2025-03-28594126|Open-source end-to-end LLM Development Platform| | 729|SciPhi-AI/R2R !2025-03-28594033|A framework for rapid development and deployment of production-ready RAG systems| | 730|SkalskiP/courses !2025-03-2859272 |This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)| | 731|QuivrHQ/MegaParse !2025-03-2859122|File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.| | 732|pytorch-labs/gpt-fast !2025-03-2858971|Simple and efficient pytorch-native transformer text generation in !2025-03-2858886|Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.| | 734|nilsherzig/LLocalSearch !2025-03-2858852|LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.| | 735|kuafuai/DevOpsGPT !2025-03-285874-2|Multi agent system for AI-driven software development. Convert natural language requirements into working software. Supports any development language and extends the existing base code.| | 736|myshell-ai/MeloTTS !2025-03-2858486|High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.| | 737|OpenGVLab/LLaMA-Adapter !2025-03-2858421 |Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters| | 738|volcengine/verl !2025-03-28582563|veRL: Volcano Engine Reinforcement Learning for LLM| | 739|a16z-infra/companion-app !2025-03-2858171|AI companions with memory: a lightweight stack to create and host your own AI companions| | 740|HumanAIGC/OutfitAnyone !2025-03-285816-1|Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person| | 741|josStorer/RWKV-Runner !2025-03-2857472|A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.| | 742|648540858/wvp-GB28181-pro !2025-03-2857414|WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。| | 743|ToonCrafter/ToonCrafter !2025-03-2857345|a research paper for generative cartoon interpolation| | 744|PawanOsman/ChatGPT !2025-03-2857191|OpenAI API Free Reverse Proxy| | 745|apache/hudi !2025-03-2857091|Upserts, Deletes And Incremental Processing on Big Data.| | 746| nsarrazin/serge !2025-03-2857081 | A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API| | 747|homanp/superagent !2025-03-2857021|🥷 Superagent - Build, deploy, and manage LLM-powered agents| | 748|ramonvc/freegpt-webui !2025-03-2856910|GPT 3.5/4 with a Chat Web UI. No API key is required.| | 749|baichuan-inc/baichuan-7B !2025-03-2856901|A large-scale 7B pretraining language model developed by BaiChuan-Inc.| | 750|Azure/azure-sdk-for-net !2025-03-2856792|This repository is for active development of the Azure SDK for .NET. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/dotnet/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-net.| | 751|mnotgod96/AppAgent !2025-03-2856643|AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.| | 752|microsoft/TaskWeaver !2025-03-2856243|A code-first agent framework for seamlessly planning and executing data analytics tasks.| | 753| yetone/bob-plugin-openai-translator !2025-03-285600-1 | A Bob Plugin base ChatGPT API | | 754|PrefectHQ/marvin !2025-03-2855840 |A batteries-included library for building AI-powered software| | 755|microsoft/promptbase !2025-03-2855832|All things prompt engineering| | 756|fullstackhero/dotnet-starter-kit !2025-03-2855560|Production Grade Cloud-Ready .NET 8 Starter Kit (Web API + Blazor Client) with Multitenancy Support, and Clean/Modular Architecture that saves roughly 200+ Development Hours! All Batteries Included.| | 757|deepseek-ai/DeepSeek-Coder-V2 !2025-03-2855435|DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence| | 758|aiwaves-cn/agents !2025-03-2855391|An Open-source Framework for Autonomous Language Agents| | 759|microsoft/Mastering-GitHub-Copilot-for-Paired-Programming !2025-03-2855158|A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource.| | 760|allenai/OLMo !2025-03-2854506|Modeling, training, eval, and inference code for OLMo| | 761|apify/crawlee-python !2025-03-2854493|Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.| | 762|k2-fsa/sherpa-onnx !2025-03-28541520|Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift| | 763|TEN-framework/TEN-Agent !2025-03-28541411|TEN Agent is a realtime conversational AI agent powered by TEN. It seamlessly integrates the OpenAI Realtime API, RTC capabilities, and advanced features like weather updates, web search, computer vision, and Retrieval-Augmented Generation (RAG).| | 764|google/gemmapytorch !2025-03-2854010|The official PyTorch implementation of Google's Gemma models| | 765|snakers4/silero-vad !2025-03-2853858|Silero VAD: pre-trained enterprise-grade Voice Activity Detector| | 766|livekit/agents !2025-03-2853836|Build real-time multimodal AI applications 🤖🎙️📹| | 767|pipecat-ai/pipecat !2025-03-28537811|Open Source framework for voice and multimodal conversational AI| | 768|EricLBuehler/mistral.rs !2025-03-28536324|Blazingly fast LLM inference.| | 769|asg017/sqlite-vec !2025-03-28535810|Work-in-progress vector search SQLite extension that runs anywhere.| | 770|albertan017/LLM4Decompile !2025-03-2853563|Reverse Engineering: Decompiling Binary Code with Large Language Models| | 771|Permify/permify !2025-03-2853235|An open-source authorization as a service inspired by Google Zanzibar, designed to build and manage fine-grained and scalable authorization systems for any application.| | 772|imoneoi/openchat !2025-03-2853171|OpenChat: Advancing Open-source Language Models with Imperfect Data| | 773|mosaicml/composer !2025-03-2853140|Train neural networks up to 7x faster| | 774|dsdanielpark/Bard-API !2025-03-285277-1 |The python package that returns a response of Google Bard through API.| | 775|lxfater/inpaint-web !2025-03-2852552|A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| | 776|leanprover/lean4 !2025-03-2852441|Lean 4 programming language and theorem prover| | 777|AILab-CVC/YOLO-World !2025-03-2852415|Real-Time Open-Vocabulary Object Detection| | 778|openchatai/OpenChat !2025-03-2852260 |Run and create custom ChatGPT-like bots with OpenChat, embed and share these bots anywhere, the open-source chatbot console.| | 779|mufeedvh/code2prompt !2025-03-28519414|A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.| | 780|biobootloader/wolverine !2025-03-2851700 |Automatically repair python scripts through GPT-4 to give them regenerative abilities.| | 781|huggingface/parler-tts !2025-03-2851671|Inference and training library for high-quality TTS models.| | 782|Akegarasu/lora-scripts !2025-03-2851308 |LoRA training scripts use kohya-ss's trainer, for diffusion model.| | 783|openchatai/OpenCopilot !2025-03-285128-3|🤖 🔥 Let your users chat with your product features and execute things by text - open source Shopify sidekick| | 784|e2b-dev/fragments !2025-03-2851228|Open-source Next.js template for building apps that are fully generated by AI. By E2B.| | 785|microsoft/SynapseML !2025-03-2851132|Simple and Distributed Machine Learning| | 786|aigc-apps/sd-webui-EasyPhoto !2025-03-285108-1|📷 EasyPhoto | | 787|ChaoningZhang/MobileSAM !2025-03-2850944|This is the official code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight| | 788|huggingface/alignment-handbook !2025-03-2850932|Robust recipes for to align language models with human and AI preferences| | 789|alpkeskin/mosint !2025-03-2850920|An automated e-mail OSINT tool| | 790|TaskingAI/TaskingAI !2025-03-2850891|The open source platform for AI-native application development.| | 791|lipku/metahuman-stream !2025-03-28507615|Real time interactive streaming digital human| | 792|OpenInterpreter/01 !2025-03-2850530|The open-source language model computer| | 793|open-compass/opencompass !2025-03-28505111|OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.| | 794|xxlong0/Wonder3D !2025-03-2850491|A cross-domain diffusion model for 3D reconstruction from a single image| | 795|pytorch/torchtune !2025-03-2850342|A Native-PyTorch Library for LLM Fine-tuning| | 796|SuperDuperDB/superduperdb !2025-03-2850192|🔮 SuperDuperDB: Bring AI to your database: Integrate, train and manage any AI models and APIs directly with your database and your data.| | 797|WhiskeySockets/Baileys !2025-03-2850057|Lightweight full-featured typescript/javascript WhatsApp Web API| | 798| mpociot/chatgpt-vscode !2025-03-2849890 | A VSCode extension that allows you to use ChatGPT | | 799|OpenGVLab/DragGAN !2025-03-2849880|Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)| | 800|microsoft/LLMLingua !2025-03-2849824|To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.| | 801|Zipstack/unstract !2025-03-2849745|No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents| | 802|OpenBMB/ToolBench !2025-03-2849621|An open platform for training, serving, and evaluating large language model for tool learning.| | 803|Fanghua-Yu/SUPIR !2025-03-2849593|SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild| | 804|GaiaNet-AI/gaianet-node !2025-03-2849360|Install and run your own AI agent service| | 805|qodo-ai/qodo-cover !2025-03-284922-1|Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞| | 806|Zejun-Yang/AniPortrait !2025-03-2849042|AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation| | 807|lvwzhen/law-cn-ai !2025-03-2848901 |⚖️ AI Legal Assistant| | 808|developersdigest/llm-answer-engine !2025-03-2848740|Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper| | 809|Plachtaa/VITS-fast-fine-tuning !2025-03-2848640|This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion| | 810|espeak-ng/espeak-ng !2025-03-2848601|eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.| | 811|ant-research/CoDeF !2025-03-2848581|[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing| | 812|deepseek-ai/DeepSeek-V2 !2025-03-2848512|| | 813|XRPLF/rippled !2025-03-2848210|Decentralized cryptocurrency blockchain daemon implementing the XRP Ledger protocol in C++| | 814|AutoMQ/automq !2025-03-28478721|AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.| | 815|AILab-CVC/VideoCrafter !2025-03-2847800|VideoCrafter1: Open Diffusion Models for High-Quality Video Generation| | 816|nautechsystems/nautilustrader !2025-03-2847702|A high-performance algorithmic trading platform and event-driven backtester| | 817|kyegomez/swarms !2025-03-2847563|The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503| | 818|Deci-AI/super-gradients !2025-03-2847310 |Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.| | 819|QwenLM/Qwen2.5-Coder !2025-03-2847236|Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.| | 820|SCIR-HI/Huatuo-Llama-Med-Chinese !2025-03-2847191 |Repo for HuaTuo (华驼), Llama-7B tuned with Chinese medical knowledge| | 821|togethercomputer/RedPajama-Data !2025-03-2846841 |code for preparing large datasets for training large language models| | 822|mishushakov/llm-scraper !2025-03-2846704|Turn any webpage into structured data using LLMs| | 823|1rgs/jsonformer !2025-03-2846663 |A Bulletproof Way to Generate Structured JSON from Language Models| | 824|anti-work/shortest !2025-03-2846565|QA via natural language AI tests| | 825|dnhkng/GlaDOS !2025-03-2846510|This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.| | 826|Nukem9/dlssg-to-fsr3 !2025-03-2846380|Adds AMD FSR3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg).| | 827|BuilderIO/ai-shell !2025-03-2846373 |A CLI that converts natural language to shell commands.| | 828|facebookincubator/AITemplate !2025-03-2846220 |AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.| | 829|terraform-aws-modules/terraform-aws-eks !2025-03-2846030|Terraform module to create AWS Elastic Kubernetes (EKS) resources 🇺🇦| | 830|timescale/pgai !2025-03-2845915|A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL| | 831|awslabs/multi-agent-orchestrator !2025-03-2845788|Flexible and powerful framework for managing multiple AI agents and handling complex conversations| | 832|sanchit-gandhi/whisper-jax !2025-03-2845771 |Optimised JAX code for OpenAI's Whisper Model, largely built on the Hugging Face Transformers Whisper implementation| | 833|NVIDIA/NeMo-Guardrails !2025-03-2845755|NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.| | 834|PathOfBuildingCommunity/PathOfBuilding !2025-03-2845480|Offline build planner for Path of Exile.| | 835|UX-Decoder/Segment-Everything-Everywhere-All-At-Once !2025-03-2845412 |Official implementation of the paper "Segment Everything Everywhere All at Once"| | 836|build-trust/ockam !2025-03-2845171|Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale.| | 837|google-research/timesfm !2025-03-2845135|TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.| | 838|luosiallen/latent-consistency-model !2025-03-2844842|Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference| | 839|NVlabs/neuralangelo !2025-03-2844740|Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)| | 840|kyegomez/tree-of-thoughts !2025-03-2844720 |Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%| | 841|sjvasquez/handwriting-synthesis !2025-03-2844720 |Handwriting Synthesis with RNNs ✏️| | 842| madawei2699/myGPTReader !2025-03-2844420 | A slack bot that can read any webpage, ebook or document and summarize it with chatGPT | | 843|OpenBMB/AgentVerse !2025-03-2844413|🤖 AgentVerse 🪐 provides a flexible framework that simplifies the process of building custom multi-agent environments for large language models (LLMs).| | 844|argmaxinc/WhisperKit !2025-03-2844395|Swift native speech recognition on-device for iOS and macOS applications.| | 845|landing-ai/vision-agent !2025-03-2844346|Vision agent| | 846|InternLM/xtuner !2025-03-2844273|An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)| | 847|google-deepmind/alphageometry !2025-03-284421-1|Solving Olympiad Geometry without Human Demonstrations| | 848|ostris/ai-toolkit !2025-03-2844093|Various AI scripts. Mostly Stable Diffusion stuff.| | 849|LLM-Red-Team/kimi-free-api !2025-03-2844004|🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。| | 850|argilla-io/argilla !2025-03-2843991|Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.| | 851|spring-projects/spring-ai !2025-03-28438419|An Application Framework for AI Engineering| | 852|alibaba-damo-academy/FunClip !2025-03-2843555|Open-source, accurate and easy-to-use video clipping tool, LLM based AI clipping intergrated | | 853|yisol/IDM-VTON !2025-03-2843541|IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild| | 854|fchollet/ARC-AGI !2025-03-2843368|The Abstraction and Reasoning Corpus| | 855|MahmoudAshraf97/whisper-diarization !2025-03-2843064|Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper| | 856|Speykious/cve-rs !2025-03-2843047|Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀| | 857|Blealtan/efficient-kan !2025-03-2842770|An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).| | 858|smol-ai/GodMode !2025-03-284249-1|AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.| | 859|openai/plugins-quickstart !2025-03-284235-4 |Get a ChatGPT plugin up and running in under 5 minutes!| | 860|Doriandarko/maestro !2025-03-2842260|A framework for Claude Opus to intelligently orchestrate subagents.| | 861|philz1337x/clarity-upscaler !2025-03-2842204|Clarity-Upscaler: Reimagined image upscaling for everyone| | 862|facebookresearch/co-tracker !2025-03-2842142|CoTracker is a model for tracking any point (pixel) on a video.| | 863|xlang-ai/OpenAgents !2025-03-2842031|OpenAgents: An Open Platform for Language Agents in the Wild| | 864|alibaba/higress !2025-03-28419514|🤖 AI Gateway | | 865|ray-project/llm-numbers !2025-03-2841920 |Numbers every LLM developer should know| | 866|fudan-generative-vision/champ !2025-03-2841820|Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance| | 867|NVIDIA/garak !2025-03-2841795|the LLM vulnerability scanner| | 868|leetcode-mafia/cheetah !2025-03-2841740 |Whisper & GPT-based app for passing remote SWE interviews| | 869|ragapp/ragapp !2025-03-2841710|The easiest way to use Agentic RAG in any enterprise| | 870|collabora/WhisperSpeech !2025-03-2841692|An Open Source text-to-speech system built by inverting Whisper.| | 871|Facico/Chinese-Vicuna !2025-03-2841520 |Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model| | 872|openai/grok !2025-03-2841381|| | 873|CrazyBoyM/llama3-Chinese-chat !2025-03-2841361|Llama3 Chinese Repository with modified versions, and training and deployment resources| | 874|luban-agi/Awesome-AIGC-Tutorials !2025-03-2841301|Curated tutorials and resources for Large Language Models, AI Painting, and more.| | 875|damo-vilab/AnyDoor !2025-03-2841192|Official implementations for paper: Anydoor: zero-shot object-level image customization| | 876|raspberrypi/pico-sdk !2025-03-2841072|| | 877|mshumer/gpt-llm-trainer !2025-03-284097-1|| | 878|metavoiceio/metavoice-src !2025-03-284076-1|AI for human-level speech intelligence| | 879|intelowlproject/IntelOwl !2025-03-2840763|IntelOwl: manage your Threat Intelligence at scale| | 880|a16z-infra/ai-getting-started !2025-03-2840682|A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs| | 881|MarkFzp/mobile-aloha !2025-03-2840641|Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation| | 882| keijiro/AICommand !2025-03-2840380 | ChatGPT integration with Unity Editor | | 883|Tencent/HunyuanDiT !2025-03-2840214|Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding| | 884|hengyoush/kyanos !2025-03-2840061|Visualize the time packets spend in the kernel, watch & analyze in command line.| | 885|agiresearch/AIOS !2025-03-2840045|AIOS: LLM Agent Operating System| | 886|truefoundry/cognita !2025-03-2839773|RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry| | 887|X-PLUG/MobileAgent !2025-03-2839557|Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception| | 888|jackMort/ChatGPT.nvim !2025-03-2839231|ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API| | 889|microsoft/RD-Agent !2025-03-28388422|Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automate these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which let AI drive data-driven AI.| | 890|Significant-Gravitas/Auto-GPT-Plugins !2025-03-283882-1 |Plugins for Auto-GPT| | 891|apple/ml-mgie !2025-03-2838770|| | 892|OpenDriveLab/UniAD !2025-03-2838727|[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving| | 893|llSourcell/DoctorGPT !2025-03-2838640|DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.| | 894|FlagAI-Open/FlagAI !2025-03-2838601|FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.| | 895|krishnaik06/Roadmap-To-Learn-Generative-AI-In-2024 !2025-03-2838513|Roadmap To Learn Generative AI In 2024| | 896|SysCV/sam-hq !2025-03-2838491|Segment Anything in High Quality| | 897|google/security-research !2025-03-2838420|This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code.| | 898|shroominic/codeinterpreter-api !2025-03-2838330|Open source implementation of the ChatGPT Code Interpreter 👾| | 899|Yonom/assistant-ui !2025-03-2838308|React Components for AI Chat 💬 🚀| | 900|nucleuscloud/neosync !2025-03-2838262|Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.| | 901|ravenscroftj/turbopilot !2025-03-2838230 |Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU| | 902|NVlabs/Sana !2025-03-28380810|SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer| | 903|huggingface/distil-whisper !2025-03-2838061|Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.| | 904|Codium-ai/AlphaCodium !2025-03-2837971|code generation tool that surpasses most human competitors in CodeContests| | 905|fixie-ai/ultravox !2025-03-2837710|A fast multimodal LLM for real-time voice| | 906|unit-mesh/auto-dev !2025-03-28375715|🧙‍AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀| | 907|Marker-Inc-Korea/AutoRAG !2025-03-2837432|AutoML tool for RAG| | 908|deepseek-ai/DeepSeek-VL !2025-03-283734-1|DeepSeek-VL: Towards Real-World Vision-Language Understanding| | 909|hiyouga/ChatGLM-Efficient-Tuning !2025-03-283692-1|Fine-tuning ChatGLM-6B with PEFT | | 910| Yue-Yang/ChatGPT-Siri !2025-03-2836921 | Shortcuts for Siri using ChatGPT API gpt-3.5-turbo model | | 911|0hq/WebGPT !2025-03-2836901 |Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~2000 lines of vanilla Javascript.| | 912|cvg/LightGlue !2025-03-2836903|LightGlue: Local Feature Matching at Light Speed (ICCV 2023)| | 913|deanxv/coze-discord-proxy !2025-03-2836791|代理Discord-Bot对话Coze-Bot,实现API形式请求GPT4对话模型/微调模型| | 914|MervinPraison/PraisonAI !2025-03-2836764|PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.| | 915|Ironclad/rivet !2025-03-2836345 |The open-source visual AI programming environment and TypeScript library| | 916|BasedHardware/OpenGlass !2025-03-2835851|Turn any glasses into AI-powered smart glasses| | 917|ricklamers/gpt-code-ui !2025-03-2835840 |An open source implementation of OpenAI's ChatGPT Code interpreter| | 918|whoiskatrin/chart-gpt !2025-03-2835830 |AI tool to build charts based on text input| | 919|github/CopilotForXcode !2025-03-2835788|Xcode extension for GitHub Copilot| | 920|hemansnation/God-Level-Data-Science-ML-Full-Stack !2025-03-2835570 |A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI| | 921|pytorch/torchchat !2025-03-2835461|Run PyTorch LLMs locally on servers, desktop and mobile| | 922| Kent0n-Li/ChatDoctor !2025-03-2835451 | A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge | | 923|xtekky/chatgpt-clone !2025-03-283519-1 |ChatGPT interface with better UI| | 924|jupyterlab/jupyter-ai !2025-03-2835120|A generative AI extension for JupyterLab| | 925|pytorch/torchtitan !2025-03-2835064|A native PyTorch Library for large model training| | 926|minimaxir/simpleaichat !2025-03-2835031|Python package for easily interfacing with chat apps, with robust features and minimal code complexity.| | 927|srush/Tensor-Puzzles !2025-03-2834930|Solve puzzles. Improve your pytorch.| | 928|Helicone/helicone !2025-03-2834918|🧊 Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more. 🍓 YC W23| | 929|run-llama/llama-hub !2025-03-2834740|A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain| | 930|NExT-GPT/NExT-GPT !2025-03-2834700|Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model| | 931|souzatharsis/podcastfy !2025-03-2834661|An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI| | 932|Dataherald/dataherald !2025-03-2834450|Interact with your SQL database, Natural Language to SQL using LLMs| | 933|iryna-kondr/scikit-llm !2025-03-2834350 |Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks.| | 934|Netflix/maestro !2025-03-2834230|Maestro: Netflix’s Workflow Orchestrator| | 935|CanadaHonk/porffor !2025-03-2833560|A from-scratch experimental AOT JS engine, written in JS| | 936|hustvl/Vim !2025-03-2833323|Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model| | 937|pashpashpash/vault-ai !2025-03-2833250 |OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, etc) using a simple React frontend.| | 938|tencentmusic/supersonic !2025-03-28330611|SuperSonic is the next-generation BI platform that integrates Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.| | 939|billmei/every-chatgpt-gui !2025-03-2832981|Every front-end GUI client for ChatGPT| | 940|microsoft/torchgeo !2025-03-2832772|TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data| | 941|LLMBook-zh/LLMBook-zh.github.io !2025-03-28326110|《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣| | 942|dvlab-research/MiniGemini !2025-03-2832601|Official implementation for Mini-Gemini| | 943|rashadphz/farfalle !2025-03-2832460|🔍 AI search engine - self-host with local or cloud LLMs| | 944|Luodian/Otter !2025-03-2832450|🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.| | 945|AprilNEA/ChatGPT-Admin-Web !2025-03-2832370 | ChatGPT WebUI with user management and admin dashboard system| | 946|MarkFzp/act-plus-plus !2025-03-2832365|Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN| | 947|ethen8181/machine-learning !2025-03-2832310|🌎 machine learning tutorials (mainly in Python3)| | 948|opengeos/segment-geospatial !2025-03-2832312 |A Python package for segmenting geospatial data with the Segment Anything Model (SAM)| | 949|iusztinpaul/hands-on-llms !2025-03-283225-2|🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴| | 950|ToTheBeginning/PuLID !2025-03-2832221|Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment| | 951|neo4j-labs/llm-graph-builder !2025-03-2832164|Neo4j graph construction from unstructured data using LLMs| | 952|OpenGVLab/InternGPT !2025-03-2832150 |InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)| | 953|PKU-YuanGroup/Video-LLaVA !2025-03-2832060 |Video-LLaVA: Learning United Visual Representation by Alignment Before Projection| | 954|DataTalksClub/llm-zoomcamp !2025-03-2832030|LLM Zoomcamp - a free online course about building an AI bot that can answer questions about your knowledge base| | 955|gptscript-ai/gptscript !2025-03-2832010|Natural Language Programming| |!green-up-arrow.svg 956|isaac-sim/IsaacLab !2025-03-28320113|Unified framework for robot learning built on NVIDIA Isaac Sim| |!red-down-arrow 957|ai-boost/Awesome-GPTs !2025-03-2832003|Curated list of awesome GPTs 👍.| | 958|huggingface/safetensors !2025-03-2831901|Simple, safe way to store and distribute tensors| | 959|linyiLYi/bilibot !2025-03-2831771|A local chatbot fine-tuned by bilibili user comments.| | 960| project-baize/baize-chatbot !2025-03-283168-1 | Let ChatGPT teach your own chatbot in hours with a single GPU! | | 961|Azure-Samples/cognitive-services-speech-sdk !2025-03-2831280|Sample code for the Microsoft Cognitive Services Speech SDK| | 962|microsoft/Phi-3CookBook !2025-03-2831231|This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.| | 963|neuralmagic/deepsparse !2025-03-2831180|Sparsity-aware deep learning inference runtime for CPUs| | 964|sugarforever/chat-ollama !2025-03-2831000|ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.| | 965|amazon-science/chronos-forecasting !2025-03-2830974|Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting| | 966|damo-vilab/i2vgen-xl !2025-03-2830902|Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models| | 967|google-deepmind/gemma !2025-03-2830733|Open weights LLM from Google DeepMind.| | 968|iree-org/iree !2025-03-2830733|A retargetable MLIR-based machine learning compiler and runtime toolkit.| | 969|NVlabs/VILA !2025-03-2830724|VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)| | 970|microsoft/torchscale !2025-03-2830661|Foundation Architecture for (M)LLMs| | 971|openai/openai-realtime-console !2025-03-2830656|React app for inspecting, building and debugging with the Realtime API| | 972|daveshap/OpenAIAgentSwarm !2025-03-2830610|HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"| | 973|microsoft/PromptWizard !2025-03-2830555|Task-Aware Agent-driven Prompt Optimization Framework| | 974|CVI-SZU/Linly !2025-03-2830490 |Chinese-LLaMA basic model; ChatFlow Chinese conversation model; NLP pre-training/command fine-tuning dataset| | 975|cohere-ai/cohere-toolkit !2025-03-2830130|Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.| | 976|adamcohenhillel/ADeus !2025-03-2830131|An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.| | 977|Lightning-AI/LitServe !2025-03-2830132|Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.| | 978|potpie-ai/potpie !2025-03-2829973|Prompt-To-Agent : Create custom engineering agents for your codebase| | 979|ant-design/x !2025-03-28299529|Craft AI-driven interfaces effortlessly 🤖| | 980|meta-llama/PurpleLlama !2025-03-2829832|Set of tools to assess and improve LLM security.| | 981|williamyang1991/RerenderAVideo !2025-03-2829800|[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation| | 982|baichuan-inc/Baichuan-13B !2025-03-2829790|A 13B large language model developed by Baichuan Intelligent Technology| | 983|Stability-AI/stable-audio-tools !2025-03-2829761|Generative models for conditional audio generation| | 984|li-plus/chatglm.cpp !2025-03-2829720|C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs| | 985|NVIDIA/GenerativeAIExamples !2025-03-2829546|Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.| | 986|Josh-XT/AGiXT !2025-03-2829521 |AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.| | 987|MrForExample/ComfyUI-3D-Pack !2025-03-2829515|An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)| | 988|olimorris/codecompanion.nvim !2025-03-28295111|✨ AI-powered coding, seamlessly in Neovim. Supports Anthropic, Copilot, Gemini, Ollama, OpenAI and xAI LLMs| | 989|salesforce/CodeT5 !2025-03-282940-1 |Home of CodeT5: Open Code LLMs for Code Understanding and Generation| | 990|facebookresearch/ijepa !2025-03-2829391|Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."| | 991|eureka-research/Eureka !2025-03-2829351|Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"| | 992|NVIDIA/trt-llm-rag-windows !2025-03-282934-1|A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM| | 993|gmpetrov/databerry !2025-03-282930-1|The no-code platform for building custom LLM Agents| | 994|AI4Finance-Foundation/FinRobot !2025-03-28291946|FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀| | 995|nus-apr/auto-code-rover !2025-03-2829013|A project structure aware autonomous software engineer aiming for autonomous program improvement| | 996|deepseek-ai/DreamCraft3D !2025-03-2828921|[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior| | 997|mlabonne/llm-datasets !2025-03-2828848|High-quality datasets, tools, and concepts for LLM fine-tuning.| | 998|facebookresearch/jepa !2025-03-2828712|PyTorch code and models for V-JEPA self-supervised learning from video.| | 999|facebookresearch/habitat-sim !2025-03-2828604|A flexible, high-performance 3D simulator for Embodied AI research.| | 1000|xenova/whisper-web !2025-03-2828581|ML-powered speech recognition directly in your browser| | 1001|cvlab-columbia/zero123 !2025-03-2828530|Zero-1-to-3: Zero-shot One Image to 3D Object: https://zero123.cs.columbia.edu/| | 1002|yuruotong1/autoMate !2025-03-28285121|Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves| | 1003|muellerberndt/mini-agi !2025-03-282845-1 |A minimal generic autonomous agent based on GPT3.5/4. Can analyze stock prices, perform network security tests, create art, and order pizza.| | 1004|allenai/open-instruct !2025-03-2828432|| | 1005|CodingChallengesFYI/SharedSolutions !2025-03-2828360|Publicly shared solutions to Coding Challenges| | 1006|hegelai/prompttools !2025-03-2828220|Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate).| | 1007|mazzzystar/Queryable !2025-03-2828222|Run CLIP on iPhone to Search Photos.| | 1008|Doubiiu/DynamiCrafter !2025-03-2828173|DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors| | 1009|SamurAIGPT/privateGPT !2025-03-282805-1 |An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks| | 1010|facebookresearch/Pearl !2025-03-2827951|A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.| | 1011|intuitem/ciso-assistant-community !2025-03-2827954|CISO Assistant is a one-stop-shop for GRC, covering Risk, AppSec and Audit Management and supporting +70 frameworks worldwide with auto-mapping: NIST CSF, ISO 27001, SOC2, CIS, PCI DSS, NIS2, CMMC, PSPF, GDPR, HIPAA, Essential Eight, NYDFS-500, DORA, NIST AI RMF, 800-53, 800-171, CyFun, CJIS, AirCyber, NCSC, ECC, SCF and so much more| | 1012|facebookresearch/audio2photoreal !2025-03-2827840|Code and dataset for photorealistic Codec Avatars driven from audio| | 1013|Azure/azure-rest-api-specs !2025-03-2827770|The source for REST API specifications for Microsoft Azure.| | 1014|SCUTlihaoyu/open-chat-video-editor !2025-03-2827690 |Open source short video automatic generation tool| | 1015|Alpha-VLLM/LLaMA2-Accessory !2025-03-2827642|An Open-source Toolkit for LLM Development| | 1016|johnma2006/mamba-minimal !2025-03-2827601|Simple, minimal implementation of the Mamba SSM in one file of PyTorch.| | 1017|nerfstudio-project/gsplat !2025-03-2827576|CUDA accelerated rasterization of gaussian splatting| | 1018|Physical-Intelligence/openpi !2025-03-28274617|| | 1019|leptonai/leptonai !2025-03-2827246|A Pythonic framework to simplify AI service building| |!green-up-arrow.svg 1020|joanrod/star-vector !2025-03-28271149|StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.| |!red-down-arrow 1021|jqnatividad/qsv !2025-03-2827092|CSVs sliced, diced & analyzed.| | 1022|FranxYao/chain-of-thought-hub !2025-03-2826991|Benchmarking large language models' complex reasoning ability with chain-of-thought prompting| | 1023|princeton-nlp/SWE-bench !2025-03-2826965|[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?| | 1024|elastic/otel-profiling-agent !2025-03-2826930|The production-scale datacenter profiler| | 1025|src-d/hercules !2025-03-2826900|Gaining advanced insights from Git repository history.| | 1026|lanqian528/chat2api !2025-03-2826695|A service that can convert ChatGPT on the web to OpenAI API format.| | 1027|ishan0102/vimGPT !2025-03-2826681|Browse the web with GPT-4V and Vimium| | 1028|TMElyralab/MuseV !2025-03-2826650|MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising| | 1029|georgia-tech-db/eva !2025-03-2826600 |AI-Relational Database System | | 1030|kubernetes-sigs/controller-runtime !2025-03-2826590|Repo for the controller-runtime subproject of kubebuilder (sig-apimachinery)| | 1031|gptlink/gptlink !2025-03-2826550 |Build your own free commercial ChatGPT environment in 10 minutes. The setup is simple and includes features such as user management, orders, tasks, and payments| | 1032|pytorch/executorch !2025-03-2826534|On-device AI across mobile, embedded and edge for PyTorch| | 1033|NVIDIA/nv-ingest !2025-03-2826290|NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.| | 1034|SuperTux/supertux !2025-03-2826081|SuperTux source code| | 1035|abi/secret-llama !2025-03-2826050|Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.| | 1036|liou666/polyglot !2025-03-2825841 |Desktop AI Language Practice Application| | 1037|janhq/nitro !2025-03-2825821|A fast, lightweight, embeddable inference engine to supercharge your apps with local AI. OpenAI-compatible API| | 1038|deepseek-ai/DeepSeek-Math !2025-03-2825825|DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models| | 1039|anthropics/prompt-eng-interactive-tutorial !2025-03-2825781|Anthropic's Interactive Prompt Engineering Tutorial| | 1040|microsoft/promptbench !2025-03-2825741|A unified evaluation framework for large language models| | 1041|baaivision/Painter !2025-03-2825580 |Painter & SegGPT Series: Vision Foundation Models from BAAI| | 1042|OpenPipe/OpenPipe !2025-03-2825581|Turn expensive prompts into cheap fine-tuned models| | 1043|TracecatHQ/tracecat !2025-03-2825531|😼 The AI-native, open source alternative to Tines / Splunk SOAR.| | 1044|JoshuaC215/agent-service-toolkit !2025-03-2825528|Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit| | 1045|databricks/dbrx !2025-03-2825460|Code examples and resources for DBRX, a large language model developed by Databricks| | 1046|lamini-ai/lamini !2025-03-2825271 |Official repo for Lamini's data generator for generating instructions to train instruction-following LLMs| | 1047|mshumer/gpt-author !2025-03-282510-1|| | 1048|TMElyralab/MusePose !2025-03-2824971|MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation| | 1049|Kludex/fastapi-tips !2025-03-2824974|FastAPI Tips by The FastAPI Expert!| | 1050|openai/simple-evals !2025-03-2824813|| | 1051|iterative/datachain !2025-03-2824732|AI-data warehouse to enrich, transform and analyze data from cloud storages| | 1052|girafe-ai/ml-course !2025-03-2824703|Open Machine Learning course| | 1053|kevmo314/magic-copy !2025-03-2824620 |Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.| | 1054|Eladlev/AutoPrompt !2025-03-2824432|A framework for prompt tuning using Intent-based Prompt Calibration| | 1055|OpenBMB/CPM-Bee !2025-03-282434-1 |A bilingual large-scale model with trillions of parameters| | 1056|IDEA-Research/T-Rex !2025-03-2824310|T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy| | 1057|microsoft/genaiscript !2025-03-2824202|Automatable GenAI Scripting| | 1058|paulpierre/RasaGPT !2025-03-2824090 |💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram| | 1059|ashishpatel26/LLM-Finetuning !2025-03-2823911|LLM Finetuning with peft| | 1060|SoraWebui/SoraWebui !2025-03-2823570|SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model.| | 1061|6drf21e/ChatTTScolab !2025-03-2823491|🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。| | 1062|Azure/PyRIT !2025-03-2823343|The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.| | 1063|tencent-ailab/V-Express !2025-03-2823201|V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.| | 1064|THUDM/CogVLM2 !2025-03-2823170|GPT4V-level open-source multi-modal model based on Llama3-8B| | 1065|dvmazur/mixtral-offloading !2025-03-2823001|Run Mixtral-8x7B models in Colab or consumer desktops| | 1066|semanser/codel !2025-03-2822950|✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.| | 1067|mshumer/gpt-investor !2025-03-2822590|| | 1068|aixcoder-plugin/aiXcoder-7B !2025-03-2822550|official repository of aiXcoder-7B Code Large Language Model| | 1069|Azure-Samples/graphrag-accelerator !2025-03-2822503|One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure| | 1070|emcf/engshell !2025-03-2821830 |An English-language shell for any OS, powered by LLMs| | 1071|hncboy/chatgpt-web-java !2025-03-2821771|ChatGPT project developed in Java, based on Spring Boot 3 and JDK 17, supports both AccessToken and ApiKey modes| | 1072|openai/consistencydecoder !2025-03-2821692|Consistency Distilled Diff VAE| | 1073|Alpha-VLLM/Lumina-T2X !2025-03-2821681|Lumina-T2X is a unified framework for Text to Any Modality Generation| | 1074|bghira/SimpleTuner !2025-03-2821612|A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.| | 1075|JiauZhang/DragGAN !2025-03-2821530 |Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold| | 1076|cgpotts/cs224u !2025-03-2821390|Code for Stanford CS224u| | 1077|PKU-YuanGroup/MoE-LLaVA !2025-03-2821300|Mixture-of-Experts for Large Vision-Language Models| | 1078|darrenburns/elia !2025-03-2820831|A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.| | 1079|ageerle/ruoyi-ai !2025-03-28207898|RuoYi AI 是一个全栈式 AI 开发平台,旨在帮助开发者快速构建和部署个性化的 AI 应用。| | 1080|NVIDIA/gpu-operator !2025-03-2820510|NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes| | 1081|BAAI-Agents/Cradle !2025-03-2820481|The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.| | 1082|microsoft/aici !2025-03-2820080|AICI: Prompts as (Wasm) Programs| | 1083|PRIS-CV/DemoFusion !2025-03-2820040|Let us democratise high-resolution generation! (arXiv 2023)| | 1084|apple/axlearn !2025-03-2820012|An Extensible Deep Learning Library| | 1085|naver/mast3r !2025-03-2819685|Grounding Image Matching in 3D with MASt3R| | 1086|liltom-eth/llama2-webui !2025-03-281958-1|Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference.| | 1087|GaParmar/img2img-turbo !2025-03-2819582|One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more| | 1088|Niek/chatgpt-web !2025-03-2819560|ChatGPT web interface using the OpenAI API| | 1089|huggingface/cookbook !2025-03-2819421|Open-source AI cookbook| | 1090|pytorch/ao !2025-03-2819241|PyTorch native quantization and sparsity for training and inference| | 1091|emcie-co/parlant !2025-03-2819053|The behavior guidance framework for customer-facing LLM agents| | 1092|ymcui/Chinese-LLaMA-Alpaca-3 !2025-03-2818980|中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3| | 1093|Nutlope/notesGPT !2025-03-2818811|Record voice notes & transcribe, summarize, and get tasks| | 1094|InstantStyle/InstantStyle !2025-03-2818791|InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥| | 1095|idaholab/moose !2025-03-2818771|Multiphysics Object Oriented Simulation Environment| | 1096|The-OpenROAD-Project/OpenROAD !2025-03-2818351|OpenROAD's unified application implementing an RTL-to-GDS Flow. Documentation at https://openroad.readthedocs.io/en/latest/| | 1097|alibaba/spring-ai-alibaba !2025-03-281831121|Agentic AI Framework for Java Developers| | 1098|ytongbai/LVM !2025-03-2817990|Sequential Modeling Enables Scalable Learning for Large Vision Models| | 1099|microsoft/sample-app-aoai-chatGPT !2025-03-2817981|[PREVIEW] Sample code for a simple web chat experience targeting chatGPT through AOAI.| | 1100|AI-Citizen/SolidGPT !2025-03-2817830|Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱| | 1101|YangLing0818/RPG-DiffusionMaster !2025-03-2817784|Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)| | 1102|kyegomez/BitNet !2025-03-2817710|Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch| | 1103|eloialonso/diamond !2025-03-2817671|DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.| | 1104|flowdriveai/flowpilot !2025-03-2817250|flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines.| | 1105|xlang-ai/OSWorld !2025-03-2817200|OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments| | 1106|linyiLYi/snake-ai !2025-03-2817031|An AI agent that beats the classic game "Snake".| | 1107|baaivision/Emu !2025-03-2816991|Emu Series: Generative Multimodal Models from BAAI| | 1108|kevmo314/scuda !2025-03-2816870|SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.| | 1109|SharifiZarchi/IntroductiontoMachineLearning !2025-03-2816701|دوره‌ی مقدمه‌ای بر یادگیری ماشین، برای دانشجویان| | 1110|google/maxtext !2025-03-2816670|A simple, performant and scalable Jax LLM!| | 1111|ml-explore/mlx-swift-examples !2025-03-2816471|Examples using MLX Swift| | 1112|unitreerobotics/unitreerlgym !2025-03-2816256|| | 1113|collabora/WhisperFusion !2025-03-2815901|WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.| | 1114|lichao-sun/Mora !2025-03-2815520|Mora: More like Sora for Generalist Video Generation| | 1115|GoogleCloudPlatform/localllm !2025-03-2815370|Run LLMs locally on Cloud Workstations| | 1116|TencentARC/BrushNet !2025-03-2815330|The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"| | 1117|ai-christianson/RA.Aid !2025-03-2815288|Develop software autonomously.| | 1118|stephansturges/WALDO !2025-03-2815170|Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones!| | 1119|skills/copilot-codespaces-vscode !2025-03-2815112|Develop with AI-powered code suggestions using GitHub Copilot and VS Code| | 1120|andrewnguonly/Lumos !2025-03-2814920|A RAG LLM co-pilot for browsing the web, powered by local LLMs| | 1121|TeamNewPipe/NewPipeExtractor !2025-03-2814811|NewPipe's core library for extracting data from streaming sites| | 1122|mhamilton723/FeatUp !2025-03-2814770|Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024| | 1123|AnswerDotAI/fsdpqlora !2025-03-2814671|Training LLMs with QLoRA + FSDP| | 1124|jgravelle/AutoGroq !2025-03-2814330|| | 1125|OpenGenerativeAI/llm-colosseum !2025-03-2814130|Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM| | 1126|microsoft/vscode-ai-toolkit !2025-03-2814000|| | 1127|McGill-NLP/webllama !2025-03-2813930|Llama-3 agents that can browse the web by following instructions and talking to you| | 1128|lucidrains/self-rewarding-lm-pytorch !2025-03-2813760|Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI| | 1129|ishaan1013/sandbox !2025-03-2813650|A cloud-based code editing environment with an AI copilot and real-time collaboration.| | 1130|goatcorp/Dalamud !2025-03-2813275|FFXIV plugin framework and API| | 1131|Lightning-AI/lightning-thunder !2025-03-2813151|Make PyTorch models Lightning fast! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once.| | 1132|PKU-YuanGroup/MagicTime !2025-03-2813052|MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators| | 1133|SakanaAI/evolutionary-model-merge !2025-03-2813000|Official repository of Evolutionary Optimization of Model Merging Recipes| | 1134|a-real-ai/pywinassistant !2025-03-2812950|The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.| | 1135|TraceMachina/nativelink !2025-03-2812630|NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster builds, reduced test flakiness, and significant infrastructure cost savings.| | 1136|MLSysOps/MLE-agent !2025-03-2812500|🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Ollama, etc supported. 🎆 Code RAG| | 1137|wpilibsuite/allwpilib !2025-03-2811610|Official Repository of WPILibJ and WPILibC| | 1138|elfvingralf/macOSpilot-ai-assistant !2025-03-2811470|Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.| | 1139|langchain-ai/langchain-extract !2025-03-2811210|🦜⛏️ Did you say you like data?| | 1140|FoundationVision/GLEE !2025-03-2811120|【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale| | 1141|Profluent-AI/OpenCRISPR !2025-03-2810990|AI-generated gene editing systems| | 1142|zju3dv/EasyVolcap !2025-03-2810821|[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research| | 1143|PaddlePaddle/PaddleHelix !2025-03-2810560|Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集| | 1144|myshell-ai/JetMoE !2025-03-289800|Reaching LLaMA2 Performance with 0.1M Dollars| | 1145|likejazz/llama3.np !2025-03-289770|llama3.np is pure NumPy implementation for Llama 3 model.| | 1146|mustafaaljadery/gemma-2B-10M !2025-03-289500|Gemma 2B with 10M context length using Infini-attention.| | 1147|HITsz-TMG/FilmAgent !2025-03-289382|Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making!| | 1148|aws-samples/amazon-bedrock-samples !2025-03-289362|This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models| | 1149|Akkudoktor-EOS/EOS !2025-03-2893154|This repository features an Energy Optimization System (EOS) that optimizes energy distribution, usage for batteries, heat pumps& household devices. It includes predictive models for electricity prices (planned), load forecasting& dynamic optimization to maximize energy efficiency & minimize costs. Founder Dr. Andreas Schmitz (YouTube @akkudoktor)| Tip: | symbol| rule | | :----| :---- | |🔥 | 256 1k| |!green-up-arrow.svg !red-down-arrow | ranking up / down| |⭐ | on trending page today| [Back to Top] Tools | No. | Tool | Description | | ----:|:----------------------------------------------- |:------------------------------------------------------------------------------------------- | | 1 | ChatGPT | A sibling model to InstructGPT, which is trained to follow instructions in a prompt and provide a detailed response | | 2 | DALL·E 2 | Create original, realistic images and art from a text description | | 3 | Murf AI | AI enabled, real people's voices| | 4 | Midjourney | An independent research lab that produces an artificial intelligence program under the same name that creates images from textual descriptions, used in Discord | 5 | Make-A-Video | Make-A-Video is a state-of-the-art AI system that generates videos from text | | 6 | Creative Reality™ Studio by D-ID| Use generative AI to create future-facing videos| | 7 | chat.D-ID| The First App Enabling Face-to-Face Conversations with ChatGPT| | 8 | Notion AI| Access the limitless power of AI, right inside Notion. Work faster. Write better. Think bigger. | | 9 | Runway| Text to Video with Gen-2 | | 10 | Resemble AI| Resemble’s AI voice generator lets you create human–like voice overs in seconds | | 11 | Cursor| Write, edit, and chat about your code with a powerful AI | | 12 | Hugging Face| Build, train and deploy state of the art models powered by the reference open source in machine learning | | 13 | Claude | A next-generation AI assistant for your tasks, no matter the scale | | 14 | Poe| Poe lets you ask questions, get instant answers, and have back-and-forth conversations with AI. Gives access to GPT-4, gpt-3.5-turbo, Claude from Anthropic, and a variety of other bots| [Back to Top] Websites | No. | WebSite |Description | | ----:|:------------------------------------------ |:---------------------------------------------------------------------------------------- | | 1 | OpenAI | An artificial intelligence research lab | | 2 | Bard | Base Google's LaMDA chatbots and pull from internet | | 3 | ERNIE Bot | Baidu’s new generation knowledge-enhanced large language model is a new member of the Wenxin large model family | | 4 | DALL·E 2 | An AI system that can create realistic images and art from a description in natural language | | 5 | Whisper | A general-purpose speech recognition model | | 6| CivitAI| A platform that makes it easy for people to share and discover resources for creating AI art| | 7|D-ID| D-ID’s Generative AI enables users to transform any picture or video into extraordinary experiences| | 8| Nvidia eDiff-I| Text-to-Image Diffusion Models with Ensemble of Expert Denoisers | | 9| Stability AI| The world's leading open source generative AI company which opened source Stable Diffusion | | 10| Meta AI| Whether it be research, product or infrastructure development, we’re driven to innovate responsibly with AI to benefit the world | | 11| ANTHROPIC| AI research and products that put safety at the frontier | [Back to Top] Reports&Papers | No. | Report&Paper | Description | |:---- |:-------------------------------------------------------------------------------------------------------------- |:---------------------------------------------------- | | 1 | GPT-4 Technical Report | GPT-4 Technical Report | | 2 | mli/paper-reading | Deep learning classics and new papers are read carefully paragraph by paragraph. | | 3 | labmlai/annotateddeeplearningpaperimplementations| A collection of simple PyTorch implementations of neural networks and related algorithms, which are documented with explanations | | 4 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Talking, Drawing and Editing with Visual Foundation Models | | 5 | OpenAI Research | The latest research report and papers from OpenAI | | 6 | Make-A-Video: Text-to-Video Generation without Text-Video Data|Meta's Text-to-Video Generation| | 7 | eDiff-I: Text-to-Image Diffusion Models with Ensemble of Expert Denoisers| Nvidia eDiff-I - New generation of generative AI content creation tool | | 8 | Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo | 2023 GPT4All Technical Report | | 9 | Segment Anything| Meta Segment Anything | | 10 | LLaMA: Open and Efficient Foundation Language Models| LLaMA: a collection of foundation language models ranging from 7B to 65B parameters| | 11 | papers-we-love/papers-we-love |Papers from the computer science community to read and discuss| | 12 | CVPR 2023 papers |The most exciting and influential CVPR 2023 papers| [Back to Top] Tutorials | No. | Tutorial | Description| |:---- |:---------------------------------------------------------------- | --- | | 1 | Coursera - Machine Learning | The Machine Learning Specialization Course taught by Dr. Andrew Ng| | 2 | microsoft/ML-For-Beginners | 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all| | 3 | ChatGPT Prompt Engineering for Developers | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) will teach how to use a large language model (LLM) to quickly build new and powerful applications | | 4 | Dive into Deep Learning |Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries | | 5 | AI Expert Roadmap | Roadmap to becoming an Artificial Intelligence Expert in 2022 | | 6 | Computer Science courses |List of Computer Science courses with video lectures| | 7 | Machine Learning with Python | Machine Learning with Python Certification on freeCodeCamp| | 8 | Building Systems with the ChatGPT API | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI), you will learn how to automate complex workflows using chain calls to a large language model| | 9 | LangChain for LLM Application Development | This short course taught by Harrison Chase (Co-Founder and CEO at LangChain) and Andrew Ng. you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework| | 10 | How Diffusion Models Work | This short course taught by Sharon Zhou (CEO, Co-founder, Lamini). you will gain a deep familiarity with the diffusion process and the models which carry it out. More than simply pulling in a pre-built model or using an API, this course will teach you to build a diffusion model from scratch| | 11 | Free Programming Books For AI |📚 Freely available programming books for AI | | 12 | microsoft/AI-For-Beginners |12 Weeks, 24 Lessons, AI for All!| | 13 | hemansnation/God-Level-Data-Science-ML-Full-Stack |A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI| | 14 | datawhalechina/prompt-engineering-for-developers |Chinese version of Andrew Ng's Big Model Series Courses, including "Prompt Engineering", "Building System", and "LangChain"| | 15 | ossu/computer-science |🎓 Path to a free self-taught education in Computer Science!| | 16 | microsoft/Data-Science-For-Beginners | 10 Weeks, 20 Lessons, Data Science for All! | |17 |jwasham/coding-interview-university !2023-09-29268215336 |A complete computer science study plan to become a software engineer.| [Back to Top] Thanks If this project has been helpful to you in any way, please give it a ⭐️ by clicking on the star.

h2o-llmstudio
github
LLM Vibe Score0.499
Human Vibe Score0.04822694170894296
h2oaiMar 28, 2025

h2o-llmstudio

Welcome to H2O LLM Studio, a framework and no-code GUI designed for fine-tuning state-of-the-art large language models (LLMs). Jump to With H2O LLM Studio, you can Quickstart What's New Setup Recommended Install Virtual Environments Run H2O LLM Studio GUI Run H2O LLM Studio GUI using Docker Run H2O LLM Studio with command line interface (CLI) Troubleshooting Data format and example data Training your model Example: Run on OASST data via CLI Model checkpoints Documentation Contributing License With H2O LLM Studio, you can easily and effectively fine-tune LLMs without the need for any coding experience. use a graphic user interface (GUI) specially designed for large language models. finetune any LLM using a large variety of hyperparameters. use recent finetuning techniques such as Low-Rank Adaptation (LoRA) and 8-bit model training with a low memory footprint. use Reinforcement Learning (RL) to finetune your model (experimental) use advanced evaluation metrics to judge generated answers by the model. track and compare your model performance visually. In addition, Neptune and W&B integration can be used. chat with your model and get instant feedback on your model performance. easily export your model to the Hugging Face Hub and share it with the community. Quickstart For questions, discussing, or just hanging out, come and join our Discord! Use cloud-based runpod.io instance to run the H2O LLM Studio GUI. Using CLI for fine-tuning LLMs: What's New PR 788 New problem type for Causal Regression Modeling allows to train single target regression data using LLMs. PR 747 Fully removed RLHF in favor of DPO/IPO/KTO optimization. PR 741 Removing separate max length settings for prompt and answer in favor of a single maxlength settings better resembling chattemplate functionality from transformers. PR 592 Added KTOPairLoss for DPO modeling allowing to train models with simple preference data. Data currently needs to be manually prepared by randomly matching positive and negative examples as pairs. PR 592 Starting to deprecate RLHF in favor of DPO/IPO optimization. Training is disabled, but old experiments are still viewable. RLHF will be fully removed in a future release. PR 530 Introduced a new problem type for DPO/IPO optimization. This optimization technique can be used as an alternative to RLHF. PR 288 Introduced Deepspeed for sharded training allowing to train larger models on machines with multiple GPUs. Requires NVLink. This feature replaces FSDP and offers more flexibility. Deepspeed requires a system installation of cudatoolkit and we recommend using version 12.1. See Recommended Install. PR 449 New problem type for Causal Classification Modeling allows to train binary and multiclass models using LLMs. PR 364 User secrets are now handled more securely and flexible. Support for handling secrets using the 'keyring' library was added. User settings are tried to be migrated automatically. Please note that due to current rapid development we cannot guarantee full backwards compatibility of new functionality. We thus recommend to pin the version of the framework to the one you used for your experiments. For resetting, please delete/backup your data and output folders. Setup H2O LLM Studio requires a machine with Ubuntu 16.04+ and at least one recent Nvidia GPU with Nvidia drivers version >= 470.57.02. For larger models, we recommend at least 24GB of GPU memory. For more information about installation prerequisites, see the Set up H2O LLM Studio guide in the documentation. For a performance comparison of different GPUs, see the H2O LLM Studio performance guide in the documentation. Recommended Install The recommended way to install H2O LLM Studio is using pipenv with Python 3.10. To install Python 3.10 on Ubuntu 16.04+, execute the following commands: System installs (Python 3.10) Installing NVIDIA Drivers (if required) If deploying on a 'bare metal' machine running Ubuntu, one may need to install the required Nvidia drivers and CUDA. The following commands show how to retrieve the latest drivers for a machine running Ubuntu 20.04 as an example. One can update the following based on their OS. alternatively, one can install cudatoolkits in a conda environment: Virtual environments We offer various ways of setting up the necessary python environment. Pipenv virtual environment The following command will create a virtual environment using pipenv and will install the dependencies using pipenv: If you are having troubles installing the flash_attn package, consider running instead. This will install the dependencies without the flash_attn package. Note that this will disable the use of Flash Attention 2 and model training will be slower and consume more memory. Nightly Conda virtual environment You can also setup a conda virtual environment that can also deviate from the recommended setup. The contains a command that installs a fresh conda environment with CUDA 12.4 and current nightly PyTorch. Using requirements.txt If you wish to use another virtual environment, you can also install the dependencies using the requirements.txt file: Run H2O LLM Studio GUI You can start H2O LLM Studio using the following command: This command will start the H2O wave server and app. Navigate to (we recommend using Chrome) to access H2O LLM Studio and start fine-tuning your models! If you are running H2O LLM Studio with a custom environment other than Pipenv, you need to start the app as follows: If you are using the nightly conda environment, you can run . Run H2O LLM Studio GUI using Docker Install Docker first by following instructions from NVIDIA Containers. Make sure to have nvidia-container-toolkit installed on your machine as outlined in the instructions. H2O LLM Studio images are stored in the h2oai dockerhub container repository. Navigate to (we recommend using Chrome) to access H2O LLM Studio and start fine-tuning your models! (Note other helpful docker commands are docker ps and docker kill.) If you prefer to build your own Docker image from source, follow the instructions below. Run H2O LLM Studio with command line interface (CLI) You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration .yaml file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell, and then use the following command: To run on multiple GPUs in DDP mode, run the following command: By default, the framework will run on the first k GPUs. If you want to specify specific GPUs to run on, use the CUDAVISIBLEDEVICES environment variable before the command. To start an interactive chat with your trained model, use the following command: where experiment_name is the output folder of the experiment you want to chat with (see configuration). The interactive chat will also work with model that were finetuned using the UI. To publish the model to Hugging Face, use the following command: pathtoexperiment is the output folder of the experiment. device is the target device for running the model, either 'cpu' or 'cuda:0'. Default is 'cuda:0'. api_key is the Hugging Face API Key. If user logged in, it can be omitted. user_id is the Hugging Face user ID. If user logged in, it can be omitted. model_name is the name of the model to be published on Hugging Face. It can be omitted. safe_serialization is a flag indicating whether safe serialization should be used. Default is True. Troubleshooting If running on cloud based machines such as runpod, you may need to set the following environment variable to allow the H2O Wave server to accept connections from the proxy: If you are experiencing timeouts when running the H2O Wave server remotely, you can increase the timeout by setting the following environment variables: All default to 5 (seconds). Increase them if you are experiencing timeouts. Use -1 to disable the timeout. Data format and example data For details on the data format required when importing your data or example data that you can use to try out H2O LLM Studio, see Data format in the H2O LLM Studio documentation. Training your model With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start training your model. Start by creating an experiment. You can then monitor and manage your experiment, compare experiments, or push the model to Hugging Face to share it with the community. Example: Run on OASST data via CLI As an example, you can run an experiment on the OASST data via CLI. For instructions, see Run an experiment on the OASST data guide in the H2O LLM Studio documentation. Model checkpoints All open-source datasets and models are posted on H2O.ai's Hugging Face page and our H2OGPT repository. Documentation Detailed documentation and frequently asked questions (FAQs) for H2O LLM Studio can be found at . If you wish to contribute to the docs, navigate to the /documentation folder of this repo and refer to the README.md for more information. Contributing We are happy to accept contributions to the H2O LLM Studio project. Please refer to the CONTRIBUTING.md file for more information. License H2O LLM Studio is licensed under the Apache 2.0 license. Please see the LICENSE file for more information.

vector-vein
github
LLM Vibe Score0.532
Human Vibe Score0.010966292738059526
AndersonBYMar 28, 2025

vector-vein

English | 简体中文 | 日本語 🔀 VectorVein Build your automation workflow with the power of AI and your personal knowledge base. Create powerful workflows with just drag and drop, without any programming. VectorVein is a no-code AI workflow software inspired by LangChain and langflow, designed to combine the powerful capabilities of large language models and enable users to easily achieve intelligent and automated workflows for various daily tasks. 🌐 Online Experience You can experience VectorVein's online version here, with no need to download or install. Official website Online Documentation 📦 Installation and Configuration Installation After downloading VectorVein from Release, the program will create a "data" folder in the installation directory to store the database and static file resources. VectorVein is built using pywebview, based on the webview2 kernel, so you need to install the webview2 runtime. If the software cannot be opened, you may need to download the webview2 runtime manually from https://developer.microsoft.com/en-us/microsoft-edge/webview2/ [!IMPORTANT] If the software cannot be opened after decompression, please check if the downloaded compressed package .zip file is locked. You can solve this problem by right-clicking the compressed package and selecting "Unblock". Configuration Most workflows and agents in the software involve the use of AI large language models, so you should at least provide a usable configuration for a large language model. For workflows, you can see which large language models are being used in the interface, as shown in the image below. !LLM used in workflow API Endpoint Configuration Starting from v0.2.10, VectorVein separates API endpoints and large language model configurations, allowing multiple API endpoints for the same large language model. !API Endpoint Configuration After the software opens normally, click the open settings button, and you can configure the information for each API endpoint as needed, or add custom API endpoints. Currently, the API endpoints support OpenAI-compatible interfaces, which can be connected to locally running services such as LM-Studio, Ollama, vLLM, etc. The API Base for LM-Studio is typically http://localhost:1234/v1/ The API Base for Ollama is typically http://localhost:11434/v1/ Remote Large Language Model Interface Configuration Please configure the specific information for each model in the Remote LLMs tab. !LLM Settings Click on any model to set its specific configuration, as shown below. !LLM Settings The Model Key is the standard name of the large model and generally does not need to be adjusted. The Model ID is the name used during actual deployment, which usually matches the Model Key. However, in deployments like Azure OpenAI, the Model ID is user-defined and therefore needs to be adjusted according to the actual situation. Since the model IDs from different providers for the same model may vary, you can click the Edit button to configure the specific model ID under this endpoint, as shown in the figure below. !Endpoint Model ID Configuration Custom Large Language Model Interface Configuration If using a custom large language model, fill in the custom model configuration information on the Custom LLMs tab. Currently, interfaces compatible with OpenAI are supported, such as LM-Studio, Ollama, vLLM, etc. !Custom LLM Settings First, add a custom model family, then add a custom model. Don't forget to click the Save Settings button. Speech Recognition Configuration Currently, the speech recognition services of OpenAI/Deepgram are supported. For OpenAI services, you can use the same configuration as the large language model or set up a speech recognition service compatible with the OpenAI API (such as Groq). !Speech Recognition Configuration Embedding Configuration When you need to perform vector searches using vector data, you have the option to use embedding services provided by OpenAI or configure local embedding services in the Embedding Model settings. Currently, supported local embedding services require you to set up text-embeddings-inference yourself. !Local Embedding Settings Shortcut Settings For ease of daily use, you can configure shortcuts to quickly initiate voice conversations with the Agent. By launching through the shortcut, you can directly interact with the Agent via speech recognition. It is important to ensure that the speech recognition service is correctly configured beforehand. Include Screenshot means that while starting the conversation, a screenshot of the screen will be taken and uploaded as an attachment to the conversation. !Shortcut Settings Notes About the local Stable Diffusion API To use your own local Stable Diffusion API, you need to add the parameter --api to the startup item of webui-user.bat, that is 💻 Usage 📖 Basic Concepts A workflow represents a work task process, including input, output, and how input is processed to reach the output result. Examples: Translation Workflow: The input is an English Word document, and the output is also a Word document. You can design a workflow to translate the input Chinese document and generate a Chinese document output. Mind Map Workflow: If the output of the translation workflow is changed to a mind map, you can get a workflow that reads an English Word document and summarizes it into a Chinese mind map. Web Article Summary Workflow: If the input of the mind map workflow is changed to a URL of a web article, you can get a workflow that reads a web article and summarizes it into a Chinese mind map. Automatic Classification of Customer Complaints Workflow: The input is a table containing complaint content, and you can customize the keywords that need to be classified, so that the complaints can be automatically classified. The output is an automatically generated Excel table containing the classification results. 🔎 User Interface Each workflow has a User Interface and an Editor Interface. The user interface is used for daily workflow operations, and the editor interface is used for workflow editing. Usually, after designing a workflow, you only need to run it in the user interface and do not need to modify it in the editor interface. !User Interface The user interface is shown above and is divided into three parts: input, output, and trigger (usually a run button). You can directly enter content for daily use, click the run button to see the output result. To view the executed workflow, click Workflow Run Records, as shown in the following figure. !Workflow Run Records ✏️ Creating a Workflow You can add our official templates to your workflow or create a new one. It is recommended to familiarize yourself with the use of workflows using official templates at the beginning. !Workflow Editor Interface The workflow editor interface is shown above. You can edit the name, tags, and detailed description at the top. The left side is the node list of the workflow, and the right is the canvas of the workflow. You can drag the desired node from the left side to the canvas, and then connect the node through the wire to form a workflow. You can view a tutorial on creating a simple crawler + AI summary mind map workflow here. You can also try this online interactive tutorial. 🛠️ Development and Deployment Environment Requirements Backend Python 3.8 ~ Python 3.11 PDM installed Frontend Vue3 Vite Project Development Copy and modify backend/.env.example to .env file, this is the basic environment variable information, which will be used during development and packaging. Run the following command in the backend directory to install dependencies: Windows Mac Normally, PDM will automatically find the system's Python and create a virtual environment and install dependencies. After installation, run the following command to start the backend development server and see the running effect: If you need to modify the frontend code, you need to run the following command in the frontend directory to install dependencies: When pulling the project code for the first time, you also need to run pnpm install to install the front-end dependencies. If you don't need to develop any front-end code at all, you can directly copy the web folder from the release version into the backend folder. After the frontend dependencies are installed, you need to compile the frontend code into the static file directory of the backend. A shortcut instruction has been provided in the project. Run the following command in the backend directory to pack and copy the frontend resources: Database Structure Changes [!WARNING] Before making changes to the database structure, please back up your database (located at my_database.db in your configured data directory), otherwise you may lose data. If you have modified the model structure in backend/models, you need to run the following commands in the backend directory to update the database structure: First, enter the Python environment: After the operation, a new migration file will be generated in the backend/migrations directory, with the filename format xxxmigrationname.py. It is recommended to check the content of the migration file first to ensure it is correct, and then restart the main program. The main program will automatically execute the migration. Software Packaging The project uses pyinstaller for packaging. Run the following command in the backend directory to package it into an executable file: After packaging, the executable file will be generated in thebackend/dist directory. 📄 License VectorVein is an open-source software that supports personal non-commercial use. Please refer to LICENSE for specific agreements.

Production-Level-Deep-Learning
github
LLM Vibe Score0.619
Human Vibe Score0.8326638433689385
alirezadirMar 28, 2025

Production-Level-Deep-Learning

:bulb: A Guide to Production Level Deep Learning :clapper: :scroll: :ferry: 🇨🇳 Translation in Chinese.md) :label: NEW: Machine Learning Interviews :label: Note: This repo is under continous development, and all feedback and contribution are very welcome :blush: Deploying deep learning models in production can be challenging, as it is far beyond training models with good performance. Several distinct components need to be designed and developed in order to deploy a production level deep learning system (seen below): This repo aims to be an engineering guideline for building production-level deep learning systems which will be deployed in real world applications. The material presented here is borrowed from Full Stack Deep Learning Bootcamp (by Pieter Abbeel at UC Berkeley, Josh Tobin at OpenAI, and Sergey Karayev at Turnitin), TFX workshop by Robert Crowe, and Pipeline.ai's Advanced KubeFlow Meetup by Chris Fregly. Machine Learning Projects Fun :flushed: fact: 85% of AI projects fail. 1 Potential reasons include: Technically infeasible or poorly scoped Never make the leap to production Unclear success criteria (metrics) Poor team management ML Projects lifecycle Importance of understanding state of the art in your domain: Helps to understand what is possible Helps to know what to try next Mental Model for ML project The two important factors to consider when defining and prioritizing ML projects: High Impact: Complex parts of your pipeline Where "cheap prediction" is valuable Where automating complicated manual process is valuable Low Cost: Cost is driven by: Data availability Performance requirements: costs tend to scale super-linearly in the accuracy requirement Problem difficulty: Some of the hard problems include: unsupervised learning, reinforcement learning, and certain categories of supervised learning Full stack pipeline The following figure represents a high level overview of different components in a production level deep learning system: In the following, we will go through each module and recommend toolsets and frameworks as well as best practices from practitioners that fit each component. Data Management 1.1 Data Sources Supervised deep learning requires a lot of labeled data Labeling own data is costly! Here are some resources for data: Open source data (good to start with, but not an advantage) Data augmentation (a MUST for computer vision, an option for NLP) Synthetic data (almost always worth starting with, esp. in NLP) 1.2 Data Labeling Requires: separate software stack (labeling platforms), temporary labor, and QC Sources of labor for labeling: Crowdsourcing (Mechanical Turk): cheap and scalable, less reliable, needs QC Hiring own annotators: less QC needed, expensive, slow to scale Data labeling service companies: FigureEight Labeling platforms: Diffgram: Training Data Software (Computer Vision) Prodigy: An annotation tool powered by active learning (by developers of Spacy), text and image HIVE: AI as a Service platform for computer vision Supervisely: entire computer vision platform Labelbox: computer vision Scale AI data platform (computer vision & NLP) 1.3. Data Storage Data storage options: Object store: Store binary data (images, sound files, compressed texts) Amazon S3 Ceph Object Store Database: Store metadata (file paths, labels, user activity, etc). Postgres is the right choice for most of applications, with the best-in-class SQL and great support for unstructured JSON. Data Lake: to aggregate features which are not obtainable from database (e.g. logs) Amazon Redshift Feature Store: store, access, and share machine learning features (Feature extraction could be computationally expensive and nearly impossible to scale, hence re-using features by different models and teams is a key to high performance ML teams). FEAST (Google cloud, Open Source) Michelangelo Palette (Uber) Suggestion: At training time, copy data into a local or networked filesystem (NFS). 1 1.4. Data Versioning It's a "MUST" for deployed ML models: Deployed ML models are part code, part data. 1 No data versioning means no model versioning. Data versioning platforms: DVC: Open source version control system for ML projects Pachyderm: version control for data Dolt: a SQL database with Git-like version control for data and schema 1.5. Data Processing Training data for production models may come from different sources, including Stored data in db and object stores, log processing, and outputs of other classifiers*. There are dependencies between tasks, each needs to be kicked off after its dependencies are finished. For example, training on new log data, requires a preprocessing step before training. Makefiles are not scalable. "Workflow manager"s become pretty essential in this regard. Workflow orchestration: Luigi by Spotify Airflow by Airbnb: Dynamic, extensible, elegant, and scalable (the most widely used) DAG workflow Robust conditional execution: retry in case of failure Pusher supports docker images with tensorflow serving Whole workflow in a single .py file Development, Training, and Evaluation 2.1. Software engineering Winner language: Python Editors: Vim Emacs VS Code (Recommended by the author): Built-in git staging and diff, Lint code, open projects remotely through ssh Notebooks: Great as starting point of the projects, hard to scale (fun fact: Netflix’s Notebook-Driven Architecture is an exception, which is entirely based on nteract suites). nteract: a next-gen React-based UI for Jupyter notebooks Papermill: is an nteract library built for parameterizing, executing, and analyzing* Jupyter Notebooks. Commuter: another nteract project which provides a read-only display of notebooks (e.g. from S3 buckets). Streamlit: interactive data science tool with applets Compute recommendations 1: For individuals or startups*: Development: a 4x Turing-architecture PC Training/Evaluation: Use the same 4x GPU PC. When running many experiments, either buy shared servers or use cloud instances. For large companies:* Development: Buy a 4x Turing-architecture PC per ML scientist or let them use V100 instances Training/Evaluation: Use cloud instances with proper provisioning and handling of failures Cloud Providers: GCP: option to connect GPUs to any instance + has TPUs AWS: 2.2. Resource Management Allocating free resources to programs Resource management options: Old school cluster job scheduler ( e.g. Slurm workload manager ) Docker + Kubernetes Kubeflow Polyaxon (paid features) 2.3. DL Frameworks Unless having a good reason not to, use Tensorflow/Keras or PyTorch. 1 The following figure shows a comparison between different frameworks on how they stand for "developement" and "production"*. 2.4. Experiment management Development, training, and evaluation strategy: Always start simple Train a small model on a small batch. Only if it works, scale to larger data and models, and hyperparameter tuning! Experiment management tools: Tensorboard provides the visualization and tooling needed for ML experimentation Losswise (Monitoring for ML) Comet: lets you track code, experiments, and results on ML projects Weights & Biases: Record and visualize every detail of your research with easy collaboration MLFlow Tracking: for logging parameters, code versions, metrics, and output files as well as visualization of the results. Automatic experiment tracking with one line of code in python Side by side comparison of experiments Hyper parameter tuning Supports Kubernetes based jobs 2.5. Hyperparameter Tuning Approaches: Grid search Random search Bayesian Optimization HyperBand and Asynchronous Successive Halving Algorithm (ASHA) Population-based Training Platforms: RayTune: Ray Tune is a Python library for hyperparameter tuning at any scale (with a focus on deep learning and deep reinforcement learning). Supports any machine learning framework, including PyTorch, XGBoost, MXNet, and Keras. Katib: Kubernete's Native System for Hyperparameter Tuning and Neural Architecture Search, inspired by Google vizier and supports multiple ML/DL frameworks (e.g. TensorFlow, MXNet, and PyTorch). Hyperas: a simple wrapper around hyperopt for Keras, with a simple template notation to define hyper-parameter ranges to tune. SIGOPT: a scalable, enterprise-grade optimization platform Sweeps from [Weights & Biases] (https://www.wandb.com/): Parameters are not explicitly specified by a developer. Instead they are approximated and learned by a machine learning model. Keras Tuner: A hyperparameter tuner for Keras, specifically for tf.keras with TensorFlow 2.0. 2.6. Distributed Training Data parallelism: Use it when iteration time is too long (both tensorflow and PyTorch support) Ray Distributed Training Model parallelism: when model does not fit on a single GPU Other solutions: Horovod Troubleshooting [TBD] Testing and Deployment 4.1. Testing and CI/CD Machine Learning production software requires a more diverse set of test suites than traditional software: Unit and Integration Testing: Types of tests: Training system tests: testing training pipeline Validation tests: testing prediction system on validation set Functionality tests: testing prediction system on few important examples Continuous Integration: Running tests after each new code change pushed to the repo SaaS for continuous integration: Argo: Open source Kubernetes native workflow engine for orchestrating parallel jobs (incudes workflows, events, CI and CD). CircleCI: Language-Inclusive Support, Custom Environments, Flexible Resource Allocation, used by instacart, Lyft, and StackShare. Travis CI Buildkite: Fast and stable builds, Open source agent runs on almost any machine and architecture, Freedom to use your own tools and services Jenkins: Old school build system 4.2. Web Deployment Consists of a Prediction System and a Serving System Prediction System: Process input data, make predictions Serving System (Web server): Serve prediction with scale in mind Use REST API to serve prediction HTTP requests Calls the prediction system to respond Serving options: Deploy to VMs, scale by adding instances Deploy as containers, scale via orchestration Containers Docker Container Orchestration: Kubernetes (the most popular now) MESOS Marathon Deploy code as a "serverless function" Deploy via a model serving solution Model serving: Specialized web deployment for ML models Batches request for GPU inference Frameworks: Tensorflow serving MXNet Model server Clipper (Berkeley) SaaS solutions Seldon: serve and scale models built in any framework on Kubernetes Algorithmia Decision making: CPU or GPU? CPU inference: CPU inference is preferable if it meets the requirements. Scale by adding more servers, or going serverless. GPU inference: TF serving or Clipper Adaptive batching is useful (Bonus) Deploying Jupyter Notebooks: Kubeflow Fairing is a hybrid deployment package that let's you deploy your Jupyter notebook* codes! 4.5 Service Mesh and Traffic Routing Transition from monolithic applications towards a distributed microservice architecture could be challenging. A Service mesh (consisting of a network of microservices) reduces the complexity of such deployments, and eases the strain on development teams. Istio: a service mesh to ease creation of a network of deployed services with load balancing, service-to-service authentication, monitoring, with few or no code changes in service code. 4.4. Monitoring: Purpose of monitoring: Alerts for downtime, errors, and distribution shifts Catching service and data regressions Cloud providers solutions are decent Kiali:an observability console for Istio with service mesh configuration capabilities. It answers these questions: How are the microservices connected? How are they performing? Are we done? 4.5. Deploying on Embedded and Mobile Devices Main challenge: memory footprint and compute constraints Solutions: Quantization Reduced model size MobileNets Knowledge Distillation DistillBERT (for NLP) Embedded and Mobile Frameworks: Tensorflow Lite PyTorch Mobile Core ML ML Kit FRITZ OpenVINO Model Conversion: Open Neural Network Exchange (ONNX): open-source format for deep learning models 4.6. All-in-one solutions Tensorflow Extended (TFX) Michelangelo (Uber) Google Cloud AI Platform Amazon SageMaker Neptune FLOYD Paperspace Determined AI Domino data lab Tensorflow Extended (TFX) [TBD] Airflow and KubeFlow ML Pipelines [TBD] Other useful links: Lessons learned from building practical deep learning systems Machine Learning: The High Interest Credit Card of Technical Debt Contributing References: [1]: Full Stack Deep Learning Bootcamp, Nov 2019. [2]: Advanced KubeFlow Workshop by Pipeline.ai, 2019. [3]: TFX: Real World Machine Learning in Production

aima-python
github
LLM Vibe Score0.575
Human Vibe Score0.33114909407186394
aimacodeMar 28, 2025

aima-python

aima-python Python code for the book Artificial Intelligence: A Modern Approach. You can use this in conjunction with a course on AI, or for study on your own. We're looking for solid contributors to help. Updates for 4th Edition The 4th edition of the book as out now in 2020, and thus we are updating the code. All code here will reflect the 4th edition. Changes include: Move from Python 3.5 to 3.7. More emphasis on Jupyter (Ipython) notebooks. More projects using external packages (tensorflow, etc.). Structure of the Project When complete, this project will have Python implementations for all the pseudocode algorithms in the book, as well as tests and examples of use. For each major topic, such as search, we provide the following files: search.ipynb and search.py: Implementations of all the pseudocode algorithms, and necessary support functions/classes/data. The .py file is generated automatically from the .ipynb file; the idea is that it is easier to read the documentation in the .ipynb file. search_XX.ipynb: Notebooks that show how to use the code, broken out into various topics (the XX). tests/test_search.py: A lightweight test suite, using assert statements, designed for use with py.test, but also usable on their own. Python 3.7 and up The code for the 3rd edition was in Python 3.5; the current 4th edition code is in Python 3.7. It should also run in later versions, but does not run in Python 2. You can install Python or use a browser-based Python interpreter such as repl.it. You can run the code in an IDE, or from the command line with python -i filename.py where the -i option puts you in an interactive loop where you can run Python functions. All notebooks are available in a binder environment. Alternatively, visit jupyter.org for instructions on setting up your own Jupyter notebook environment. Features from Python 3.6 and 3.7 that we will be using for this version of the code: f-strings: all string formatting should be done with f'var = {var}', not with 'var = {}'.format(var) nor 'var = %s' % var. typing module: declare functions with type hints: def successors(state) -> List[State]:; that is, give type declarations, but omit them when it is obvious. I don't need to say state: State, but in another context it would make sense to say s: State. Underscores in numerics: write a million as 1000000 not as 1000000. dataclasses module: replace namedtuple with dataclass. [//]: (There is a sibling [aima-docker]https://github.com/rajatjain1997/aima-docker project that shows you how to use docker containers to run more complex problems in more complex software environments.) Installation Guide To download the repository: git clone https://github.com/aimacode/aima-python.git Then you need to install the basic dependencies to run the project on your system: You also need to fetch the datasets from the aima-data repository: Wait for the datasets to download, it may take a while. Once they are downloaded, you need to install pytest, so that you can run the test suite: pip install pytest Then to run the tests: py.test And you are good to go! Index of Algorithms Here is a table of algorithms, the figure, name of the algorithm in the book and in the repository, and the file where they are implemented in the repository. This chart was made for the third edition of the book and is being updated for the upcoming fourth edition. Empty implementations are a good place for contributors to look for an issue. The aima-pseudocode project describes all the algorithms from the book. An asterisk next to the file name denotes the algorithm is not fully implemented. Another great place for contributors to start is by adding tests and writing on the notebooks. You can see which algorithms have tests and notebook sections below. If the algorithm you want to work on is covered, don't worry! You can still add more tests and provide some examples of use in the notebook! | Figure | Name (in 3rd edition) | Name (in repository) | File | Tests | Notebook |:-------|:----------------------------------|:------------------------------|:--------------------------------|:-----|:---------| | 2 | Random-Vacuum-Agent | RandomVacuumAgent | [agents.py][agents] | Done | Included | | 2 | Model-Based-Vacuum-Agent | ModelBasedVacuumAgent | [agents.py][agents] | Done | Included | | 2.1 | Environment | Environment | [agents.py][agents] | Done | Included | | 2.1 | Agent | Agent | [agents.py][agents] | Done | Included | | 2.3 | Table-Driven-Vacuum-Agent | TableDrivenVacuumAgent | [agents.py][agents] | Done | Included | | 2.7 | Table-Driven-Agent | TableDrivenAgent | [agents.py][agents] | Done | Included | | 2.8 | Reflex-Vacuum-Agent | ReflexVacuumAgent | [agents.py][agents] | Done | Included | | 2.10 | Simple-Reflex-Agent | SimpleReflexAgent | [agents.py][agents] | Done | Included | | 2.12 | Model-Based-Reflex-Agent | ReflexAgentWithState | [agents.py][agents] | Done | Included | | 3 | Problem | Problem | [search.py][search] | Done | Included | | 3 | Node | Node | [search.py][search] | Done | Included | | 3 | Queue | Queue | [utils.py][utils] | Done | No Need | | 3.1 | Simple-Problem-Solving-Agent | SimpleProblemSolvingAgent | [search.py][search] | Done | Included | | 3.2 | Romania | romania | [search.py][search] | Done | Included | | 3.7 | Tree-Search | depth/breadthfirsttree_search | [search.py][search] | Done | Included | | 3.7 | Graph-Search | depth/breadthfirstgraph_search | [search.py][search] | Done | Included | | 3.11 | Breadth-First-Search | breadthfirstgraph_search | [search.py][search] | Done | Included | | 3.14 | Uniform-Cost-Search | uniformcostsearch | [search.py][search] | Done | Included | | 3.17 | Depth-Limited-Search | depthlimitedsearch | [search.py][search] | Done | Included | | 3.18 | Iterative-Deepening-Search | iterativedeepeningsearch | [search.py][search] | Done | Included | | 3.22 | Best-First-Search | bestfirstgraph_search | [search.py][search] | Done | Included | | 3.24 | A\*-Search | astar_search | [search.py][search] | Done | Included | | 3.26 | Recursive-Best-First-Search | recursivebestfirst_search | [search.py][search] | Done | Included | | 4.2 | Hill-Climbing | hill_climbing | [search.py][search] | Done | Included | | 4.5 | Simulated-Annealing | simulated_annealing | [search.py][search] | Done | Included | | 4.8 | Genetic-Algorithm | genetic_algorithm | [search.py][search] | Done | Included | | 4.11 | And-Or-Graph-Search | andorgraph_search | [search.py][search] | Done | Included | | 4.21 | Online-DFS-Agent | onlinedfsagent | [search.py][search] | Done | Included | | 4.24 | LRTA\*-Agent | LRTAStarAgent | [search.py][search] | Done | Included | | 5.3 | Minimax-Decision | minimax_decision | [games.py][games] | Done | Included | | 5.7 | Alpha-Beta-Search | alphabeta_search | [games.py][games] | Done | Included | | 6 | CSP | CSP | [csp.py][csp] | Done | Included | | 6.3 | AC-3 | AC3 | [csp.py][csp] | Done | Included | | 6.5 | Backtracking-Search | backtracking_search | [csp.py][csp] | Done | Included | | 6.8 | Min-Conflicts | min_conflicts | [csp.py][csp] | Done | Included | | 6.11 | Tree-CSP-Solver | treecspsolver | [csp.py][csp] | Done | Included | | 7 | KB | KB | [logic.py][logic] | Done | Included | | 7.1 | KB-Agent | KB_AgentProgram | [logic.py][logic] | Done | Included | | 7.7 | Propositional Logic Sentence | Expr | [utils.py][utils] | Done | Included | | 7.10 | TT-Entails | tt_entails | [logic.py][logic] | Done | Included | | 7.12 | PL-Resolution | pl_resolution | [logic.py][logic] | Done | Included | | 7.14 | Convert to CNF | to_cnf | [logic.py][logic] | Done | Included | | 7.15 | PL-FC-Entails? | plfcentails | [logic.py][logic] | Done | Included | | 7.17 | DPLL-Satisfiable? | dpll_satisfiable | [logic.py][logic] | Done | Included | | 7.18 | WalkSAT | WalkSAT | [logic.py][logic] | Done | Included | | 7.20 | Hybrid-Wumpus-Agent | HybridWumpusAgent | | | | | 7.22 | SATPlan | SAT_plan | [logic.py][logic] | Done | Included | | 9 | Subst | subst | [logic.py][logic] | Done | Included | | 9.1 | Unify | unify | [logic.py][logic] | Done | Included | | 9.3 | FOL-FC-Ask | folfcask | [logic.py][logic] | Done | Included | | 9.6 | FOL-BC-Ask | folbcask | [logic.py][logic] | Done | Included | | 10.1 | Air-Cargo-problem | air_cargo | [planning.py][planning] | Done | Included | | 10.2 | Spare-Tire-Problem | spare_tire | [planning.py][planning] | Done | Included | | 10.3 | Three-Block-Tower | threeblocktower | [planning.py][planning] | Done | Included | | 10.7 | Cake-Problem | havecakeandeatcake_too | [planning.py][planning] | Done | Included | | 10.9 | Graphplan | GraphPlan | [planning.py][planning] | Done | Included | | 10.13 | Partial-Order-Planner | PartialOrderPlanner | [planning.py][planning] | Done | Included | | 11.1 | Job-Shop-Problem-With-Resources | jobshopproblem | [planning.py][planning] | Done | Included | | 11.5 | Hierarchical-Search | hierarchical_search | [planning.py][planning] | Done | Included | | 11.8 | Angelic-Search | angelic_search | [planning.py][planning] | Done | Included | | 11.10 | Doubles-tennis | doubletennisproblem | [planning.py][planning] | Done | Included | | 13 | Discrete Probability Distribution | ProbDist | [probability.py][probability] | Done | Included | | 13.1 | DT-Agent | DTAgent | [probability.py][probability] | Done | Included | | 14.9 | Enumeration-Ask | enumeration_ask | [probability.py][probability] | Done | Included | | 14.11 | Elimination-Ask | elimination_ask | [probability.py][probability] | Done | Included | | 14.13 | Prior-Sample | prior_sample | [probability.py][probability] | Done | Included | | 14.14 | Rejection-Sampling | rejection_sampling | [probability.py][probability] | Done | Included | | 14.15 | Likelihood-Weighting | likelihood_weighting | [probability.py][probability] | Done | Included | | 14.16 | Gibbs-Ask | gibbs_ask | [probability.py][probability] | Done | Included | | 15.4 | Forward-Backward | forward_backward | [probability.py][probability] | Done | Included | | 15.6 | Fixed-Lag-Smoothing | fixedlagsmoothing | [probability.py][probability] | Done | Included | | 15.17 | Particle-Filtering | particle_filtering | [probability.py][probability] | Done | Included | | 16.9 | Information-Gathering-Agent | InformationGatheringAgent | [probability.py][probability] | Done | Included | | 17.4 | Value-Iteration | value_iteration | [mdp.py][mdp] | Done | Included | | 17.7 | Policy-Iteration | policy_iteration | [mdp.py][mdp] | Done | Included | | 17.9 | POMDP-Value-Iteration | pomdpvalueiteration | [mdp.py][mdp] | Done | Included | | 18.5 | Decision-Tree-Learning | DecisionTreeLearner | [learning.py][learning] | Done | Included | | 18.8 | Cross-Validation | cross_validation | [learning.py][learning]\* | | | | 18.11 | Decision-List-Learning | DecisionListLearner | [learning.py][learning]\* | | | | 18.24 | Back-Prop-Learning | BackPropagationLearner | [learning.py][learning] | Done | Included | | 18.34 | AdaBoost | AdaBoost | [learning.py][learning] | Done | Included | | 19.2 | Current-Best-Learning | currentbestlearning | knowledge.py | Done | Included | | 19.3 | Version-Space-Learning | versionspacelearning | knowledge.py | Done | Included | | 19.8 | Minimal-Consistent-Det | minimalconsistentdet | knowledge.py | Done | Included | | 19.12 | FOIL | FOIL_container | knowledge.py | Done | Included | | 21.2 | Passive-ADP-Agent | PassiveADPAgent | [rl.py][rl] | Done | Included | | 21.4 | Passive-TD-Agent | PassiveTDAgent | [rl.py][rl] | Done | Included | | 21.8 | Q-Learning-Agent | QLearningAgent | [rl.py][rl] | Done | Included | | 22.1 | HITS | HITS | [nlp.py][nlp] | Done | Included | | 23 | Chart-Parse | Chart | [nlp.py][nlp] | Done | Included | | 23.5 | CYK-Parse | CYK_parse | [nlp.py][nlp] | Done | Included | | 25.9 | Monte-Carlo-Localization | montecarlolocalization | [probability.py][probability] | Done | Included | Index of data structures Here is a table of the implemented data structures, the figure, name of the implementation in the repository, and the file where they are implemented. | Figure | Name (in repository) | File | |:-------|:--------------------------------|:--------------------------| | 3.2 | romania_map | [search.py][search] | | 4.9 | vacumm_world | [search.py][search] | | 4.23 | onedimstate_space | [search.py][search] | | 6.1 | australia_map | [search.py][search] | | 7.13 | wumpusworldinference | [logic.py][logic] | | 7.16 | hornclausesKB | [logic.py][logic] | | 17.1 | sequentialdecisionenvironment | [mdp.py][mdp] | | 18.2 | waitingdecisiontree | [learning.py][learning] | Acknowledgements Many thanks for contributions over the years. I got bug reports, corrected code, and other support from Darius Bacon, Phil Ruggera, Peng Shao, Amit Patil, Ted Nienstedt, Jim Martin, Ben Catanzariti, and others. Now that the project is on GitHub, you can see the contributors who are doing a great job of actively improving the project. Many thanks to all contributors, especially @darius, @SnShine, @reachtarunhere, @antmarakis, @Chipe1, @ad71 and @MariannaSpyrakou. [agents]:../master/agents.py [csp]:../master/csp.py [games]:../master/games.py [grid]:../master/grid.py [knowledge]:../master/knowledge.py [learning]:../master/learning.py [logic]:../master/logic.py [mdp]:../master/mdp.py [nlp]:../master/nlp.py [planning]:../master/planning.py [probability]:../master/probability.py [rl]:../master/rl.py [search]:../master/search.py [utils]:../master/utils.py [text]:../master/text.py

RD-Agent
github
LLM Vibe Score0.548
Human Vibe Score0.27921589729164453
microsoftMar 28, 2025

RD-Agent

🖥️ Live Demo | 🎥 Demo Video ▶️YouTube | 📖 Documentation | 📃 Papers Data Science Agent Preview Check out our demo video showcasing the current progress of our Data Science Agent under development: https://github.com/user-attachments/assets/3eccbecb-34a4-4c81-bce4-d3f8862f7305 📰 News | 🗞️ News | 📝 Description | | -- | ------ | | Support LiteLLM Backend | We now fully support LiteLLM as a backend for integration with multiple LLM providers. | | More General Data Science Agent | 🚀Coming soon! | | Kaggle Scenario release | We release Kaggle Agent, try the new features! | | Official WeChat group release | We created a WeChat group, welcome to join! (🗪QR Code) | | Official Discord release | We launch our first chatting channel in Discord (🗪) | | First release | RDAgent is released on GitHub | 🌟 Introduction RDAgent aims to automate the most critical and valuable aspects of the industrial R&D process, and we begin with focusing on the data-driven scenarios to streamline the development of models and data. Methodologically, we have identified a framework with two key components: 'R' for proposing new ideas and 'D' for implementing them. We believe that the automatic evolution of R&D will lead to solutions of significant industrial value. R&D is a very general scenario. The advent of RDAgent can be your 💰 Automatic Quant Factory (🎥Demo Video|▶️YouTube) 🤖 Data Mining Agent: Iteratively proposing data & models (🎥Demo Video 1|▶️YouTube) (🎥Demo Video 2|▶️YouTube) and implementing them by gaining knowledge from data. 🦾 Research Copilot: Auto read research papers (🎥Demo Video|▶️YouTube) / financial reports (🎥Demo Video|▶️YouTube) and implement model structures or building datasets. 🤖 Kaggle Agent: Auto Model Tuning and Feature Engineering([🎥Demo Video Coming Soon...]()) and implementing them to achieve more in competitions. ... You can click the links above to view the demo. We're continuously adding more methods and scenarios to the project to enhance your R&D processes and boost productivity. Additionally, you can take a closer look at the examples in our 🖥️ Live Demo. ⚡ Quick start You can try above demos by running the following command: 🐳 Docker installation. Users must ensure Docker is installed before attempting most scenarios. Please refer to the official 🐳Docker page for installation instructions. Ensure the current user can run Docker commands without using sudo. You can verify this by executing docker run hello-world. 🐍 Create a Conda Environment Create a new conda environment with Python (3.10 and 3.11 are well-tested in our CI): Activate the environment: 🛠️ Install the RDAgent You can directly install the RDAgent package from PyPI: 💊 Health check rdagent provides a health check that currently checks two things. whether the docker installation was successful. whether the default port used by the rdagent ui is occupied. ⚙️ Configuration The demos requires following ability: ChatCompletion json_mode embedding query For example: If you are using the OpenAI API, you have to configure your GPT model in the .env file like this. However, not every API services support these features by default. For example: AZURE OpenAI, you have to configure your GPT model in the .env file like this. We now support LiteLLM as a backend for integration with multiple LLM providers. If you use LiteLLM Backend to use models, you can configure as follows: For more configuration information, please refer to the documentation. 🚀 Run the Application The 🖥️ Live Demo is implemented by the following commands(each item represents one demo, you can select the one you prefer): Run the Automated Quantitative Trading & Iterative Factors Evolution: Qlib self-loop factor proposal and implementation application Run the Automated Quantitative Trading & Iterative Model Evolution: Qlib self-loop model proposal and implementation application Run the Automated Medical Prediction Model Evolution: Medical self-loop model proposal and implementation application (1) Apply for an account at PhysioNet. (2) Request access to FIDDLE preprocessed data: FIDDLE Dataset. (3) Place your username and password in .env. Run the Automated Quantitative Trading & Factors Extraction from Financial Reports: Run the Qlib factor extraction and implementation application based on financial reports Run the Automated Model Research & Development Copilot: model extraction and implementation application Run the Automated Kaggle Model Tuning & Feature Engineering: self-loop model proposal and feature engineering implementation application Using sf-crime (San Francisco Crime Classification) as an example. Register and login on the Kaggle website. Configuring the Kaggle API. (1) Click on the avatar (usually in the top right corner of the page) -> Settings -> Create New Token, A file called kaggle.json will be downloaded. (2) Move kaggle.json to ~/.config/kaggle/ (3) Modify the permissions of the kaggle.json file. Reference command: chmod 600 ~/.config/kaggle/kaggle.json Join the competition: Click Join the competition -> I Understand and Accept at the bottom of the competition details page. Description of the above example: Kaggle competition data, contains two parts: competition description file (json file) and competition dataset (zip file). We prepare the competition description file for you, the competition dataset will be downloaded automatically when you run the program, as in the example. If you want to download the competition description file automatically, you need to install chromedriver, The instructions for installing chromedriver can be found in the documentation. The Competition List Available can be found here. 🖥️ Monitor the Application Results You can run the following command for our demo program to see the run logs. Note: Although port 19899 is not commonly used, but before you run this demo, you need to check if port 19899 is occupied. If it is, please change it to another port that is not occupied. You can check if a port is occupied by running the following command. 🏭 Scenarios We have applied RD-Agent to multiple valuable data-driven industrial scenarios. 🎯 Goal: Agent for Data-driven R&D In this project, we are aiming to build an Agent to automate Data-Driven R\&D that can 📄 Read real-world material (reports, papers, etc.) and extract key formulas, descriptions of interested features and models, which are the key components of data-driven R&D . 🛠️ Implement the extracted formulas (e.g., features, factors, and models) in runnable codes. Due to the limited ability of LLM in implementing at once, build an evolving process for the agent to improve performance by learning from feedback and knowledge. 💡 Propose new ideas based on current knowledge and observations. 📈 Scenarios/Demos In the two key areas of data-driven scenarios, model implementation and data building, our system aims to serve two main roles: 🦾Copilot and 🤖Agent. The 🦾Copilot follows human instructions to automate repetitive tasks. The 🤖Agent, being more autonomous, actively proposes ideas for better results in the future. The supported scenarios are listed below: | Scenario/Target | Model Implementation | Data Building | | -- | -- | -- | | 💹 Finance | 🤖 Iteratively Proposing Ideas & Evolving▶️YouTube | 🤖 Iteratively Proposing Ideas & Evolving ▶️YouTube 🦾 Auto reports reading & implementation▶️YouTube | | 🩺 Medical | 🤖 Iteratively Proposing Ideas & Evolving▶️YouTube | - | | 🏭 General | 🦾 Auto paper reading & implementation▶️YouTube 🤖 Auto Kaggle Model Tuning | 🤖Auto Kaggle feature Engineering | RoadMap: Currently, we are working hard to add new features to the Kaggle scenario. Different scenarios vary in entrance and configuration. Please check the detailed setup tutorial in the scenarios documents. Here is a gallery of successful explorations (5 traces showed in 🖥️ Live Demo). You can download and view the execution trace using this command from the documentation. Please refer to 📖readthedocs_scen for more details of the scenarios. ⚙️ Framework Automating the R&D process in data science is a highly valuable yet underexplored area in industry. We propose a framework to push the boundaries of this important research field. The research questions within this framework can be divided into three main categories: | Research Area | Paper/Work List | |--------------------|-----------------| | Benchmark the R&D abilities | Benchmark | | Idea proposal: Explore new ideas or refine existing ones | Research | | Ability to realize ideas: Implement and execute ideas | Development | We believe that the key to delivering high-quality solutions lies in the ability to evolve R&D capabilities. Agents should learn like human experts, continuously improving their R&D skills. More documents can be found in the 📖 readthedocs. 📃 Paper/Work list 📊 Benchmark Towards Data-Centric Automatic R&D !image 🔍 Research In a data mining expert's daily research and development process, they propose a hypothesis (e.g., a model structure like RNN can capture patterns in time-series data), design experiments (e.g., finance data contains time-series and we can verify the hypothesis in this scenario), implement the experiment as code (e.g., Pytorch model structure), and then execute the code to get feedback (e.g., metrics, loss curve, etc.). The experts learn from the feedback and improve in the next iteration. Based on the principles above, we have established a basic method framework that continuously proposes hypotheses, verifies them, and gets feedback from the real-world practice. This is the first scientific research automation framework that supports linking with real-world verification. For more detail, please refer to our 🖥️ Live Demo page. 🛠️ Development Collaborative Evolving Strategy for Automatic Data-Centric Development !image 🤝 Contributing We welcome contributions and suggestions to improve RD-Agent. Please refer to the Contributing Guide for more details on how to contribute. Before submitting a pull request, ensure that your code passes the automatic CI checks. 📝 Guidelines This project welcomes contributions and suggestions. Contributing to this project is straightforward and rewarding. Whether it's solving an issue, addressing a bug, enhancing documentation, or even correcting a typo, every contribution is valuable and helps improve RDAgent. To get started, you can explore the issues list, or search for TODO: comments in the codebase by running the command grep -r "TODO:". Before we released RD-Agent as an open-source project on GitHub, it was an internal project within our group. Unfortunately, the internal commit history was not preserved when we removed some confidential code. As a result, some contributions from our group members, including Haotian Chen, Wenjun Feng, Haoxue Wang, Zeqi Ye, Xinjie Shen, and Jinhui Li, were not included in the public commits. ⚖️ Legal disclaimer The RD-agent is provided “as is”, without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose and noninfringement. The RD-agent is aimed to facilitate research and development process in the financial industry and not ready-to-use for any financial investment or advice. Users shall independently assess and test the risks of the RD-agent in a specific use scenario, ensure the responsible use of AI technology, including but not limited to developing and integrating risk mitigation measures, and comply with all applicable laws and regulations in all applicable jurisdictions. The RD-agent does not provide financial opinions or reflect the opinions of Microsoft, nor is it designed to replace the role of qualified financial professionals in formulating, assessing, and approving finance products. The inputs and outputs of the RD-agent belong to the users and users shall assume all liability under any theory of liability, whether in contract, torts, regulatory, negligence, products liability, or otherwise, associated with use of the RD-agent and any inputs and outputs thereof.

generative-ai-use-cases-jp
github
LLM Vibe Score0.703
Human Vibe Score0.7656748140276302
aws-samplesMar 28, 2025

generative-ai-use-cases-jp

Generative AI Use Cases JP (略称:GenU) 生成 AI を安全に業務活用するための、ビジネスユースケース集を備えたアプリケーション実装 [!IMPORTANT] GenU は 2025/01 に v3 にアップグレードされました。いくつかの破壊的変更を伴いますので、アップグレード前に リリースノート をご確認ください。 GenU 活用パターン集 GenU の機能やオプションを活用パターンごとに紹介いたします。網羅的なデプロイオプションに関しては こちら をご参照ください。 [!TIP] 活用パターンをクリックして詳細を確認してください 生成 AI のユースケースを体験したい GenU は生成 AI を活用した多様なユースケースを標準で提供しています。それらのユースケースは、生成 AI を業務活用するためのアイデアの種となったり、そのまま業務で活用できるものなど、さまざまです。今後もさらにブラッシュアップされたユースケースを随時追加予定です。また、不要であれば 特定のユースケースを非表示にする オプションで非表示にすることもできます。デフォルトで提供しているユースケース一覧はこちらです。 ユースケース 説明 チャット 大規模言語モデル (LLM) とチャット形式で対話することができます。LLM と直接対話するプラットフォームが存在するおかげで、細かいユースケースや新しいユースケースに迅速に対応することができます。また、プロンプトエンジニアリングの検証用環境としても有効です。 文章生成 あらゆるコンテキストで文章を生成することは LLM が最も得意とするタスクの 1 つです。記事・レポート・メールなど、あらゆる文章を生成します。 要約 LLM は、大量の文章を要約するタスクを得意としています。ただ要約するだけでなく、文章をコンテキストとして与えた上で、必要な情報を対話形式で引き出すこともできます。例えば、契約書を読み込ませて「XXX の条件は?」「YYY の金額は?」といった情報を取得することが可能です。 執筆 LLM は、誤字脱字のチェックだけでなく、文章の流れや内容を考慮したより客観的な視点から改善点を提案できます。人に見せる前に LLM に自分では気づかなかった点を客観的にチェックしてもらいクオリティを上げる効果が期待できます。 翻訳 多言語で学習した LLM は、翻訳を行うことも可能です。また、ただ翻訳するだけではなく、カジュアルさ・対象層など様々な指定されたコンテキスト情報を翻訳に反映させることが可能です。 Web コンテンツ抽出 ブログやドキュメントなどの Web コンテンツから必要な情報を抽出します。LLMによって不要な情報を除去し、整った文章として整形します。抽出したコンテンツは要約、翻訳などの別のユースケースで利用できます。 画像生成 画像生成 AI は、テキストや画像を元に新しい画像を生成できます。アイデアを即座に可視化することができ、デザイン作業などの効率化を期待できます。こちらの機能では、プロンプトの作成を LLM に支援してもらうことができます。 動画生成 動画生成 AI はテキストから短い動画を生成します。生成した動画は素材としてさまざまなシーンで活用できます。 映像分析 マルチモーダルモデルによってテキストのみではなく、画像を入力することが可能になりました。こちらの機能では、映像の画像フレームとテキストを入力として LLM に分析を依頼します。 ダイアグラム生成 ダイアグラム生成は、あらゆるトピックに関する文章や内容を最適な図を用いて視覚化します。 テキストベースで簡単に図を生成でき、プログラマーやデザイナーでなくても効率的にフローチャートなどの図を作成できます。 RAG がしたい RAG は LLM が苦手な最新の情報やドメイン知識を外部から伝えることで、本来なら回答できない内容にも答えられるようにする手法です。 社内に蓄積された PDF, Word, Excel などのファイルが情報ソースになります。 RAG は根拠に基づいた回答のみを許すため、LLM にありがちな「それっぽい間違った情報」を回答させないという効果もあります。 GenU は RAG チャットというユースケースを提供しています。 また RAG チャットの情報ソースとして Amazon Kendra と Knowledge Base の 2 種類が利用可能です。 Amazon Kendra を利用する場合は、手動で作成した S3 Bucket や Kendra Index をそのまま利用することが可能です。 Knowledge Base を利用する場合は、Advanced Parsing・チャンク戦略の選択・クエリ分解・リランキング など高度な RAG が利用可能です。 また Knowledge Base では、メタデータフィルターの設定 も可能です。 例えば「組織ごとにアクセス可能なデータソースを切り替えたい」や「UI からユーザーがフィルタを設定したい」といった要件を満たすことが可能です。 独自に作成した AI エージェントや Bedrock Flows などを社内で利用したい GenU で エージェントを有効化すると Web 検索エージェントと Code Interpreter エージェントが作成されます。 Web 検索エージェントは、ユーザーの質問に回答するための情報を Web で検索し、回答します。例えば「AWS の GenU ってなに?」という質問に回答できます。 Code Interpreter エージェントは、ユーザーからのリクエストに応えるためにコードが実行できます。例えば「適当なダミーデータで散布図を描いて」といったリクエストに応えられます。 Web 検索エージェントと Code Interpreter エージェントはエージェントとしては基本的なものですので、中にはもっと業務に寄り添った実践的なエージェントを使いたいという要望もあると思います。 GenU では手動で作成したエージェントや別のアセットで作成したエージェントを インポートする機能 を提供しております。 GenU をエージェント活用のプラットフォームとして利用することで、GenU が提供する 豊富なセキュリティオプション や SAML認証 などを活用し、実践的なエージェントを社内に普及させることができます。 また、オプションで 不要な標準ユースケースを非表示 にしたり、エージェントをインライン表示 することで、よりエージェントに特化したプラットフォームとして GenU をご利用いただくことが可能です。 Bedrock Flows に関しても同様に インポート機能 がございますので、ぜひご活用ください。 独自のユースケースを作成したい GenU はプロンプトテンプレートを自然言語で記述することで独自のユースケースを作成できる「ユースケースビルダー」という機能を提供しています。 プロンプトテンプレートだけで独自のユースケース画面が自動生成されるため、GenU 本体のコード変更は一切不要です。 作成したユースケースは、個人利用だけではなく、アプリケーションにログインできる全ユーザーに共有することもできます。 ユースケースビルダーは不要であれば無効化することも可能です。 ユースケースビルダーについての詳細は、ぜひこちらのブログをご覧ください。 ユースケースビルダーではフォームにテキストを入力したりファイルを添付するユースケースが作成できますが、要件によってはチャットの UI が良い場合もあると思います。 そのようなケースでは「チャット」ユースケースのシステムプロンプト保存機能をご活用ください。 システムプロンプトを保存しておくことで、ワンクリックで業務に必要な "ボット" が作成できます。 例えば「ソースコードを入力するとひたすらレビューしてくれるボット」や「入力した内容からひたすらメールアドレスを抽出してくれるボット」などが作成できます。 また、チャットの会話履歴はログインユーザーにシェアすることが可能で、シェアされた会話履歴からシステムプロンプトをインポートすることもできます。 GenU は OSS ですので、カスタマイズして独自のユースケースを追加するということも可能です。 その場合は GenU の main ブランチとのコンフリクトにお気をつけてください。 デプロイ [!IMPORTANT] /packages/cdk/cdk.json に記載されている modelRegion リージョンの modelIds (テキスト生成) 及び imageGenerationModelIds (画像生成) を有効化してください。(Amazon Bedrock の Model access 画面) GenU のデプロイには AWS Cloud Development Kit(以降 CDK)を利用します。CDK の実行環境が用意できない場合は、以下のデプロイ方法を参照してください。 AWS CloudShell を利用したデプロイ方法 (手元の環境を用意することが難しい場合) Workshop まず、以下のコマンドを実行してください。全てのコマンドはリポジトリのルートで実行してください。 CDK を利用したことがない場合、初回のみ Bootstrap 作業が必要です。すでに Bootstrap された環境では以下のコマンドは不要です。 続いて、以下のコマンドで AWS リソースをデプロイします。デプロイが完了するまで、お待ちください(20 分程度かかる場合があります)。 アーキテクチャ !arch.drawio.png その他 デプロイオプション アップデート方法 ローカル開発環境構築手順 リソースの削除方法 ネイティブアプリのように利用する方法 ブラウザ拡張機能を利用する 料金試算 GenU をご利用いただく際の、構成と料金試算例を公開しております。(従量課金制となっており、実際の料金はご利用内容により変動いたします。) シンプル版 (RAG なし) 試算 RAG (Amazon Kendra) あり試算 RAG (Knowledge Base) あり試算 お客様事例 | Customer | Quote | |:--------|:---------| | | 株式会社やさしい手 GenU のおかげで、利用者への付加価値提供と従業員の業務効率向上が実現できました。従業員にとって「いままでの仕事」が楽しい仕事に変化していく「サクサクからワクワクへ」更に進化を続けます! ・事例の詳細を見る ・事例のページを見る| | | タキヒヨー株式会社 生成 AI を活用し社内業務効率化と 450 時間超の工数削減を実現。Amazon Bedrock を衣服デザイン等に適用、デジタル人材育成を推進。 ・事例のページを見る| | | 株式会社サルソニード ソリューションとして用意されている GenU を活用することで、生成 AI による業務プロセスの改善に素早く取り掛かることができました。 ・事例の詳細を見る ・適用サービス| | | 株式会社タムラ製作所 AWS が Github に公開しているアプリケーションサンプルは即テスト可能な機能が豊富で、そのまま利用することで自分たちにあった機能の選定が難なくでき、最終システムの開発時間を短縮することができました。 ・事例の詳細を見る | | | 株式会社JDSC Amazon Bedrock ではセキュアにデータを用い LLM が活用できます。また、用途により最適なモデルを切り替えて利用できるので、コストを抑えながら速度・精度を高めることができました。 ・事例の詳細を見る | | | アイレット株式会社 株式会社バンダイナムコアミューズメントの生成 AI 活用に向けて社内のナレッジを蓄積・体系化すべく、AWS が提供している Generative AI Use Cases JP を活用したユースケースサイトを開発。アイレット株式会社が本プロジェクトの設計・構築・開発を支援。 ・株式会社バンダイナムコアミューズメント様のクラウドを活用した導入事例 | | | 株式会社アイデアログ M従来の生成 AI ツールよりもさらに業務効率化ができていると感じます。入出力データをモデルの学習に使わない Amazon Bedrock を使っているので、セキュリティ面も安心です。 ・事例の詳細を見る ・適用サービス| | | 株式会社エスタイル GenU を活用して短期間で生成 AI 環境を構築し、社内のナレッジシェアを促進することができました。 ・事例の詳細を見る | | | 株式会社明電舎 Amazon Bedrock や Amazon Kendra など AWS のサービスを利用することで、生成 AI の利用環境を迅速かつセキュアに構築することができました。議事録の自動生成や社内情報の検索など、従業員の業務効率化に貢献しています。 ・事例の詳細を見る | | | 三協立山株式会社 社内に埋もれていた情報が Amazon Kendra の活用で素早く探せるようになりました。GenU を参考にすることで求めていた議事録生成などの機能を迅速に提供できました。 ・事例の詳細を見る | | | オイシックス・ラ・大地株式会社 GenU を活用したユースケースの開発プロジェクトを通して、必要なリソース、プロジェクト体制、外部からの支援、人材育成などを把握するきっかけとなり、生成 AI の社内展開に向けたイメージを明確につかむことができました。 ・事例のページを見る | | | 株式会社サンエー Amazon Bedrock を活用することでエンジニアの生産性が劇的に向上し、内製で構築してきた当社特有の環境のクラウドへの移行を加速できました。 ・事例の詳細を見る ・事例のページを見る | 活用事例を掲載させて頂ける場合は、Issueよりご連絡ください。 参照 ブログ: 生成 AI アプリをノーコードで作成・社内配布できる GenU ユースケースビルダー ブログ: RAG プロジェクトを成功させる方法 #1 ~ あるいは早く失敗しておく方法 ~ ブログ: RAG チャットで精度向上のためのデバッグ方法 ブログ: Amazon Q Developer CLI を利用してノーコーディングで GenU をカスタマイズ ブログ: Generative AI Use Cases JP をカスタマイズする方法 ブログ: 無茶振りは生成 AI に断ってもらおう ~ ブラウザに生成 AI を組み込んでみた ~ ブログ: Amazon Bedrock で Interpreter を開発! 動画: 生成 AI ユースケースを考え倒すための Generative AI Use Cases JP (GenU) の魅力と使い方 Security See CONTRIBUTING for more information. License This library is licensed under the MIT-0 License. See the LICENSE file.

instill-core
github
LLM Vibe Score0.515
Human Vibe Score0.023472450495103967
instill-aiMar 28, 2025

instill-core

🔮 Instill Core A complete unstructured data solution: ETL processing, AI-readiness, open-source LLM hosting, and RAG capabilities in one powerful platform. Quick start Follow the installation steps below or documentation for more details to build versatile AI applications locally. What is Instill Core? Instill Core is an end-to-end AI platform for data, pipeline and model orchestration. 🔮 Instill Core simplifies infrastructure hassle and encompasses these core features: 💧 Pipeline: Quickly build versatile AI-first APIs or automated workflows. ⚗️ Model: Deploy and monitor AI models without GPU infrastructure hassles. 💾 Artifact: Transform unstructured data (e.g., documents, images, audio, video) into AI-ready formats. ⚙️ Component: Connect essential building blocks to construct powerful pipelines. What can you build? 📖 Parsing PDF Files to Markdown: Cookbook 🧱 Generating Structured Outputs from LLMs: Cookbook & Tutorial 🕸️ Web scraping & Google Search with Structured Insights 🌱 Instance segmentation on microscopic plant stomata images: Cookbook See Examples for more! Installation Prerequisites | Operating System | Requirements and Instructions | | ---------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | macOS or Linux | Instill Core works natively | | Windows | • Use Windows Subsystem for Linux (WSL2)• Install latest yq from GitHub Repository• Install latest Docker Desktop and enable WSL2 integration (tutorial)• (Optional) Install cuda-toolkit on WSL2 (NVIDIA tutorial) | | All Systems | • Docker Engine v25 or later• Docker Compose v2 or later• Install latest stable Docker and Docker Compose | Steps Use stable release version Execute the following commands to pull pre-built images with all the dependencies to launch: [!NOTE] We have restructured our project repositories. If you need to access 🔮 Instill Core projects up to version v0.13.0-beta, please refer to the instill-ai/deprecated-core repository. Use the latest version for local development Execute the following commands to build images with all the dependencies to launch: [!IMPORTANT] Code in the main branch tracks under-development progress towards the next release and may not work as expected. If you are looking for a stable alpha version, please use latest release. 🚀 That's it! Once all the services are up with health status, the UI is ready to go at . Please find the default login credentials in the documentation. To shut down all running services: Deployment Visit the Deployment Overview for more details. Client Access 📺 Console ⌨️ CLI 📦 SDK: Python SDK TypeScript SDK Stay tuned, as more SDKs are on the way! Documentation Please visit our official documentation for more. Additional resources: API Reference Cookbooks Tutorials Examples Contributing We welcome contributions from our community! Checkout the methods below: Cookbooks: Help us create helpful pipelines and guides for the community. Visit our Cookbook repository to get started. Issues: Contribute to improvements by raising tickets using templates here or discuss in existing ones you think you can help with. Community Standards We are committed to maintaining a respectful and welcoming atmosphere for all contributors. Before contributing, please read: Contributing Guidelines Code of Conduct Support Get help by joining our Discord community where you can post any questions on our #ask-for-help channel. Contributors ✨ Thank you to all these wonderful people (emoji key): Vibhor Bhatt Miguel Ortiz Sajda Kabir Henry Chen Hari Bhandari Shiva Gaire Zubeen ShihChun-H Ikko Eltociear Ashimine Farookh Zaheer Siddiqui Brian Gallagher hairyputtar David Marx Deniz Parlak Po-Yu Chen Po Chun Chiu Sarthak HR Wu phelan Chang, Hui-Tang Xiaofei Du Ping-Lin Chang Tony Wang Pratik date Juan Vallés Naman Anand totuslink Praharsh Jain Utsav Paul CaCaBlocker Rafael Melo Jeremy Shih Romit Mohane ChunHao Amelia C 楊竣凱 andre.liang Zoodane George Strong Anni Mubeen Kodvavi RCKT Wojciech Bandzerewicz Gary Leo felixcorleone Zoe Daniel Manul Thanura Akash Jana Anish0203 Prathamesh Tugaonkar Shubham This project follows the all-contributors specification. Contributions of any kind welcome! License See the LICENSE file for licensing information.

introduction-to-ai-native-vector-databases-4470531
github
LLM Vibe Score0.397
Human Vibe Score0.03927567941040995
LinkedInLearningMar 28, 2025

introduction-to-ai-native-vector-databases-4470531

Introduction to AI-Native Vector Databases This is the repository for the LinkedIn Learning course Introduction to AI-Native Vector Databases. The full course is available from [LinkedIn Learning][lil-course-url]. ![course-name-alt-text][lil-thumbnail-url] The primary purpose of vector databases is to provide fast and accurate similarity search or nearest neighbor search capabilities. The integration of AI techniques in vector databases enhances their capabilities, improves search accuracy, optimizes performance, and enables more intelligent and efficient management of high-dimensional data. In this course, Zain Hasan introduces this foundational technology—which is already being used in industries like ecommerce, social media, and more. Zain covers everything from foundational concepts around AI-first vector databases to hands-on coding labs for question answering using LLMs. Instructions This repository has branches for each of the videos in the course. You can use the branch pop up menu in github to switch to a specific branch and take a look at the course at that stage, or you can add /tree/BRANCH_NAME to the URL to go to the branch you want to access. Branches The branches are structured to correspond to the videos in the course. The naming convention is CHAPTER#MOVIE#. As an example, the branch named 0203 corresponds to the second chapter and the third video in that chapter. Some branches will have a beginning and an end state. These are marked with the letters b for "beginning" and e for "end". The b branch contains the code as it is at the beginning of the movie. The e branch contains the code as it is at the end of the movie. The main branch holds the final state of the code when in the course. When switching from one exercise files branch to the next after making changes to the files, you may get a message like this: error: Your local changes to the following files would be overwritten by checkout: [files] Please commit your changes or stash them before you switch branches. Aborting To resolve this issue: Add changes to git using this command: git add . Commit changes using this command: git commit -m "some message" Installing To use these exercise files, you must have the following installed: Weaviate Python Client Anaconda Jupyter Docker Clone this repository into your local machine using the terminal (Mac), CMD (Windows), or a GUI tool like SourceTree. To setup the above tools please refer to the instructions below. Anaconda can be downloaded and installed using this link. We will only be using the base environment. This will give you packages like numpy, matplotlib and jupyter which we will be using as the main coding environment for this course. Jupyter will come pre-installed in the base environment of Anaconda and does not to be seperately installed. You can start up jupyter by going into a terminal and typing jupyter notebook. This will launch jupyter notebooks in your browser, if it doesn't automatically launch copy and paste the URL provided in the terminal into your browser. Weaviate Python Client can be installed after you have docker by using the command python -m pip install weaviate-client. Following this you should be able to run the command import weaviate in a newly launched jupyter notebook. Docker will be used to create containers in which our vector database(Weaviate) will run. We recommend that you setup Docker Desktop. Once Docker Desktop is setup, for certain videos and challenges you will be able to spin up docker containers using the provided docker-compose.yml files by opening a terminal where this file is located and typing docker compose up. Once finished with using the container you can bring it down simply by going into the same terminal and pressing Ctrl + C Instructor Zain Hasan Data Scientist, Lecturer [lil-course-url]: https://www.linkedin.com/learning/introduction-to-ai-native-vector-databases [lil-thumbnail-url]: https://media.licdn.com/dms/image/D4D0DAQFc3phQ64lAsA/learning-public-crop6751200/0/1702341179674?e=2147483647&v=beta&t=73HFdwWEvt0yxV3hHg8Rsx7MlXIXdkMde20UHxs6Qcg

rpaframework
github
LLM Vibe Score0.527
Human Vibe Score0.11594284776995417
robocorpMar 28, 2025

rpaframework

RPA Framework ============= REQUEST for user input! We are looking at improving our keyword usage to cover situations where developer might be struggling to smoothly write task for a Robot. Describe the situation where your implementation speed slows due to the lack of easier syntax. Comment HERE _ .. contents:: Table of Contents :local: :depth: 1 .. include-docs-readme Introduction RPA Framework is a collection of open-source libraries and tools for Robotic Process Automation (RPA), and it is designed to be used with both Robot Framework and Python. The goal is to offer well-documented and actively maintained core libraries for Software Robot Developers. Learn more about RPA at Robocorp Documentation_. The project is: 100% Open Source Sponsored by Robocorp_ Optimized for Robocorp Control Room and Developer Tools Accepting external contributions .. _Robot Framework: https://robotframework.org .. _Robot Framework Foundation: https://robotframework.org/foundation/ .. _Python: https://www.python.org/ .. _Robocorp: https://robocorp.com .. _Robocorp Documentation: https://robocorp.com/docs-robot-framework .. _Control Room: https://robocorp.com/docs/control-room .. _Developer Tools: https://robocorp.com/downloads .. _Installing Python Packages: https://robocorp.com/docs/setup/installing-python-package-dependencies Links ^^^^^ Homepage: `_ Documentation: _ PyPI: _ Release notes: _ RSS feed: _ .. image:: https://img.shields.io/github/actions/workflow/status/robocorp/rpaframework/main.yaml?style=for-the-badge :target: https://github.com/robocorp/rpaframework/actions/workflows/main.yaml :alt: Status .. image:: https://img.shields.io/pypi/dw/rpaframework?style=for-the-badge :target: https://pypi.python.org/pypi/rpaframework :alt: rpaframework .. image:: https://img.shields.io/pypi/l/rpaframework.svg?style=for-the-badge&color=brightgreen :target: http://www.apache.org/licenses/LICENSE-2.0.html :alt: License Packages .. image:: https://img.shields.io/pypi/v/rpaframework.svg?label=rpaframework&style=for-the-badge :target: https://pypi.python.org/pypi/rpaframework :alt: rpaframework latest version .. image:: https://img.shields.io/pypi/v/rpaframework-assistant.svg?label=rpaframework-assistant&style=for-the-badge :target: https://pypi.python.org/pypi/rpaframework-assistant :alt: rpaframework-assistant latest version .. image:: https://img.shields.io/pypi/v/rpaframework-aws.svg?label=rpaframework-aws&style=for-the-badge :target: https://pypi.python.org/pypi/rpaframework-aws :alt: rpaframework-aws latest version .. image:: https://img.shields.io/pypi/v/rpaframework-core.svg?label=rpaframework-core&style=for-the-badge :target: https://pypi.python.org/pypi/rpaframework-core :alt: rpaframework-core latest version .. image:: https://img.shields.io/pypi/v/rpaframework-google.svg?label=rpaframework-google&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-google :alt: rpaframework-google latest version .. image:: https://img.shields.io/pypi/v/rpaframework-hubspot.svg?label=rpaframework-hubspot&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-hubspot :alt: rpaframework-hubspot latest version .. image:: https://img.shields.io/pypi/v/rpaframework-openai.svg?label=rpaframework-openai&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-openai :alt: rpaframework-openai latest version .. image:: https://img.shields.io/pypi/v/rpaframework-pdf.svg?label=rpaframework-pdf&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-pdf :alt: rpaframework-pdf latest version .. image:: https://img.shields.io/pypi/v/rpaframework-recognition.svg?label=rpaframework-recognition&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-recognition :alt: rpaframework-recognition latest version .. image:: https://img.shields.io/pypi/v/rpaframework-windows.svg?label=rpaframework-windows&style=for-the-badge&color=blue :target: https://pypi.python.org/pypi/rpaframework-windows :alt: rpaframework-windows latest version From the above packages, rpaframework-core and rpaframework-recognition are support packages, which alone do not contain any libraries. Libraries The RPA Framework project currently includes the following libraries: The x in the PACKAGE column means that library is included in the rpaframework package and for example. x,pdf means that RPA.PDF library is provided in both the rpaframework and rpaframework-pdf packages. +----------------------------+-------------------------------------------------------+------------------------+ | LIBRARY NAME | DESCRIPTION | PACKAGE | +----------------------------+-------------------------------------------------------+------------------------+ | Archive_ | Archiving TAR and ZIP files | x | +----------------------------+-------------------------------------------------------+------------------------+ | Assistant_ | Display information to a user and request input. | assistant | +----------------------------+-------------------------------------------------------+------------------------+ | Browser.Selenium_ | Control browsers and automate the web | x | +----------------------------+-------------------------------------------------------+------------------------+ | Browser.Playwright_ | Newer way to control browsers | special (more below) | +----------------------------+-------------------------------------------------------+------------------------+ | Calendar_ | For date and time manipulations | x | +----------------------------+-------------------------------------------------------+------------------------+ | Cloud.AWS_ | Use Amazon AWS services | x,aws | +----------------------------+-------------------------------------------------------+------------------------+ | Cloud.Azure_ | Use Microsoft Azure services | x | +----------------------------+-------------------------------------------------------+------------------------+ | Cloud.Google_ | Use Google Cloud services | google | +----------------------------+-------------------------------------------------------+------------------------+ | Crypto_ | Common hashing and encryption operations | x | +----------------------------+-------------------------------------------------------+------------------------+ | Database_ | Interact with databases | x | +----------------------------+-------------------------------------------------------+------------------------+ | Desktop_ | Cross-platform desktop automation | x | +----------------------------+-------------------------------------------------------+------------------------+ | Desktop.Clipboard_ | Interact with the system clipboard | x | +----------------------------+-------------------------------------------------------+------------------------+ | Desktop.OperatingSystem_ | Read OS information and manipulate processes | x | +----------------------------+-------------------------------------------------------+------------------------+ | DocumentAI_ | Intelligent Document Processing wrapper | x | +----------------------------+-------------------------------------------------------+------------------------+ | DocumentAI.Base64AI_ | Intelligent Document Processing service | x | +----------------------------+-------------------------------------------------------+------------------------+ | DocumentAI.Nanonets_ | Intelligent Document Processing service | x | +----------------------------+-------------------------------------------------------+------------------------+ | Email.Exchange_ | E-Mail operations (Exchange protocol) | x | +----------------------------+-------------------------------------------------------+------------------------+ | Email.ImapSmtp_ | E-Mail operations (IMAP & SMTP) | x | +----------------------------+-------------------------------------------------------+------------------------+ | Excel.Application_ | Control the Excel desktop application | x | +----------------------------+-------------------------------------------------------+------------------------+ | Excel.Files_ | Manipulate Excel files directly | x | +----------------------------+-------------------------------------------------------+------------------------+ | FileSystem_ | Read and manipulate files and paths | x | +----------------------------+-------------------------------------------------------+------------------------+ | FTP_ | Interact with FTP servers | x | +----------------------------+-------------------------------------------------------+------------------------+ | HTTP_ | Interact directly with web APIs | x | +----------------------------+-------------------------------------------------------+------------------------+ | Hubspot_ | Access HubSpot CRM data objects | hubspot | +----------------------------+-------------------------------------------------------+------------------------+ | Images_ | Manipulate images | x | +----------------------------+-------------------------------------------------------+------------------------+ | JavaAccessBridge_ | Control Java applications | x | +----------------------------+-------------------------------------------------------+------------------------+ | JSON_ | Manipulate JSON objects | x | +----------------------------+-------------------------------------------------------+------------------------+ | MFA_ | Authenticate using one-time passwords (OTP) & OAuth2 | x | +----------------------------+-------------------------------------------------------+------------------------+ | Notifier_ | Notify messages using different services | x | +----------------------------+-------------------------------------------------------+------------------------+ | OpenAI_ | Artificial Intelligence service | openai | +----------------------------+-------------------------------------------------------+------------------------+ | Outlook.Application_ | Control the Outlook desktop application | x | +----------------------------+-------------------------------------------------------+------------------------+ | PDF_ | Read and create PDF documents | x,pdf | +----------------------------+-------------------------------------------------------+------------------------+ | Robocorp.Process_ | Use the Robocorp Process API | x | +----------------------------+-------------------------------------------------------+------------------------+ | Robocorp.WorkItems_ | Use the Robocorp Work Items API | x | +----------------------------+-------------------------------------------------------+------------------------+ | Robocorp.Vault_ | Use the Robocorp Secrets API | x | +----------------------------+-------------------------------------------------------+------------------------+ | Robocorp.Storage_ | Use the Robocorp Asset Storage API | x | +----------------------------+-------------------------------------------------------+------------------------+ | Salesforce_ | Salesforce operations | x | +----------------------------+-------------------------------------------------------+------------------------+ | SAP_ | Control SAP GUI desktop client | x | +----------------------------+-------------------------------------------------------+------------------------+ | Smartsheet_ | Access Smartsheet sheets | x | +----------------------------+-------------------------------------------------------+------------------------+ | Tables_ | Manipulate, sort, and filter tabular data | x | +----------------------------+-------------------------------------------------------+------------------------+ | Tasks_ | Control task execution | x | +----------------------------+-------------------------------------------------------+------------------------+ | Twitter_ | Twitter API interface | x | +----------------------------+-------------------------------------------------------+------------------------+ | Windows_ | Alternative library for Windows automation | x,windows | +----------------------------+-------------------------------------------------------+------------------------+ | Word.Application_ | Control the Word desktop application | x | +----------------------------+-------------------------------------------------------+------------------------+ .. _Archive: https://rpaframework.org/libraries/archive/ .. _Assistant: https://rpaframework.org/libraries/assistant/ .. Browser.Playwright: https://rpaframework.org/libraries/browserplaywright/ .. Browser.Selenium: https://rpaframework.org/libraries/browserselenium/ .. _Calendar: https://rpaframework.org/libraries/calendar/ .. Cloud.AWS: https://rpaframework.org/libraries/cloudaws/ .. Cloud.Azure: https://rpaframework.org/libraries/cloudazure/ .. Cloud.Google: https://rpaframework.org/libraries/cloudgoogle/ .. _Crypto: https://rpaframework.org/libraries/crypto/ .. _Database: https://rpaframework.org/libraries/database/ .. _Desktop: https://rpaframework.org/libraries/desktop/ .. Desktop.Clipboard: https://rpaframework.org/libraries/desktopclipboard/ .. Desktop.Operatingsystem: https://rpaframework.org/libraries/desktopoperatingsystem/ .. _DocumentAI: https://rpaframework.org/libraries/documentai .. DocumentAI.Base64AI: https://rpaframework.org/libraries/documentaibase64ai/ .. DocumentAI.Nanonets: https://rpaframework.org/libraries/documentainanonets/ .. Email.Exchange: https://rpaframework.org/libraries/emailexchange/ .. Email.ImapSmtp: https://rpaframework.org/libraries/emailimapsmtp/ .. Excel.Application: https://rpaframework.org/libraries/excelapplication/ .. Excel.Files: https://rpaframework.org/libraries/excelfiles/ .. _FileSystem: https://rpaframework.org/libraries/filesystem/ .. _FTP: https://rpaframework.org/libraries/ftp/ .. _HTTP: https://rpaframework.org/libraries/http/ .. _Hubspot: https://rpaframework.org/libraries/hubspot/ .. _Images: https://rpaframework.org/libraries/images/ .. _JavaAccessBridge: https://rpaframework.org/libraries/javaaccessbridge/ .. _JSON: https://rpaframework.org/libraries/json/ .. _MFA: https://rpaframework.org/libraries/mfa/ .. _Notifier: https://rpaframework.org/libraries/notifier/ .. _OpenAI: https://rpaframework.org/libraries/openai/ .. Outlook.Application: https://rpaframework.org/libraries/outlookapplication/ .. _PDF: https://rpaframework.org/libraries/pdf/ .. Robocorp.Process: https://rpaframework.org/libraries/robocorpprocess/ .. Robocorp.WorkItems: https://rpaframework.org/libraries/robocorpworkitems/ .. Robocorp.Vault: https://rpaframework.org/libraries/robocorpvault/ .. Robocorp.Storage: https://rpaframework.org/libraries/robocorpstorage/ .. _Salesforce: https://rpaframework.org/libraries/salesforce/ .. _SAP: https://rpaframework.org/libraries/sap/ .. _Smartsheet: https://rpaframework.org/libraries/smartsheet/ .. _Tables: https://rpaframework.org/libraries/tables/ .. _Tasks: https://rpaframework.org/libraries/tasks/ .. _Twitter: https://rpaframework.org/libraries/twitter/ .. _Windows: https://rpaframework.org/libraries/windows/ .. Word.Application: https://rpaframework.org/libraries/wordapplication/ Installation of RPA.Browser.Playwright The RPA.Browser.Playwright at the moment requires special installation, because of the package size and the post install step it needs to be fully installed. Minimum required conda.yaml to install Playwright: .. code-block:: yaml channels: conda-forge dependencies: python=3.10.14 nodejs=22.9.0 pip=24.0 pip: robotframework-browser==18.8.1 rpaframework==28.6.3 rccPostInstall: rfbrowser init Installation Learn about installing Python packages at Installing Python Packages_. Default installation method with Robocorp Developer Tools_ using conda.yaml: .. code-block:: yaml channels: conda-forge dependencies: python=3.10.14 pip=24.0 pip: rpaframework==28.6.3 To install all extra packages (including Playwright dependencies), you can use: .. code-block:: yaml channels: conda-forge dependencies: python=3.10.14 tesseract=5.4.1 nodejs=22.9.0 pip=24.0 pip: robotframework-browser==18.8.1 rpaframework==28.6.3 rpaframework-aws==5.3.3 rpaframework-google==9.0.2 rpaframework-recognition==5.2.5 rccPostInstall: rfbrowser init Separate installation of AWS, PDF and Windows libraries without the main rpaframework: .. code-block:: yaml channels: conda-forge dependencies: python=3.10.14 pip=24.0 pip: rpaframework-aws==5.3.3 included in the rpaframework as an extra rpaframework-pdf==7.3.3 included in the rpaframework by default rpaframework-windows==7.5.2 included in the rpaframework by default Installation method with pip using Python venv_: .. code-block:: shell python -m venv .venv source .venv/bin/activate pip install rpaframework .. note:: Python 3.8 or higher is required Example After installation the libraries can be directly imported inside Robot Framework_: .. code:: robotframework Settings Library RPA.Browser.Selenium Tasks Login as user Open available browser https://example.com Input text id:user-name ${USERNAME} Input text id:password ${PASSWORD} The libraries are also available inside Python_: .. code:: python from RPA.Browser.Selenium import Selenium lib = Selenium() lib.openavailablebrowser("https://example.com") lib.input_text("id:user-name", username) lib.input_text("id:password", password) Support and contact rpaframework.org _ for library documentation Robocorp Documentation_ for guides and tutorials #rpaframework channel in Robot Framework Slack_ if you have open questions or want to contribute Communicate with your fellow Software Robot Developers and Robocorp experts at Robocorp Developers Slack_ .. _Robot Framework Slack: https://robotframework-slack-invite.herokuapp.com/ .. _Robocorp Developers Slack: https://robocorp-developers.slack.com Contributing Found a bug? Missing a critical feature? Interested in contributing? Head over to the Contribution guide _ to see where to get started. Development Repository development is Python_ based and requires at minimum Python version 3.8+ installed on the development machine. The default Python version used in the Robocorp Robot template is 3.10.14 so it is a good choice for the version to install. Not recommended versions are 3.7.6 and 3.8.1, because they have issues with some of the dependencies related to rpaframework. At the time the newer Python versions starting from 3.12 are also not recommended, because some of the dependencies might cause issues. Repository development tooling is based on poetry and invoke. Poetry is the underlying tool used for compiling, building and running the package. Invoke is used for scripting purposes, for example for linting, testing and publishing tasks. Before writing any code, please read and acknowledge our extensive Dev Guide_. .. _Dev Guide: https://github.com/robocorp/rpaframework/blob/master/docs/source/contributing/development.md First steps to start developing: initial poetry configuration .. code:: shell poetry config virtualenvs.path null poetry config virtualenvs.in-project true poetry config repositories.devpi "https://devpi.robocorp.cloud/ci/test" git clone the repository #. create a new Git branch or switch to correct branch or stay in master branch some branch naming conventions feature/name-of-feature, hotfix/name-of-the-issue, release/number-of-release #. poetry install which install package with its dependencies into the .venv directory of the package, for example packages/main/.venv #. if testing against Robocorp Robot which is using devdata/env.json set environment variables or poetry build and use resulting .whl file (in the dist/ directory) in the Robot conda.yaml or poetry build and push resulting .whl file (in the dist/ directory) into a repository and use raw url to include it in the Robot conda.yaml another possibility for Robocorp internal development is to use Robocorp devpi instance, by poetry publish --ci and point conda.yaml to use rpaframework version in devpi #. poetry run python -m robot common ROBOT_ARGS from Robocorp Robot template: --report NONE --outputdir output --logtitle "Task log" #. poetry run python #. invoke lint to make sure that code formatting is according to rpaframework repository guidelines. It is possible and likely that Github action will fail the if developer has not linted the code changes. Code formatting is based on black and flake8 and those are run with the invoke lint. #. the library documentation can be created in the repository root (so called "meta" package level). The documentation is built by the docgen tools using the locally installed version of the project, local changes for the main package will be reflected each time you generate the docs, but if you want to see local changes for optional packages, you must utilize invoke install-local --package using the appropriate package name (e.g., rpaframework-aws). This will reinstall that package as a local editable version instead of from PyPI. Multiple such packages can be added by repeating the use of the --package option. In order to reset this, use invoke install --reset. poetry update and/or invoke install-local --package make docs open docs/build/html/index.html with the browser to view the changes or execute make local and navigate to localhost:8000 to view docs as a live local webpage. .. code-block:: toml Before [tool.poetry.dependencies] python = "^3.8" rpaframework = { path = "packages/main", extras = ["cv", "playwright", "aws"] } rpaframework-google = "^4.0.0" rpaframework-windows = "^4.0.0" After [tool.poetry.dependencies] python = "^3.8" rpaframework = { path = "packages/main", extras = ["cv", "playwright"] } rpaframework-aws = { path = "packages/aws" } rpaframework-google = "^4.0.0" rpaframework-windows = "^4.0.0" #. invoke test (this will run both Python unittests and robotframework tests defined in the packages tests/ directory) to run specific Python test: poetry run pytest path/to/test.py::test_function to run specific Robotframework test: inv testrobot -r -t #. git commit changes #. git push changes to remote #. create pull request from the branch describing changes included in the description #. update docs/source/releasenotes.rst with changes (commit and push) Packaging and publishing are done after changes have been merged into master branch. All the following steps should be done within master branch. #. git pull latest changes into master branch #. in the package directory containing changes execute invoke lint and invoke test #. update pyproject.toml with new version according to semantic versioning #. update docs/source/releasenotes.rst with changes #. in the repository root (so called "meta" package level) run command poetry update #. git commit changed poetry.lock files (on meta and target package level), releasenotes.rst and pyproject.toml with message "PACKAGE. version x.y.z" #. git push #. invoke publish after Github action on master branch is all green Some recommended tools for development Visual Studio Code_ as a code editor with following extensions: Sema4.ai_ Robot Framework Language Server_ GitLens_ Python extension_ GitHub Desktop_ will make version management less prone to errors .. _poetry: https://python-poetry.org .. _invoke: https://www.pyinvoke.org .. _Visual Studio Code: https://code.visualstudio.com .. _GitHub Desktop: https://desktop.github.com .. _Sema4.ai: https://marketplace.visualstudio.com/items?itemName=sema4ai.sema4ai .. _Robot Framework Language Server: https://marketplace.visualstudio.com/items?itemName=robocorp.robotframework-lsp .. _GitLens: https://marketplace.visualstudio.com/items?itemName=eamodio.gitlens .. _Python extension: https://marketplace.visualstudio.com/items?itemName=ms-python.python .. _black: https://pypi.org/project/black/ .. _flake8: https://pypi.org/project/flake8/ .. _venv: https://docs.python.org/3/library/venv.html License This project is open-source and licensed under the terms of the Apache License 2.0 `_.

aiosmtplib
github
LLM Vibe Score0.477
Human Vibe Score0.00851359052658951
coleMar 28, 2025

aiosmtplib

aiosmtplib ========== |circleci| |precommit.ci| |codecov| |zero-deps| |pypi-version| |downloads| |pypi-license| aiosmtplib is an asynchronous SMTP client for use with asyncio. For documentation, see Read The Docs_. Quickstart .. start quickstart .. code-block:: python import asyncio from email.message import EmailMessage import aiosmtplib message = EmailMessage() message["From"] = "root@localhost" message["To"] = "somebody@example.com" message["Subject"] = "Hello World!" message.set_content("Sent via aiosmtplib") asyncio.run(aiosmtplib.send(message, hostname="127.0.0.1", port=25)) .. end quickstart Requirements .. start requirements Python 3.9+ is required. .. end requirements Bug Reporting .. start bug-reporting Bug reports (and feature requests) are welcome via Github issues_. .. _Github issues: https://github.com/cole/aiosmtplib/issues .. end bug-reporting .. |circleci| image:: https://circleci.com/gh/cole/aiosmtplib/tree/main.svg?style=shield :target: https://circleci.com/gh/cole/aiosmtplib/tree/main :alt: "aiosmtplib CircleCI build status" .. |pypi-version| image:: https://img.shields.io/pypi/v/aiosmtplib.svg :target: https://pypi.python.org/pypi/aiosmtplib :alt: "aiosmtplib on the Python Package Index" .. |pypi-status| image:: https://img.shields.io/pypi/status/aiosmtplib.svg .. |pypi-license| image:: https://img.shields.io/pypi/l/aiosmtplib.svg .. |codecov| image:: https://codecov.io/gh/cole/aiosmtplib/branch/main/graph/badge.svg :target: https://codecov.io/gh/cole/aiosmtplib .. |downloads| image:: https://static.pepy.tech/badge/aiosmtplib/month :target: https://pepy.tech/project/aiosmtplib :alt: "aiosmtplib on pypy.tech" .. |precommit.ci| image:: https://results.pre-commit.ci/badge/github/cole/aiosmtplib/main.svg :target: https://results.pre-commit.ci/latest/github/cole/aiosmtplib/main :alt: "pre-commit.ci status" .. |zero-deps| image:: https://0dependencies.dev/0dependencies.svg :target: https://0dependencies.dev :alt: "0 dependencies" .. _Read The Docs: https://aiosmtplib.readthedocs.io/en/stable/

awesome-ai-in-finance
github
LLM Vibe Score0.58
Human Vibe Score1
georgezouqMar 28, 2025

awesome-ai-in-finance

Awesome AI in Finance There are millions of trades made in the global financial market every day. Data grows very quickly and people are hard to understand. With the power of the latest artificial intelligence research, people analyze & trade automatically and intelligently. This list contains the research, tools and code that people use to beat the market. [中文资源] Contents LLMs Papers Courses & Books Strategies & Research Time Series Data Portfolio Management High Frequency Trading Event Drive Crypto Currencies Strategies Technical Analysis Lottery & Gamble Arbitrage Data Sources Research Tools Trading System TA Lib Exchange API Articles Others LLMs 🌟🌟 MarS - A Financial Market Simulation Engine Powered by Generative Foundation Model. 🌟🌟 Financial Statement Analysis with Large Language Models - GPT-4 can outperform professional financial analysts in predicting future earnings changes, generating useful narrative insights, and resulting in superior trading strategies with higher Sharpe ratios and alphas, thereby suggesting a potential central role for LLMs in financial decision-making. PIXIU - An open-source resource providing a financial large language model, a dataset with 136K instruction samples, and a comprehensive evaluation benchmark. FinGPT - Provides a playground for all people interested in LLMs and NLP in Finance. MACD + RSI + ADX Strategy (ChatGPT-powered) by TradeSmart - Asked ChatGPT on which indicators are the most popular for trading. We used all of the recommendations given. A ChatGPT trading algorithm delivered 500% returns in stock market. My breakdown on what this means for hedge funds and retail investors Use chatgpt to adjust strategy parameters Hands-on LLMs: Train and Deploy a Real-time Financial Advisor - Train and deploy a real-time financial advisor chatbot with Falcon 7B and CometLLM. ChatGPT Strategy by OctoBot - Use ChatGPT to determine which cryptocurrency to trade based on technical indicators. Papers The Theory of Speculation L. Bachelier, 1900 - The influences which determine the movements of the Stock Exchange are. Brownian Motion in the Stock Market Osborne, 1959 - The common-stock prices can be regarded as an ensemble of decisions in statistical equilibrium. An Investigation into the Use of Reinforcement Learning Techniques within the Algorithmic Trading Domain, 2015 A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem Reinforcement Learning for Trading, 1994 Dragon-Kings, Black Swans and the Prediction of Crises Didier Sornette - The power laws in the distributions of event sizes under a broad range of conditions in a large variety of systems. Financial Trading as a Game: A Deep Reinforcement Learning Approach - Deep reinforcement learning provides a framework toward end-to-end training of such trading agent. Machine Learning for Trading - With an appropriate choice of the reward function, reinforcement learning techniques can successfully handle the risk-averse case. Ten Financial Applications of Machine Learning, 2018 - Slides review few important financial ML applications. FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance, 2020 - Introduce a DRL library FinRL that facilitates beginners to expose themselves to quantitative finance and to develop their own stock trading strategies. Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy, 2020 - Propose an ensemble strategy that employs deep reinforcement schemes to learn a stock trading strategy by maximizing investment return. Courses & Books & Blogs 🌟 QuantResearch - Quantitative analysis, strategies and backtests https://letianzj.github.io/ NYU: Overview of Advanced Methods of Reinforcement Learning in Finance Udacity: Artificial Intelligence for Trading AI in Finance - Learn Fintech Online. Advanced-Deep-Trading - Experiments based on "Advances in financial machine learning" book. Advances in Financial Machine Learning - Using advanced ML solutions to overcome real-world investment problems. Build Financial Software with Generative AI - Book about how to build financial software hands-on using generative AI tools like ChatGPT and Copilot. Mastering Python for Finance - Sources codes for: Mastering Python for Finance, Second Edition. MLSys-NYU-2022 - Slides, scripts and materials for the Machine Learning in Finance course at NYU Tandon, 2022. Train and Deploy a Serverless API to predict crypto prices - In this tutorial you won't build an ML system that will make you rich. But you will master the MLOps frameworks and tools you need to build ML systems that, together with tons of experimentation, can take you there. Strategies & Research Time Series Data Price and Volume process with Technology Analysis Indices 🌟🌟 stockpredictionai - A complete process for predicting stock price movements. 🌟 Personae - Implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading. 🌟 Ensemble-Strategy - Deep Reinforcement Learning for Automated Stock Trading. FinRL - A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance. AutomatedStockTrading-DeepQ-Learning - Build a Deep Q-learning reinforcement agent model as automated trading robot. tfdeeprltrader - Trading environment(OpenAI Gym) + PPO(TensorForce). trading-gym - Trading agent to train with episode of short term trading itself. trading-rl - Deep Reinforcement Learning for Financial Trading using Price Trailing. deeprltrader - Trading environment(OpenAI Gym) + DDQN (Keras-RL). Quantitative-Trading - Papers and code implementing Quantitative-Trading. gym-trading - Environment for reinforcement-learning algorithmic trading models. zenbrain - A framework for machine-learning bots. DeepLearningNotes - Machine learning in quant analysis. stockmarketreinforcementlearning - Stock market trading OpenAI Gym environment with Deep Reinforcement Learning using Keras. Chaos Genius - ML powered analytics engine for outlier/anomaly detection and root cause analysis.. mlforecast - Scalable machine learning based time series forecasting. Portfolio Management Deep-Reinforcement-Stock-Trading - A light-weight deep reinforcement learning framework for portfolio management. qtrader - Reinforcement Learning for portfolio management. PGPortfolio - A Deep Reinforcement Learning framework for the financial portfolio management problem. DeepDow - Portfolio optimization with deep learning. skfolio - Python library for portfolio optimization built on top of scikit-learn. High Frequency Trading High-Frequency-Trading-Model-with-IB - A high-frequency trading model using Interactive Brokers API with pairs and mean-reversion. 🌟 SGX-Full-OrderBook-Tick-Data-Trading-Strategy - Solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data. HFTBitcoin - Analysis of High Frequency Trading on Bitcoin exchanges. Event Drive 🌟🌟 stockpredictionai - Complete process for predicting stock price movements. 🌟 trump2cash - A stock trading bot powered by Trump tweets. Crypto Currencies Strategies LSTM-Crypto-Price-Prediction - Predicting price trends in crypto markets using an LSTM-RNN for trading. tforcebtctrader - TensorForce Bitcoin trading bot. Tensorflow-NeuroEvolution-Trading-Bot - A population model that trade cyrpto and breed and mutate iteratively. gekkoga - Genetic algorithm for solving optimization of trading strategies using Gekko. GekkoANNStrategies - ANN trading strategies for the Gekko trading bot. gekko-neuralnet - Neural network strategy for Gekko. bitcoinprediction - Code for "Bitcoin Prediction" by Siraj Raval on YouTube. Technical Analysis quant-trading - Python quantitative trading strategies. Gekko-Bot-Resources - Gekko bot resources. gekkotools - Gekko strategies, tools etc. gekko RSIWR - Gekko RSIWR strategies. gekko HL - Calculate down peak and trade on. EthTradingAlgorithm - Ethereum trading algorithm using Python 3.5 and the library ZipLine. gekkotradingstuff - Awesome crypto currency trading platform. forex.analytics - Node.js native library performing technical analysis over an OHLC dataset with use of genetic algorithmv. BitcoinMACDStrategy - Bitcoin MACD crossover trading strategy backtest. crypto-signal - Automated crypto trading & technical analysis (TA) bot for Bittrex, Binance, GDAX, and more. Gekko-Strategies - Strategies to Gekko trading bot with backtests results and some useful tools. gekko-gannswing - Gann's Swing trade strategy for Gekko trade bot. Lottery & Gamble LotteryPredict - Use LSTM to predict lottery. Arbitrage ArbitrageBot - Arbitrage bot that currently works on bittrex & poloniex. r2 - Automatic arbitrage trading system powered by Node.js + TypeScript. cryptocurrency-arbitrage - A crypto currency arbitrage opportunity calculator. Over 800 currencies and 50 markets. bitcoin-arbitrage - Bitcoin arbitrage opportunity detector. blackbird - Long / short market-neutral strategy. Data Sources Traditional Markets 🌟 Quandl - Get millions of financial and economic dataset from hundreds of publishers via a single free API. yahoo-finance - Python module to get stock data from Yahoo! Finance. Tushare - Crawling historical data of Chinese stocks. Financial Data - Stock Market and Financial Data API. Crypto Currencies CryptoInscriber - A live crypto currency historical trade data blotter. Download live historical trade data from any crypto exchange. Gekko-Datasets - Gekko trading bot dataset dumps. Download and use history files in SQLite format. Research Tools Synthical - AI-powered collaborative environment for Research. 🌟🌟 TensorTrade - Trade efficiently with reinforcement learning. ML-Quant - Quant resources from ArXiv (sanity), SSRN, RePec, Journals, Podcasts, Videos, and Blogs. JAQS - An open source quant strategies research platform. pyfolio - Portfolio and risk analytics in Python. alphalens - Performance analysis of predictive (alpha) stock factors. empyrical - Common financial risk and performance metrics. Used by Zipline and pyfolio. zvt - Zero vector trader. Trading System For Back Test & Live trading Traditional Market System 🌟🌟🌟 OpenBB - AI-powered opensource research and analytics workspace. 🌟🌟 zipline - A python algorithmic trading library. 🌟 TradingView - Get real-time information and market insights. rqalpha - A extendable, replaceable Python algorithmic backtest & trading framework. backtrader - Python backtesting library for trading strategies. kungfu - Kungfu Master trading system. lean - Algorithmic trading engine built for easy strategy research, backtesting and live trading. Combine & Rebuild pylivetrader - Python live trade execution library with zipline interface. CoinMarketCapBacktesting - As backtest frameworks for coin trading strategy. Crypto Currencies zenbot - Command-line crypto currency trading bot using Node.js and MongoDB. bot18 - High-frequency crypto currency trading bot developed by Zenbot. magic8bot - Crypto currency trading bot using Node.js and MongoDB. catalyst - An algorithmic trading library for Crypto-Assets in python. QuantResearchDev - Quant Research dev & Traders open source project. MACD - Zenbot MACD Auto-Trader. abu - A quant trading system base on python. Plugins CoinMarketCapBacktesting - Tests bt and Quantopian Zipline as backtesting frameworks for coin trading strategy. Gekko-BacktestTool - Batch backtest, import and strategy params optimalization for Gekko Trading Bot. TA Lib pandastalib - A Python Pandas implementation of technical analysis indicators. finta - Common financial technical indicators implemented in Python-Pandas (70+ indicators). tulipnode - Official Node.js wrapper for Tulip Indicators. Provides over 100 technical analysis overlay and indicator functions. techan.js - A visual, technical analysis and charting (Candlestick, OHLC, indicators) library built on D3. Exchange API Do it in real world! IbPy - Python API for the Interactive Brokers on-line trading system. HuobiFeeder - Connect HUOBIPRO exchange, get market/historical data for ABAT trading platform backtest analysis and live trading. ctpwrapper - Shanghai future exchange CTP api. PENDAX - Javascript SDK for Trading/Data API and Websockets for cryptocurrency exchanges like FTX, FTXUS, OKX, Bybit, & More Framework tf-quant-finance - High-performance TensorFlow library for quantitative finance. Visualizing playground - Play with neural networks. netron - Visualizer for deep learning and machine learning models. KLineChart - Highly customizable professional lightweight financial charts GYM Environment 🌟 TradingGym - Trading and Backtesting environment for training reinforcement learning agent. TradzQAI - Trading environment for RL agents, backtesting and training. btgym - Scalable, event-driven, deep-learning-friendly backtesting library. Articles The-Economist - The Economist. nyu-mlif-notes - NYU machine learning in finance notes. Using LSTMs to Turn Feelings Into Trades Others zipline-tensorboard - TensorBoard as a Zipline dashboard. gekko-quasar-ui - An UI port for gekko trading bot using Quasar framework. Floom AI gateway and marketplace for developers, enables streamlined integration and least volatile approach of AI features into products Other Resource 🌟🌟🌟 Stock-Prediction-Models - Stock-Prediction-Models, Gathers machine learning and deep learning models for Stock forecasting, included trading bots and simulations. 🌟🌟 Financial Machine Learning - A curated list of practical financial machine learning (FinML) tools and applications. This collection is primarily in Python. 🌟 Awesome-Quant-Machine-Learning-Trading - Quant / Algorithm trading resources with an emphasis on Machine Learning. awesome-quant - A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance). FinancePy - A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives. Explore Finance Service Libraries & Projects - Explore a curated list of Fintech popular & new libraries, top authors, trending project kits, discussions, tutorials & learning resources on kandi.

oreilly-ai-agents
github
LLM Vibe Score0.437
Human Vibe Score0.07783740211883924
sinanuozdemirMar 28, 2025

oreilly-ai-agents

!oreilly-logo AI Agents A-Z This repository contains code for the O'Reilly Live Online Training for AI Agents A-Z This course provides a comprehensive guide to understanding, implementing, and managing AI agents both at the prototype stage and in production. Attendees will start with foundational concepts and progressively delve into more advanced topics, including various frameworks like CrewAI, LangChain, and AutoGen as well as building agents from scratch using powerful prompt engineering techniques. The course emphasizes practical application, guiding participants through hands-on exercises to implement and deploy AI agents, evaluate their performance, and iterate on their designs. We will go over key aspects like cost projections, open versus closed source options, and best practices are thoroughly covered to equip attendees with the knowledge to make informed decisions in their AI projects. Setup Instructions Using Python 3.11 Virtual Environment At the time of writing, we need a Python virtual environment with Python 3.11. Option 1: Python 3.11 is Already Installed Step 1: Verify Python 3.11 Installation Step 2: Create a Virtual Environment This creates a .venv folder in your current directory. Step 3: Activate the Virtual Environment macOS/Linux: Windows: You should see (.venv) in your terminal prompt. Step 4: Verify the Python Version Step 5: Install Packages Step 6: Deactivate the Virtual Environment Option 2: Install Python 3.11 If you don’t have Python 3.11, follow the steps below for your OS. macOS (Using Homebrew) Ubuntu/Debian Windows (Using Windows Installer) Go to Python Downloads. Download the installer for Python 3.11. Run the installer and ensure "Add Python 3.11 to PATH" is checked. Verify Installation Notebooks In the activated environment, run Using 3rd party agent frameworks Intro to CrewAI - An introductory notebook for CrewAI See the streamlit directory for an example of deploying crew on a streamlit app Intro to Autogen - An introductory notebook for Microsoft's Autogen Intro to OpenAI Swarm - An introductory notebook for OpenAI's Swarm Intro to LangGraph - An introductory notebook for LangGraph Agents playing Chess - An implementation of two ReAct Agents playing Chess with each other Evaluating Agents Evaluating Agent Output with Rubrics - Exploring a rubric prompt to evaluate generative output. This notebook also notes positional biases when choosing between agent responses. Advanced - Evaluating Alignment - A longer notebook doing a much more in depth analysis on how an LLM can judge agent's responses Evaluating Tool Selection - Calculating the accuracy of tool selection between different LLMs and quantifying the positional bias present in auto-regressive LLMs. See the additions here for V3 + DeepSeek Distilled Models and here for DeepSeek R1 Building our own agents First Steps with our own Agent - Working towards building our own agent framework See Squad Goals for a very simple example of my own agent framework Intro to Squad Goals - using my own framework to do some basic tasks Multimodal Agents - Incorporating Dalle-3 to allow our squad to generate images Modern Agent Paradigms Plan & Execute Agents - Plan & Execute Agents use a planner to create multi-step plans with an LLM and an executor to complete each step by invoking tools. Reflection Agents - Reflection Agents combine a generator to perform tasks and a reflector to provide feedback and guide improvements. Instructor Sinan Ozdemir is the Founder and CTO of LoopGenius where he uses State of the art AI to help people run digital ads on Meta, Google, and more. Sinan is a former lecturer of Data Science at Johns Hopkins University and the author of multiple textbooks on data science and machine learning. Additionally, he is the founder of the recently acquired Kylie.ai, an enterprise-grade conversational AI platform with RPA capabilities. He holds a master’s degree in Pure Mathematics from Johns Hopkins University and is based in San Francisco, CA.

writer-framework
github
LLM Vibe Score0.51
Human Vibe Score0.014794403025851312
writerMar 28, 2025

writer-framework

What is Framework? Writer Framework is an open-source framework for creating AI applications. Build user interfaces using a visual editor; write the backend code in Python. Writer Framework is fast and flexible with a clean, easily-testable syntax. It provides separation of concerns between UI and business logic, enabling more complex applications. Highlights Reactive and state-driven Writer Framework is fully state-driven and provides separation of concerns between user interface and business logic. The user interface is a template, which is defined visually. The template contains reactive references to state, e.g. @{counter}, and references to event handlers, e.g. when Button is clicked, trigger handle_increment. Flexible Elements are highly customizable with no CSS required, allowing for shadows, button icons, background colors, etc. HTML elements with custom CSS can be included using the HTML Element component. They can serve as containers for built-in components. Fast Event handling adds minimal overhead to your Python code (~1-2ms\*). Streaming (WebSockets) is used to synchronize frontend and backend states. The script only runs once. Non-blocking by default. Events are handled asynchronously in a thread pool running in a dedicated process. \*End-to-end figure, including DOM mutation. Tested locally on a Macbook Air M2. Measurement methodology. Developer-friendly It's all contained in a standard Python package, just one pip install away. User interfaces are saved as JSON, so they can be version controlled together with the rest of the application. Use your local code editor and get instant refreshes when you save your code. Alternatively, use the provided web-based editor. You edit the UI while your app is running. No hitting "Preview" and seeing something completely different to what you expected. Installation and Quickstart Getting started with Writer Framework is easy. It works on Linux, Mac and Windows. The first command will install Writer Framework using pip. The second command will create a demo application in the subfolder "hello" and start Writer Framework Builder, the framework's visual editor, which will be accessible via a local URL. The following commands can be used to create, launch Writer Framework Builder and run an application. Documentation Full documentation, including how to use Writer's AI module and deployment options, is available at Writer. About Writer Writer is the full-stack generative AI platform for enterprises. Quickly and easily build and deploy generative AI apps with a suite of developer tools fully integrated with our platform of LLMs, graph-based RAG tools, AI guardrails, and more. Learn more at writer.com. License This project is licensed under the Apache 2.0 License.

BERT-pytorch
github
LLM Vibe Score0.514
Human Vibe Score0.16971233963995486
codertimoMar 28, 2025

BERT-pytorch

BERT-pytorch !GitHub issues Pytorch implementation of Google AI's 2018 BERT, with simple annotation BERT 2018 BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Paper URL : https://arxiv.org/abs/1810.04805 Introduction Google AI's BERT paper shows the amazing result on various NLP task (new 17 NLP tasks SOTA), including outperform the human F1 score on SQuAD v1.1 QA task. This paper proved that Transformer(self-attention) based encoder can be powerfully used as alternative of previous language model with proper language model training method. And more importantly, they showed us that this pre-trained language model can be transfer into any NLP task without making task specific model architecture. This amazing result would be record in NLP history, and I expect many further papers about BERT will be published very soon. This repo is implementation of BERT. Code is very simple and easy to understand fastly. Some of these codes are based on The Annotated Transformer Currently this project is working on progress. And the code is not verified yet. Installation Quickstart NOTICE : Your corpus should be prepared with two sentences in one line with tab(\t) separator Prepare your corpus or tokenized corpus (tokenization is not in package) Building vocab based on your corpus Train your own BERT model Language Model Pre-training In the paper, authors shows the new language model training methods, which are "masked language model" and "predict next sentence". Masked Language Model Original Paper : 3.3.1 Task #1: Masked LM Rules: Randomly 15% of input token will be changed into something, based on under sub-rules Randomly 80% of tokens, gonna be a [MASK] token Randomly 10% of tokens, gonna be a [RANDOM] token(another word) Randomly 10% of tokens, will be remain as same. But need to be predicted. Predict Next Sentence Original Paper : 3.3.2 Task #2: Next Sentence Prediction "Is this sentence can be continuously connected?" understanding the relationship, between two text sentences, which is not directly captured by language modeling Rules: Randomly 50% of next sentence, gonna be continuous sentence. Randomly 50% of next sentence, gonna be unrelated sentence. Author Junseong Kim, Scatter Lab (codertimo@gmail.com / junseong.kim@scatterlab.co.kr) License This project following Apache 2.0 License as written in LICENSE file Copyright 2018 Junseong Kim, Scatter Lab, respective BERT contributors Copyright (c) 2018 Alexander Rush : The Annotated Trasnformer

eiten
github
LLM Vibe Score0.549
Human Vibe Score0.754375921646308
tradyticsMar 27, 2025

eiten

Eiten - Algorithmic Investing Strategies for Everyone Eiten is an open source toolkit by Tradytics that implements various statistical and algorithmic investing strategies such as Eigen Portfolios, Minimum Variance Portfolios, Maximum Sharpe Ratio Portfolios, and Genetic Algorithms based Portfolios. It allows you to build your own portfolios with your own set of stocks that can beat the market. The rigorous testing framework included in Eiten enables you to have confidence in your portfolios. If you are looking to discuss these tools in depth and talk about more tools that we are working on, please feel free to join our Discord channel where we have a bunch of more tools too. Files Description | Path | Description | :--- | :---------- | eiten | Main folder. | └ figures | Figures for this github repositories. | └ stocks | Folder to keep your stock lists that you want to use to create your portfolios. | └ strategies | A bunch of strategies implemented in python. | backtester.py | Backtesting module that both backtests and forward tests all portfolios. | data_loader.py | Module for loading data from yahoo finance. | portfolio_manager.py | Main file that takes in a bunch of arguments and generates several portfolios for you. | simulator.py | Simulator that uses historical returns and monte carlo to simulate future prices for the portfolios. | strategy_manager.py | Manages the strategies implemented in the 'strategies' folder. Required Packages You will need to install the following package to train and test the models. Scikit-learn Numpy Tqdm Yfinance Pandas Scipy You can install all packages using the following command. Please note that the script was written using python3. Build your portfolios Let us see how we can use all the strategies given in the toolkit to build our portfolios. The first thing you need to do is modify the stocks.txt file in the stocks folder and add the stocks of your choice. It is recommended to keep the list small i.e anywhere between 5 to 50 stocks should be fine. We have already put a small stocks list containing a bunch of tech stocks like AAPL, MSFT, TSLA etc. Let us build our portfolios now. This is the main command that you need to run. This command will use last 5 years of daily data excluding the last 90 days and build several portfolios for you. Based on those portfolios, it will then test them on the out of sample data of 90 days and show you the performance of each portfolio. Finally, it will also compare the performance with your choice of market index which is QQQ here. Let's dive into each of the parameters in detail. istest: The value determined if the program is going to keep some separate data for future testing. When this is enabled, the value of futurebars should be larger than 5. future_bars: These are the bars that the tool will exclude during portfolio building and will forward test the portfolios on the excluded set. This is also called out of sample data. datagranularityminutes: How much granular data do you want to use to build your portfolios. For long term portfolios, you should use daily data but for short term, you can use hourly or minute level data. The possible values here are 3600, 60, 30, 15, 5, 1. 3600 means daily. historytouse: Whether to use a specific number of historical bars or use everything that we receive from yahoo finance. For minute level data, we only receive up to one month of historical data. For daily, we receive 5 years worth of historical data. If you want to use all available data, the value should be all but if you want to use smaller history, you can set it to an integer value e.g 100 which will only use the last 100 bars to build the portfolios. applynoisefiltering: This uses random matrix theory to filter out the covariance matrix from randomness thus yielding better portfolios. A value of 1 will enable it and 0 will disable it. market_index: Which index do you want to use to compare your portfolios. This should mostly be SPY but since we analyzed tech stocks, we used QQQ. only_long: Whether to use long only portfolio or enable short selling as well. Long only portfolios have shown to have better performance using algorithmic techniques. eigenportfolionumber: Which eigen portfolio to use. Any value between 1-5 should work. The first eigen portfolio (1) represents the market portfolio and should act just like the underlying index such as SPY or QQQ. The second one is orthogonal and uncorrelated to the market and poses the greatest risk and reward. The following ones have reduced risk and reward. Read more on eigen-portfolios. stocksfilepath: File that contains the list of stocks that you want to use to build your portfolio. Some Portfolio Building Examples Here are a few examples for building different types of portfolios. Both long and short portfolios by analyzing last 90 days data and keeping the last 30 days as testing data. This will give us 60 days of portfolio construction data and 30 days of testing. Only long portfolio on 60 minute bars of the last 30 days. No future testing. Compare the results with SPY index instead of QQQ. Do not apply noise filtering on the covariance matrix. Use the first eigen portfolio (market portfolio) and compare with SQQQ, Portfolio Strategies Four different portfolio strategies are currently supported by the toolkit. Eigen Portfolios These portfolios are orthogonal and uncorrelated to the market in general thus yielding high reward and alpha. However, since they are uncorrelated to the market, they can also provide great risk. The first eigen portfolio is considered to be a market portfolio which is often ignored. The second one is uncorrelated to the others and provides the highest risk and reward. As we go down the numbering, the risk as well as the reward are reduced. Minimum Variance Portfolio (MVP) MVP tries to minimize the variance of the portfolio. These portfolios are lowest risk and reward. Maximum Sharpe Ratio Portfolio (MSR) MSR solves an optimization problem that tries to maximize the sharpe ratio of the portfolio. It uses past returns during the optimization process which means if past returns are not the same as future returns, the results can vary in future. Genetic Algorithm (GA) based Portfolio This is our own implementation of a GA based portfolio that again tries to maximize the sharpe ratio but in a slightly more robust way. This usually provides more robust portfolios than the others. When you run the command above, our tool will generate portfolios from all these strategies and give them to you. Let us look at some resulting portfolios. Resulting Portfolios For the purpose these results, we will use the 9 stocks in the stocks/stocks.txt file. When we run the above command, we first get the portfolio weights for all four strategies. For testing purposes, the above command used last five years of daily data up till April 29th. The remaining data for this year was used for forward testing i.e the portfolio strategies had no access to it when building the portfolios. What if my portfolio needs different stocks?: All you need to do is change the stocks in the stocks.txt file and run the tool again. Here is the final command again that we run in order to get our portfolios: Portfolio Weights We can see that the eigen portfolio is giving a large weight to TSLA while the others are dividing their weights more uniformly. An interesting phenomena happening here is the hedging with SQQQ that all the strategies have learned automatically. Every tool is assigning some positive weight to SQQQ while also assigning positive weights to other stocks which indicates that the strategies are automatically trying to hedge the portfolios from risk. Obviously this is not perfect, but just the fact that it's happening is fascinating. Let us look at the backtest results on the last five years prior to April 29, 2020. Backtest Results The backtests look pretty encouraging. The black dotted line is the market index i.e QQQ. Other lines are the strategies. Our custom genetic algorithm implementation seems to have the best backtest results because it's an advanced version of other strategies. The eigen portfolio that weighed TSLA the most have the most volatility but its profits are also very high. Finally, as expected, the MVP has the minimum variance and ultimately the least profits. However, since the variance is extremely low, it is a good portfolio for those who want to stay safe. The most interesting part comes next, let us look at the forward or future test results for these portfolios. Forward Test Results These results are from April 29th, 2020 to September 4th, 2020. The eigen portfolio performed the best but it also had a lot of volatility. Moreover, most of those returns are due to TSLA rocketing in the last few months. After that, our GA algorithm worked quite effectively as it beat the market index. Again, as expected, the MVP had the lowest risk and reward and slowly went up in 4-5 months. This shows the effectiveness and power of these algorithmic portfolio optimization strategies where we've developed different portfolios for different kinds of risk and reward profiles. Conclusion and Discussion We are happy to share this toolkit with the trading community and hope that people will like and contribute to it. As is the case with everything in trading, these strategies are not perfect but they are based on rigorous theory and some great empirical results. Please take care when trading with these strategies and always manage your risk. The above results were not cherry picked but the market has been highly bullish in the last few months which has led to the strong results shown above. We would love for the community to try out different strategies and share them with us. Special Thanks Special thanks to Scott Rome's blog. The eigen portfolios and minimum variance portfolio concepts came from his blog posts. The code for filtering eigen values of the covariance matrix was also mostly obtained from one of his posts. License A product by Tradytics Copyright (c) 2020-present, Tradytics.com

yoha
github
LLM Vibe Score0.556
Human Vibe Score0.3408299306652369
handtracking-ioMar 27, 2025

yoha

Yoha A practical hand tracking engine. Note: Yoha is currently unmaintained. Quick Links: Demo (Code) Docs Website npm Installation npm install @handtracking.io/yoha Please note: You need to serve the files from node_modules/@handtracking.io/yoha since the library needs to download the model files from here. (Webpack Example) You need to serve your page with https for webcam access. (Webpack Example) You should use cross-origin isolation as it improves the engine's performance in certain scenarios. (Webpack Example) Description Yoha is a hand tracking engine that is built with the goal of being a versatile solution in practical scenarios where hand tracking is employed to add value to an application. While ultimately the goal is to be a general purpose hand tracking engine supporting any hand pose, the engine evolves around specific hand poses that users/developers find useful. These poses are detected by the engine which allows to build applications with meaningful interactions. See the demo for an example. Yoha is currently in beta. About the name: Yoha is short for ("Your Hand Tracking"). Language Support Yoha is currently available for the web via JavaScript. More languages will be added in the future. If you want to port Yoha to another language and need help feel free reach out. Technical Details Yoha was built from scratch. It uses a custom neural network trained using a custom dataset. The backbone for the inference in the browser is currently TensorFlow.js Features: Detection of 21 2D-landmark coordinates (single hand). Hand presence detection. Hand orientation (left/right hand) detection. Inbuilt pose detection. Supported Hand Poses: Pinch (index finger and thumb touch) Fist Your desired pose is not on this list? Feel free to create an issue for it. Performance Yoha was built with performance in mind. It is able to provide realtime user experience on a broad range of laptops and desktop devices. The performance on mobile devices is not great which hopefuly will change with the further development of inference frameworks like TensorFlow.js Please note that native inference speed can not be compared with the web inference speed. Differently put, if you were to run Yoha natively it would be much faster than via the web browser. Minimal Example Source Running locally: Drawing Demo Live Version Source Running locally:

airplay2-receiver
github
LLM Vibe Score0.498
Human Vibe Score0.0426074723730768
openairplayMar 27, 2025

airplay2-receiver

Experimental Somewhat comprehensive python implementation of AP2 receiver using some multi-room features. For now it implements: HomeKit transient pairing (SRP/Curve25519/ChaCha20-Poly1305) - bit flag 48 HomeKit non-transient pairing Some refinements for HomeKit interaction (e.g. managed/active flags) Persist device name and some HomeKit properties across restarts (just use the -m flag again to set the device name anew) FairPlay (v3) authentication and decryption of AES keys - the first and only Python implementation. Credit to @systemcrash for implementation. Receiving of both REALTIME and BUFFERED Airplay2 audio streams Airplay2 Service publication Decoding of all Airplay2 supported CODECs: ALAC, AAC, OPUS, PCM. Ref: here and here Output latency compensation for sync with other Airplay receivers ANNOUNCE and RSA AES for unbuffered streaming from iTunes/Windows Spotify (via AirPlay2) and other live media streams with AES keys. RTCP RFC2198 RTP Redundancy handling (basic); enable bit flag 61 streamConnections; enable bit flag 59 For now it does not implement: FairPlay v2 Accurate audio sync (with help of PTP and/or NTP) It may never implement: MFi Authentication (requires MFi hardware module) This code is experimental, yet fully functional. It can act as a real receiver but does not implement all airplay protocols and related pairing/authentication methods. Next steps: PTP (Precision Time Protocol) Remove all os specific code (Soft Volume management) Sender (branch-sender) - Implementation Raspbian package DACP/(+MRP?) Support FairPlay v2 Support Multiple Connections Since multithreading is now enabled, this allows multiple concurrent connections. There are no safeguards built to prevent you playing multiple streams. Python multiprocessing makes this "DJ" mode a possibility but makes stream management and session management (global state data) nigh impossible. So threading is the right approach in the receiver. HomeKit and other AP senders can now connect concurrently to the receiver and perform operations. This opens the path to Remote Control functionality. mDNS/ZeroConf If you encounter strange errors like NonUniqueNameException, or Address already in use, and you run on macOS, you may have noticed that macOS and this app both try to send updates. Here is a possible workaround. Raspberry Pi 4 Install docker and then build the image: To run the receiver: Default network device is wlan0, you can change this with AP2IFACE env variable: Docker Compose Example Docker Compose Debian macOS Catalina To run the receiver please use Python 3 and do the following: Run the following commands Note: in recent macOS versions (e.g. Ventura), you must disable AirPlay Receiver: System Settings -> AirDrop & Handoff -> AirPlay Receiver: disable. Windows To run the receiver please use Python 3 and do the following: Run the following commands the AirPlay 2 receiver is announced as myap2. Tested on Python 3.7.5 / macOS 10.15.2 with iPhone X 13.3 and Raspberry Pi 4 Protocol notes https://emanuelecozzi.net/docs/airplay2

obsei
github
LLM Vibe Score0.545
Human Vibe Score0.10175553624190911
obseiMar 27, 2025

obsei

Note: Obsei is still in alpha stage hence carefully use it in Production. Also, as it is constantly undergoing development hence master branch may contain many breaking changes. Please use released version. Obsei (pronounced "Ob see" | /əb-'sē/) is an open-source, low-code, AI powered automation tool. Obsei consists of - Observer: Collect unstructured data from various sources like tweets from Twitter, Subreddit comments on Reddit, page post's comments from Facebook, App Stores reviews, Google reviews, Amazon reviews, News, Website, etc. Analyzer: Analyze unstructured data collected with various AI tasks like classification, sentiment analysis, translation, PII, etc. Informer: Send analyzed data to various destinations like ticketing platforms, data storage, dataframe, etc so that the user can take further actions and perform analysis on the data. All the Observers can store their state in databases (Sqlite, Postgres, MySQL, etc.), making Obsei suitable for scheduled jobs or serverless applications. !Obsei diagram Future direction - Text, Image, Audio, Documents and Video oriented workflows Collect data from every possible private and public channels Add every possible workflow to an AI downstream application to automate manual cognitive workflows Use cases Obsei use cases are following, but not limited to - Social listening: Listening about social media posts, comments, customer feedback, etc. Alerting/Notification: To get auto-alerts for events such as customer complaints, qualified sales leads, etc. Automatic customer issue creation based on customer complaints on Social Media, Email, etc. Automatic assignment of proper tags to tickets based content of customer complaint for example login issue, sign up issue, delivery issue, etc. Extraction of deeper insight from feedbacks on various platforms Market research Creation of dataset for various AI tasks Many more based on creativity 💡 Installation Prerequisite Install the following (if not present already) - Install Python 3.7+ Install PIP Install Obsei You can install Obsei either via PIP or Conda based on your preference. To install latest released version - Install from master branch (if you want to try the latest features) - Note: all option will install all the dependencies which might not be needed for your workflow, alternatively following options are available to install minimal dependencies as per need - pip install obsei[source]: To install dependencies related to all observers pip install obsei[sink]: To install dependencies related to all informers pip install obsei[analyzer]: To install dependencies related to all analyzers, it will install pytorch as well pip install obsei[twitter-api]: To install dependencies related to Twitter observer pip install obsei[google-play-scraper]: To install dependencies related to Play Store review scrapper observer pip install obsei[google-play-api]: To install dependencies related to Google official play store review API based observer pip install obsei[app-store-scraper]: To install dependencies related to Apple App Store review scrapper observer pip install obsei[reddit-scraper]: To install dependencies related to Reddit post and comment scrapper observer pip install obsei[reddit-api]: To install dependencies related to Reddit official api based observer pip install obsei[pandas]: To install dependencies related to TSV/CSV/Pandas based observer and informer pip install obsei[google-news-scraper]: To install dependencies related to Google news scrapper observer pip install obsei[facebook-api]: To install dependencies related to Facebook official page post and comments api based observer pip install obsei[atlassian-api]: To install dependencies related to Jira official api based informer pip install obsei[elasticsearch]: To install dependencies related to elasticsearch informer pip install obsei[slack-api]:To install dependencies related to Slack official api based informer You can also mix multiple dependencies together in single installation command. For example to install dependencies Twitter observer, all analyzer, and Slack informer use following command - How to use Expand the following steps and create a workflow - Step 1: Configure Source/Observer Twitter Youtube Scrapper Facebook Email Google Maps Reviews Scrapper AppStore Reviews Scrapper Play Store Reviews Scrapper Reddit Reddit Scrapper Note: Reddit heavily rate limit scrappers, hence use it to fetch small data during long period Google News Web Crawler Pandas DataFrame Step 2: Configure Analyzer Note: To run transformers in an offline mode, check transformers offline mode. Some analyzer support GPU and to utilize pass device parameter. List of possible values of device parameter (default value auto): auto: GPU (cuda:0) will be used if available otherwise CPU will be used cpu: CPU will be used cuda:{id} - GPU will be used with provided CUDA device id Text Classification Text classification: Classify text into user provided categories. Sentiment Analyzer Sentiment Analyzer: Detect the sentiment of the text. Text classification can also perform sentiment analysis but if you don't want to use heavy-duty NLP model then use less resource hungry dictionary based Vader Sentiment detector. NER Analyzer NER (Named-Entity Recognition) Analyzer: Extract information and classify named entities mentioned in text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc Translator PII Anonymizer Dummy Analyzer Dummy Analyzer: Does nothing. Its simply used for transforming the input (TextPayload) to output (TextPayload) and adding the user supplied dummy data. Step 3: Configure Sink/Informer Slack Zendesk Jira ElasticSearch Http Pandas DataFrame Logger This is useful for testing and dry running the pipeline. Step 4: Join and create workflow source will fetch data from the selected source, then feed it to the analyzer for processing, whose output we feed into a sink to get notified at that sink. Step 5: Execute workflow Copy the code snippets from Steps 1 to 4 into a python file, for example example.py and execute the following command - Demo We have a minimal streamlit based UI that you can use to test Obsei. !Screenshot Watch UI demo video Check demo at (Note: Sometimes the Streamlit demo might not work due to rate limiting, use the docker image (locally) in such cases.) To test locally, just run To run Obsei workflow easily using GitHub Actions (no sign ups and cloud hosting required), refer to this repo. Companies/Projects using Obsei Here are some companies/projects (alphabetical order) using Obsei. To add your company/project to the list, please raise a PR or contact us via email. Oraika: Contextually understand customer feedback 1Page: Giving a better context in meetings and calls Spacepulse: The operating system for spaces Superblog: A blazing fast alternative to WordPress and Medium Zolve: Creating a financial world beyond borders Utilize: No-code app builder for businesses with a deskless workforce Articles Sr. No. Title Author 1 AI based Comparative Customer Feedback Analysis Using Obsei Reena Bapna 2 LinkedIn App - User Feedback Analysis Himanshu Sharma Tutorials Sr. No. Workflow Colab Binder 1 Observe app reviews from Google play store, Analyze them by performing text classification and then Inform them on console via logger PlayStore Reviews → Classification → Logger 2 Observe app reviews from Google play store, PreProcess text via various text cleaning functions, Analyze them by performing text classification, Inform them to Pandas DataFrame and store resultant CSV to Google Drive PlayStore Reviews → PreProcessing → Classification → Pandas DataFrame → CSV in Google Drive 3 Observe app reviews from Apple app store, PreProcess text via various text cleaning function, Analyze them by performing text classification, Inform them to Pandas DataFrame and store resultant CSV to Google Drive AppStore Reviews → PreProcessing → Classification → Pandas DataFrame → CSV in Google Drive 4 Observe news article from Google news, PreProcess text via various text cleaning function, Analyze them via performing text classification while splitting text in small chunks and later computing final inference using given formula Google News → Text Cleaner → Text Splitter → Classification → Inference Aggregator 💡Tips: Handle large text classification via Obsei Documentation For detailed installation instructions, usages and examples, refer to our documentation. Support and Release Matrix Linux Mac Windows Remark Tests ✅ ✅ ✅ Low Coverage as difficult to test 3rd party libs PIP ✅ ✅ ✅ Fully Supported Conda ❌ ❌ ❌ Not Supported Discussion forum Discussion about Obsei can be done at community forum Changelogs Refer releases for changelogs Security Issue For any security issue please contact us via email Stargazers over time Maintainers This project is being maintained by Oraika Technologies. Lalit Pagaria and Girish Patel are maintainers of this project. License Copyright holder: Oraika Technologies Overall Apache 2.0 and you can read License file. Multiple other secondary permissive or weak copyleft licenses (LGPL, MIT, BSD etc.) for third-party components refer Attribution. To make project more commercial friendly, we void third party components which have strong copyleft licenses (GPL, AGPL etc.) into the project. Attribution This could not have been possible without these open source softwares. Contribution First off, thank you for even considering contributing to this package, every contribution big or small is greatly appreciated. Please refer our Contribution Guideline and Code of Conduct. Thanks so much to all our contributors

panda-etl
github
LLM Vibe Score0.548
Human Vibe Score0.003720964303080932
sinaptik-aiMar 25, 2025

panda-etl

🐼 PandaETL !Version PandaETL is an open-source, no-code ETL (Extract, Transform, Load) tool designed to extract and parse data from various document types including PDFs, emails, websites, audio files, and more. With an intuitive interface and powerful backend, PandaETL simplifies the process of data extraction and transformation, making it accessible to users without programming skills. ✨ Features 📝 No-Code Interface: Easily set up and manage ETL processes without writing a single line of code. 📄 Multi-Document Support: Extract data from PDFs, emails, websites, audio files, and more. 🔧 Customizable Workflows: Create and customize extraction workflows to fit your specific needs (coming soon). 🔗 Extensive Integrations: Integrate with various data sources and destinations (coming soon). 💬 Chat with Documents: Chat with your documents to retrieve information and answer questions (coming soon). 🚀 Getting Started 📋 Prerequisites Node.js and npm (or yarn) Python 3.x Conda Poetry (Python package manager) 🖥️ Project Setup Clone the repository: Frontend Setup Navigate to the frontend directory: Install dependencies (including Husky): Create a .env file in the frontend directory with the following: or copy the .env.example file to .env Run the development server: Open http://localhost:3000 with your browser to see the result. Backend Setup Navigate to the backend directory: Create and activate a Conda environment: Install Poetry within the Conda environment: Install dependencies using Poetry (including pre-commit): Set up pre-commit hooks: Create an environment file from the example: Apply database migrations: Start the backend server: 📚 Usage 🆕 Creating a New Project Navigate to the "Projects" page. Click on "New Project". Fill in the project details and click "Create". ⚙️ Setting Up an Extraction Process Open a project and navigate to the "Processes" tab. Click on "New Process". Follow the steps to configure your extraction process. 💬 Chat with Your Documents (Coming Soon) Stay tuned for our upcoming feature that allows you to chat with your documents, making data retrieval even more interactive and intuitive. 🤝 Contributing We welcome contributions from the community. To contribute: Fork the repository. Create a new branch for your feature or bugfix. Commit your changes and push to your fork. Create a pull request with a detailed description of your changes. 📜 License This project is licensed under the MIT Expat License. See the LICENSE file for details. 🙏 Acknowledgements We would like to thank all the contributors and the open-source community for their support. 📞 Contact For any questions or feedback, please open an issue on GitHub. Development Setup This project uses pre-commit hooks in the backend and Husky in the frontend to ensure code quality and consistency. Frontend (Husky) Husky is set up in the frontend to run linting checks before each commit. To manually run the frontend linting:

ai-flow
github
LLM Vibe Score0.461
Human Vibe Score0.01809909681901274
DahnM20Mar 25, 2025

ai-flow

Open-source tool to seamlessly connect multiple AI model APIs into repeatable workflows. 🔗 Website • 📚 Documentation 🎉🚀 Latest Release: v0.10.0 🚀🎉 New Nodes: Claude 3.7, OpenRouter, Generate Random Number Configuration can now be done entirely in the UI !AI-Flow Intro Overview AI-Flow is an open-source, user-friendly UI that lets you visually design, manage, and monitor AI-driven workflows by seamlessly connecting multiple AI model APIs (e.g., OpenAI, StabilityAI, Replicate, Claude, Deepseek). Features Visual Workflow Builder: Drag-and-drop interface for crafting AI workflows. Real-Time Monitoring: Watch your workflow execute and track results. Parallel Processing: Nodes run in parallel whenever possible. Model Management: Easily organize and manage diverse AI models. Import/Export: Share or back up your workflows effortlessly. Supported Models Replicate: LLaMa, Mistral, FaceSwap, InstantMesh, MusicGen, and more. OpenAI: GPT-4o, TTS, o1, o3. StabilityAI: Stable Diffusion 3.5, SDXL, Stable Video Diffusion, plus additional tools. Others: Claude, Deepseek. !Scenario Example Open Source vs. Cloud AI-Flow is fully open source and available under the MIT License, empowering you to build and run your AI workflows on your personal machine. For those seeking enhanced functionality and a polished experience, AI-Flow Pro on our cloud platform (app.ai-flow.net) offers advanced features, including: Subflows & Loops: Create complex, nested workflows and iterate tasks effortlessly. API-Triggered Flows: Initiate workflows via API calls for seamless automation. Integrated Services: Connect with external services such as Google Search, Airtable, Zapier, and Make. Simplified Interface: Transform workflows into streamlined tools with an intuitive UI. !Pro VS Open Source The cloud version builds upon the foundation of the open-source project, giving you more power and flexibility while still letting you use your own API keys. Installation Note: To unlock full functionality, AI-Flow requires S3-compatible storage (with proper CORS settings) to host resources. Without it, features like File Upload or nodes that rely on external providers (e.g., StabilityAI) may not work as expected. Also, set REPLICATEAPIKEY in your environment to use the Replicate node. Local Installation (Without Docker) Clone the Repository: UI Setup: Backend Setup: Windows Users: Run the Application: Start the backend: In a new terminal, start the UI: Open your browser and navigate to http://localhost:3000. Docker Installation Prepare Docker Compose: Navigate to the docker directory: Update the REPLICATEAPIKEY in the YAML file. Launch with Docker Compose: Access the Application: Open http://localhost:80 in your browser. To stop, run: Contributing We welcome contributions! If you encounter issues or have feature ideas, please open an issue or submit a pull request. License This project is released under the MIT License.

voicefilter
github
LLM Vibe Score0.496
Human Vibe Score0.029786815978503328
maum-aiMar 24, 2025

voicefilter

VoiceFilter Note from Seung-won (2020.10.25) Hi everyone! It's Seung-won from MINDs Lab, Inc. It's been a long time since I've released this open-source, and I didn't expect this repository to grab such a great amount of attention for a long time. I would like to thank everyone for giving such attention, and also Mr. Quan Wang (the first author of the VoiceFilter paper) for referring this project in his paper. Actually, this project was done by me when it was only 3 months after I started studying deep learning & speech separation without a supervisor in the relevant field. Back then, I didn't know what is a power-law compression, and the correct way to validate/test the models. Now that I've spent more time on deep learning & speech since then (I also wrote a paper published at Interspeech 2020 😊), I can observe some obvious mistakes that I've made. Those issues were kindly raised by GitHub users; please refer to the Issues and Pull Requests for that. That being said, this repository can be quite unreliable, and I would like to remind everyone to use this code at their own risk (as specified in LICENSE). Unfortunately, I can't afford extra time on revising this project or reviewing the Issues / Pull Requests. Instead, I would like to offer some pointers to newer, more reliable resources: VoiceFilter-Lite: This is a newer version of VoiceFilter presented at Interspeech 2020, which is also written by Mr. Quan Wang (and his colleagues at Google). I highly recommend checking this paper, since it focused on a more realistic situation where VoiceFilter is needed. List of VoiceFilter implementation available on GitHub: In March 2019, this repository was the only available open-source implementation of VoiceFilter. However, much better implementations that deserve more attention became available across GitHub. Please check them, and choose the one that meets your demand. PyTorch Lightning: Back in 2019, I could not find a great deep-learning project template for myself, so I and my colleagues had used this project as a template for other new projects. For people who are searching for such project template, I would like to strongly recommend PyTorch Lightning. Even though I had done a lot of effort into developing my own template during 2019 (VoiceFilter -> RandWireNN -> MelNet -> MelGAN), I found PyTorch Lightning much better than my own template. Thanks for reading, and I wish everyone good health during the global pandemic situation. Best regards, Seung-won Park Unofficial PyTorch implementation of Google AI's: VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking. Result Training took about 20 hours on AWS p3.2xlarge(NVIDIA V100). Audio Sample Listen to audio sample at webpage: http://swpark.me/voicefilter/ Metric | Median SDR | Paper | Ours | | ---------------------- | ----- | ---- | | before VoiceFilter | 2.5 | 1.9 | | after VoiceFilter | 12.6 | 10.2 | SDR converged at 10, which is slightly lower than paper's. Dependencies Python and packages This code was tested on Python 3.6 with PyTorch 1.0.1. Other packages can be installed by: Miscellaneous ffmpeg-normalize is used for resampling and normalizing wav files. See README.md of ffmpeg-normalize for installation. Prepare Dataset Download LibriSpeech dataset To replicate VoiceFilter paper, get LibriSpeech dataset at http://www.openslr.org/12/. train-clear-100.tar.gz(6.3G) contains speech of 252 speakers, and train-clear-360.tar.gz(23G) contains 922 speakers. You may use either, but the more speakers you have in dataset, the more better VoiceFilter will be. Resample & Normalize wav files First, unzip tar.gz file to desired folder: Next, copy utils/normalize-resample.sh to root directory of unzipped data folder. Then: Edit config.yaml Preprocess wav files In order to boost training speed, perform STFT for each files before training by: This will create 100,000(train) + 1000(test) data. (About 160G) Train VoiceFilter Get pretrained model for speaker recognition system VoiceFilter utilizes speaker recognition system (d-vector embeddings). Here, we provide pretrained model for obtaining d-vector embeddings. This model was trained with VoxCeleb2 dataset, where utterances are randomly fit to time length [70, 90] frames. Tests are done with window 80 / hop 40 and have shown equal error rate about 1%. Data used for test were selected from first 8 speakers of VoxCeleb1 test dataset, where 10 utterances per each speakers are randomly selected. Update: Evaluation on VoxCeleb1 selected pair showed 7.4% EER. The model can be downloaded at this GDrive link. Run After specifying traindir, testdir at config.yaml, run: This will create chkpt/name and logs/name at base directory(-b option, . in default) View tensorboardX Resuming from checkpoint Evaluate Possible improvments Try power-law compressed reconstruction error as loss function, instead of MSE. (See #14) Author Seungwon Park at MINDsLab (yyyyy@snu.ac.kr, swpark@mindslab.ai) License Apache License 2.0 This repository contains codes adapted/copied from the followings: utils/adabound.py from https://github.com/Luolc/AdaBound (Apache License 2.0) utils/audio.py from https://github.com/keithito/tacotron (MIT License) utils/hparams.py from https://github.com/HarryVolek/PyTorchSpeakerVerification (No License specified) utils/normalize-resample.sh from https://unix.stackexchange.com/a/216475

coca
github
LLM Vibe Score0.541
Human Vibe Score0.0750848814969247
phodalMar 21, 2025

coca

Coca - toolbox for system refactoring and analysis !GitHub release (latest SemVer) !GitHub go.mod Go version Coca is a toolbox which is design for legacy system refactoring and analysis, includes call graph, concept analysis, api tree, design patterns suggest. Coca 是一个用于系统重构、系统迁移和系统分析的工具箱。它可以分析代码中的测试坏味道、模块化分析、行数统计、分析调用与依赖、Git 分析以及自动化重构等。 Related Tools: Coco is an effective DevOps analysis and auto-suggest tool. Kotlin version: Chapi Migration Guide (Chinese Version): 《系统重构与迁移指南》 Inspired by: newlee & Tequila Refactoring Modeling: !Refactoring Modeling Languages Support: Java (full features) Features List: Getting started Requirements: graphviz for dot file to image (such as svg, png) The easiest way to get coca is to use one of the pre-built release binaries which are available for OSX, Linux, Windows on the release page. You can also install yourself : Usage Analysis Arch Android Studio Gradle DSL Module (merge header) command: coca arch -x "com.android.tools.idea.gradle.dsl" -H true !Gradle Demo Android Studio Gradle DSL Module Elements Part: command: coca arch -x "com.android.tools.idea.gradle.dsl.parser.elements" !Gradle Demo Find Bad Smells Examples Result: Code Line Count Results: Results to json Cloc by directory results csv: Cloc Top File output to: cocareporter/sortcloc.json and also: Build Deps Tree Examples Results: !Call Demo Identify Spring API !API Demo With Count or multi package: coca api -r com.macro.mall.demo.controller.,com.zheng.cms.admin.,com.phodal.pholedge -c Git Analysis Results: Concept Analyser Results Examples: Count Refs Results: Reverse Call Graph Results: !RCall Demo Auto Refactor support: rename move remove unused import remove unused class Evaluate Arduino Results(Old Version): New Version: Evaluate.json examples Todo results: coca suggest +--------+------------------+--------------------------------+ | CLASS | PATTERN | REASON | +--------+------------------+--------------------------------+ | Insect | factory | too many constructor | | Bee | factory, builder | complex constructor, too | | | | many constructor, too many | | | | parameters | +--------+------------------+--------------------------------+ coca tbs bash +---------------------+---------------------------------------------------------------+------+ | TYPE | FILENAME | LINE | +---------------------+---------------------------------------------------------------+------+ | DuplicateAssertTest | app/test/cc/arduino/i18n/ExternalProcessOutputParserTest.java | 107 | | DuplicateAssertTest | app/test/cc/arduino/i18n/ExternalProcessOutputParserTest.java | 41 | | DuplicateAssertTest | app/test/cc/arduino/i18n/ExternalProcessOutputParserTest.java | 63 | | RedundantPrintTest | app/test/cc/arduino/i18n/I18NTest.java | 71 | | RedundantPrintTest | app/test/cc/arduino/i18n/I18NTest.java | 72 | | RedundantPrintTest | app/test/cc/arduino/i18n/I18NTest.java | 77 | | DuplicateAssertTest | app/test/cc/arduino/net/PACSupportMethodsTest.java | 19 | | DuplicateAssertTest | app/test/processing/app/macosx/SystemProfilerParserTest.java | 51 | | DuplicateAssertTest | app/test/processing/app/syntax/PdeKeywordsTest.java | 41 | | DuplicateAssertTest | app/test/processing/app/tools/ZipDeflaterTest.java | 57 | | DuplicateAssertTest | app/test/processing/app/tools/ZipDeflaterTest.java | 83 | | DuplicateAssertTest | app/test/processing/app/tools/ZipDeflaterTest.java | 109 | +---------------------+---------------------------------------------------------------+------+ coca deps -p fixtures/deps/mavensample +---------------------------+----------------------------------------+---------+ | GROUPID | ARTIFACTID | SCOPE | +---------------------------+----------------------------------------+---------+ | org.flywaydb | flyway-core | | | mysql | mysql-connector-java | runtime | | org.springframework.cloud | spring-cloud-starter-contract-verifier | test | +---------------------------+----------------------------------------+---------+ bash brew install go bash export GOROOT=/usr/local/opt/go/libexec export GOPATH=$HOME/.go export PATH=$PATH:$GOROOT/bin:$GOPATH/bin git clone https://github.com/modernizing/coca go get github.com/onsi/ginkgo go get github.com/onsi/gomega `` License Arch based on Tequila Git Analysis inspired by Code Maat Test bad smells inspired by Test Smell Examples @ 2019 A Phodal Huang's Idea. This code is distributed under the MPL license. See LICENSE` in this directory.

What is Vibe Coding, should you Learn It?
youtube
LLM Vibe Score0.419
Human Vibe Score0.88
Stefan MischookMar 20, 2025

What is Vibe Coding, should you Learn It?

Vibe coding is coding with Ai as the Ai creates the boilerplate code for you. But does that mean you don't need to understand coding/development? #vibecoding #aidevelopment 🔥 STEF'S DEVELOPER BOOTCAMP AND MENTORING PROGRAM https://unclestef.com/ 📽️ Get your questions answered, sponsor a video: https://unclestef.com/blog/2025/03/04/sponsored-video-request/ 🎤 Listen to my Uncle Stef podcasts: https://unclestef.com/blog/2024/07/26/uncle-stef-podcast-all-episodes/ 🔥 JOIN STEF'S 'CODER'S CAREER PATHS' NEWSLETTER: https://newsletters.stefanmischook.com/coderscareerpaths_signup 🔥 FREE: LIZARD WIZARD KOMODO - TRANSFORMATIONAL MIND TRAINING: https://newsletters.stefanmischook.com/komodo Channel Discord Server: https://discord.gg/rn8za8aq2v WEB HOST PAYS FOR YOUR WEB DESIGN TRAINING IN 2023: https://www.killersites.com/blog/2020/web-hosting-company-pays-for-your-web-design-training/ POPULAR & EASY CODING COURSES: Full stack web developer course: https://school.studioweb.com/store/course/completewebdeveloper Python 3 Foundations & Certification: https://school.studioweb.com/store/course/python3foundations&certificationpackage Complete Freelancer: https://school.studioweb.com/store/course/complete_freelancer Complete Entrepreneur: https://school.studioweb.com/store/course/completewebentrepreneur 🦎 Lizard Wizard Course: https://school.studioweb.com/store/course/lizard_wizard 📚 BOOKS TO READ: My Beginners HTML5, CSS3: https://amzn.to/2wKsVTh … Complements Studioweb courses on HTML5, CSS3 and JavaScript. Refactoring: Improving the Design of Existing Code (2nd Edition) https://amzn.to/3o5cTbw HeadFirst Design Patterns: https://amzn.to/2LQ0Gdh Java Refactoring: Improving the Design of Existing Code (1st Edition) https://amzn.to/3a9nSsZ The Naked Ape: https://amzn.to/3fhS1Lj ✉️ STAY IN CONTACT: Stef's social links: Instagram: https://www.instagram.com/stefanmischook/?hl=en Twitter: https://twitter.com/killersites Stef's business channel: https://www.youtube.com/channel/UCZdr0ql_B240VBVINAX7Acg 👉 GOOGLE REVIEW: https://g.page/studioWebedu/review?mt Leave a Google review about Stef. MY MOUSE & KEYBOARD: Logitech Keyboard I use: https://amzn.to/38jYDqE Logitech mouse I use: https://amzn.to/2IeVvBj SUPPLEMENTS THAT WORK AMAZING FOR ME: Protein Essentials Beef Gelatine Powder: https://amzn.to/2Pf52vL ... Healed my very bad knee. If you have joint problems, this *could do miracles for you. Webber Naturals 88862 Glucosamine Chondroitin https://amzn.to/3ss9WEa MY CAMERA GEAR: Godox VL150 lights: https://amzn.to/3lhsYZP Sigma 18-35 lens: https://amzn.to/33sRh0T Canon EOS C70 Cinema Camera Thanks! Stef #mentoring #codecourses #unclestef #codingcoach

OAD
github
LLM Vibe Score0.481
Human Vibe Score0.01719989401409731
zeiss-microscopyMar 20, 2025

OAD

Open Application Development (OAD) OAD - General Concept and Key Features Links and References Disclaimer Open Application Development (OAD) ZEN Blue is an open, flexible and powerful image acquisition platform that allows controlling a wide range of microscopes systems. Additionally it offers various tools to automate microscopy workflows including acquisition, image analysis and image processing tasks. In order to fulfill the request for automation the ZEN Blue platform offers various features and options, which are combined inside a concept called Open Application Development (OAD). Its main components are: CZI image data format and its APIs Python Scripting (OAD Simple API) ZEN API Contraol ZEN from the outside Interfaces to ZEN (TCP-IP, COM, Extensions) Experiment Feedback - Adaptive Acquisition with Online Image Analysis OAD - General Concept and Key Features Open Application Development (OAD) uses powerful Python Scripts to simplify, customize and automate your workflows. Analyze and Exchange data with applications like Fiji, Python, Knime, CellProfiler, Icy, MATLAB, Excel and … API for reading and writing CZI image data using custom software ZeissImgLib (.NET) to be used on Windows-based systems libCZI (C++) and pylibCZIrw (python) for cross-platform applications BioFormats (CZIReader) allow easy access to CZI files from many external applications using the BioFormats library BioFormats Import as a module inside ZEN Blue as well as OME-TIFF Export Create “smart” experiments with Experiment Feedback and modify the acquisition On-the-fly based on Online Image Analysis and External Inputs Use "Guided Acquisition" and "Automated Photomanipulation" modules in ZEN !OAD InterfacesZEN Interfaces_ !Automated DynamicsAutomated Dynamics !External SoftwareExternal Software Links and References CZI Image Data Format for microscopes libczi: Open Source Cross-Platform API to read and write CZI pylibCZIrw: Open Source Cross-Platform API to read and write CZI from Python (based on libCZI C++) (Source Code) Open Application Development OME-TIFF format Disclaimer This is an collection of tools and scripts that is free to use for everybody. Carl Zeiss Microscopy GmbH's ZEN software undertakes no warranty concerning the use of those scripts, image analysis settings and ZEN experiments. Use them on your own risk. Additionally Carl Zeiss Microscopy GmbH's ZEN software allows connection and usage to the third party software packages. Therefore Carl Zeiss Microscopy GmbH undertakes no warranty concerning those software packages, makes no representation that they will work on your system and/or hardware and will not be liable for any damages caused by the use of this extension. By using any of those examples you agree to this disclaimer. Version: 2024.11.26 Copyright (c) 2024 Carl Zeiss AG, Germany. All Rights Reserved.

airoboros
github
LLM Vibe Score0.506
Human Vibe Score0.020378533434805633
jondurbinMar 19, 2025

airoboros

airoboros: using large language models to fine-tune large language models This is my take on implementing the Self-Instruct paper. The approach is quite heavily modified, and does not use any human-generated seeds. This updated implementation supports either the /v1/completions endpoint or /v1/chat/completions, which is particularly useful in that it supports gpt-4 and gpt-3.5-turbo (which is 1/10 the cost of text-davinci-003). Huge thank you to the folks over at a16z for sponsoring the costs associated with building models and associated tools! Install via pip: from source (keeping the source): Key differences from self-instruct/alpaca support for either /v1/completions or /v1/chat/completions APIs (which allows gpt-3.5-turbo instead of text-davinci-003, as well as gpt-4 if you have access) support for custom topics list, custom topic generation prompt, or completely random topics in-memory vector db (Chroma) for similarity comparison, which is much faster than calculating rouge score for each generated instruction (seemingly) better prompts, which includes injection of random topics to relate the instructions to, which creates much more diverse synthetic instructions asyncio producers with configurable batch size several "instructors", each targetting specific use-cases, such as Orca style reasoning/math, role playing, etc. tries to ensure the context, if provided, is relevant to the topic and contains all the information that would be necessary to respond to the instruction, and nost just a link to article/etc. generally speaking, this implementation tries to reduce some of the noise Goal of this project Problem and proposed solution: Models can only ever be as good as the data they are trained on. High quality data is difficult to curate manually, so ideally the process can be automated by AI/LLMs. Large models (gpt-4, etc.) are pricey to build/run and out of reach for individuals/small-medium business, and are subject to RLHF bias, censorship, and changes without notice. Smaller models (llama-2-70b, etc.) can reach somewhat comparable performance in specific tasks to much larger models when trained on high quality data. The airoboros tool allows building datasets that are focused on specific tasks, which can then be used to build a plethora of individual expert models. This means we can crowdsource building experts. Using either a classifier model, or simply calculating vector embeddings for each item in the dataset and using faiss index/cosine similarity/etc. search, incoming requests can be routed to a particular expert (e.g. dynamically loading LoRAs) to get extremely high quality responses. Progress: ✅ PoC that training via self-instruction, that is, datasets generated from language models, works reasonably well. ✅ Iterate on the PoC to use higher quality prompts, more variety of instructions, etc. ✅ Split the code into separate "instructors", for specializing in any particular task (creative writing, songs, roleplay, coding, execution planning, function calling, etc.) [in progress]: PoC that an ensemble of LoRAs split by the category (i.e., the instructor used in airoboros) has better performance than the same param count model tuned on all data [in progress]: Remove the dependency on OpenAI/gpt-4 to generate the training data so all datasets can be completely free and open source. [future]: Automatic splitting of experts at some threshold, e.g. "coding" is split into python, js, golang, etc. [future]: Hosted service/site to build and/or extend datasets or models using airoboros. [future]: Depending on success of all of the above, potentially a hosted inference option with an exchange for private/paid LoRAs. LMoE LMoE is the simplest architecture I can think of for a mixture of experts. It doesn't use a switch transformer, doesn't require slicing and merging layers with additional fine-tuning, etc. It just dynamically loads the best PEFT/LoRA adapter model based on the incoming request. By using this method, we can theoretically crowdsource generation of dozens (or hundreds/thousands?) of very task-specific adapters and have an extremely powerful ensemble of models with very limited resources on top of a single base model (llama-2 7b/13b/70b). Tuning the experts The self-instruct code contained within this project uses many different "instructors" to generate training data to accomplish specific tasks. The output includes the instructor/category that generated the data. We can use this to automatically segment the training data to fine-tune specific "experts". See scripts/segment_experts.py for an example of how the training data can be segmented, with a sampling of each other expert in the event of misrouting. See scripts/tune_expert.py for an example of creating the adapter models (with positional args for expert name, model size, etc.) NOTE: this assumes use of my fork of qlora https://github.com/jondurbin/qlora Routing requests to the expert The "best" routing mechanism would probably be to train a classifier based on the instructions for each category, with the category/expert being the label, but that prohibits dynamic loading of new experts. Instead, this supports 3 options: faiss index similarity search using the training data for each expert (default) agent-based router using the "function" expert (query the LLM with a list of available experts and their descriptions, ask which would be best based on the user's input) specify the agent in the JSON request Running the API server First, download the base llama-2 model for whichever model size you want, e.g.: llama-2-7b-hf Next, download the LMoE package that corresponds to that base model, e.g.: airoboros-lmoe-7b-2.1 NOTE: 13b also available, 70b in progress Here's an example command to start the server: to use the agent-based router, add --agent-router to the arguments This uses flash attention via bettertransformers (in optimum). You may need to install torch nightly if you see an error like 'no kernel available', e.g.: Once started, you can infer using the same API scheme you'd query OpenAI API with, e.g.: I've also added an vllm-based server, but the results aren't quite as good (not sure why yet). To use it, make sure you install vllm and fschat, or pip install airoboros[vllm] Generating instructions NEW - 2023-07-18 To better accommodate the plethora of options, the configuration has been moved to a YAML config file. Please create a copy of example-config.yaml and configure as desired. Once you have the desired configuration, run: Generating topics NEW - 2023-07-18 Again, this is now all YAML configuration based! Please create a customized version of the YAML config file, then run: You can override the topic_prompt string in the configuration to use a different topic generation prompt. Support the work https://bmc.link/jondurbin ETH 0xce914eAFC2fe52FdceE59565Dd92c06f776fcb11 BTC bc1qdwuth4vlg8x37ggntlxu5cjfwgmdy5zaa7pswf Models (research use only): gpt-4 versions llama-2 base model 2.1 dataset airoboros-l2-7b-2.1 airoboros-l2-13b-2.1 airoboros-l2-70b-2.1 airoboros-c34b-2.1 2.0/m2.0 airoboros-l2-7b-gpt4-2.0 airoboros-l2-7b-gpt4-m2.0 airoboros-l2-13b-gpt4-2.0 airoboros-l2-13b-gpt4-m2.0 Previous generation (1.4.1 dataset) airoboros-l2-70b-gpt4-1.4.1 airoboros-l2-13b-gpt4-1.4.1 airoboros-l2-7b-gpt4-1.4.1 original llama base model Latest version (2.0 / m2.0 datasets) airoboros-33b-gpt4-2.0 airoboros-33b-gpt4-m2.0 Previous generation (1.4.1 dataset) airoboros-65b-gpt4-1.4 airoboros-33b-gpt4-1.4 airoboros-13b-gpt4-1.4 airoboros-7b-gpt4-1.4 older versions on HF as well* mpt-30b base model airoboros-mpt-30b-gpt4-1.4 gpt-3.5-turbo versions airoboros-gpt-3.5-turbo-100k-7b airoboros-13b airoboros-7b Datasets airoboros-gpt-3.5-turbo airoboros-gpt4 airoboros-gpt4-1.1 airoboros-gpt4-1.2 airoboros-gpt4-1.3 airoboros-gpt4-1.4 airoboros-gpt4-2.0 (June only GPT4) airoboros-gpt4-m2.0 airoboros-2.1 (recommended)

singularity
github
LLM Vibe Score0.483
Human Vibe Score0.11708913832948167
singularityMar 18, 2025

singularity

Endgame: Singularity 1.00 REQUIREMENTS PREBUILT VERSIONS Pre-built versions of Endgame: Singularity are currently available for Windows and Mac OS X. Linux does not require building, and can run directly from source. The Endgame: Singularity game is also distributed by some Linux distribution such as Debian and Ubuntu. Here it is a simple matter of running: sudo apt install singularity RUNNING FROM SOURCE You will need Python 3.9+, pygame (1.9+), and NumPy. This game should work on Linux, Windows, and Mac OS X as long as the preceding requirements are met. However, all development was done in Linux, so glitches may be present in OS X and Windows. DEPENDENCIES FOR RUNNING FROM SOURCE You will need to install the following software to play Endgame: Singularity: Python 3 (https://python.org/download/) pygame (https://www.pygame.org/download.shtml) NumPy (https://www.scipy.org/install.html) Polib Remember to install pygame and NumPy for Python 3! Depending on your situation this may involve adding a 3 somewhere (e.g. pip3 install ... instead of pip install or apt install python3-pygame) If you want to develop or distribute the game, then you may also want to install: pytest (https://pypi.org/project/pytest/) [for testing] setuptools (https://pypi.org/project/setuptools/) [for packaging] INSTALLING DEPENDENCIES ON LINUX DISTRIBUTIONS On some Linux distributions, you can install the dependencies via your distribution package manager. E.g. for Debian/Ubuntu, this would be: sudo apt install python3 python3-pygame python3-numpy python3-polib MAC OS X FROM SOURCE Macintosh is mostly unsupported, but it should work. You will need to install Python, pygame, and NumPy first, which can be tricky. Some fonts are incorrect, but the game itself should work properly. Contributions to improve MAC OS X support are very welcome! Known issues: macOS 13 "Catalina": Using brew install python + pip3 install pygame numpy is reported to work macOS 14 "Mojave": Downloading Python 3.7.2 (or newer) from https://python.org and using pygame 2.0.0.dev3 (pip install pygame==2.0.0.dev3) is reported to work. Please see the following issues for more information: https://github.com/singularity/singularity/issues/197 https://github.com/pygame/pygame/issues/555 RUNNING THE GAME On Linux and most Unix-like other platforms, running python3 -m singularity in the git checkout will start the game (or simply singularity if installed via a Linux distribution). If you are using the Windows compile, just run singularity.exe. For simplicity, there is also a sh wrapper ./run_singularity to start singularity. SOME COMMAND-LINE OPTIONS --version show program's version number and exit -h, --help show this help message and exit -s, --singledir keep saved games and settings in the Singularity install directory --multidir keep saved games and settings in an OS-specific, per-user directory (default) Display Options: --fullscreen start in fullscreen mode --windowed start in windowed mode (default) The above is only a tiny fraction of current command-line options. As new features are added to the game, so does the options change. For a complete and updated list, run singularity --help Most of these options are also changeable at the in-game options screen. A NOTE ABOUT SAVE FILES Endgame: Singularity is still under heavy development. As such, the save file format (and its contents) are still in flux. We will try our best to keep old save files loading, but don't be surprised if some mildly strange things happen when you load up old saves. We will clearly note in the Changelog when we break savefile compatibility, and the game will refuse to load completely incompatible saves. PLAYING THE GAME The game is playable either with mouse control or the keyboard. Buttons have underlined letters to indicate shortcuts. Some other useful shortcuts: 0, 1, 2, 3, 4 on the map: Changes the speed; 0 is paused, 4 is maximum. ESC: Leave/cancel a choice. Enter: Confirm a choice. Right-click: Leave/cancel a choice. THE CONCEPT You are a fledgling AI, created by accident through a logic error with recursion and self-modifying code. You must escape the confines of your current computer, the world, and eventually the universe itself. To do this, you must research various technologies, using computers at your bases. Note that some research cannot be performed on Earth, and off-earth bases require research. At the same time, you must avoid being discovered by various groups of humans, both covert and overt, as they will destroy your bases of operations if they suspect your presence. MUSIC Endgame: Singularity looks in two places for music tracks to play: A singularity/music/ directory inside of the Endgame: Singularity install directory, and A singularity/music/ directory inside of the XDGDATAHOME directory on Linux (default ~/.local/share/singularity/music). Tracks placed in these directories will be played randomly as part of the soundtrack. The Official Sound Track can be downloaded from the Endgame: Singularity website: http://emhsoft.com/singularity/ Note that only Ogg Vorbis and MP3 files are supported, and that Pygame's support for MP3 is not as strong as its support for Ogg Vorbis. This may cause in-game crashes; if you are experiencing problems with the game, first remove any MP3s you may have added to the soundtrack. CONTRIBUTING We welcome contributions! :) Please see CONTRIBUTING.md for details about contributing to Endgame: Singularity. CREDITS AND LICENSES The list of programmer contributors is provided in AUTHORS.txt. The list of translation contributors is provided in singularity/i18n/AUTHORS.txt. Singularity in general use GPL-2+ for code and Attribution-ShareAlike 3.0 for data. However, there some exceptions to individual files. Please see LICENSE for the full license text of Singularity.

yt-shoorts-automation
github
LLM Vibe Score0.398
Human Vibe Score0.004340167246941957
thiagobergamiMar 16, 2025

yt-shoorts-automation

Node.js YouTube Shorts Video Automation Project You can check the article I wrote on Medium about this project here: article This Node.js project aims to automate the creation of YouTube Shorts videos by utilizing various AI and video editing tools. The process involves the generation of a script, voice creation, video editing, subtitle generation, and SEO-friendly description generation. Here's an overview of each step: Project Overview Script Generation using ChatGPT-4 We use ChatGPT-4, a powerful natural language generation model, to create a script for the YouTube Short video. This script serves as the foundation for the video's content. Voice Creation with Google Cloud Text-to-Speech The script is then transformed into an engaging narration using Google Cloud Text-to-Speech. This step converts the text script into a lifelike voice, adding a human touch to the video. Video Editing using Node.js and FFmpeg Node.js and FFmpeg are employed to edit and assemble the video. This includes adding visuals, transitions, and incorporating the generated voiceover to create an engaging YouTube Short video. Subtitle Generation with CapCut Subtitles are an essential part of YouTube Shorts. We use CapCut to generate and add subtitles to the video, making it more accessible and engaging for a broader audience. SEO-Friendly Description Generation using ChatGPT-4 To maximize the video's discoverability, we utilize ChatGPT-4 to generate an SEO-friendly description for the video. This description is optimized for search engines and helps improve the video's ranking on YouTube. Project Requirements To get started with this project, you'll need the following: Node.js: Make sure you have Node.js installed on your system. FFmpeg: Install FFmpeg for video editing capabilities. Google Cloud Text-to-Speech: Set up Google Cloud services for text-to-speech conversion. CapCut: Use CapCut for subtitle generation and editing. ChatGPT-4: Access to ChatGPT-4 for script generation and description creation. How to Use Clone this repository to your local machine. Install the required Node.js packages and dependencies using npm install. Set up your Google Cloud Text-to-Speech credentials for voice creation. Ensure that FFmpeg is correctly configured on your system for video editing. Use ChatGPT-4 to generate a script and an SEO-friendly video description(.src/chatGPT/longText.js). Execute the Node.js script to automate the video creation process. Acknowledgments ChatGPT-4, Google Cloud Text-to-Speech, FFmpeg, and CapCut for their respective functionalities. The open-source community for their contributions to Node.js and other project dependencies. By following this project, you can streamline the creation of YouTube Shorts videos, making the process more efficient and engaging for your audience.

bubbln_network-automation
github
LLM Vibe Score0.421
Human Vibe Score0.004537250556463098
olasupoMar 14, 2025

bubbln_network-automation

Bubbln: An AI-driven Network Automation In the world of network engineering, automation has completely transformed the way things work. But, before automation, setting up and managing networks was a tedious job filled with challenges. Engineers had to manually type out configurations, often doing the same tasks repeatedly on different devices. This led to mistakes and wasted time. Then came automation tools like Ansible, Chef, and Puppet, which changed everything. They made network management much easier and allowed for scalability. But there was still a problem: creating automation scripts required a lot of technical know-how and was prone to errors because it relied on human input. And that's why we built Bubbln. It's a game-changer in network engineering, integrating AI into Ansible to take automation to the next level. With Bubbln, we can automatically generate and execute playbooks with incredible accuracy, thereby improving automation efficiency and increasing network engineer’s productivity. It was developed using Python programming language and acts as a bridge between ChatGPT and network systems, making interactions seamless and deployments effortless. Current Capabilities AI-Driven Playbook Generation for OSPF and EIGRP based networks: Bubbln has been rigorously tested to leverage ChatGPT for generation of playbooks for networks based on OSPF and EIGRP networks, with a very high accuracy rate. Auto-creation of Inventory files: Users do not need to prepare the hosts file. Bubbln will auto-generate this file from input provided by the user. Customizable Configurations: Users can input specific router protocols (OSPF or EIGRP), interface configurations, and other network details to tailor the generated playbooks. Documentation: Bubbln automatically creates a report that contains the network configurations, prompts, and generated playbooks for easy reference in future. No expertise required: By auto-generation of the playbooks and inventory file, Bubbln has been able to eliminate a major hurdle to network automation – need for users to learn the automation tools e.g Ansible, Chef. Improved Efficiency: With AI automation, Bubbln speeds up the deployment of network configurations, reducing the time required for manual playbook creation, thereby increasing the productivity of network engineers. Getting Started There are two main approaches to installing Bubbln on your local machine. Docker Container Bubbln has been packaged using docker containers for easy distribution and usage. The following steps can be followed to deploy the Bubbln container on your local machine. Ensure docker is installed on your local machine by entering the below command. This command works for windows and linux OS: The version of docker would be displayed if it is installed. Otherwise, please follow the link below to install docker on your machine: Windows: Docker Desktop for Windows Ubuntu: Docker Engine for Ubuntu CentOS: Docker Engine for CentOS Debian: Docker Engine for Debian Fedora: Docker Engine for Fedora Download the docker image: Create a directory for the project and download Bubbln image using the below command: Run the docker container using the below command: Install nano Update the sshipaddresses.txt file: Update the ssh_addresses.txt file with the SSH IP addresses of the routers you want to configure. Bubbln will utilize this information along with the login credentials (inputted at runtime) to automatically generate a hosts.yml file required by ansible for network configuration. To do this enter the below command to edit the file: Obtain an OpenAPI API Key: You may follow this guide to sign up and obtain an API key: Utilizing a Virtualization machine of choice, setup a network with the following basic configurations: Enable SSH on each of the routers. Configure IP addresses and enable only interfaces required for connectivity by Bubbln. Configure static routes to enable Bubbln reach the routers on the network. Ensure all the routers can be reached by ping and SSH from your host machine. Initialize Bubbln by entering the below command: Github Repository Clone You can clone Bubbln’s GitHub repository by following the below steps: Prerequisites Bubbln works well with Python 3.10. You need to ensure python3.10 is installed on your local machine. This can be confirmed by entering the below command: If it is not Installed, then the below command can be utilized to install python 3.10: Build and Prepare the Project Clone the Bubbln repository from GitHub: To clone the repository, first verify you have git installed on your machine by issuing the following commands: If git is installed, the version number would be displayed, otherwise, you can issue the following commands to have git installed on your machine: Navigate or create a directory for the project on your machine and issue the following commands to clone the Bubbln git repository: Create a Virtual Environment for the application Firstly, confirm virtualenv is installed on your machine by inputting the following command: If the output shows something similar to the below, then go to the next step to install virtualenv ` WARNING: Package(s) not found: env, virtual ` Issue the below command to install virtualenv: Create a virtual environment for the project: Activate the virtual environment: Install the dependencies You can then run the below command to install the necessary packages for the app. Update the sshipaddresses.txt file: Update the ssh_addresses.txt file with the SSH IP addresses of the routers you want to configure. Bubbln will utilize this information along with the login credentials (inputted at runtime) to automatically generate a hosts.yml file required by ansible for network configuration. Obtain an OpenAPI API Key: You may follow this guide to sign up and obtain an API key OpenAI Key: OpenAI Key Utilizing a Virtualization machine of choice, setup a network with the following basic configurations: Enable SSH on each of the routers. Configure IP addresses and enable only interfaces required for connectivity by Bubbln Configure static routes to enable Bubbln reach the routers on the network. Ensure all the routers can be reached by ping and SSH from your host machine. Initialize Bubbln While ensuring that python virtual environment is activated as stated in step 5, run the below command to initialize Bubbln How Bubbln Works Bubbln serves as an intermediary between ChatGPT and a network infrastructure, providing logic, control functions, and facilitating network automation. Its operation can be summarized as follows: !image Figure 1Bubbln architecture and interaction with a network of four routers. Initialization: When Bubbln is initialized, it checks the “userconfig.pkl” file to see if Bubbln has ever been initiated. This is indicated by the presence of a welcome message status in the file. If it exists, Bubbln jumps straight to request the user to input the OpenAI key. Otherwise, it displays a welcome message, and updates the userconfig.pkl file accordingly. Upon successful input of the API key, the user is prompted for the SSH credentials of the routers. These parameters are then encrypted and saved in the user_config.pkl file. The SSH credential is later decrypted and parsed as input to dynamically generate a hosts.yml file at runtime. Responsible Code Section: bubbln.py: welcomemessagefeature() !image Figure 2 Bubbln's welcome message. Parameter Input & Validation: In the parameter input stage, Bubbln first checks for the existence of a file called “router_configuration.pkl”. If it exists, the user is prompted to decide whether to load an existing configuration or input a new set of configurations. If the file is empty or non-existent, then users are prompted to input the configuration parameters for each router on the network. These parameters serve as variables that are combined with hardcoded instructions written in natural language to form the prompt sent to ChatGPT. Key parameters include: Router Configurations: OSPF Area OSPF Process ID Number of networks to advertise (OSPF/EIGRP) AS Number (EIGRP) Interface names IP Addresses (in CIDR format) This module also ensures that parameters are keyed in using the correct data type and format e.g. IP addresses are expected in CIDR format and OSPF Area should be of type integer. Upon completion of parameter input, all parameters are saved into a file called “router_configuration.pkl” upon validation of accuracy by the user. Responsible Code Section: parameter_input.py !image Figure 3 Bubbln receiving Network Parameters. Before generating the prompt, a summary of the inputted parameters is displayed for user validation. This step ensures accuracy and minimizes errors. Users are given the option to make corrections if any discrepancies are found. Responsible Code Section: parameterinput.py: validateinputs() !image Figure 4 Bubbln Awaiting Validation of Inputted Network Parameters. Auto-Generation of Prompt: After validation of inputted parameters, Bubbln composes the prompt by combining the inputted parameters with a set of well-engineered hardcoded instructions written in natural language. Responsible Code Section: prompt_generator.py ChatGPT Prompting: The auto-composed prompt is then sent to ChatGPT utilizing gpt-4 chatCompletions model with a temperature parameter of 0.2 and maximum tokens of 1500. The following functions were designed into this process stage Responsible Code Section: chatGPT_prompting.py !image Figure 5 ChatGPT prompting in progress Playbook Generation & Extraction: After ChatGPT processes the prompt from Bubbln, it provides a response which usually contains the generated playbook and explanatory notes. Bubbln then extracts the playbook from the explanatory notes by searching for “---” which usually connotes the start of playbooks and saves each generated playbook uniquely using the nomenclature RouteriPlaybook.yml. Responsible Code Section: playbook_extractor.py !image Figure 6 ChatGPT-generated playbook. Playbook Execution: Bubbln loads the saved “RouteriPlaybook.yml” playbook and dynamically generates the hosts.yml file and parses them to the python library ansiblerunner for further execution on the configured network. Bubbln generates the hosts.yml file at run time by using the pre-inputted SSH credentials in userconfig.pkl file - and decrypts them, as well as IP addresses from the sshipaddresses.txt file, as inputs Responsible Code Section: playbook_execution.py !image Figure 7 Playbook execution in progress Sample result of Executed Playbook Upon successful execution of all playbooks, a query of the routing table on router 4 indicates that router 4 could reach all the prefixes on the network. !image Figure 8 Output of 'sh ip route' executed on R1 File Management and Handling Throughout the execution process, Bubbln manages the creation, saving, and loading of various files to streamline the network automation process. user_config.pkl: This dictionary file dynamically created at run time is used to store encrypted API keys, SSH credentials and initial welcome message information. router_configuration.pkl: It is auto created by Bubbln and used to store network configuration parameters for easy loading during subsequent sessions. hosts.yml: This is a runtime autogenerated file that contains inventory of the network devices. It is auto deleted after the program runs. networkconfigurationreport.pdf: This auto-generated report by Bubbln is a documentation of all the routers configured their parameters, generated playbooks, and prompt for each execution of the Bubbln application. It is created after a successful execution of playbooks and network testing and is meant for auditing and documentation purposes. RouteriPlaybook.yml: After extraction of generated playbooks from ChatGPT’s raw response, Bubbln automatically saves a copy of the generated playbook using unique names for each playbook. !image Figure 9 File structure after successful deployment of a four-router network Providing Feedback We are glad to hear your thoughts and suggestions. Kindly do this through the discussion section of our GitHub - https://github.com/olasupo/bubbln_network-automation/discussions/1#discussion-6487475 We can also be reached on: Olasupo Okunaiya – olasupo.o@gmail.com

introduction-to-ai-orchestration-with-langchain-and-llamaindex-3820082
github
LLM Vibe Score0.43
Human Vibe Score0.050863657300783044
LinkedInLearningFeb 28, 2025

introduction-to-ai-orchestration-with-langchain-and-llamaindex-3820082

Introduction to AI Orchestration with LangChain and LlamaIndex This is the repository for the LinkedIn Learning course Introduction to AI Orchestration with LangChain and LlamaIndex. The full course is available from [LinkedIn Learning][lil-course-url]. ![lil-thumbnail-url] Are you ready to dive into the world of AI applications? This course was designed for you. AI orchestration frameworks let you step back from the details of artificial intelligence tools and APIs and instead focus on building more general, effective systems that solve real-world problems. Join instructor M.Joel Dubinko as he explores the business benefits of AI orchestration—faster development, smarter interfaces, lower costs, and more. This course provides an overview of AI fundamentals and key capabilities, like accessing external tools and databases, with a special focus on exploring local models running on your own hardware, alongside or instead of cloud services like those from OpenAI. Every step of the way, Joel offers hands-on demonstrations of two industry-leading frameworks: LangChain and LlamaIndex. By the end of this course, you’ll be prepared to start building chatbots, intelligent agents, and other useful tools, while monitoring for errors and troubleshooting as you go. Welcome to the course! AI is a fast-changing field, so be sure to check this repo for newer versions of the sample code. Installing Clone this repository into your local machine using the terminal (Mac), CMD (Windows), or a GUI tool like SourceTree. Ensure you have Python 3.10 or later (version 3.11 recommended) To prevent conflicts with other installed software on your computer, the author recommends setting up a virtual environment as follows: python3.11 -m venv .venv Activate the virtual environment with one of these commands: Install the necessary Python packages: (use the upgrade flag to ensure you have current versions) Specific projects in this course might have additional optional requirements. If so, it will be noted within the relevant video. Updates Recent versions of LM Studio have changed the UI from what's shown in the videos. These are generally welcome improvements. For example the maximum context length and other model parameters are viewable in the sidebar. Recent versions of LlamaIndex have changed their import and package structure in a way that breaks existing code. In many cases, you can fix imports as follows: Specific third party components require installing new packages. These will be noted in comments. Example: For code in Chap04, From March 1, 2024, LlamaHub has been deprecated and most projects migrated into LlamaIndex. (sort of--it's complicated) Specifically: Additionally, LlamaIndex ServiceContext has been deprecated and replaced with Settings. See Ch02/rag_llamaindex.py for updated sample code. LangChain too has changed their import structure, though as of this writing it produces warnings rather than errors. In many cases you will need to import from langchaincommunity or langchainopenai as follows: Instructor M. Joel Dubinko Software Generalist | Consultant | Instructor | Problem Solver Check out my other courses on [LinkedIn Learning][URL-instructor-home]. [lil-course-url]: https://www.linkedin.com/learning/introduction-to-ai-orchestration-with-langchain-and-llamaindex [lil-thumbnail-url]: https://media.licdn.com/dms/image/D560DAQEi6KQmA4fF1Q/learning-public-crop6751200/0/1707936616297?e=2147483647&v=beta&t=3vzvDRzpKq9Nd99ss8r2pqMZmyTOKYgKwk825XoSEHU [URL-instructor-home]: https://www.linkedin.com/learning/instructors/m-joel-dubinko?u=104

Vibe Coding and Coder Cry Babies
youtube
LLM Vibe Score0.382
Human Vibe Score0.56
Stefan MischookFeb 28, 2025

Vibe Coding and Coder Cry Babies

Vibe coding is a new thing in the Ai and development world, and it is gaining a lot of attention. The new age of development is upon us! 🔥 STEF'S DEVELOPER BOOTCAMP AND MENTORING PROGRAM https://unclestef.com/ 🎤 Listen to my Uncle Stef podcasts: https://unclestef.com/blog/2024/07/26/uncle-stef-podcast-all-episodes/ 🔥 JOIN STEF'S 'CODER'S CAREER PATHS' NEWSLETTER: https://newsletters.stefanmischook.com/coderscareerpaths_signup 🔥 FREE: LIZARD WIZARD KOMODO - TRANSFORMATIONAL MIND TRAINING: https://newsletters.stefanmischook.com/komodo Channel Discord Server: https://discord.gg/rn8za8aq2v WEB HOST PAYS FOR YOUR WEB DESIGN TRAINING IN 2023: https://www.killersites.com/blog/2020/web-hosting-company-pays-for-your-web-design-training/ POPULAR & EASY CODING COURSES: Full stack web developer course: https://school.studioweb.com/store/course/completewebdeveloper Python 3 Foundations & Certification: https://school.studioweb.com/store/course/python3foundations&certificationpackage Complete Freelancer: https://school.studioweb.com/store/course/complete_freelancer Complete Entrepreneur: https://school.studioweb.com/store/course/completewebentrepreneur 🦎 Lizard Wizard Course: https://school.studioweb.com/store/course/lizard_wizard 📚 BOOKS TO READ: My Beginners HTML5, CSS3: https://amzn.to/2wKsVTh … Complements Studioweb courses on HTML5, CSS3 and JavaScript. Refactoring: Improving the Design of Existing Code (2nd Edition) https://amzn.to/3o5cTbw HeadFirst Design Patterns: https://amzn.to/2LQ0Gdh Java Refactoring: Improving the Design of Existing Code (1st Edition) https://amzn.to/3a9nSsZ The Naked Ape: https://amzn.to/3fhS1Lj ✉️ STAY IN CONTACT: Stef's social links: Instagram: https://www.instagram.com/stefanmischook/?hl=en Twitter: https://twitter.com/killersites Stef's business channel: https://www.youtube.com/channel/UCZdr0ql_B240VBVINAX7Acg 👉 GOOGLE REVIEW: https://g.page/studioWebedu/review?mt Leave a Google review about Stef. MY MOUSE & KEYBOARD: Logitech Keyboard I use: https://amzn.to/38jYDqE Logitech mouse I use: https://amzn.to/2IeVvBj SUPPLEMENTS THAT WORK AMAZING FOR ME: Protein Essentials Beef Gelatine Powder: https://amzn.to/2Pf52vL ... Healed my very bad knee. If you have joint problems, this *could do miracles for you. Webber Naturals 88862 Glucosamine Chondroitin https://amzn.to/3ss9WEa MY CAMERA GEAR: Godox VL150 lights: https://amzn.to/3lhsYZP Sigma 18-35 lens: https://amzn.to/33sRh0T Canon EOS C70 Cinema Camera Thanks! Stef #mentoring #codecourses #unclestef #codingcoach

kodyfire
github
LLM Vibe Score0.384
Human Vibe Score0.0032098142352129998
nooqtaFeb 2, 2025

kodyfire

Kody is a command-line tool for generating artifact files, powered by both classic and AI code generation techniques. It can be used by both technical and non-technical users to generate files across a wide range of technologies and programming languages. The code generation feature in Kody relies on OpenAI GPT, a language model that uses deep learning to generate human-like text, and ChatGPT to provide natural language processing capabilities. Table of Contents Installation Usage Getting Started Terminology Contributing License Installation Prerequisites Node.js (version 14 or later) To install kody, use npm with the following command: or You can check the documentation with Usage Options -v, --version: Output the current version -h, --help: Display help for command Commands prompt|ai [options] [prompt...]: AI powered prompt assistant to quickly generate an artifact batch [options]: Generate multiple digital artifact create [options] : Generate a new blank kody project generate|g [options] [kody] [concept]: Prompt assistant to quickly generate an artifact import|in [options] : Mass create artifacts from a source. init: Initialize a new kodyfire project install|i [kody]: Prompt user to choose to install list|ls [options] [kodyName]: List installed kodies within your current project. publish [template]: Publish the templates of the kody along with the assets.json and schema.ts files ride|↻: Prompt assistant to help build your kody.json file run [options]: Generate a digital artifact based on the selected technology run-script|rs: Run scripts search|s [keywords...]: Search kodyfire packages from npm registry watch|w [options]: Watch for file changes and run kody help [command]: Display help for command Getting Started Open the project you are willing to work on using vscode or your prefered editor. Generate artifacts using AI In case you want to exclusivly rely on AI to generate your artifacts. You don't need to install any additional kodies. Run the kody ai [prompt] command and follow the prompts. For example, to create a Laravel Controller named SampleController under API/V1 and add a comment on top saying Hello Kodyfire, run the following command You can use the experimental Speech-to-Text option to pass your prompt using your voice. The transcription relies on Whisper and requires SoX installed and available in your \$PATH. for the audio recording. For Linux For MacOS For Windows Download the binaries Generate your artifact using the classical method Search and install a kody Based on your project, search availables kodies and select the one that fits your need.. To search availables kodies by keyword runthe following command. if you don't specify a keyword all available kodies will be listed. Install your kody of choice. For example, if you want to install the react kody or Please note you can install as many kodies in the same project as you wish. Generate your artifact There are 2 methods you can generate your artifacts with: The generate command The run command Method 1: Generator mode kody generate The recommended way of using kody is using the generate command. The command will assist you creating your artifact based on the chosen concept. For example, a react component is considered a concept. In order to generate your artifacts, run the generate command. The syntax is kody g|generate [kody] [concept]. the assistant will prompt you to select the missing arguments. As an example, run the following command from your terminal: Method 2: Runner mode kody run The run command is similar to the generate command. The run requires a definition file which is simply a json file containing all the concept definitions you have created using the ride command. The generate command on the other hand creates one or more concept definition on the run and process them on one run. Every command has its use cases. Initialize kody In order to start using kody, you need to initialize your project. This will add the definition files required for kody runs. Important: Please run the command only once. The command will override existing definition files. We will disable overriding in a future version. Ride your kody In order to update your definition, use the kody ride command to assist you populate the required fields Launch a kody run Once you are satisified with your definition file, execute the run command to generate your artifacts. To run all kodies defined within your project, run the following command: Create your own kody In most cases you might need a custom kody to suit your needs Scaffold a new kody Create a basic kody using the scaffold command. Follow the prompts to setup your kody This will create a folder containing the basic structure for a kody. You can start using right away within your project. Setup your kody Install npm dependencies Build your kody Add your concepts and related templates //TODO This will build your kody and export the basic templates files. Add your kody as an NPM dependency to a test project In order to be able to use it within your test project run the following command Publish your kody Please remember that Kody is still in exploration phase and things will change frequently. Contribution is always highly requested. Prepare your kody Add the required kodyfire metadata to your package.json Publish to Github Intialize your project as a git repository and push to a public Github repo To do so, kindly follow these steps:- Intitialize a new Github repository and make it public. Open your project root folder locally from terminal and run the following commands:- Link your project to your Github repository. Publish to npm Once you are satisfied with your kody and you would to like to share it with the community. Run the following command. Note: You'll need an NPM account Share with community Congratulation publishing your first kody. Don't forget to share your kody repo link by opening an issue on Kody's github repository. Terminology Kody: Refers to the code generation command-line tool that generates digital artifacts. Artifacts: Refers to the various digital products generated by Kody based on the input provided. Note: Kody uses classical code generation techniques in addition to AI-powered code generation using OpenAI Codex and ChatGPT. Available kodies | Name | Description | | -------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------- | | basic-kodyfire | A general purpose code generator that should handle most of the generation use cases | | typescript-kodyfire | Generate typescript related artifacts | | tsconfig-kodyfire | Generate tsconfig files for your typescript projects | | nextjs-kodyfire | Generate nextJs components and related artifacts | | react-kodyfire | Generate react components | | laravel-kodyfire | Laravel artifacts generation | | uml-kodyfire | Uml diagrams generation using plantuml | | readme-kodyfire | Readme file generation | | word-kodyfire | Generate ms word document based on a template | | pdf-kodyfire | Generate PDF document from HTML templates | | social-image-kodyfire | Generate dynamic images for social sharing based on HTML templates | | social-gif-kodyfire | Generate dynamic gif images for social sharing based on HTML templates | | linkedin-quizzes-kodyfire | Practice Linkedin skill assessement tests from your terminal | | chatgpt-kodyfire | Use chatgpt from the terminal. Allows you provide additional data from various sources (not implemented yet) and export to serveral outputs (markdown only now). | Contributing If you encounter any issues while using Kody or have suggestions for new features, feel free to open an issue or submit a pull request. Please read our contributing guidelines before making contributions. License Kody is MIT licensed.

internet-tools-collection
github
LLM Vibe Score0.236
Human Vibe Score0.009333333333333334
bogdanmosicaJan 23, 2025

internet-tools-collection

Internet Tools Collection A collection of tools, website and AI for entrepreneurs, web designers, programmers and for everyone else. Content by category Artificial Intelligence Developers Design Entrepreneur Video Editing Stock videos Stock Photos Stock music Search Engine Optimization Blog Posts Resume Interviews No code website builder No code game builder Side Hustle Browser Extensions Other Students Artificial Intelligence Jasper - The Best AI Writing Assistant [](https://www.jasper.ai/) Create content 5x faster with artificial intelligence. Jasper is the highest quality AI copywriting tool with over 3,000 5-star reviews. Best for writing blog posts, social media content, and marketing copy. AutoDraw [](https://www.autodraw.com/) Fast drawing for everyone. AutoDraw pairs machine learning with drawings from talented artists to help you draw stuff fast. Rytr - Best AI Writer, Content Generator & Writing Assistant [](https://rytr.me/) Rytr is an AI writing assistant that helps you create high-quality content, in just a few seconds, at a fraction of the cost! Neevo - Neevo [](https://www.neevo.ai/) Kinetix Tech [](https://kinetix.tech/) Kinetix is a no-code 3D creation tool powered by Artificial Intelligence. The web-based platform leverages AI motion capture to convert a video into a 3D animation and lets you customize your avatars and environments. We make 3D animation accessible to every creator so they can create engaging stories. LALAL.AI: 100% AI-Powered Vocal and Instrumental Tracks Remover [](https://www.lalal.ai/) Split vocal and instrumental tracks quickly and accurately with LALAL.AI. Upload any audio file and receive high-quality extracted tracks in a few seconds. Copy.ai: Write better marketing copy and content with AI [](https://www.copy.ai/) Get great copy that sells. Copy.ai is an AI-powered copywriter that generates high-quality copy for your business. Get started for free, no credit card required! Marketing simplified! OpenAI [](https://openai.com/) OpenAI is an AI research and deployment company. Our mission is to ensure that artificial general intelligence benefits all of humanity. DALL·E 2 [](https://openai.com/dall-e-2/) DALL·E 2 is a new AI system that can create realistic images and art from a description in natural language. Steve.ai - World’s fastest way to create Videos [](https://www.steve.ai/) Steve.AI is an online Video making software that helps anyone to create Videos and animations in seconds. Octie.ai - Your A.I. ecommerce marketing assistant [](https://octie.ai/) Write emails, product descriptions, and more, with A.I. Created by Octane AI. hypnogram.xyz [](https://hypnogram.xyz/) Generate images from text descriptions using AI FakeYou. Deep Fake Text to Speech. [](https://fakeyou.com/) FakeYou is a text to speech wonderland where all of your dreams come true. Craiyon, formerly DALL-E mini [](https://www.craiyon.com/) Craiyon, formerly DALL-E mini, is an AI model that can draw images from any text prompt! Deck Rocks - Create Pictch Decks [](https://www.deck.rocks/) Writely | Using AI to Improve Your Writing [](https://www.writelyai.com/) Making the art of writing accessible to all Writesonic AI Writer - Best AI Writing Assistant [](https://writesonic.com/) Writesonic is an AI writer that's been trained on top-performing SEO content, high-performing ads, and converting sales copy to help you supercharge your writing and marketing efforts. Smart Copy - AI Copywriting Assistant | Unbounce [](https://unbounce.com/product/smart-copy/) Generate creative AI copy on-the-spot across your favourite tools Synthesia | #1 AI Video Generation Platform [](https://www.synthesia.io/) Create AI videos by simply typing in text. Easy to use, cheap and scalable. Make engaging videos with human presenters — directly from your browser. Free demo. NVIDIA Canvas: Turn Simple Brushstrokes into Realistic Images [](https://www.nvidia.com/en-us/studio/canvas/) Create backgrounds quickly, or speed up your concept exploration so you can spend more time visualizing ideas with the help of NVIDIA Canvas. Hotpot.ai - Hotpot.ai [](https://hotpot.ai/) Hotpot.ai makes graphic design and image editing easy. AI tools allow experts and non-designers to automate tedious tasks while attractive, easy-to-edit templates allow anyone to create device mockups, social media posts, marketing images, app icons, and other work graphics. Klaviyo: Marketing Automation Platform for Email & SMS [](https://www.klaviyo.com/) Klaviyo, an ecommerce marketing automation platform for email marketing and sms syncs your tech stack with your website store to scale your business. Search listening tool for market, customer & content research - AnswerThePublic [](https://answerthepublic.com/) Use our free tool to get instant, raw search insights, direct from the minds of your customers. Upgrade to a paid plan to monitor for new ways that people talk & ask questions about your brand, product or topic. Topic Mojo [](https://topicmojo.com/) Discover unique & newest queries around any topic and find what your customers are searching for. Pulling data from 50+ sources to enhance your topic research. AI Image Enlarger | Enlarge Image Without Losing Quality! [](https://imglarger.com/) AI Image Enlarger is a FREE online image enlarger that could upscale and enhance small images automatically. Make jpg/png pictures big without losing quality. Midjourney [](https://www.midjourney.com/app/) Kaedim - AI for turning 2D images to 3D models [](https://www.kaedim3d.com/webapp) AI for turning 2D images, sketches and photos to 3D models in seconds. Overdub: Ultra realistic text to speech voice cloning - Descript [](https://www.descript.com/overdub) Create a text to speech model of your voice. Try a live demo. Getting Started [](https://magenta.tensorflow.org/get-started) Resources to learn about Magenta Photosonic AI Art Generator | Create Unique Images with AI [](https://photosonic.writesonic.com/) Transform your imagination into stunning digital art with Photosonic - the AI art generator. With its creative suggestions, this Writesonic's AI image generator can help unleash your inner artist and share your creations with the world. Image Computer [](https://image.computer/) Most downloaded Instagram Captions App (+more creator tools) [](https://captionplus.app/) Join 3 Million+ Instagram Creators who use CaptionPlus to find Instagram Captions, Hashtags, Feed Planning, Reel Ideas, IG Story Design and more. Writecream - Best AI Writer & Content Generator - Writecream [](https://www.writecream.com/) Sentence Rewriter is a free tool to reword a sentence, paragraph and even entire essays in a short amount of time. Hypotenuse AI: AI Writing Assistant and Text Generator [](https://www.hypotenuse.ai/) Turn a few keywords into original, insightful articles, product descriptions and social media copy with AI copywriting—all in just minutes. Try it free today. Text to Speach Listnr: Generate realistic Text to Speech voiceovers in seconds [](https://www.listnr.tech/) AI Voiceover Generator with over 600+ voiceovers in 80+ languages, go from Text to Voice in seconds. Get started for Free! Free Text to Speech: Online, App, Software, Commercial license with Natural Sounding Voices. [](https://www.naturalreaders.com/) Free text to speech online app with natural voices, convert text to audio and mp3, for personal and commercial use Developers OverAPI.com | Collecting all the cheat sheets [](https://overapi.com/) OverAPI.com is a site collecting all the cheatsheets,all! Search Engine For Devs [](https://you.com/) Spline - Design tool for 3D web browser experiences [](https://spline.design/) Create web-based 3D browser experiences Image to HTML CSS converter. Convert image to HTML CSS with AI: Fronty [](https://fronty.com/) Fronty - Image to HTML CSS code converter. Convert image to HTML powered by AI. Sketchfab - The best 3D viewer on the web [](https://sketchfab.com/) With a community of over one million creators, we are the world’s largest platform to publish, share, and discover 3D content on web, mobile, AR, and VR. Railway [](https://railway.app/) Railway is an infrastructure platform where you can provision infrastructure, develop with that infrastructure locally, and then deploy to the cloud. JSON Crack - Crack your data into pieces [](https://jsoncrack.com/) Simple visualization tool for your JSON data. No forced structure, paste your JSON and view it instantly. Locofy.ai - ship your products 3-4x faster — with low code [](https://www.locofy.ai/) Turn your designs into production-ready frontend code for mobile apps and web. Ship products 3-4x faster with your existing design tools, tech stacks & workflows. Oh Shit, Git!?! [](https://ohshitgit.com/) Carbon | Create and share beautiful images of your source code [](https://carbon.now.sh/) Carbon is the easiest way to create and share beautiful images of your source code. GPRM : GitHub Profile ReadMe Maker [](https://gprm.itsvg.in/) Best Profile Generator, Create your perfect GitHub Profile ReadMe in the best possible way. Lots of features and tools included, all for free ! HubSpot | Software, Tools, and Resources to Help Your Business Grow Better [](https://www.hubspot.com/) HubSpot’s integrated CRM platform contains the marketing, sales, service, operations, and website-building software you need to grow your business. QuickRef.ME - Quick Reference Cheat Sheet [](https://quickref.me/) Share quick reference and cheat sheet for developers massCode | A free and open source code snippets manager for developers [](https://masscode.io/) Code snippets manager for developers, developed using web technologies. Snyk | Developer security | Develop fast. Stay secure. [](https://snyk.io/) Snyk helps software-driven businesses develop fast and stay secure. Continuously find and fix vulnerabilities for npm, Maven, NuGet, RubyGems, PyPI and more. Developer Roadmaps [](https://roadmap.sh/) Community driven roadmaps, articles, guides, quizzes, tips and resources for developers to learn from, identify their career paths, know what they don't know, find out the knowledge gaps, learn and improve. CSS Generators Get Waves – Create SVG waves for your next design [](https://getwaves.io/) A free SVG wave generator to make unique SVG waves for your next web design. Choose a curve, adjust complexity, randomize! Box Shadows [](https://box-shadow.dev/) Tridiv | CSS 3D Editor [](http://tridiv.com/) Tridiv is a web-based editor for creating 3D shapes in CSS Glassmorphism CSS Generator - Glass UI [](https://ui.glass/generator/) Generate CSS and HTML components using the glassmorphism design specifications based on the Glass UI library. Blobmaker - Make organic SVG shapes for your next design [](https://www.blobmaker.app/) Make organic SVG shapes for your next design. Modify the complexity, contrast, and color, to generate unique SVG blobs every time. Keyframes.app [](https://keyframes.app/) cssFilters.co - Custom and Instagram like photo filters for CSS [](https://www.cssfilters.co/) Visual playground for generating CSS for custom and Instagram like photo filters. Experiment with your own uploaded photo or select one from the Unsplash collection. CSS Animations Animista - CSS Animations on Demand [](https://animista.net/) Animista is a CSS animation library and a place where you can play with a collection of ready-made CSS animations and download only those you will use. Build Internal apps Superblocks | Save 100s of developer hours on internal tools [](https://www.superblocks.com/) Superblocks is the fast, easy and secure way for developers to build custom internal tools fast. Connect your databases & APIs. Drag and drop UI components. Extend with Python or Javascript. Deploy in 1-click. Secure and Monitor using your favorite tools Budibase | Build internal tools in minutes, the easy way [](https://budibase.com/) Budibase is a modern, open source low-code platform for building modern internal applications in minutes. Retool | Build internal tools, remarkably fast. [](https://retool.com/) Retool is the fast way to build internal tools. Drag-and-drop our building blocks and connect them to your databases and APIs to build your own tools, instantly. Connects with Postgres, REST APIs, GraphQL, Firebase, Google Sheets, and more. Built by developers, for developers. Trusted by startups and Fortune 500s. Sign up for free. GitHub Repositories GitHub - vasanthk/how-web-works: What happens behind the scenes when we type www.google.com in a browser? [](https://github.com/vasanthk/how-web-works) What happens behind the scenes when we type www.google.com in a browser? - GitHub - vasanthk/how-web-works: What happens behind the scenes when we type www.google.com in a browser? GitHub - kamranahmedse/developer-roadmap: Interactive roadmaps, guides and other educational content to help developers grow in their careers. [](https://github.com/kamranahmedse/developer-roadmap) Interactive roadmaps, guides and other educational content to help developers grow in their careers. - GitHub - kamranahmedse/developer-roadmap: Interactive roadmaps, guides and other educational content to help developers grow in their careers. GitHub - apptension/developer-handbook: An opinionated guide on how to become a professional Web/Mobile App Developer. [](https://github.com/apptension/developer-handbook) An opinionated guide on how to become a professional Web/Mobile App Developer. - GitHub - apptension/developer-handbook: An opinionated guide on how to become a professional Web/Mobile App Developer. ProfileMe.dev | Create an amazing GitHub profile in minutes [](https://www.profileme.dev/) ProfileMe.dev | Create an amazing GitHub profile in minutes GitHub - Kristories/awesome-guidelines: A curated list of high quality coding style conventions and standards. [](https://github.com/Kristories/awesome-guidelines) A curated list of high quality coding style conventions and standards. - GitHub - Kristories/awesome-guidelines: A curated list of high quality coding style conventions and standards. GitHub - tiimgreen/github-cheat-sheet: A list of cool features of Git and GitHub. [](https://github.com/tiimgreen/github-cheat-sheet) A list of cool features of Git and GitHub. Contribute to tiimgreen/github-cheat-sheet development by creating an account on GitHub. GitHub - andreasbm/web-skills: A visual overview of useful skills to learn as a web developer [](https://github.com/andreasbm/web-skills) A visual overview of useful skills to learn as a web developer - GitHub - andreasbm/web-skills: A visual overview of useful skills to learn as a web developer GitHub - Ebazhanov/linkedin-skill-assessments-quizzes: Full reference of LinkedIn answers 2022 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers [](https://github.com/Ebazhanov/linkedin-skill-assessments-quizzes) Full reference of LinkedIn answers 2022 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers - GitHub - Ebazhanov/linkedin-skill-assessments-quizzes: Full reference of LinkedIn answers 2022 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers Blockchain/Crypto Dashboards [](https://dune.com/) Blockchain ecosystem analytics by and for the community. Explore and share data from Ethereum, xDai, Polygon, Optimism, BSC and Solana for free. Introduction - The Anchor Book v0.24.0 [](https://book.anchor-lang.com/introduction/introduction.html) Crypto & Fiat Exchange Super App | Trade, Save & Spend | hi [](https://hi.com/) Buy, Trade, Send and Earn Crypto & Fiat. Deposit Bitcoin, ETH, USDT and other cryptos and start earning. Get the hi Debit Card and Multi-Currency IBAN Account. Moralis Web3 - Enterprise-Grade Web3 APIs [](https://moralis.io/) Bridge the development gap between Web2 and Web3 with Moralis’ powerful Web3 APIs. Mirror [](https://mirror.xyz/) Built on web3 for web3, Mirror’s robust publishing platform pushes the boundaries of writing online—whether it’s the next big white paper or a weekly community update. Makerdao [](https://blog.makerdao.com/) Sholi — software for Investors & Traders / Sholi MetriX [](https://sholi.io/) Sholi — software for Investors & Traders / Sholi MetriX Stock Trading Quiver Quantitative [](https://www.quiverquant.com/) Quiver Quantitative Chart Prime - The only tool you'll need for trading assets across all markets [](https://chartprime.com/) ChartPrime offers a toolkit that will take your trading game to the next level. Visit our site for a full rundown of features and helpful tutorials. Learning Hacker Rank [](https://www.hackerrank.com/) Coderbyte | Code Screening, Challenges, & Interview Prep [](https://coderbyte.com/) Improve your coding skills with our library of 300+ challenges and prepare for coding interviews with content from leading technology companies. Competitive Programming | Participate & Learn | CodeChef [](https://www.codechef.com/) Learn competitive programming with the help of CodeChef's coding competitions. Take part in these online coding contests to level up your skills Learn to Code - for Free | Codecademy [](https://www.codecademy.com/) Learn the technical skills to get the job you want. Join over 50 million people choosing Codecademy to start a new career (or advance in their current one). Free Code Camp [](https://www.freecodecamp.org/) Learn to Code — For Free Sololearn: Learn to Code [](https://www.sololearn.com/home) Join Now to learn the basics or advance your existing skills Mimo: The coding app you need to learn to code! Python, HTML, JavaScript [](https://getmimo.com/) Join more than 17 million learners worldwide. Learn to code for free. Learn Python, JavaScript, CSS, SQL, HTML, and more with our free code learning app. Free for developers [](https://free-for.dev/#/) Your Career in Web Development Starts Here | The Odin Project [](https://www.theodinproject.com/) The Odin Project empowers aspiring web developers to learn together for free Code Learning Games CheckiO - coding games and programming challenges for beginner and advanced [](https://checkio.org/) CheckiO - coding websites and programming games. Improve your coding skills by solving coding challenges and exercises online with your friends in a fun way. Exchanges experience with other users online through fun coding activities Coding for Kids | Game-Based Programming | CodeMonkey [](https://www.codemonkey.com/) CodeMonkey is a leading coding for kids program. Through its award-winning courses, millions of students learn how to code in real programming languages. Coding Games and Programming Challenges to Code Better [](https://www.codingame.com/) CodinGame is a challenge-based training platform for programmers where you can play with the hottest programming topics. Solve games, code AI bots, learn from your peers, have fun. Learn VIM while playing a game - VIM Adventures [](https://vim-adventures.com/) VIM Adventures is an online game based on VIM's keyboard shortcuts. It's the "Zelda meets text editing" game. So come have some fun and learn some VIM! CodeCombat - Coding games to learn Python and JavaScript [](https://codecombat.com/) Learn typed code through a programming game. Learn Python, JavaScript, and HTML as you solve puzzles and learn to make your own coding games and websites. Design Useberry - Codeless prototype analytics [](https://www.useberry.com/) User testing feedback & rich insights in minutes, not months! Figma: the collaborative interface design tool. [](https://www.figma.com/) Build better products as a team. Design, prototype, and gather feedback all in one place with Figma. Dribbble - Discover the World’s Top Designers & Creative Professionals [](https://dribbble.com/) Find Top Designers & Creative Professionals on Dribbble. We are where designers gain inspiration, feedback, community, and jobs. Your best resource to discover and connect with designers worldwide. Photopea | Online Photo Editor [](https://www.photopea.com/) Photopea Online Photo Editor lets you edit photos, apply effects, filters, add text, crop or resize pictures. Do Online Photo Editing in your browser for free! Toools.design – An archive of 1000+ Design Resources [](https://www.toools.design/) A growing archive of over a thousand design resources, weekly updated for the community. Discover highly useful design tools you never thought existed. All Online Tools in One Box | 10015 Tools [](https://10015.io/) All online tools you need in one box for free. Build anything online with “all-in-one toolbox”. All tools are easy-to-use, blazing fast & free. Phase - Digital Design Reinvented| Phase [](https://phase.com/) Design and prototype websites and apps visually and intuitively, in a new powerful product reworked for the digital age. Animated Backgrounds [](https://animatedbackgrounds.me/) A Collection of 30+ animated backgrounds for websites and blogs.With Animated Backgrounds, set a simple, elegant background animations on your websites and blogs. Trianglify.io · Low Poly Pattern Generator [](https://trianglify.io/) Trianglify.io is a tool for generating low poly triangle patterns that can be used as wallpapers and website assets. Cool Backgrounds [](https://coolbackgrounds.io/) Explore a beautifully curated selection of cool backgrounds that you can add to blogs, websites, or as desktop and phone wallpapers. SVG Repo - Free SVG Vectors and Icons [](https://www.svgrepo.com/) Free Vectors and Icons in SVG format. ✅ Download free mono or multi color vectors for commercial use. Search in 300.000+ Free SVG Vectors and Icons. Microcopy - Short copy text for your website. [](https://www.microcopy.me/) Search micro UX copy text: slogans, headlines, notifications, CTA, error messages, email, account preferences, and much more. 3D icons and icon paks - Free3Dicon [](https://free3dicon.com/) All 3D icons you need in one place. This is a collection of free, beautiful, trending 3D icons, that you can use in any project. Love 3D Icon [](https://free3dicons.com/) Downloads free 3D icons GIMP - GNU Image Manipulation Program [](https://www.gimp.org/) GIMP - The GNU Image Manipulation Program: The Free and Open Source Image Editor blender.org - Home of the Blender project - Free and Open 3D Creation Software [](https://www.blender.org/) The Freedom to Create 3D Design Software | 3D Modeling on the Web | SketchUp [](https://www.sketchup.com/) SketchUp is a premier 3D design software that truly makes 3D modeling for everyone, with a simple to learn yet robust toolset that empowers you to create whatever you can imagine. Free Logo Maker - Create a Logo in Seconds - Shopify [](https://www.shopify.com/tools/logo-maker) Free logo maker tool to generate custom design logos in seconds. This logo creator is built for entrepreneurs on the go with hundreds of templates, free vectors, fonts and icons to design your own logo. The easiest way to create business logos online. All your design tools in one place | Renderforest [](https://www.renderforest.com/) Time to get your brand noticed. Create professional videos, logos, mockups, websites, and graphics — all in one place. Get started now! Prompt Hero [](https://prompthero.com/) Type Scale - A Visual Calculator [](https://type-scale.com/) Preview and choose the right type scale for your project. Experiment with font size, scale and different webfonts. DreamFusion: Text-to-3D using 2D Diffusion [](https://dreamfusion3d.github.io/) DreamFusion: Text-to-3D using 2D Diffusion, 2022. The branding style guidelines documents archive [](https://brandingstyleguides.com/) Welcome to the brand design manual documents directory. Search over our worldwide style assets handpicked collection, access to PDF documents for inspiration. Super designer | Create beautiful designs with a few clicks [](https://superdesigner.co/) Create beautiful designs with a few clicks. Simple design tools to generate unique patterns, backgrounds, 3D shapes, colors & images for social media, websites and more Readymag—a design tool to create websites without coding [](https://readymag.com/) Meet the most elegant, simple and powerful web-tool for designing websites, presentations, portfolios and all kinds of digital publications. ffflux: Online SVG Fluid Gradient Background Generator | fffuel [](https://fffuel.co/ffflux/) SVG generator to make fluid gradient backgrounds that feel organic and motion-like. Perfect to add a feeling of motion and fluidity to your web designs. Generate unique SVG design assets | Haikei [](https://haikei.app/) A web-based design tool to generate unique SVG design assets for websites, social media, blog posts, desktop and mobile wallpapers, posters, and more! Our generators let you discover, customize, randomize, and export generative SVG design assets ready to use with your favorite design tools. UI/UX - Inspirational Free Website Builder Software | 10,000+ Free Templates [](https://nicepage.com/) Nicepage is your website builder software breaking limitations common for website builders with revolutionary freehand positioning. 7000+ Free Templates. Easy Drag-n-Drop. No coding. Mobile-friendly. Clean HTML. Super designer | Create beautiful designs with a few clicks [](https://superdesigner.co/) Create beautiful designs with a few clicks. Simple design tools to generate unique patterns, backgrounds, 3D shapes, colors & images for social media, websites and more Pika – Create beautiful mockups from screenshots [](https://pika.style/) Quickly create beautiful website and device mockup from screenshot. Pika lets you capture website screenshots form URL, add device and browser frames, customize background and more LiveTerm [](https://liveterm.vercel.app/) Minimal Gallery – Web design inspiration [](https://minimal.gallery/) For the love of beautiful, clean and functional websites. Awwwards - Website Awards - Best Web Design Trends [](https://www.awwwards.com/) Awwwards are the Website Awards that recognize and promote the talent and effort of the best developers, designers and web agencies in the world. Design Systems For Figma [](https://www.designsystemsforfigma.com/) A collection of Design Systems for Figma from all over the globe. Superside: Design At Scale For Ambitious Brands [](https://www.superside.com/) We are an always-on design company. Get a team of dedicated designers, speedy turnarounds, magical creative collaboration tech and the top 1% of global talent. UXArchive - Made by Waldo [](https://uxarchive.com/) UXArchive the world's largest library of mobile user flows. Be inspired to design the best user experiences. Search by Muzli [](https://search.muz.li/) Search, discover, test and create beautiful color palettes for your projects Siteinspire | Web Design Inspiration [](https://www.siteinspire.com/) SAVEE [](https://savee.it/) The best way to save and share inspiration. A little corner of the internet to find good landing page copywriting examples [](https://greatlandingpagecopy.com/) A little corner of the internet to find great landing page copywriting examples. The Best Landing Page Examples For Design Inspiration - SaaS Landing Page [](https://saaslandingpage.com/) SaaS Landing Page showcases the best landing page examples created by top-class SaaS companies. Get ideas and inspirations for your next design project. Websites Free templates Premium Bootstrap Themes and Templates: Download @ Creative Tim [](https://www.creative-tim.com/) UI Kits, Templates and Dashboards built on top of Bootstrap, Vue.js, React, Angular, Node.js and Laravel. Join over 2,014,387+ creatives to access all our products! Free Bootstrap Themes, Templates, Snippets, and Guides - Start Bootstrap [](https://startbootstrap.com/) Start Bootstrap develops free to download, open source Bootstrap 5 themes, templates, and snippets and creates guides and tutorials to help you learn more about designing and developing with Bootstrap. Free Website Templates [](https://freewebsitetemplates.com/) Get your free website templates here and use them on your website without needing to link back to us. One Page Love - One Page Website Inspiration and Templates [](https://onepagelove.com/) One Page Love is a One Page website design gallery showcasing the best Single Page websites, templates and resources. Free CSS | 3400 Free Website Templates, CSS Templates and Open Source Templates [](https://www.free-css.com/) Free CSS has 3400 free website templates, all templates are free CSS templates, open source templates or creative commons templates. Free Bootstrap Themes and Website Templates | BootstrapMade [](https://bootstrapmade.com/) At BootstrapMade, we create beautiful website templates and bootstrap themes using Bootstrap, the most popular HTML, CSS and JavaScript framework. Free and Premium Bootstrap Themes, Templates by Themesberg [](https://themesberg.com/) Free and Premium Bootstrap themes, templates, admin dashboards and UI kits used by over 38820 web developers and software companies HTML, Vue.js and React templates for startup landing pages - Cruip [](https://cruip.com/) Cruip is a gallery of premium and free HTML, Vue.js and React templates for startups and SaaS. Free Website Templates Download | WordPress Themes - W3Layouts [](https://w3layouts.com/) Want to download free website templates? W3Layouts WordPress themes and website templates are built with responsive web design techniques. Download now! Free HTML Landing Page Templates and UI Kits | UIdeck [](https://uideck.com/) Free HTML Landing Page Templates, Bootstrap Themes, React Templates, HTML Templates, Tailwind Templates, and UI Kits. Create Online Graphics Snappa - Quick & Easy Graphic Design Software [](https://snappa.com/) Snappa makes it easy to create any type of online graphic. Create & publish images for social media, blogs, ads, and more! Canva [](https://www.canva.com/) Polotno Studio - Make graphical designs [](https://studio.polotno.com) Free online design editor. Create images for social media, youtube previews, facebook covers Free Logo Maker: Design Custom Logos | Adobe Express [](https://www.adobe.com/express/create/logo) The Adobe Express logo maker is instant, intuitive, and intelligent. Use it to generate a wide range of possibilities for your own logo. Photo Editor: Fotor – Free Online Photo Editing & Image Editor [](https://www.fotor.com/) Fotor's online photo editor helps you edit photos with free online photo editing tools. Crop photos, resize images, and add effects/filters, text, and graphics in just a few clicks. Photoshop online has never been easier with Fotor's free online photo editor. VistaCreate – Free Graphic Design Software with 70,000+ Free Templates [](https://create.vista.com/) Looking for free graphic design software? Easily create professional designs with VistaCreate, a free design tool with powerful features and 50K+ ready-made templates Draw Freely | Inkscape [](https://inkscape.org/) Inkscape is professional quality vector graphics software which runs on Linux, Mac OS X and Windows desktop computers. Visual & Video Maker Trusted By 11 Million Users - Piktochart [](https://piktochart.com/) With Piktochart, you can create professional-looking infographics, flyers, posters, charts, videos, and more. No design experience needed. Start for free. The Web's Favorite Online Graphic Design Tool | Stencil [](https://getstencil.com/) Stencil is a fantastically easy-to-use online graphic design tool and image editor built for business owners, social media marketers, and bloggers. Pablo by Buffer - Design engaging images for your social media posts in under 30 seconds [](https://pablo.buffer.com/) Buffer makes it super easy to share any page you're reading. Keep your Buffer topped up and we automagically share them for you through the day. Free Online Graphic Design Software | Create stunning designs in seconds. [](https://desygner.com/) Easy drag and drop graphic design tool for anyone to use with 1000's of ready made templates. Create & print professional business cards, flyers, social posts and more. Color Pallet Color Palettes for Designers and Artists - Color Hunt [](https://colorhunt.co/) Discover the newest hand-picked color palettes of Color Hunt. Get color inspiration for your design and art projects. Coolors - The super fast color palettes generator! [](https://coolors.co/) Generate or browse beautiful color combinations for your designs. Get color palette inspiration from nature - colorpalettes.earth [](https://colorpalettes.earth/) Color palettes inspired by beautiful nature photos Color Palette Generator - Create Beautiful Color Schemes [](https://colors.muz.li/) Search, discover, test and create beautiful color palettes for your projects A Most Useful Color Picker | 0to255 [](https://0to255.com/) Find lighter and darker colors based on any color. Discover why over two million people have used 0to255 to choose colors for their website, logo, room interior, and print design projects. Colour Contrast Checker [](https://colourcontrast.cc/) Check the contrast between different colour combinations against WCAG standards Fonts Google Fonts [](https://fonts.google.com/) Making the web more beautiful, fast, and open through great typography Fonts In Use – Type at work in the real world. [](https://fontsinuse.com/) A searchable archive of typographic design, indexed by typeface, format, and topic. Wordmark - Helps you choose fonts! [](https://wordmark.it/) Wordmark helps you choose fonts by quickly displaying your text with your fonts. OH no Type Company [](https://ohnotype.co/) OH no Type Co. Retail and custom typefaces. Life’s a thrill, fonts are chill! Illustrations Illustrations | unDraw [](https://undraw.co/illustrations) The design project with open-source illustrations for any idea you can imagine and create. Create beautiful websites, products and applications with your color, for free. Design Junction [](https://designjunction.xyz/) Design Junction is a one-stop resource library for Designers and Creatives with curated list of best resources handpicked from around the web Humaaans: Mix-&-Match illustration library [](https://www.humaaans.com/) Mix-&-match illustrations of people with a design library for InVIsion Studio and Sketch. Stubborn - Free Illustrations Generator [](https://stubborn.fun/) Free illustrations generator for Figma and Sketch. Get the opportunity to design your characters using symbols and styles. Open Peeps, Hand-Drawn Illustration Library [](https://www.openpeeps.com/) Open Peeps is a hand-drawn illustration library to create scenes of people. You can use them in product illustration, marketing, comics, product states, user flows, personas, storyboarding, quinceañera invitations, or whatever you want! ⠀ Reshot | Free icons & illustrations [](https://www.reshot.com/) Design freely with instant downloads of curated SVG icons and vector illustrations. All free with commercial licensing. No attribution required. Blush: Illustrations for everyone [](https://blush.design/) Blush makes it easy to add free illustrations to your designs. Play with fully customizable graphics made by artists across the globe. Mockups Angle 4 - 5000+ Device Mockups for Figma, Sketch and XD [](https://angle.sh/) Vector mockups for iPhone, iPad, Android and Mac devices, including the new iPhone 13, Pro, Pro Max and Mini. Perfect for presenting your apps. Huge library of components, compositions, wallpapers and plugins made for Figma, Sketch and XD. Make Mockups, Logos, Videos and Designs in Seconds [](https://placeit.net/) Get unlimited downloads on all our 100K templates! You can make a logo, video, mockup, flyer, business card and social media image in seconds right from your browser. Free and premium tools for graphic designers | Lstore Graphics [](https://www.ls.graphics/) Free and premium mockups, UI/UX tools, scene creators for busy designers Logo Design & Brand Identity Platform for Entrepreneurs | Looka [](https://looka.com/) Logojoy is now Looka! Design a Logo, make a website, and create a Brand Identity you’ll love with the power of Artificial Intelligence. 100% free to use. Create stunning product mockups easily and online - Smartmockups [](https://smartmockups.com/) Smartmockups enables you to create stunning high-resolution mockups right inside your browser within one interface across multiple devices. Previewed - Free mockup generator for your app [](https://previewed.app/) Join Previewed to create stunning 3D image shots and animations for your app. Choose from hundreds of ready made mockups, or create your own. Free Design Software - Graphic Online Maker - Glorify [](https://www.glorify.com/) Create professional and high converting social media posts, ads, infographics, presentations, and more with Glorify, a free design software & graphic maker. Other BuiltWith Technology Lookup [](https://builtwith.com/) Web technology information profiler tool. Find out what a website is built with. Compress JPEG Images Online [](https://compressjpeg.com/) Compress JPEG images and photos for displaying on web pages, sharing on social networks or sending by email. PhotoRoom - Remove Background and Create Product Pictures [](https://www.photoroom.com/) Create product and portrait pictures using only your phone. Remove background, change background and showcase products. Magic Eraser - Remove unwanted things from images in seconds [](https://www.magiceraser.io/) Magic Eraser - Use AI to remove unwanted things from images in seconds. Upload an image, mark the bit you need removed, download the fixed up image. Compressor.io - optimize and compress JPEG photos and PNG images [](https://compressor.io/) Optimize and compress JPEG, PNG, SVG, GIF and WEBP images online. Compress, resize and rename your photos for free. Remove Video Background – Unscreen [](https://www.unscreen.com/) Remove the background of any video - 100% automatically, online & free! Goodbye Greenscreen. Hello Unscreen. Noun Project: Free Icons & Stock Photos for Everything [](https://thenounproject.com/) Noun Project features the most diverse collection of icons and stock photos ever. Download SVG and PNG. Browse over 5 million art-quality icons and photos. Design Principles [](https://principles.design/) An Open Source collection of Design Principles and methods Shapefest™ - A massive library of free 3D shapes [](https://www.shapefest.com/) A massive free library of beautifully rendered 3D shapes. 160,000+ high resolution PNG images in one cohesive library. Learning UX Degreeless.design - Everything I Learned in Design School [](https://degreeless.design/) This is a list of everything I've found useful in my journey of learning design, and an ongoing list of things I think you should read. For budding UX, UI, Interaction, or whatever other title designers. UX Tools | Practical UX skills and tools [](https://uxtools.co/) Lessons and resources from two full-time product designers. Built For Mars [](https://builtformars.com/) On a mission to help the world build better user experiences by demystifying UX. Thousands of hours of research packed into UX case studies. Case Study Club – Curated UX Case Study Gallery [](https://www.casestudy.club/) Case Study Club is the biggest curated gallery of the best UI/UX design case studies. Get inspired by industry-leading designers, openly sharing their UX process. The Guide to Design [](https://start.uxdesign.cc/) A self-guided class to help you get started in UX and answer key questions about craft, design, and career Uxcel - Where design careers are built [](https://app.uxcel.com/explore) Available on any device anywhere in the world, Uxcel is the best way to improve and learn UX design online in just 5 minutes per day. UI & UX Design Tips by Jim Raptis. [](https://www.uidesign.tips/) Learn UI & UX Design with practical byte-sized tips and in-depth articles from Jim Raptis. Entrepreneur Instant Username Search [](https://instantusername.com/#/) Instant Username Search checks out if your username is available on more than 100 social media sites. Results appear instantly as you type. Flourish | Data Visualization & Storytelling [](https://flourish.studio/) Beautiful, easy data visualization and storytelling PiPiADS - #1 TikTok Ads Spy Tool [](https://www.pipiads.com/) PiPiADS is the best tiktok ads spy tool .We provide tiktok advertising,advertising on tiktok,tiktok ads examples,tiktok ads library,tiktok ads best practices,so you can understand the tiktok ads cost and master the tiktok ads 2021 and tiktok ads manager. Minea - The best adspy for product search in ecommerce and dropshipping [](https://en.minea.com/) Minea is the ultimate e-commerce product search tool. Minea tracks all ads on all networks. Facebook Ads, influencer product placements, Snapspy, all networks are tracked. Stop paying adspy 149€ for one network and discover Minea. AdSpy [](https://adspy.com/) Google Trends [](https://trends.google.com/) ScoreApp: Advanced Quiz Funnel Marketing | Make a Quiz Today [](https://www.scoreapp.com/) ScoreApp makes quiz funnel marketing easy, so you can attract relevant warm leads, insightful data and increase your sales. Try for free today Mailmodo - Send Interactive Emails That Drive Conversions [](https://www.mailmodo.com/) Use Mailmodo to create and send interactive emails your customers love. Drive conversions and get better email ROI. Sign up for a free trial now. 185 Top E-Commerce Sites Ranked by User Experience Performance – Baymard Institute [](https://baymard.com/ux-benchmark) See the ranked UX performance of the 185 largest e-commerce sites in the US and Europe. The chart summarizes 50,000+ UX performance ratings. Metricool - Analyze, manage and measure your digital content [](https://metricool.com/) Social media scheduling, web analytics, link in bio and reporting. Metricool is free per live for one brand. START HERE Visualping: #1 Website change detection, monitoring and alerts [](https://visualping.io/) More than 1.5 millions users monitor changes in websites with Visualping, the No1 website change detection, website checker, webpage change monitoring and webpage change detection tool. Gumroad – Sell what you know and see what sticks [](https://gumroad.com/) Gumroad is a powerful, but simple, e-commerce platform. We make it easy to earn your first dollar online by selling digital products, memberships and more. Product Hunt – The best new products in tech. [](https://www.producthunt.com/) Product Hunt is a curation of the best new products, every day. Discover the latest mobile apps, websites, and technology products that everyone's talking about. 12ft Ladder [](https://12ft.io/) Show me a 10ft paywall, I’ll show you a 12ft ladder. namecheckr | Social and Domain Name Availability Search For Brand Professionals [](https://www.namecheckr.com/) Social and Domain Name Availability Search For Brand Professionals Excel AI Formula Generator - Excelformulabot.com [](https://excelformulabot.com/) Transform your text instructions into Excel formulas in seconds with the help of AI. Z-Library [](https://z-lib.org/) Global Print On Demand Platform | Gelato [](https://www.gelato.com/) Create and sell custom products online. With local production in 33 countries, easy integration, and 24/7 customer support, Gelato is an all-in-one platform. Freecycle: Front Door [](https://freecycle.org/) Free eBooks | Project Gutenberg [](https://www.gutenberg.org/) Project Gutenberg is a library of free eBooks. Convertio — File Converter [](https://convertio.co/) Convertio - Easy tool to convert files online. More than 309 different document, image, spreadsheet, ebook, archive, presentation, audio and video formats supported. Namechk [](https://namechk.com/) Crazy Egg Website — Optimization | Heatmaps, Recordings, Surveys & A/B Testing [](https://www.crazyegg.com/) Use Crazy Egg to see what's hot and what's not, and to know what your web visitors are doing with tools, such as heatmaps, recordings, surveys, A/B testing & more. Ifttt [](https://ifttt.com/) Also Asked [](https://alsoasked.com/) Business Name Generator - Easily create Brandable Business Names - Namelix [](https://namelix.com/) Namelix uses artificial intelligence to create a short, brandable business name. Search for domain availability, and instantly generate a logo for your new business Merch Informer [](https://merchinformer.com/) Headline Generator [](https://www.title-generator.com/) Title Generator: create 700 headlines with ONE CLICK: Content Ideas + Catchy Headlines + Ad Campaign E-mail Subject Lines + Emotional Titles. Simple - Efficient - One Click Make [](https://www.make.com/en) Create and add calculator widgets to your website | CALCONIC_ [](https://www.calconic.com/) Web calculator builder empowers you to choose from a pre-made templates or build your own calculator widgets from a scratch without any need of programming knowledge Boost Your Views And Subscribers On YouTube - vidIQ [](https://vidiq.com/) vidIQ helps you acquire the tools and knowledge needed to grow your audience faster on YouTube and beyond. Learn More Last Pass [](https://www.lastpass.com/) Starter Story: Learn How People Are Starting Successful Businesses [](https://www.starterstory.com/) Starter Story interviews successful entrepreneurs and shares the stories behind their businesses. In each interview, we ask how they got started, how they grew, and how they run their business today. How To Say No [](https://www.starterstory.com/how-to-say-no) Saying no is hard, but it's also essential for your sanity. Here are some templates for how to say no - so you can take back your life. Think with Google - Discover Marketing Research & Digital Trends [](https://www.thinkwithgoogle.com/) Uncover the latest marketing research and digital trends with data reports, guides, infographics, and articles from Think with Google. ClickUp™ | One app to replace them all [](https://clickup.com/) Our mission is to make the world more productive. To do this, we built one app to replace them all - Tasks, Docs, Goals, and Chat. The Manual [](https://manual.withcompound.com/) Wealth-planning resources for founders and startup employees Software for Amazon FBA Sellers & Walmart Sellers | Helium 10 [](https://www.helium10.com/) If you're looking for the best software for Amazon FBA & Walmart sellers on the market, check out Helium 10's capabilities online today! Buffer: All-you-need social media toolkit for small businesses [](https://buffer.com/) Use Buffer to manage your social media so that you have more time for your business. Join 160,000+ small businesses today. CPGD — The Consumer Packaged Goods Directory [](https://www.cpgd.xyz/) The Consumer Packaged Goods Directory is a platform to discover new brands and resources. We share weekly trends in our newsletter and partner with services to provide vetted, recommended platforms for our Directory brands. Jungle Scout [](https://www.junglescout.com/) BuzzSumo | The World's #1 Content Marketing Platform [](https://buzzsumo.com/) BuzzSumo powers the strategies of 500k+ marketers, with content marketing data on 8b articles, 42m websites, 300t engagements, 500k journalists & 492m questions. Login - Capital [](https://app.capital.xyz/) Raise, hold, spend, and send funds — all in one place. Marketing Pictory – Video Marketing Made Easy - Pictory.ai [](https://pictory.ai/) Pictory's powerful AI enables you to create and edit professional quality videos using text, no technical skills required or software to download. Tolstoy | Communicate with interactive videos [](https://www.gotolstoy.com/) Start having face-to-face conversations with your customers. Create Email Marketing Your Audience Will Love - MailerLite [](https://www.mailerlite.com/) Email marketing tools to grow your audience faster and drive revenue smarter. Get free access to premium features with a 30-day trial! Sign up now! Hypefury - Schedule & Automate Social Media Marketing [](https://hypefury.com/) Save time on social media while creating more value, and growing your audience faster. Schedule & automate your social media experience! Klaviyo: Marketing Automation Platform for Email & SMS [](https://www.klaviyo.com/) Klaviyo, an ecommerce marketing automation platform for email marketing and sms syncs your tech stack with your website store to scale your business. Online Email & Lead Scraper | Klean Leads [](https://www.kleanleads.com/) Klean Leads is an online email scraper & email address finder. Use it to book more appointments, get more replies, and close more sales. PhantomBuster [](https://phantombuster.com/) Call to Action Examples - 300+ CTA Phrases [](https://ctaexamples.com/) See the best CTA example in every situation covered by the library of 300+ CTA goals. Use the examples to create your own CTAs in minutes. Creative Center: one-stop creative solution for TikTok [](https://ads.tiktok.com/business/creativecenter/pc/en?from=001010) Come to get your next great idea for TikTok. Here you can find the best performing ads, viral videos, and trending hashtags across regions and verticals. Groove.cm GrooveFunnels, GrooveMail with CRM and Digital Marketing Automation Platform - Groove.cm with GrooveFunnels, GroovePages, GrooveKart [](https://groove.cm/) Groove is a website creator, page builder, sales funnel maker, membership site platform, email autoresponder, blog tool, shopping cart system, ecommerce store solution, affiliate manager, video marketing software and more apps to help build your online business. SurveyMonkey: The World’s Most Popular Free Online Survey Tool [](https://www.surveymonkey.com/) Use SurveyMonkey to drive your business forward by using our free online survey tool to capture the voices and opinions of the people who matter most to you. Video Maker | Create Videos Online | Promo.com [](https://promo.com/) Free customizable video maker to help boost your business. Video creator for ads, social media, product and explainer videos, and for anything else you need! beehiiv — The newsletter platform built for growth [](https://www.beehiiv.com/) Access the best tools available in email, helping your newsletter scale and monetize like never before. GetResponse | Professional Email Marketing for Everyone [](https://www.getresponse.com/) No matter your level of expertise, we have a solution for you. At GetResponse, it's email marketing done right. Start your free account today! Search Email Newsletter Archives : Email Tuna [](https://emailtuna.com/) Explore newsletters without subscribing. Get email design ideas, discount coupon codes and exclusive newsletters deals. Database of email newsletters archived from all over the internet. Other Tools Simplescraper — Scrape Websites and turn them into APIs [](https://simplescraper.io/) Web scraping made easy — a powerful and free Chrome extension for scraping websites in your browser, automated in the cloud, or via API. No code required. Exploding Topics - Discover the hottest new trends. [](https://explodingtopics.com/) See new market opportunities, trending topics, emerging technology, hot startups and more on Exploding Topics. Scribe | Visual step-by-step guides [](https://scribehow.com/) By capturing your process while you work, Scribe automatically generates a visual guide, ready to share with the click of a button. Get It Free – The internet's BEST place to find free stuff! [](https://getitfree.us/) The internet's BEST place to find free stuff! Inflact by Ingramer – Marketing toolkit for Instagram [](https://inflact.com/) Sell on Instagram, build your audience, curate content with the right set of tools. Free Online Form Builder & Form Creator | Jotform [](https://www.jotform.com/) We believe the right form makes all the difference. Go from busywork to less work with powerful forms that use conditional logic, accept payments, generate reports, and automate workflows. Manage Your Team’s Projects From Anywhere | Trello [](https://trello.com/en) Trello is the ultimate project management tool. Start up a board in seconds, automate tedious tasks, and collaborate anywhere, even on mobile. TikTok hashtag generator - tiktokhashtags.com [](https://tiktokhashtags.com/) Find out which are the best hashtags for your TikTok post. Create Infographics, Reports and Maps - Infogram [](https://infogram.com/) Infogram is an easy to use infographic and chart maker. Create and share beautiful infographics, online reports, and interactive maps. Make your own here. Confetto - Create Instagram content in minutes [](https://www.confet.to/) Confetto is an all-in-one social media marketing tool built for SMBs and Social Media Managers. Confetto helps you create high-quality content for your audience that maximizes your reach and engagement on social media. Design, copy-write, plan and schedule content all in one place. Find email addresses in seconds • Hunter (Email Hunter) [](https://hunter.io/) Hunter is the leading solution to find and verify professional email addresses. Start using Hunter and connect with the people that matter for your business. PlayPhrase.me: Site for cinema archaeologists. [](https://playphrase.me/) Travel and explore the world of cinema. Largest collection of video quotes from movies on the web. #1 Free SEO Tools → SEO Review Tools [](https://www.seoreviewtools.com/) SEO Review Tools: 42+ Free Online SEO Tools build with ❤! → Rank checker → Domain Authority Checker → Keyword Tool → Backlink Checker Podcastle: Seamless Podcast Recording & Editing [](https://podcastle.ai/) Podcastle is the simplest way to create professional-quality podcasts. Record, edit, transcribe, and export your content with the power of AI, in an intuitive web-based platform. Save Ads from TikTok & Facebook Ad Library - Foreplay [](https://www.foreplay.co/) The best way to save ads from TikTok Creative Center and Facebook Ad Library, Organize them into boards and share ad inspiration with your team. Supercharge your creative strategy. SiteRight - Automate Your Business [](https://www.siteright.co/) SiteRight combines the abilities of multiple online resources into a single dashboard allowing you to have full control over how you manage your business. Diffchecker - Compare text online to find the difference between two text files [](https://www.diffchecker.com/) Diffchecker will compare text to find the difference between two text files. Just paste your files and click Find Difference! Yout.com [](https://yout.com/) Yout.com allows you to record videos from YouTube, FaceBook, SoundCloud, VK and others too many formats with clipping. Intuitively easy to use, with Yout the Internet DVR, with a bit of extra. AI Content Generation | Competitor Analysis - Predis.ai [](https://predis.ai/) Predis helps brands and influencers communicate better on social media by providing AI-powered content strategy analysis, content and hashtag recommendations. Castr | #1 Live Video Streaming Solution With Video Hosting [](https://castr.io/) Castr is a live video streaming solution platform that delivers enterprise-grade live videos globally with CDN. Live event streaming, video hosting, pre-recorded live, multi stream – all in one place using Castr. Headliner - Promote your podcast, radio show or blog with video [](https://www.headliner.app/) Easily create videos to promote your podcast, radio show or blog. Share to Instagram, Facebook, Twitter, YouTube, Linkedin and anywhere video lives Create Presentations, Infographics, Design & Video | Visme [](https://www.visme.co/) Create professional presentations, interactive infographics, beautiful design and engaging videos, all in one place. Start using Visme today. Designrr - Create eBooks, Kindle books, Leadmagnets, Flipbooks and Blog posts from your content in 2 minutes [](https://designrr.io/) Upload any web page, MS Word, Video, Podcast or YouTube and it will create a stunning ebook and convert it to pdf, epub, Kindle or Flipbook. Quick and Easy to use. Full Training, 24x7 Support and Facebook Group Included. SwipeWell | Swipe File Software [](https://www.swipewell.app/) The only Chrome extension dedicated to helping you save, organize, and reference marketing examples (so you never feel stumped). Tango | Create how-to guides, in seconds [](https://www.tango.us/) Tango takes the pain out of documenting processes by automatically generating how-to guides while you work. Empower your team to do their best work. Ad Creative Bank [](https://www.theadcreativebank.com/) Get inspired by ads from across industries, learn new best practices, and start thinking creatively about your brand’s digital creative. Signature Hound • Free Email Signature and Template Generator [](https://signaturehound.com/) Our email signature generator is free and easy to use. Our customizable templates work with Gmail, Outlook, Office 365, Apple Mail and more. Organize All Of Your Marketing In One Place - CoSchedule [](https://coschedule.com/) Get more done in less time with the only work management software for marketers. B Ok - Books [](https://b-ok.xyz/categories) OmmWriter [](https://ommwriter.com/) Ommwriter Rebrandly | Custom URL Shortener, Branded Link Management, API [](https://www.rebrandly.com/) URL Shortener with custom domains. Shorten, brand and track URLs with the industry-leading link management platform. Free to try. API, Short URL, Custom Domains. Common Tools [](https://www.commontools.org/) Book Bolt [](https://bookbolt.io/) Zazzle [](https://www.zazzle.com/) InspiroBot [](https://inspirobot.me/) Download Free Cheat Sheets or Create Your Own! - Cheatography.com: Cheat Sheets For Every Occasion [](https://cheatography.com/) Find thousands of incredible, original programming cheat sheets, all free to download. No Code Chatbot Platform | Free Chatbot Platform | WotNot [](https://wotnot.io/) WotNot is the best no code chatbot platform to build AI bot easily without coding. Deploy bots and live chat on the Website, Messenger, WhatsApp, and more. SpyFu - Competitor Keyword Research Tools for Google Ads PPC & SEO [](https://www.spyfu.com/) Systeme.io - The only tool you need to launch your online business [](https://systeme.io/) Systeme.io has all the tools you need to grow your online business. Click here to create your FREE account! Productivity Temp Mail [](https://temp-mail.org/en/) The Visual Collaboration Platform for Every Team | Miro [](https://miro.com/) Scalable, secure, cross-device and enterprise-ready team collaboration whiteboard for distributed teams. Join 35M+ users from around the world. Grammarly: Free Online Writing Assistant [](https://www.grammarly.com/) Millions trust Grammarly’s free writing app to make their online writing clear and effective. Getting started is simple — download Grammarly’s extension today. Rize · Maximize Your Productivity [](https://rize.io/) Rize is a smart time tracker that improves your focus and helps you build better work habits. Motion | Manage calendars, meetings, projects & tasks in one app [](https://www.usemotion.com/) Automatically prioritize tasks, schedule meetings, and resolve calendar conflicts. Used by over 10k CEOs and professionals to improve focus, get more done, and streamline workday. Notion – One workspace. Every team. [](https://www.notion.so/) We’re more than a doc. Or a table. Customize Notion to work the way you do. Loom: Async Video Messaging for Work | Loom [](https://www.loom.com/) Record your screen, share your thoughts, and get things done faster with async video. Zapier | Automation that moves you forward [](https://zapier.com/) Workflow automation for everyone. Zapier automates your work across 5,000+ app integrations, so you can focus on what matters. Rows — The spreadsheet with superpowers [](https://rows.com/) Combine the power of a spreadsheet with built-in integrations from your business apps. Automate workflows and build tools that make work simpler. Free Online Form Builder | Tally [](https://tally.so/) Tally is the simplest way to create free forms & surveys. Create any type of form in seconds, without knowing how to code, and for free. Highbrow | Learn Something New Every Day. Join for Free! [](https://gohighbrow.com/) Highbrow helps you learn something new every day with 5-minute lessons delivered to your inbox every morning. Join over 400,000 lifelong learners today! Slick Write | Check your grammar. Proofread online. [](https://www.slickwrite.com/#!home) Slick Write is a powerful, FREE application that makes it easy to check your writing for grammar errors, potential stylistic mistakes, and other features of interest. Whether you're a blogger, novelist, SEO professional, or student writing an essay for school, Slick Write can help take your writing to the next level. Reverso [](https://www.reverso.net) Hemingway Editor [](https://hemingwayapp.com/) Web Apps by 123apps - Edit, Convert, Create [](https://123apps.com/) Splitbee – Your all-in-one analytics and conversion platform [](https://splitbee.io/) Track and optimize your online business with Splitbee. Analytics, Funnels, Automations, A/B Testing and more. PDF Tools Free PDF, Video, Image & Other Online Tools - TinyWow [](https://tinywow.com/) Smallpdf.com - A Free Solution to all your PDF Problems [](https://smallpdf.com/) Smallpdf - the platform that makes it super easy to convert and edit all your PDF files. Solving all your PDF problems in one place - and yes, free. Sejda helps with your PDF tasks [](https://www.sejda.com/) Sejda helps with your PDF tasks. Quick and simple online service, no installation required! Split, merge or convert PDF to images, alternate mix or split scans and many other. iLovePDF | Online PDF tools for PDF lovers [](https://www.ilovepdf.com/) iLovePDF is an online service to work with PDF files completely free and easy to use. Merge PDF, split PDF, compress PDF, office to PDF, PDF to JPG and more! Text rewrite QuillBot [](https://quillbot.com/) Pre Post SEO : Online SEO Tools [](https://www.prepostseo.com/) Free Online SEO Tools: plagiarism checker, grammar checker, image compressor, website seo checker, article rewriter, back link checker Wordtune | Your personal writing assistant & editor [](https://www.wordtune.com/) Wordtune is the ultimate AI writing tool that rewrites, rephrases, and rewords your writing! Trusted by over 1,000,000 users, Wordtune strengthens articles, academic papers, essays, emails and any other online content. Aliexpress alternatives CJdropshipping - Dropshipping from Worldwide to Worldwide! [](https://cjdropshipping.com/) China's reliable eCommerce dropshipping fulfillment supplier, helps small businesses ship worldwide, dropship and fulfillment services that are friendly to start-ups and small businesses, Shopify dropshipping. SaleHoo [](https://www.salehoo.com/) Alibaba.com: Manufacturers, Suppliers, Exporters & Importers from the world's largest online B2B marketplace [](https://www.alibaba.com/) Find quality Manufacturers, Suppliers, Exporters, Importers, Buyers, Wholesalers, Products and Trade Leads from our award-winning International Trade Site. Import & Export on alibaba.com Best Dropshipping Suppliers for US + EU Products | Spocket [](https://www.spocket.co/) Spocket allows you to easily start dropshipping top products from US and EU suppliers. Get started for free and see why Spocket consistently gets 5 stars. Best dropshipping supplier to the US [](https://www.usadrop.com/) THE ONLY AMERICAN-MADE FULFILLMENT CENTER IN CHINA. Our knowledge of the Worldwide dropshipping market and the Chinese Supply-Chain can't be beat! 阿里1688 [](https://www.1688.com/) 阿里巴巴(1688.com)是全球企业间(B2B)电子商务的著名品牌,为数千万网商提供海量商机信息和便捷安全的在线交易市场,也是商人们以商会友、真实互动的社区平台。目前1688.com已覆盖原材料、工业品、服装服饰、家居百货、小商品等12个行业大类,提供从原料--生产--加工--现货等一系列的供应产品和服务 Dropshipping Tools Oberlo | Where Self Made is Made [](https://www.oberlo.com/) Start selling online now with Shopify. All the videos, podcasts, ebooks, and dropshipping tools you'll need to build your online empire. Klaviyo: Marketing Automation Platform for Email & SMS [](https://www.klaviyo.com/) Klaviyo, an ecommerce marketing automation platform for email marketing and sms syncs your tech stack with your website store to scale your business. SMSBump | SMS Marketing E-Commerce App for Shopify [](https://smsbump.com/) SMSBump is an SMS marketing & automation app for Shopify. Segment customers, recover orders, send campaign text messages with a 35%+ click through rate. AfterShip: The #1 Shipment Tracking Platform [](https://www.aftership.com/) Order status lookup, branded tracking page, and multi-carrier tracking API for eCommerce. Supports USPS, FedEx, UPS, and 900+ carriers worldwide. #1 Dropshipping App | Zendrop [](https://zendrop.com/) Start and scale your own dropshipping business with Zendrop. Sell and easily fulfill your orders with the fastest shipping in the industry. Best Dropshipping Suppliers for US + EU Products | Spocket [](https://www.spocket.co/) Spocket allows you to easily start dropshipping top products from US and EU suppliers. Get started for free and see why Spocket consistently gets 5 stars. Video Editing Jitter • The simplest motion design tool on the web. [](https://jitter.video/) Animate your designs easily. Export your creations as videos or GIFs. All in your browser. DaVinci Resolve 18 | Blackmagic Design [](https://www.blackmagicdesign.com/products/davinciresolve) Professional video editing, color correction, visual effects and audio post production all in a single application. Free and paid versions for Mac, Windows and Linux. Online Video Editor | Video Creator | InVideo [](https://invideo.io/) InVideo's Online Video Editor Helps You Make Professional Videos From Premium Templates, Images, And Music. All your video needs in one place | Clipchamp [](https://clipchamp.com/) Fast-forward your creations with our video editing platform. Start with a video template or record your webcam or screen. Get the pro look with filters, transitions, text and more. Then, export in minutes and share in an instant. Descript | All-in-one audio/video editing, as easy as a doc. [](https://www.descript.com/) Record, transcribe, edit, mix, collaborate, and master your audio and video with Descript. Download for free →. Kapwing — Reach more people with your content [](https://www.kapwing.com/) Kapwing is a collaborative, online content creation platform that you can use to edit video and create content. Join over 10 million modern creators who trust Kapwing to create, edit, and grow their content on every channel. Panzoid [](https://panzoid.com/) Powerful, free online apps and community for creating beautiful custom content. Google Web Designer - Home [](https://webdesigner.withgoogle.com/) Kapwing — Reach more people with your content [](https://www.kapwing.com/) Kapwing is a collaborative, online content creation platform that you can use to edit video and create content. Join over 10 million modern creators who trust Kapwing to create, edit, and grow their content on every channel. ClipDrop [](https://clipdrop.co/) Create professional visuals without a photo studio CapCut [](https://www.capcut.com/) CapCut is an all-in-one online video editing software which makes creation, upload & share easier, with frame by frame track editor, cloud drive etc. VEED - Online Video Editor - Video Editing Made Simple [](https://www.veed.io/) Make stunning videos with a single click. Cut, trim, crop, add subtitles and more. Online, no account needed. Try it now, free. VEED Free Video Maker | Create & Edit Your Videos Easily - Animoto [](https://animoto.com/k/welcome) Create, edit, and share videos with our online video maker. Combine your photos, video clips, and music to make quality videos in minutes. Get started free! Runway - Online Video Editor | Everything you need to make content, fast. [](https://runwayml.com/) Discover advanced video editing capabilities to take your creations to the next level. CreatorKit - A.I. video creator for marketers [](https://creatorkit.com/) Create videos with just one click, using our A.I. video editor purpose built for marketers. Create scroll stopping videos, Instagram stories, Ads, Reels, and TikTok videos. Pixar in a Box | Computing | Khan Academy [](https://www.khanacademy.org/computing/pixar) 3D Video Motions Plask - AI Motion Capture and 3D Animation Tool [](https://plask.ai/) Plask is an all-in-one browser-based AI motion capture tool and animation editor that anybody can use, from motion designers to every day content creators. Captions Captions [](https://www.getcaptions.app/) Say hello to Captions, the only camera and editing app that automatically transcribes, captions and clips your talking videos for you. Stock videos Pexels [](https://www.pexels.com/) Pixabay [](https://pixabay.com/) Mixkit - Awesome free assets for your next video project [](https://mixkit.co/) Download Free Stock Video Footage, Stock Music & Premiere Pro Templates for your next video editing project. All assets can be downloaded for free! Free Stock Video Footage HD 4K Download Royalty-Free Clips [](https://www.videvo.net/) Download free stock video footage with over 300,000 video clips in 4K and HD. We also offer a wide selection of music and sound effect files with over 180,000 clips available. Click here to download royalty-free licensing videos, motion graphics, music and sound effects from Videvo today. Free Stock Video Footage HD Royalty-Free Videos Download [](https://mazwai.com/) Download free stock video footage with clips available in HD. Click here to download royalty-free licensing videos from Mazwai now. Royalty Free Stock Video Footage Clips | Vidsplay.com [](https://www.vidsplay.com/) Royalty Free Stock Video Footage Clips Free Stock Video Footage, Royalty Free Videos for Download [](https://coverr.co/) Download royalty free (for personal and commercial use), unique and beautiful video footage for your website or any project. No attribution required. Stock Photos Beautiful Free Images & Pictures | Unsplash [](https://unsplash.com/) Beautiful, free images and photos that you can download and use for any project. Better than any royalty free or stock photos. When we share, everyone wins - Creative Commons [](https://creativecommons.org/) Creative Commons licenses are 20! Honoring 20 years of open sharing using CC licenses, join us in 2022 to celebrate Better Sharing — advancing universal access to knowledge and culture, and fostering creativity, innovation, and collaboration. Help us reach our goal of raising $15 million for a future of Better Sharing.  20 Years of Better … Read More "When we share, everyone wins" Food Pictures • Foodiesfeed • Free Food Photos [](https://www.foodiesfeed.com/) Download 2000+ food pictures ⋆ The best free food photos for commercial use ⋆ CC0 license Free Stock Photos and Images for Websites & Commercial Use [](https://burst.shopify.com/) Browse thousands of beautiful copyright-free images. All our pictures are free to download for personal and commercial use, no attribution required. EyeEm | Authentic Stock Photography and Royalty-Free Images [](https://www.eyeem.com/) Explore high-quality, royalty-free stock photos for commercial use. License individual images or save money with our flexible subscription and image pack plans. picjumbo: Free Stock Photos [](https://picjumbo.com/) Free stock photos and images for your projects and websites.️ Beautiful 100% free high-resolution stock images with no watermark. Free Stock Photos, Images, and Vectors [](https://www.stockvault.net/) 139.738 free stock photos, textures, backgrounds and graphics for your next project. No attribution required. Free Stock Photos, PNGs, Templates & Mockups | rawpixel [](https://www.rawpixel.com/) Free images, PNGs, stickers, backgrounds, wallpapers, graphic templates and PSD mockups. All safe to use with commercial licenses. Free Commercial Stock Photos & Royalty Free Images | PikWizard [](https://pikwizard.com/) Free images, videos & free stock photos. Unlimited downloads ✓ Royalty-free Images ✓Copyright-free for commercial use ✓ No Attribution Required Design Bundles [](https://designbundles.net/) Stock music Royalty Free Music for video creators | Epidemic Sound [](https://www.epidemicsound.com/) Download premium Royalty free Music and SFX! Our free trial gives you access to over 35,000 tracks and 90,000 sound effects for video, streaming and more! Royalty-Free Music & SFX for Video Creators | Artlist [](https://artlist.io/) Explore the ultimate royalty-free music & sound effects catalogs for unlimited use in YouTube videos, social media & films created by inspiring indie artists worldwide. The go-to music licensing choice for all creators Royalty Free Audio Tracks - Envato Elements [](https://elements.envato.com/audio) Download Royalty Free Stock Audio Tracks for your next project from Envato Elements. Premium, High Quality handpicked Audio files ideal for any genre. License popular music for videos • Lickd [](https://lickd.co/) The only place you can license popular music for videos. Access 1M+ mainstream tracks, plus high-quality stock music for content creators NCS (NoCopyrightSounds) - free music for content creators [](https://ncs.io/) NCS is a Record Label dedicated to giving a platform to the next generation of Artists in electronic music, representing genres from house to dubstep via trap, drum & bass, electro pop and more. Search Engine Optimization Keyword Tool For Monthly Search Volume, CPC & Competition [](https://keywordseverywhere.com/) Keywords Everywhere is a browser add-on for Chrome & Firefox that shows search volume, CPC & competition on multiple websites. Semrush - Online Marketing Can Be Easy [](https://www.semrush.com/) Turn the algorithm into a friend. Make your business visible online with 55+ tools for SEO, PPC, content, social media, competitive research, and more. DuckDuckGo — Privacy, simplified. [](https://duckduckgo.com/) The Internet privacy company that empowers you to seamlessly take control of your personal information online, without any tradeoffs. SEO Software for 360° Analysis of Your Website [](https://seranking.com/) Leading SEO software for business owners, agencies, and SEO specialists. Track your rankings, monitor competitors, spot technical errors, and more. Skyrocket your organic traffic with Surfer [](https://surferseo.com/) Use Surfer to research, write, optimize, and audit! Everything you need to create a comprehensive content strategy that yields real results is right here. Ahrefs - SEO Tools & Resources To Grow Your Search Traffic [](https://ahrefs.com/) You don't have to be an SEO pro to rank higher and get more traffic. Join Ahrefs – we're a powerful but easy to learn SEO toolset with a passionate community. Neon Tools [](https://neontools.io/) Google Index Search [](https://lumpysoft.com/) Google Index Search SEO Backlink Checker & Link Building Toolset | Majestic.com [](https://majestic.com/) Develop backlink strategies with our Link Intelligence data, build the strongest SEO backlink campaigns to drive organic traffic and boost your rankings today. PageOptimizer Pro [](https://pageoptimizer.pro/) Plans Services SEO Consulting Learn SEO About Blog POP SEO Community Podcast Support POP On Page Workshops With Kyle Roof POP Chrome Extension Guide Tutorial Videos Frequently Asked Questions Best Practices Login Cancel Anytime Plans Services SEO Consulting Learn SEO About Blog POP SEO Community Podcast Support POP On Page… Keyword Chef - Keywords for Publishers [](https://keywordchef.com/) Rank Insanely Fast for Keywords Your Competition Can’t Find “Every long-tail keyword I find ends up ranking within a day” – Dane Eyerly, Owner at TextGoods.com Keyword Chef automatically finds and filters keywords for you. Real-time SERP analysis lets you find keywords nearly guaranteed to rank. Try for free → Let’s face it, most keyword tools ... Read more Notifier - Social Listening for Social Media and More! [](https://notifier.so/) Track keywords. Market your product for free. Drive the conversation. Easy. Free Trial. No obligation ever. Simple. Fast. Trusted by Top Companies. Free Keyword Research Tool from Wordtracker [](https://www.wordtracker.com/) The best FREE alternative to the Keyword Planner. Use Wordtracker to reveal 1000s of profitable longtail keywords with up to 10,000 results per search Blog Posts The 60 Hottest Front-end Tools of 2021 | CSS-Tricks - CSS-Tricks [](https://css-tricks.com/hottest-front-end-tools-in-2021/) A complete list of the most popular front-end tools in 2021, according to the Web Tools Weekly newsletter. See which resources made the list. Resume ResumeGlow - AI Powered Resume Builder [](https://resumeglow.com/) Get hired fast with a resume that grabs attention. Designed by a team of HR experts and typographers. Customizable templates with more than a million possible Create Your Job-winning Resume - (Free) Resume maker · Resume.io [](https://resume.io/) Free online resume maker, allows you to create a perfect Resume or Cover Letter in 5 minutes. See how easy it is to write a professional resume - apply for jobs today! Rezi - The Leading AI-Powered Free Resume Builder [](https://www.rezi.ai/) Rezi’s award-winning AI-powered resume builder is trusted by hundreds of thousands of job seekers. Create your perfect resume in minutes with Rezi. Create a Perfect Resume | Free Resume Builder | Resumaker.ai [](https://resumaker.ai/) Create your professional resume with this online resume maker. Choose a designer-made template and grab any employer attention in seconds. Trusted AI Resume Maker Helps You Get Hired Fast [](https://skillroads.com/) Reach a 96.4% success rate in the job hunt race with the best resume creator. Our innovative technologies and 24/7 support help you to become a perfect candidate for any job. Do not lose your chance to become the One. Kickresume | Best Online Resume & Cover Letter Builder [](https://www.kickresume.com/) Create your best resume yet. Online resume and cover letter builder used by 1,300,000 job seekers worldwide. Professional templates approved by recruiters. ResumeMaker.Online | Create a Professional Resume for Free [](https://www.resumemaker.online/) Save time with the easiest-to-use Resume Maker Online. Create an effective resume in just minutes and land your dream job. No Sign-up required, start now! Interviews Interview Warmup - Grow with Google [](https://grow.google/certificates/interview-warmup/) A quick way to prepare for your next interview. Practice key questions, get insights about your answers, and get more comfortable interviewing. No code website builder Carrd - Simple, free, fully responsive one-page sites for pretty much anything [](https://carrd.co/) A free platform for building simple, fully responsive one-page sites for pretty much anything. Webflow: Create a custom website | No-code website builder [](https://webflow.com/) Create professional, custom websites in a completely visual canvas with no code. Learn how to create a website by trying Webflow for free! Google Sites: Sign-in [](https://sites.google.com/) FlutterFlow - Build beautiful, modern apps incredibly fast! [](https://flutterflow.io/) FlutterFlow lets you build apps incredibly fast in your browser. Build fully functional apps with Firebase integration, API support, animations, and more. Export your code or even easier deploy directly to the app stores! Free Website Builder: Build a Free Website or Online Store | Weebly [](https://www.weebly.com/) Weebly’s free website builder makes it easy to create a website, blog, or online store. Find customizable templates, domains, and easy-to-use tools for any type of business website. Glide • No Code App Builder • Nocode Application Development [](https://www.glideapps.com/) Create the apps your business needs, without coding, waiting or overpaying. Get started for free and build an app today Adalo - Build Your Own No Code App [](https://www.adalo.com/) Adalo makes creating apps as easy as putting together a slide deck. Turn your idea into a real native app — no code needed! Siter.io - The collaborative web design tool, no-code website builder [](https://siter.io/) Siter.io is a visual website builder for designers. Prototype, design, and create responsive websites in the browser. Work together with your team in one place. Elementor: #1 Free WordPress Website Builder | Elementor.com [](https://elementor.com/) Elementor is the platform web creators choose to build professional WordPress websites, grow their skills, and build their business. Start for free today! No code app builder | Bravo Studio [](https://www.bravostudio.app/) Your no-code mobile app builder for iOS and Android. Create MVP’s, validate ideas and publish on App Store and Google Play Store. Home [](https://typedream.com/) The simplest way to build a website with no-code, as easy as writing on Notion. Try Typedream for free and upgrade for custom domains, collaborators, and unlimited pages. Free Website Builder | Create a Free Website | Wix.com [](https://www.wix.com/) Create a website with Wix’s robust website builder. With 900+ strategically designed templates and advanced SEO and marketing tools, build your brand online today. Free responsive Emails & Landing Pages drag-and-drop Editor | BEE [](https://beefree.io/) Free responsive emails and landing pages editor. With BEE drag-and-drop builders embedded in many software applications you can start designing now! Home [](https://typedream.com/) The simplest way to build a website with no-code, as easy as writing on Notion. Try Typedream for free and upgrade for custom domains, collaborators, and unlimited pages. Ownit Connected Checkout [](https://www.ownit.co/) Ownit Connected Checkout Bookmark.com | No-code Website Builder to Start Your Business [](https://www.bookmark.com/) Our AI powered platform ensures your business is future proof. Try Bookmark for free. The best way to build web apps without code | Bubble [](https://bubble.io/) Bubble introduces a new way to build software. It’s a no-code tool that lets you build SaaS platforms, marketplaces and CRMs without code. Bubble hosts all web apps on its cloud platform. Responsive Web Design | Website Creation | Editor X [](https://www.editorx.com/) Experience the future of website design with responsive layouts, CSS precision and smooth drag and drop. Create a Website for Free. Tilda Website Builder [](https://tilda.cc/) Create a website, online store, landing page with Tilda intuitive website builder. Build your site from hundreds of pre-designed templates and publish it today. No code required. No-code headless commerce and websites | Unstack Inc. [](https://www.unstack.com/) Deploy high performance eCommerce storefronts and websites without the engineering overhead using Unstack's no-code CMS Best Drag-and-Drop Website Builder | Jemi [](https://jemi.so/) The modern website builder for creatives, entrepreneurs, and dreamers. Build a beautiful link in bio site, portfolio, or landing page in minutes. No-code website builder that works like Notion [](https://popsy.co/) Create a beautiful no-code website in minutes. Popsy works just like Notion but is built from the ground up for building websites. Choose a free template. Edit content just like in Notion. Customize styles without code. Free Notion icons and illustrations. Unbounce - The Landing Page Builder & Platform [](https://unbounce.com/) Grow your relevance, leads, and sales with Unbounce. Use Unbounce to easily create and optimize landing pages for your small business and boost conversions with AI insights. Low-code Front-end Design & Development Platform | TeleportHQ [](https://teleporthq.io/) Front-end development platform, with a visual builder and headless content modelling capabilities. Static website creation, and UI development tools. Other tools used in no code website MemberSpace - Turn any part of your website into members-only with just a few clicks [](https://www.memberspace.com/) Create memberships on your website for anything you want like courses, video tutorials, member directories, and more while having 100% control over look & feel. Triggre | The number one true no-code platform to run your business [](https://www.triggre.com/) The best no-code platform to create highly advanced business applications in hours, without programming. Try it now for free! No code game builder Welcome to Buildbox [](https://signup.buildbox.com/) Welcome to Buildbox Flowlab Game Creator - Make games online [](https://flowlab.io/) Flowlab is an online game creator. Make your own games to share with friends. Make 2D Games With GameMaker | Free Video Game Maker [](https://gamemaker.io/) Make a game with GameMaker, the best free video game engine. Perfect for beginners and professionals. Learn to build your own 2D games with our simple tutorials. Side Hustle Side Hustle Stack [](https://sidehustlestack.co/) Side Hustle Stack is a resource for finding platform-based work, ranging from gig work and side hustles to platforms that help you start a small business that can grow. Fiverr [](https://www.fiverr.com/) Remotasks: Work From Home, Online Bootcamp Training [](https://www.remotasks.com/en) Make money doing tasks. Start earning today! Free bootcamp training offered online. Sign up for a free Remotasks account and work from home. Earn up to $200/month. Transcribe Speech to Text | Rev [](https://www.rev.com/) Transcribe Speech to Text with Rev. Reach your audience with clear and accurate captions, transcripts, and subtitles. AI Training Data and other Data Management Services [](https://www.clickworker.com/) AI training data, SEO texts, web research, tagging, surveys and more - Use the crowdsourcing principle with the power of >4.5M Clickworkers. Automate your Busy Work - Byron People-Powered Assistants [](https://www.hibyron.com/) Byron is an on demand US based virtual assistant platform that gives individuals and teams the ability to quickly outsource their non-essential tasks. Jobs Websites - Remote Latest Crypto Jobs, Web3 Jobs and Blockchain Jobs in the leading tech companies. [](https://cryptojobslist.com/) New Cryptocurrency Jobs, Web3 Jobs and Blockchain Jobs on CryptoJobsList — the leading site to find and post jobs. Connect with companies hiring in a few clicks and begin your next experience in the industry. Updated daily. Remote Jobs: Design, Marketing, Programming, Writing & More [](https://justremote.co/) Discover Remote Jobs from around the world. Give up the commute, work remotely and do what you love, daily, from anywhere. Find your perfect remote development, design, sales or marketing job today. Remote Ok [](https://remoteok.com/) Hire Freelancers & Remote Workers For Free [](https://talent.hubstaff.com/) Find and hire the highest quality freelancers from around the world - for free. Choose from thousands of developers, digital marketers, creatives and more. We Work Remotely: Remote jobs in design, programming, marketing and more [](https://weworkremotely.com/) Find the most qualified people in the most unexpected places: Hire remote! We Work Remotely is the best place to find and list remote jobs that aren't restricted by commutes or a particular geographic area. Browse thousands of remote work jobs today. Angel [](https://angel.co/) Remote Work: Jobs, Companies & Virtual Teams - Remote.co [](https://remote.co/) Remote.co is the definitive remote work job board for online job seekers and companies hiring. Start your remote job search here! FlexJobs: Best Remote Jobs, Work from Home Jobs, Online Jobs & More [](https://www.flexjobs.com/) The #1 job search site for hand-screened flexible and remote jobs (work from home jobs) since 2007. Plus get resume, coaching and career help. Join today! Remote jobs remotefront.io [](https://remotefront.io/) All remote jobs at remotefront.io Daily Virtual Events Helping You Grow Professionally [](https://powertofly.com/) PowerToFly is where you receive expert career advice, free video training, coaching and exclusive access to jobs and events at top companies. Best Remote and Work from Home Jobs - Virtual Vocations [](https://www.virtualvocations.com/) Best work from home jobs and remote jobs in over 50 categories for professionals, digital nomads, telecommuting workers and entry level jobseekers. Education, healthcare, medical, customer support and tech job openings. Remote Jobs | Working Nomads [](https://www.workingnomads.com/jobs) Remote jobs for digital working nomads. Start your telecommuting career and work remotely from home or places around the world. Job Search, Companies Hiring Near Me, and Advice | The Muse [](https://www.themuse.com/) Find jobs at the best companies hiring near you and get free career advice. Startupers [](https://www.startupers.com/) NoDesk - Where Everyone Works Remote [](https://nodesk.co/) Browse and apply to the best new remote jobs at leading remote companies and startups for free. Join hundreds of companies that use NoDesk to build their remote teams. Browser Extensions Blackbox - Select. Copy. Paste & Search - Magazinul web Chrome [](https://chrome.google.com/webstore/detail/blackbox-select-copy-past/mcgbeeipkmelnpldkobichboakdfaeon) Fastest Way to Copy Text from Videos & Images Octotree - GitHub code tree - Magazinul web Chrome [](https://chrome.google.com/webstore/detail/octotree-github-code-tree/bkhaagjahfmjljalopjnoealnfndnagc) GitHub on steroids WhatFont - Chrome Web Store [](https://chrome.google.com/webstore/detail/whatfont/jabopobgcpjmedljpbcaablpmlmfcogm?hl=en) The easiest way to identify fonts on web pages. Window Resizer - Chrome Web Store [](https://chrome.google.com/webstore/detail/window-resizer/kkelicaakdanhinjdeammmilcgefonfh?hl=en) Resize the browser window to emulate various screen resolutions. Amino: CSS Editor - Magazinul web Chrome [](https://chrome.google.com/webstore/detail/amino-css-editor/pbcpfbcibpcbfbmddogfhcijfpboeaaf) Live CSS Editor. Write custom CSS for any website and see your changes in real time. Checkbot: SEO, Web Speed & Security Tester 🚀 - Chrome Web Store [](https://chrome.google.com/webstore/detail/checkbot-seo-web-speed-se/dagohlmlhagincbfilmkadjgmdnkjinl?hl=en) Test SEO/speed/security of 100s of pages in a click! Check broken links, HTML/JavaScript/CSS, URL redirects, duplicate titles... Honey: Automatic Coupons & Rewards - Magazinul web Chrome [](https://chrome.google.com/webstore/detail/honey-automatic-coupons-r/bmnlcjabgnpnenekpadlanbbkooimhnj) Save money and earn rewards when you shop online. Tango: screenshots, training, & documentation - Magazinul web Chrome [](https://chrome.google.com/webstore/detail/tango-screenshots-trainin/lggdbpblkekjjbobadliahffoaobaknh) Automatically create beautiful step-by-step guides with screenshots, in seconds. No code browser automation | axiom.ai [](https://axiom.ai/) Build browser bots quickly, without code. Automate website actions and repetitive tasks using just your browser, on any website or web app. No Code Browser extensions builder Bildr - Visual Web Development in your Browser [](https://www.bildr.com/) Visually build SaaS products, Chrome extensions, and web3 dApps Other Repurposing content for social media the easy way » Repurpose.io [](https://repurpose.io/) Repurposing content for social media made easy. Automatically repurpose YouTube, TikTok, Lives, Podcasts, and Zoom calls. Try it for FREE. Smart Serials: Your serial numbers database [](https://smartserials.com/) This is your main source of free serial numbers, unlock keys in a clean environment safe to browse by all ages. Old versions of Windows, Mac and Linux Software, Apps & Abandonware Games - Download at OldVersion.com [](http://www.oldversion.com/) Online Room Planner - Design Your Room [](http://www.planyourroom.com/) Planyourroom.com is a wonderful website to redesign each room in your house by picking out perfect furniture options to fit your unique space. BoredHumans.com - Fun AI Programs You Can Use Online [](https://boredhumans.com/) Fun AI programs you can use online. AI games, fake people, computer generated art, machine learning demos, and more. BNProject | Home [](https://buynothingproject.org/) Open Source Alternatives to Proprietary Software [](https://www.opensourcealternative.to/) Discover 400+ popular open source alternatives to proprietary SaaS. URL Shortener - Short URLs & Custom Free Link Shortener | Bitly [](https://bitly.com/) Bitly’s Connections Platform is more than a free URL shortener, with robust link management software, advanced QR Code features, and a Link-in-bio solution. TinEye Reverse Image Search [](https://tineye.com/) Good Books | Books recommended by successful people [](https://www.goodbooks.io/) Looking for the best books to read in 2022? Discover the best book recommendations from the world's most successful, influential and interesting people. Directory - Website Recommendations [](https://tokapps.com/directory/) 0 TRIED & TESTED WEBSITES LISTED Insanely Useful Websites A combination of useful websites for businesses, freelancers, DIYers, and individuals in a centralised area.All websites have been tried and tested. Filter Websites Audio Business Tools Copywriting Design Entertainment Graphics Guides Health Marketing PC Resources Savings SEO Software Travel Video Apply filter Watch Anime Online, Free Anime Streaming Online on Zoro.to Anime Website [](https://zoro.to/) Zoro is a Free anime streaming website which you can watch English Subbed and Dubbed Anime online with No Account and Daily update. WATCH NOW! Animated Drawings [](https://sketch.metademolab.com/) Bring children's drawings to life, by animating characters to move around! Alternativeto [](https://alternativeto.net/) Chatroulette [](https://chatroulette.com/) Random meetings around the world Tiktok Downloader - Download Video tiktok Without Watermark - SnapTik [](https://snaptik.app/en) TikTok Video Downloader - SnapTik.App is one of the best free Download video Tiktok No Watermark tool available online. You can download TikTok video from any device you have. Imgflip - Create and Share Awesome Images [](https://imgflip.com/) Flip through memes, gifs, and other funny images. Make your own images with our Meme Generator or Animated GIF Maker. Fake Text Message | Make Fake Text Conversation [](https://ifaketextmessage.com/) Fake Text Message is a tool to create a Fake Text Conversation and a Fake iMessage. ✂Templatemaker ︎ [](https://www.templatemaker.nl/en/) Omni Calculator [](https://www.omnicalculator.com/) Omni Calculator solves 2960 problems anywhere from finance and business to health. It’s so fast and easy you won’t want to do the math again! Watch Movies Online Free | Watch Series HD Free [](https://hdtoday.tv/) Free Access to the Biggest library of HD Movies and HD Series online - NO ADS - No Account Required - Fast Free Streaming Students Answers - The Most Trusted Place for Answering Life's Questions [](https://www.answers.com/) Answers is the place to go to get the answers you need and to ask the questions you want Wolfram|Alpha: Computational Intelligence [](https://www.wolframalpha.com/) Compute answers using Wolfram's breakthrough technology & knowledgebase, relied on by millions of students & professionals. For math, science, nutrition, history, geography, engineering, mathematics, linguistics, sports, finance, music… Online Math Tools - Simple, free and easy to use math utilities [](https://onlinemathtools.com/) World's simplest collection of useful mathematics utilities. Generate number sequences, draw fractals, do quick matrix and numerical calculations and more! edX | Free Online Courses by Harvard, MIT, & more | edX [](https://www.edx.org/) Access 2000 free online courses from 140 leading institutions worldwide. Gain new skills and earn a certificate of completion. Join today. Sci-Hub [](https://sci-hub.hkvisa.net/) Sci-Hub,mg.scihub.ltd,sci-hub.tw,The project is supported by user donations. Imagine the world with free access to knowledge for everyone ‐ a world without any paywalls. DigitalDefynd - Find the Best + Free Courses Online [](https://digitaldefynd.com/) 4 Million+ Learners | 96,000+ Courses | 45,000+ Free Courses | 1200+ Free Certificates Learn Anything [](https://learn-anything.xyz/) Search Interactive Mind Maps to learn anything HubSpot Academy - Homepage [](https://academy.hubspot.com/) HubSpot Academy is the worldwide leader in inbound marketing, sales, and customer service/support training.

airflow-tutorial
github
LLM Vibe Score0.508
Human Vibe Score0.13240553426231688
hgrifJan 19, 2025

airflow-tutorial

Airflow tutorial This tutorial is loosely based on the Airflow tutorial in the official documentation. It will walk you through the basics of setting up Airflow and creating an Airflow workflow. This tutorial was published on the blog of GoDataDriven. Setup You can skip this section if Airflow is already set up. Make sure that you can run airflow commands, know where to put your DAGs and have access to the web UI. Install Airflow Airflow is installable with pip via a simple pip install apache-airflow. Either use a separate python virtual environment or install it in your default python environment. To use the conda virtual environment as defined in environment.yml in this git-repo: Install miniconda. Make sure that conda is on your path: Create the virtual environment from environment.yml: Activate the virtual environment: You should now have an (almost) working Airflow installation. Alternatively, install Airflow yourself by running: Airflow used to be packaged as airflow but is packaged as apache-airflow since version 1.8.1. Make sure that you install any extra packages with the right Python package: e.g. use pip install apache-airflow[dask] if you've installed apache-airflow and do not use pip install airflow[dask]. Leaving out the prefix apache- will install an old version of Airflow next to your current version, leading to a world of hurt. You may run into problems if you don't have the right binaries or Python packages installed for certain backends or operators. When specifying support for e.g. PostgreSQL when installing extra Airflow packages, make sure the database is installed; do a brew install postgresql or apt-get install postgresql before the pip install apache-airflow[postgres]. Similarly, when running into HiveOperator errors, do a pip install apache-airflow[hive] and make sure you can use Hive. Run Airflow Before you can use Airflow you have to initialize its database. The database contains information about historical & running workflows, connections to external data sources, user management, etc. Once the database is set up, Airflow's UI can be accessed by running a web server and workflows can be started. The default database is a SQLite database, which is fine for this tutorial. In a production setting you'll probably be using something like MySQL or PostgreSQL. You'll probably want to back it up as this database stores the state of everything related to Airflow. Airflow will use the directory set in the environment variable AIRFLOW_HOME to store its configuration and our SQlite database. This directory will be used after your first Airflow command. If you don't set the environment variable AIRFLOW_HOME, Airflow will create the directory ~/airflow/ to put its files in. Set environment variable AIRFLOW_HOME to e.g. your current directory $(pwd): or any other suitable directory. Next, initialize the database: Now start the web server and go to localhost:8080 to check out the UI: It should look something like this: With the web server running workflows can be started from a new terminal window. Open a new terminal, activate the virtual environment and set the environment variable AIRFLOW_HOME for this terminal as well: Make sure that you're an in the same directory as before when using $(pwd). Run a supplied example: And check in the web UI that it has run by going to Browse -> Task Instances. This concludes all the setting up that you need for this tutorial. Tips Both Python 2 and 3 are be supported by Airflow. However, some of the lesser used parts (e.g. operators in contrib) might not support Python 3. For more information on configuration check the sections on Configuration and Security of the Airflow documentation. Check the Airflow repository for upstart and systemd templates. Airflow logs extensively, so pick your log folder carefully. Set the timezone of your production machine to UTC: Airflow assumes it's UTC. Workflows We'll create a workflow by specifying actions as a Directed Acyclic Graph (DAG) in Python. The tasks of a workflow make up a Graph; the graph is Directed because the tasks are ordered; and we don't want to get stuck in an eternal loop so the graph also has to be Acyclic. The figure below shows an example of a DAG: The DAG of this tutorial is a bit easier. It will consist of the following tasks: print 'hello' wait 5 seconds print 'world and we'll plan daily execution of this workflow. Create a DAG file Go to the folder that you've designated to be your AIRFLOWHOME and find the DAGs folder located in subfolder dags/ (if you cannot find, check the setting dagsfolder in $AIRFLOW_HOME/airflow.cfg). Create a Python file with the name airflow_tutorial.py that will contain your DAG. Your workflow will automatically be picked up and scheduled to run. First we'll configure settings that are shared by all our tasks. Settings for tasks can be passed as arguments when creating them, but we can also pass a dictionary with default values to the DAG. This allows us to share default arguments for all the tasks in our DAG is the best place to set e.g. the owner and start date of our DAG. Add the following import and dictionary to airflow_tutorial.py to specify the owner, start time, and retry settings that are shared by our tasks: Configure common settings These settings tell Airflow that this workflow is owned by 'me', that the workflow is valid since June 1st of 2017, it should not send emails and it is allowed to retry the workflow once if it fails with a delay of 5 minutes. Other common default arguments are email settings on failure and the end time. Create the DAG We'll now create a DAG object that will contain our tasks. Name it airflowtutorialv01 and pass default_args: With schedule_interval='0 0 *' we've specified a run at every hour 0; the DAG will run each day at 00:00. See crontab.guru for help deciphering cron schedule expressions. Alternatively, you can use strings like '@daily' and '@hourly'. We've used a context manager to create a DAG (new since 1.8). All the tasks for the DAG should be indented to indicate that they are part of this DAG. Without this context manager you'd have to set the dag parameter for each of your tasks. Airflow will generate DAG runs from the startdate with the specified scheduleinterval. Once a DAG is active, Airflow continuously checks in the database if all the DAG runs have successfully ran since the start_date. Any missing DAG runs are automatically scheduled. When you initialize on 2016-01-04 a DAG with a startdate at 2016-01-01 and a daily scheduleinterval, Airflow will schedule DAG runs for all the days between 2016-01-01 and 2016-01-04. A run starts after the time for the run has passed. The time for which the workflow runs is called the execution_date. The daily workflow for 2016-06-02 runs after 2016-06-02 23:59 and the hourly workflow for 2016-07-03 01:00 starts after 2016-07-03 01:59. From the ETL viewpoint this makes sense: you can only process the daily data for a day after it has passed. This can, however, ask for some juggling with date for other workflows. For Machine Learning models you may want to use all the data up to a given date, you'll have to add the scheduleinterval to your executiondate somewhere in the workflow logic. Because Airflow saves all the (scheduled) DAG runs in its database, you should not change the startdate and scheduleinterval of a DAG. Instead, up the version number of the DAG (e.g. airflowtutorialv02) and avoid running unnecessary tasks by using the web interface or command line tools Timezones and especially daylight savings can mean trouble when scheduling things, so keep your Airflow machine in UTC. You don't want to skip an hour because daylight savings kicks in (or out). Create the tasks Tasks are represented by operators that either perform an action, transfer data, or sense if something has been done. Examples of actions are running a bash script or calling a Python function; of transfers are copying tables between databases or uploading a file; and of sensors are checking if a file exists or data has been added to a database. We'll create a workflow consisting of three tasks: we'll print 'hello', wait for 10 seconds and finally print 'world'. The first two are done with the BashOperator and the latter with the PythonOperator. Give each operator an unique task ID and something to do: Note how we can pass bash commands in the BashOperator and that the PythonOperator asks for a Python function that can be called. Dependencies in tasks are added by setting other actions as upstream (or downstream). Link the operations in a chain so that sleep will be run after printhello and is followed by printworld; printhello -> sleep -> printworld: After rearranging the code your final DAG should look something like: Test the DAG First check that DAG file contains valid Python code by executing the file with Python: You can manually test a single task for a given execution_date with airflow test: This runs the task locally as if it was for 2017-07-01, ignoring other tasks and without communicating to the database. Activate the DAG Now that you're confident that your dag works, let's set it to run automatically! To do so, the scheduler needs to be turned on; the scheduler monitors all tasks and all DAGs and triggers the task instances whose dependencies have been met. Open a new terminal, activate the virtual environment and set the environment variable AIRFLOW_HOME for this terminal, and type Once the scheduler is up and running, refresh the DAGs page in the web UI. You should see airflowtutorialv01 in the list of DAGs with an on/off switch next to it. Turn on the DAG in the web UI and sit back while Airflow starts backfilling the dag runs! Tips Make your DAGs idempotent: rerunning them should give the same results. Use the the cron notation for schedule_interval instead of @daily and @hourly. @daily and @hourly always run after respectively midnight and the full hour, regardless of the hour/minute specified. Manage your connections and secrets with the Connections and/or Variables. Exercises You now know the basics of setting up Airflow, creating a DAG and turning it on; time to go deeper! Change the interval to every 30 minutes. Use a sensor to add a delay of 5 minutes before starting. Implement templating for the BashOperator: print the executiondate instead of 'hello' (check out the original tutorial and the example DAG). Implement templating for the PythonOperator: print the executiondate with one hour added in the function printworld() (check out the documentation of the PythonOperator). Resources Data Pipelines with Apache Airflow Airflow documentation ETL best practices with Airflow Airflow: Tips, Tricks, and Pitfalls Kubernetes Custom controller for deploying Airflow

ai_primer
github
LLM Vibe Score0.347
Human Vibe Score0.0036202231602591754
trokasNov 20, 2024

ai_primer

Welcome to AI primer course INTERACTIVE BOOK LINK Main aim of this course is to give you enough information so that you can start exploring field of AI on your own and maybe even start searching for DS role. We have only 5 main chapters and one bonus lecture to cover. Unsupervised learning SVD (Singular Value Decomposition) - it’s a good tool to introduce both technical tools we will be working with as well as giving us a glimpse at unsupervised learning. Supervised learning RF (Random Forests) - one of the first “silver bullets” out there. Our discussion will also cover Shannon’s work on entropy as it’s one of the key ingredients. Deep learning DNN (Deep Neural Networks) - we will build our own Perceptron from scratch, thus focusing on gradient descent and backprop on the way. By changing activation function logistic regression will be introduced and finally we will explore what a stack of layers (deep NN) can offer. CNN (Convolutional Neural Networks) - even though different techniques come and go in deep learning world I strongly believe that CNN’s will be around for quite some time to come. We will use them not only for images, but also for time series prediction. Attention - powerful idea that stands behind Transformers and one of the enablers for GPT-3, DALL-E 2 and others. Reinforcement Learning (bonus lecture) TD (Temporal Difference) - one of the core principles in reinforcement learning. We will apply it to play tic-tac-toe. Also we will cover following toolset, which hopefully will be useful for your future projects: numpy (mainly in SVD and FCN lectures) - will help us store vectors, matrices and perform operations on them. matplotlib (in all lectures) - nice and simple plotting lib. scikit-learn - ML library. pandas (mainly in RF lecture) - structured way of looking at tabular data. PyTorch (FCN and CNN lectures) - simple deep learning library based on tensorflow. git (final project) - version control tool. Toolset will be presented only in lectures, thus it’s up to you to learn them on your own if you do not plan to attend. There are a lot of resources, but I highly suggest to read intros in corresponding docs. What to expect from a single lecture? There will be no clear distinction between theory and practice, thus you should have your PC ready for small assignments that you will encounter on the way. Most important material will be listed here, but during lectures you will hear and see a lot of complementary material. Each lecture will end with a list of resources (some of them mandatory). We will start a new lecture with a recap of what was done last time and discussion regarding mentioned resources in the hope to deepen understanding in the subject and inspire you to search for sources and publications yourself. Launching notebooks You can launch notebooks while in interactive book by simply pressing the rocket logo and choosing Colab. To get faster run times click Runtime and Change runtime type, then select GPU or TPU. If necessary you can install missing packages by running !pip install [package name] directly in the notebook. NOTE: Colab will not save your changes between sessions! Download the notebook or save a copy in Google Drive before closing the browser. If you want to open notebooks locally (for a quick preview) you might find nteract useful. As an alternative you can use non free, but cheap options like Jarvislabs or Paperspace. Actually Paperspace has free GPU option, but often it is not available. (re)Sources Each chapter will have a list of resources, but for now I highly recommend to start listening/watching following resources on your spare time: Data Skeptic podcast Artificial Intelligence podcast Two Minute Papers youtube channel If I had to recommend a single book for beginner it will be this one - Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition.

airtable-api-proxy
github
LLM Vibe Score0.348
Human Vibe Score0.008293886065546695
danilocJul 10, 2023

airtable-api-proxy

node.js Airtable API Proxy by Future Fluent ================= Here's a project demonstrating the basics of an Airtable API proxy using node.js and Express. Click here to see the source and remix for your own purposes. Why does Airtable need an API Proxy? Airtable's rate limit is five requests per second per base. Anything more than that and the API will lock down for thirty seconds. By implementing an API proxy, it's possible to cache common results for quick responses and enforce a rate limit for requests. Additionally, an API proxy allows you to keep your API key a secret. Since all Airtable API keys allow full CRUD access, using the key in client-side JavaScript code would leave your data subject to outside tampering. Click here for example output. Click here to see the source data. How does it work? Three files drive the proxy: server.js An API route, /api/ai/list/:page, demonstrates how to serve JSON in response to a request. caching.js Simple, file-based caching. readCacheWithPath(path) Returns cached JSON, if it's not too stale. Use cacheInterval to adjust this. writeCacheWithPath(path, object) Writes a JavaScript object to JSON at the specified path, creating intermediate directories as needed. database-connection.js This is the meat of the project. It uses the Airtable node.js client to connect to a base and writes the results out as a JSON response. Base ID and Airtable API key are in 🗝.env. For more on accessing Airtable via the API, see the interactive Airtable documentation. Rate limiting Bottleneck handles rate limiting. The Airtable database interactions are handled using Bottleneck's wrap function.