Best Way to Learn About AI | VibeBuilders.ai Resource Directory

My Manager Thinks ML Projects Takes 5 Minutes 🤦‍♀️

My Manager Thinks ML Projects Takes 5 Minutes 🤦‍♀️

Hey, everyone! I’ve got to vent a bit because work has been something else lately. I’m a BI analyst at a bank, and I’m pretty much the only one dealing with machine learning and AI stuff. The rest of my team handles SQL and reporting—no Python, no R, no ML knowledge AT ALL. You could say I’m the only one handling data science stuff So, after I did a Python project for retail, my boss suddenly decided I’m the go-to for all things ML. Since then, I’ve been getting all the ML projects dumped on me (yay?), but here’s the kicker: my manager, who knows nothing about ML, acts like he’s some kind of expert. He keeps making suggestions that make zero sense and setting unrealistic deadlines. I swear, it’s like he read one article and thinks he’s cracked the code. And the best part? Whenever I finish a project, he’s all “we completed this” and “we came up with these insights.” Ummm, excuse me? We? I must’ve missed all those late-night coding sessions you didn’t show up for. The higher-ups know it’s my work and give me credit, but my manager just can’t help himself. Last week, he set a ridiculous deadline of 10 days for a super complex ML project. TEN DAYS! Like, does he even know that data preprocessing alone can take weeks? I’m talking about cleaning up messy datasets, handling missing values, feature engineering, and then model tuning. And that’s before even thinking about building the model! The actual model development is like the tip of the iceberg. But I just nodded and smiled because I was too exhausted to argue. 🤷‍♀️ And then, this one time, they didn’t even invite me to a meeting where they were presenting my work! The assistant manager came to me last minute, like, “Hey, can you explain these evaluation metrics to me so I can present them to the heads?” I was like, excuse me, what? Why not just invite me to the meeting to present my own work? But nooo, they wanted to play charades on me So, I gave the most complicated explanation ever, threw in all the jargon just to mess with him. He came back 10 minutes later, all flustered, and was like, “Yeah, you should probably do the presentation.” I just smiled and said, “I know… data science isn’t for everyone.” Anyway, they called me in at the last minute, and of course, I nailed it because I know my stuff. But seriously, the nerve of not including me in the first place and expecting me to swoop in like some kind of superhero. I mean, at least give me a cape if I’m going to keep saving the day! 🤦‍♀️ Honestly, I don’t know how much longer I can keep this up. I love the work, but dealing with someone who thinks they’re an ML guru when they can barely spell Python is just draining. I have built like some sort of defense mechanism to hit them with all the jargon and watch their eyes glaze over How do you deal with a manager who takes credit for your work and sets impossible deadlines? Should I keep pushing back or just let it go and keep my head down? Any advice! TL;DR: My manager thinks ML projects are plug-and-play, takes credit for my work, and expects me to clean and process data, build models, and deliver results in 10 days. How do I deal with this without snapping? #WorkDrama

#reddit #manager #thinks

I want to learn generative AI with no coding background ,can anyone guide how to from scratch

0

Prestigious_Cat2556•This week

I want to learn generative AI with no coding background ,can anyone guide how to from scratch

#reddit #want #learn

🚀 PureML - AI-Powered No-Code ML for Everyone 🚀

0

Savings-Taro-8645•This week

🚀 PureML - AI-Powered No-Code ML for Everyone 🚀

We’ve built PureML, a no-code AI platform that takes you from raw data to a fully trained ML model—no data science team required. Our MVP is live with AI-driven data preprocessing, feature engineering, and model training, and we’re actively building out deployment along with other powerful AI enhancements. We’ve been featured on the AI Accelerator Institute’s 2025 LLMOps Ecosystem Map, won the LlamaIndex RAG-a-thon, and given talks at GitHub, San Diego Python, and Austin Python. We were also featured in the blogs of LlamaIndex (article) and Box (article). We’re bringing on early adopters—want in? Join the waitlist at pureml.io 👀🔥

#reddit #pureml #ai-powered

0

1

TechPrimo•This week

How I Started Learning Machine Learning

Hello, everyone. As promised, I'll write a longer post about how I entered the world of ML, hoping it will help someone shape their path. I'll include links to all the useful materials I used alongside the story, which you can use for learning. I like to call myself an AI Research Scientist who enjoys exploring new AI trends, delving deeper into understanding their background, and applying them to real products. This way, I try to connect science and entrepreneurship because I believe everything that starts as scientific research ends up "on the shelves" as a product that solves a specific user problem. I began my journey in ML in 2016 when it wasn't such a popular field. Everyone had heard of it, but few were applying it. I have several years of development experience and want to try my hand at ML. The first problem I encountered was where to start - whether to learn mathematics, statistics, or something else. That's when I came across a name and a course that completely changed my career. Let's start You guessed it. It was Professor Andrew Ng and his globally popular Machine Learning course available on Coursera (I still have the certificate, hehe). This was also my first official online course ever. Since that course no longer exists as it's been replaced by a new one, I recommend you check out: Machine Learning (Stanford CS229) Machine Learning Specialization These two courses start from the basics of ML and all the necessary calculus you need to know. Many always ask questions like whether to learn linear algebra, statistics, or probability, but you don't need to know everything in depth. This knowledge helps if you're a scientist developing a new architecture, but as an engineer, not really. You need to know some basics to understand, such as how the backpropagation algorithm works. I know that Machine Learning (Stanford CS229) is a very long and arduous course, but it's the right start if you want to be really good at ML. In my time, I filled two thick notebooks by hand while taking the course mentioned above. TensorFlow and Keras After the course, I didn't know how to apply my knowledge because I hadn't learned specifically how to code things. Then, I was looking for ways to learn how to code it. That's when I came across a popular framework called Keras, now part of TensorFlow. I started with a new course and acquiring practical knowledge: Deep Learning Specialization Deep Learning by Ian Goodfellow Machine Learning Yearning by Andrew Ng These resources above were my next step. I must admit that I learned the most from that course and from the book Deep Learning by Ian Goodfellow because I like reading books (although this one is quite difficult to read). Learn by coding To avoid just learning, I went through various GitHub repositories that I manually retyped and learned that way. It may be an old-fashioned technique, but it helped me a lot. Now, most of those repositories don't exist, so I'll share some that I found to be good: Really good Jupyter notebooks that can teach you the basics of TensorFlow Another good repo for learning TF and Keras Master the challenge After mastering the basics in terms of programming in TF/Keras, I wanted to try solving some real problems. There's no better place for that challenge than Kaggle and the popular Titanic dataset. Here, you can really find a bunch of materials and simple examples of ML applications. Here are some of my favorites: Titanic - Machine Learning from Disaster Home Credit Default Risk House Prices - Advanced Regression Techniques Two Sigma: Using News to Predict Stock Movements I then decided to further develop my career in the direction of applying ML to the stock market, first using predictions on time series and then using natural language processing. I've remained in this field until today and will defend my doctoral dissertation soon. How to deploy models To continue, before I move on to the topic of specialization, we need to address the topic of deployment. Now that we've learned how to make some basic models in Keras and how to use them, there are many ways and services, but I'll only mention what I use today. For all my ML models, whether simple regression models or complex GPT models, I use FastAPI. It's a straightforward framework, and you can quickly create API endpoints. I'll share a few older and useful tutorials for beginners: AI as an API tutorial series A step-by-step guide Productizing an ML Model with FastAPI and Cloud Run Personally, I've deployed on various cloud providers, of which I would highlight GCP and AWS because they have everything needed for model deployment, and if you know how to use them, they can be quite cheap. Chose your specialization The next step in developing my career, besides choosing finance as the primary area, was my specialization in the field of NLP. This happened in early 2020 when I started working with models based on the Transformer architecture. The first model I worked with was BERT, and the first tasks were related to classifications. My recommendations are to master the Transformer architecture well because 99% of today's LLM models are based on it. Here are some resources: The legendary paper "Attention Is All You Need" Hugging Face Course on Transformers Illustrated Guide to Transformers - Step by Step Explanation Good repository How large language models work, a visual intro to transformers After spending years using encoder-based Transformer models, I started learning GPT models. Good open-source models like Llama 2 then appear. Then, I started fine-tuning these models using the excellent Unsloth library: How to Finetune Llama-3 and Export to Ollama Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth After that, I focused on studying various RAG techniques and developing Agent AI systems. This is now called AI engineering, and, as far as I can see, it has become quite popular. So I'll write more about that in another post, but here I'll leave what I consider to be the three most famous representatives, i.e., their tutorials: LangChain tutorial LangGraph tutorial CrewAI examples Here I am today Thanks to the knowledge I've generated over all these years in the field of ML, I've developed and worked on numerous projects. The most significant publicly available project is developing an agent AI system for well-being support, which I turned into a mobile application. Also, my entire doctoral dissertation is related to applying ML to the stock market in combination with the development of GPT models and reinforcement learning (more on that in a separate post). After long 6 years, I've completed my dissertation, and now I'm just waiting for its defense. I'll share everything I'm working on for the dissertation publicly on the project, and in tutorials I'm preparing to write. If you're interested in these topics, I announce that I'll soon start with activities of publishing content on Medium and a blog, but I'll share all of that here on Reddit as well. Now that I've gathered years of experience and knowledge in this field, I'd like to share it with others and help as much as possible. If you have any questions, feel free to ask them, and I'll try to answer all of them. Thank you for reading.

#reddit #started #learning

Are Genetic Algorithms Still Relevant in 2025?

0

1

Baby-Boss0506•This week

Are Genetic Algorithms Still Relevant in 2025?

Hey everyone, I was first introduced to Genetic Algorithms (GAs) during an Introduction to AI course at university, and I recently started reading "Genetic Algorithms in Search, Optimization, and Machine Learning" by David E. Goldberg. While I see that GAs have been historically used in optimization problems, AI, and even bioinformatics, I’m wondering about their practical relevance today. With advancements in deep learning, reinforcement learning, and modern optimization techniques, are they still widely used in research and industry?I’d love to hear from experts and practitioners: In which domains are Genetic Algorithms still useful today? Have they been replaced by more efficient approaches? If so, what are the main alternatives? Beyond Goldberg’s book, what are the best modern resources (books, papers, courses) to deeply understand and implement them in real-world applications? I’m currently working on a hands-on GA project with a friend, and we want to focus on something meaningful rather than just a toy example.

#reddit #genetic #algorithms

0

TheFanHeater•This week

What are some no code ai tool

#reddit #some #code

Understand and train your own GPT – No prior AI or coding knowledge required

0

serced_red•This week

Understand and train your own GPT – No prior AI or coding knowledge required

Hi all I wrote this "blog" Google colab notebook for friends and family who have little technical know-how or do not yet have an understanding of language models. It's based on nanoGPT (a minimal GPT implementation from Andrej Karpathy, ex-OpenAI). However, I rewrote the code from scratch, made it more explicit, and have lots of textual information such that everyone should be able to follow what is happening when, and train their own little GPT :) Hope it helps some people and let me know if you have any questions/feedback. Find it here: https://colab.research.google.com/drive/1J9qJ-JArJSORVfOcppoD4cMC3u2I7aGZ?usp=sharing Tl;dr: Blog about how LLMs work & train a mini model without prior knowledge.

#reddit #understand #train

TOP ML University Courses for beginners (FREE)

0

Attitude_Alone•This week

TOP ML University Courses for beginners (FREE)

UC Berkeley UC Berkeley Full Stack Playlist - YouTube Stanford University STEFANO ERMON's Deep Generative Model classes - (Highly Recommended) Stanford CS234: Reinforcement Learning Spring 2024 - Playlist 1 Stanford CS234: Reinforcement Learning Spring 2024 - Playlist 2 Stanford XCS224U: Natural Language Understanding Spring 2023 - Playlist Stanford CS229M: Machine Learning Theory Fall 2021 - Playlist Stanford CS25 - Transformers United ML with Graphs MIT MIT 6.S191: Introduction to Deep Learning Harvard NLP Harvard NLP Code - Official Website CU Boulder AI BY HAND Cornell Tech ML Models UMICH DL for CV

#reddit #university #courses

I'm average at math and don't enjoy it. Is the ML path right for me?

0

hustler24•This week

I'm average at math and don't enjoy it. Is the ML path right for me?

I know machine learning is the future, and as an experienced sw engineer, I’m really interested in it. However, I struggle with math and don’t particularly enjoy it. For example, I tried reading Deep Learning by Goodfellow, but the math felt too complex and hard for me to understand. I have a degree in computer science, but I’m wondering if the ML path is right for me given my challenges with math. Should I start with simpler books, such as Introduction to Statistical Learning? Or maybe at deeplearning.ai ? Can you recommend me other resources?

#reddit #average #math

How's the fast.ai Introduction to Machine Learning for Coders! Course?

0

1

AnshulPiperde•This week

How's the fast.ai Introduction to Machine Learning for Coders! Course?

It doesnt seem like the usual Machine Learning courses as it there are only two topics : Random forests and Gradient descent in which the whole course is covered. There are many topics missing like Clustering , other classification models.

#reddit #how's #fast.ai

Introduction to Large Language Models for Generative AI

0

SleekEagle•This week

Introduction to Large Language Models for Generative AI

Hey everyone! I've recently written an introduction to large language models for generative ai \- it's a comprehensive introduction to the topic for anyone that is interested! We go over: What language models are How language models are trained The abilities of large language models The scaling of language models And more! Feel free to let me know what you think :)

#reddit #introduction #large

0

1

SleekEagle•This week

Introduction to Generative AI

Hey everyone! There's been a rapidly growing interest in Generative AI recently, so I wrote this Introduction to Generative AI that provides a high-level overview of the topic. It is the starting point in a series that will go through Generative AI in different domains, so it's a good resource if you're relatively new to (Generative) AI and want to learn more! Happy to answer any questions in the comments :)

#reddit #introduction #generative

0

1

anishathalye•This week

MIT Introduction to Data-Centric AI

Announcing the first-ever course on Data-Centric AI. Learn how to train better ML models by improving the data. Course homepage | Lecture videos on YouTube | Lab Assignments The course covers: Data-Centric AI vs. Model-Centric AI Label Errors Dataset Creation and Curation Data-centric Evaluation of ML Models Class Imbalance, Outliers, and Distribution Shift Growing or Compressing Datasets Interpretability in Data-Centric ML Encoding Human Priors: Data Augmentation and Prompt Engineering Data Privacy and Security MIT, like most universities, has many courses on machine learning (6.036, 6.867, and many others). Those classes teach techniques to produce effective models for a given dataset, and the classes focus heavily on the mathematical details of models rather than practical applications. However, in real-world applications of ML, the dataset is not fixed, and focusing on improving the data often gives better results than improving the model. We’ve personally seen this time and time again in our applied ML work as well as our research. Data-Centric AI (DCAI) is an emerging science that studies techniques to improve datasets in a systematic/algorithmic way — given that this topic wasn’t covered in the standard curriculum, we (a group of PhD candidates and grads) thought that we should put together a new class! We taught this intensive 2-week course in January over MIT’s IAP term, and we’ve just published all the course material, including lecture videos, lecture notes, hands-on lab assignments, and lab solutions, in hopes that people outside the MIT community would find these resources useful. We’d be happy to answer any questions related to the class or DCAI in general, and we’d love to hear any feedback on how we can improve the course material. Introduction to Data-Centric AI is open-source opencourseware, so feel free to make improvements directly: https://github.com/dcai-course/dcai-course.

#reddit #introduction #data-centric

Free YouTube course on introduction to AI and ML

0

vinaych•This week

Free YouTube course on introduction to AI and ML

Published a course on AI and ML (beginner to intermediate level) youtube.com/watch?v=ueHIUDYfAq4&list=PLoSocS9HiyjWVjPKLIF6aduFO1XODsviS Grab a coffee and start learning now!

#reddit #free #youtube

Gentle introduction to the basics of Poker AI

0

tt293•This week

Gentle introduction to the basics of Poker AI

I have written a multi-part series on the concepts and implementation of Counterfactual Regret Minimization (CFR), with Python code and toy example. CFR is at the heart of most advanced Poker AIs including the famed Pluribus) I hope you find it interesting: Steps to building a Poker AI (Part 1) If there's enough interest I plan to continue the series in some form to talk more about the aspects specific to Texas Hold'em and some more advanced stuff that is necessary to make a strong AI. Direct links to the other parts: Part 2: Modelling Imperfect Information Games Part 3: Regrets and Minimizing Regrets in One-Shot Games Part 4: Regret matching for Rock-Paper-Scissors in Python Part 5: Sequential Games, Kuhn Poker and Counterfactual Regrets Part 6: Beating Kuhn Poker with CFR using Python Part 7: Exploitability, Multiplayer CFR and 3-player Kuhn Poker (X-post from r/poker)

#reddit #gentle #introduction

List of free educational ML resources I used to become a FAANG ML Engineer

0

1

aifordevs•This week

List of free educational ML resources I used to become a FAANG ML Engineer

Full commentary and notes here ➡️: https://www.trybackprop.com/blog/top\ml\learning\resources Used these to brush up on math and teach myself AI/ML over the course of two years. I'm now a staff ML engineer at FAANG. Hope these help. Fundamentals Linear Algebra – 3Blue1Brown's Essence of Linear Algebra series, binged all these videos on a one hour train ride visiting my parents Multivariable Calculus – Khan Academy's Multivariable Calculus lessons were a great refresher of what I had learned in college. Looking back, I just needed to have reviewed Unit 1 – intro and Unit 2 – derivatives. Calculus for ML – this amazing animated video explains calculus and backpropagation Information Theory – easy-to-understand book on information theory called Information Theory: A Tutorial Introduction. Statistics and Probability – the StatQuest YouTube channel Machine Learning Stanford Intro to Machine Learning by Andrew Ng – Stanford's CS229, the intro to machine learning course, published their lectures on YouTube for free. I watched lectures 1, 2, 3, 4, 8, 9, 11, 12, and 13, and I skipped the rest since I was eager to move onto deep learning. The course also offers a free set of course notes, which are very well written. Caltech Machine Learning – Caltech's machine learning lectures on YouTube, less mathematical and more intuition based Deep Learning Andrej Karpathy's Zero to Hero Series – Andrej Karpathy, an AI researcher who graduated with a Stanford PhD and led Tesla AI for several years, released an amazing series of hands on lectures on YouTube. highly highly recommend Neural networks – Stanford's CS231n course notes and lecture videos were my gateway drug*, so to speak, into the world of deep learning. Transformers and LLMs Transformers – watched these two lectures: lecture from the University of Waterloo and lecture from the University of Michigan. I have also heard good things about Jay Alammar's The Illustrated Transformer guide ChatGPT Explainer – Wolfram's YouTube explainer video on ChatGPT Interactive LLM Visualization – This LLM visualization that you can play with in your browser is hands down the best interactive experience with an LLM. Financial Times' Transformer Explainer – The Financial Times released a lovely interactive article that explains the transformer very well. Residual Learning – 2023 Future Science Prize Laureates Lecture on residual learning. Efficient ML and GPUs How are Microchips Made? – This YouTube video by Branch Education is one of the best free educational videos on the internet, regardless of subject, but also, it's the best video on understanding microchips. CUDA – My L8 and L9 FAANG coworkers acquired their CUDA knowledge from this series of lectures. TinyML and Efficient Deep Learning Computing – 2023 lectures on efficient ML techniques online. Chip War – Chip War is a bestselling book published in 2022 about microchip technology whose beginning chapters on the invention of the microchip actually explain CPUs very well

How I Started Learning Machine Learning

Hello, everyone. As promised, I'll write a longer post about how I entered the world of ML, hoping it will help someone shape their path. I'll include links to all the useful materials I used alongside the story, which you can use for learning. I like to call myself an AI Research Scientist who enjoys exploring new AI trends, delving deeper into understanding their background, and applying them to real products. This way, I try to connect science and entrepreneurship because I believe everything that starts as scientific research ends up "on the shelves" as a product that solves a specific user problem. I began my journey in ML in 2016 when it wasn't such a popular field. Everyone had heard of it, but few were applying it. I have several years of development experience and want to try my hand at ML. The first problem I encountered was where to start - whether to learn mathematics, statistics, or something else. That's when I came across a name and a course that completely changed my career. Let's start You guessed it. It was Professor Andrew Ng and his globally popular Machine Learning course available on Coursera (I still have the certificate, hehe). This was also my first official online course ever. Since that course no longer exists as it's been replaced by a new one, I recommend you check out: Machine Learning (Stanford CS229) Machine Learning Specialization These two courses start from the basics of ML and all the necessary calculus you need to know. Many always ask questions like whether to learn linear algebra, statistics, or probability, but you don't need to know everything in depth. This knowledge helps if you're a scientist developing a new architecture, but as an engineer, not really. You need to know some basics to understand, such as how the backpropagation algorithm works. I know that Machine Learning (Stanford CS229) is a very long and arduous course, but it's the right start if you want to be really good at ML. In my time, I filled two thick notebooks by hand while taking the course mentioned above. TensorFlow and Keras After the course, I didn't know how to apply my knowledge because I hadn't learned specifically how to code things. Then, I was looking for ways to learn how to code it. That's when I came across a popular framework called Keras, now part of TensorFlow. I started with a new course and acquiring practical knowledge: Deep Learning Specialization Deep Learning by Ian Goodfellow Machine Learning Yearning by Andrew Ng These resources above were my next step. I must admit that I learned the most from that course and from the book Deep Learning by Ian Goodfellow because I like reading books (although this one is quite difficult to read). Learn by coding To avoid just learning, I went through various GitHub repositories that I manually retyped and learned that way. It may be an old-fashioned technique, but it helped me a lot. Now, most of those repositories don't exist, so I'll share some that I found to be good: Really good Jupyter notebooks that can teach you the basics of TensorFlow Another good repo for learning TF and Keras Master the challenge After mastering the basics in terms of programming in TF/Keras, I wanted to try solving some real problems. There's no better place for that challenge than Kaggle and the popular Titanic dataset. Here, you can really find a bunch of materials and simple examples of ML applications. Here are some of my favorites: Titanic - Machine Learning from Disaster Home Credit Default Risk House Prices - Advanced Regression Techniques Two Sigma: Using News to Predict Stock Movements I then decided to further develop my career in the direction of applying ML to the stock market, first using predictions on time series and then using natural language processing. I've remained in this field until today and will defend my doctoral dissertation soon. How to deploy models To continue, before I move on to the topic of specialization, we need to address the topic of deployment. Now that we've learned how to make some basic models in Keras and how to use them, there are many ways and services, but I'll only mention what I use today. For all my ML models, whether simple regression models or complex GPT models, I use FastAPI. It's a straightforward framework, and you can quickly create API endpoints. I'll share a few older and useful tutorials for beginners: AI as an API tutorial series A step-by-step guide Productizing an ML Model with FastAPI and Cloud Run Personally, I've deployed on various cloud providers, of which I would highlight GCP and AWS because they have everything needed for model deployment, and if you know how to use them, they can be quite cheap. Chose your specialization The next step in developing my career, besides choosing finance as the primary area, was my specialization in the field of NLP. This happened in early 2020 when I started working with models based on the Transformer architecture. The first model I worked with was BERT, and the first tasks were related to classifications. My recommendations are to master the Transformer architecture well because 99% of today's LLM models are based on it. Here are some resources: The legendary paper "Attention Is All You Need" Hugging Face Course on Transformers Illustrated Guide to Transformers - Step by Step Explanation Good repository How large language models work, a visual intro to transformers After spending years using encoder-based Transformer models, I started learning GPT models. Good open-source models like Llama 2 then appear. Then, I started fine-tuning these models using the excellent Unsloth library: How to Finetune Llama-3 and Export to Ollama Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth After that, I focused on studying various RAG techniques and developing Agent AI systems. This is now called AI engineering, and, as far as I can see, it has become quite popular. So I'll write more about that in another post, but here I'll leave what I consider to be the three most famous representatives, i.e., their tutorials: LangChain tutorial LangGraph tutorial CrewAI examples Here I am today Thanks to the knowledge I've generated over all these years in the field of ML, I've developed and worked on numerous projects. The most significant publicly available project is developing an agent AI system for well-being support, which I turned into a mobile application. Also, my entire doctoral dissertation is related to applying ML to the stock market in combination with the development of GPT models and reinforcement learning (more on that in a separate post). After long 6 years, I've completed my dissertation, and now I'm just waiting for its defense. I'll share everything I'm working on for the dissertation publicly on the project, and in tutorials I'm preparing to write. If you're interested in these topics, I announce that I'll soon start with activities of publishing content on Medium and a blog, but I'll share all of that here on Reddit as well. Now that I've gathered years of experience and knowledge in this field, I'd like to share it with others and help as much as possible. If you have any questions, feel free to ask them, and I'll try to answer all of them. Thank you for reading.

#reddit #started #learning

What Reinforcement Learning Method Should I Use for Poker AI with LLMs?

0

1

godlover123451•This week

What Reinforcement Learning Method Should I Use for Poker AI with LLMs?

Hey everyone, I’m working on a poker AI project, where I’m training a large language model (LLM) to predict poker actions from given game states (check, call, bet, raise, etc.). My end goal is to create a model that can play poker at a high level, primarily by self-play and opponent modeling. However, I’m running into some challenges that I hope you can help me with! Here's the situation: Training Method: I’m using supervised fine-tuning (SFT) on real poker hand history data to initially teach the LLM how to predict poker actions from game states. This means that the model learns from examples of past games, predicting the actions that players took in various situations. Self-Play Setup: I plan to eventually move to self-play, where the LLM will play against itself (or other types of models that I create to simulate different play styles). I’ll use these self-play sessions to improve the model over time. Opponent Pool: I’m creating 6 types of poker players (Loose Aggressive, Loose Passive, Tight Aggressive, Tight Passive, Maniac, and Nit), each trained at 5 different skill levels (Novice, Beg\*nner, Intermediate, Advanced, Expert). This gives me a decent range of opponent behavior for training. The problem: Here’s the catch: The LLM I’m using only outputs discrete actions (e.g., bet 3BB, raise to 10BB, etc.) with no access to the probabilities of actions, so I can't directly use methods like policy gradients or Q-learning that rely on action probabilities or continuous action spaces. This makes applying traditional RL methods a bit tricky. My question: Given that I don't have access to action probabilities, what RL method or strategy should I pursue to improve my model? Specifically, I’m looking for a way to: Incorporate self-play with reward-based learning. Refine the model through reinforcement learning, without the need for continuous probabilities. Ensure the model doesn’t just overfit to its own prior behavior but learns to adapt and exploit different strategies in poker. I’ve considered a few approaches like reward-weighted supervised fine-tuning or using simpler RL techniques like Monte Carlo updates, but I’m not sure which would work best with the LLM setup I have. I've also considered Q-learning or Deep Q-learning. Any advice or suggestions on which RL approach I should take given my situation would be greatly appreciated! Yes I used AI to write this queston. But it captures everything I want to say, and I suck at writing.

#reddit #reinforcement #learning

How to Effectively Read Research Papers Like RCNN, Fast RCNN, etc.?

0

1

Intrepid-Purpose2151•This week

How to Effectively Read Research Papers Like RCNN, Fast RCNN, etc.?

Hey everyone, I'm starting to read research papers on object detection, beginning with RCNN, Fast RCNN, and others. Since these papers are packed with technical details, I want to make sure I'm reading them the right way and extracting the most useful insights. For those experienced in reading ML/AI research papers, what’s the best approach? Specifically: What key things should I focus on while reading these papers!? Any strategies to understand the important contributions of a paper? I’d appreciate any tips or resources that helped you when you started! Thanks in advance.

#reddit #effectively #read

Getting Back Into AI/ML After a Career Break – Seeking Advice

0

1

Little_Pangolin_164•This week

Getting Back Into AI/ML After a Career Break – Seeking Advice

Hi everyone, I’m a stay-at-home mom looking to restart my career in AI/ML. I have a Bachelor’s in AI and a Master’s in IT, along with experience teaching undergraduate courses like Fundamentals of AI and Artificial Neural Networks (ANN). However, I have a significant gap in my resume, which makes me feel a bit demotivated about re-entering the field. Despite the gap, my passion for AI hasn’t faded—I’m still fascinated by the field and would love to learn Machine Learning from scratch to build a strong foundation and transition into an ML career. I’d really appreciate any advice on: • The best way to relearn ML from scratch (courses, books, hands-on projects) • How to fill the resume gap and make myself a strong candidate again • Any communities, mentorship programs, or networking strategies that might help If anyone has been through a similar situation or has guidance on how to break back into AI/ML after a career break, I’d love to hear your thoughts! Thanks in advance.

#reddit #getting #back

0

1

Dependent_Acadia_433•This week

Guidance Needed for Project

Hi everyone, I’m working on a project to build an optimized GPT platform that takes a prompt as input and makes API calls to multiple LLMs (e.g., ChatGPT, Perplexity, etc.). The goal is to evaluate the responses from these models and return the best output to the user. As a student, I’m on a tight budget, and I need your guidance on a few key points: Cost-Effective API Calls: How can I design the API calls so they don’t turn out to be overly expensive? Are there any best practices or techniques to optimize token usage for APIs like OpenAI and Perplexity? Reducing Latency: How can I handle API calls efficiently to avoid long wait times for the user? Are there strategies for managing multiple API calls in parallel without increasing the response time significantly? Additionally, I want to experiment with building this system using open-source models like LLaMA or Mistral. However, I’m limited by my hardware: I have a Dell Inspiron 5000 laptop without a GPU. What are some lightweight open-source models I could use that run well on a CPU? Are there cloud-based solutions (preferably free or low-cost) where I could experiment with running these models? Any advice, resources, or tips would be incredibly helpful.

#reddit #guidance #needed

How do I get a job in this job market? How do I stand out from the crowd?

0

1

__proximity__•This week

How do I get a job in this job market? How do I stand out from the crowd?

About me - I am an international grad student graduating in Spring 2025. I have been applying for jobs and internships since September 2024 and so far I haven't even been able to land a single interview. I am not an absolute beginner in this field. Before coming to grad school I worked as an AI Software Engineer in a startup for more than a year. I have 2 publications one in the WACV workshop and another in ACM TALLIP. I have experience in computer vision and natural language processing, focusing on multimodal learning and real-world AI applications. My academic projects include building vision-language models, segmentation algorithms for medical imaging, and developing datasets with human attention annotations. I’ve also worked on challenging industry projects like automating AI pipelines and deploying real-time classifiers. How can I improve my chances in this competitive job market? Are there specific strategies for international students navigating U.S. tech job applications? How can I stand out, especially when competing with candidates from top schools and with more experience?

#reddit #market?#stand

Creating a strategy game AI without writing a board evaluation function?

0

1

195monke•This week

Creating a strategy game AI without writing a board evaluation function?

I've recreated a favorite obscure two player board game of mine, where the board is divided into cells and pieces move across these cells (like chess, checkers, etc...) I want to try and create an AI that will be able to play the game - but I want to avoid writing the board evaluation function myself (for a minmax algorithm). Are there any general good ways to achieve that? From the little research I've done, I found out about genetic algorithms - and they seem like they would fit but I can't fathom how would one create a dna schema that describes a board evaluation function

#reddit #creating #strategy

I share my strategy to identify the most interesting AI (audio) papers

0

diabulusInMusica•This week

I share my strategy to identify the most interesting AI (audio) papers

A ton of AI papers get published every week 🎓 🎓 How can you identify the most promising ones? In this video, I introduce my strategy to select AI (audio) papers. I also share the venues where I find new AI audio / music papers, and briefly talk about my reading workflow. Here’s the video: https://www.youtube.com/watch?v=UEdWuu8hqrg&list=PL-wATfeyAMNqwhLIBEX7lS-sYIQTcEPxY&index=2

#reddit #share #strategy

All Resources

My Manager Thinks ML Projects Takes 5 Minutes 🤦‍♀️

I want to learn generative AI with no coding background ,can anyone guide how to from scratch

🚀 PureML - AI-Powered No-Code ML for Everyone 🚀

How I Started Learning Machine Learning

Are Genetic Algorithms Still Relevant in 2025?

What are some no code ai tool

Understand and train your own GPT – No prior AI or coding knowledge required

TOP ML University Courses for beginners (FREE)

I'm average at math and don't enjoy it. Is the ML path right for me?

How's the fast.ai Introduction to Machine Learning for Coders! Course?

Introduction to Large Language Models for Generative AI

Introduction to Generative AI

MIT Introduction to Data-Centric AI

Free YouTube course on introduction to AI and ML

Gentle introduction to the basics of Poker AI

List of free educational ML resources I used to become a FAANG ML Engineer

How I Started Learning Machine Learning

What Reinforcement Learning Method Should I Use for Poker AI with LLMs?

How to Effectively Read Research Papers Like RCNN, Fast RCNN, etc.?

Getting Back Into AI/ML After a Career Break – Seeking Advice

Guidance Needed for Project

How do I get a job in this job market? How do I stand out from the crowd?

Creating a strategy game AI without writing a board evaluation function?

I share my strategy to identify the most interesting AI (audio) papers