Raspberry Pi's New Add-on Board Has 8GB of RAM For Running Gen AI Models (theverge.com) 49

Posted by msmash on Thursday January 15, 2026 @04:45PM from the keeping-up-with-times dept.

An anonymous reader shares a report: Raspberry Pi is launching a new add-on board capable of running generative AI models locally on the Raspberry Pi 5. Announced on Thursday, the $130 AI HAT+ 2 is an upgraded -- and more expensive -- version of the module launched last year, now offering 8GB of RAM and a Hailo 10H chip with 40 TOPS of AI performance.

Once connected, the Raspberry Pi 5 will use the AI HAT+ 2 to handle AI-related workloads while leaving the main board's Arm CPU available to complete other tasks. Unlike the previous AI HAT+, which is focused on image-based AI processing, the AI HAT+ 2 comes with onboard RAM and can run small gen AI models like Llama 3.2 and DeepSeek-R1-Distill, along with a series of Qwen models. You can train and fine-tune AI models using the device as well.

Raspberry Pi's New Add-on Board Has 8GB of RAM For Running Gen AI Models

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 49 Comments Log In/Create an Account

Comments Filter:

I have yet to see a use case for small LLMs (Score:2)

by ffkom ( 3519199 ) writes:

Since all shortcomings of the very large language models popular these days are much more pronounced and abundant in the not-so-large-language-models that fit in 8GB, I wonder what use cases this is meant for. I could understand why somebody would want to use some small "AI-upscaler" or "image recognition" in a Raspberry PI... but LLMs?
- Re: I have yet to see a use case for small LLMs (Score:2)
  
  by liqu1d ( 4349325 ) writes:
  
  Does seem a bit small for any gen AI I know of. 16GB seems to be the minimum. Can you use multiple hats perhaps to expand the RAM? Perhaps useful for computer vision/audio.
  - Re: (Score:2)
    
    by arglebargle_xiv ( 2212710 ) writes:
    
    Should be enough for Frigate, which is about the only practical use I've ever found for a TPU.
    Downside is that you're then stuck running your NVR on a Pi.
- Re: (Score:3, Insightful)
  
  by blackomegax ( 807080 ) writes:
  
  I run deepseek on my 4060 8gb and it's been great.
  - Re: (Score:3)
    
    by dfghjk ( 711126 ) writes:
    
    been great at what?
  - Re:I have yet to see a use case for small LLMs (Score:4, Funny)
    
    by Growlley ( 6732614 ) writes: on Thursday January 15, 2026 @05:50PM (#65927608)
    
    nah you just hallucinated it was great.
    
    - Re: (Score:2)
      
      by smithmc ( 451373 ) writes:
      
      ...or maybe *it* hallucinated that *he* was great.
- Re: (Score:3)
  
  by Xenx ( 2211586 ) writes:
  
  I could understand why somebody would want to use some small "AI-upscaler" or "image recognition" in a Raspberry PI... but LLMs?
  I'm sure there are a few use cases, but the thing that comes to mind for me right now is something like Home Assistant.
  - Re: (Score:2)
    
    by DeanonymizedCoward ( 7230266 ) writes:
    
    This, among other things, is a good use case. I've got an 8GB 1070 in my home NAS/media server. It runs an Ollama instance that's used by Karakeep and Home Assistant, as well as GPU transcoding for Jellyfin and machine learning for Immich. Some of the "AI" stuff is pretty cool and useful, when it doesn't involve sending all your personal data to The Cloud.
    LLM integration in Home Assistant is really nice for building cloudless voice assistants -- HA's native pipeline works, but requires very specific phra
    - - Re: (Score:2)
        
        by DeanonymizedCoward ( 7230266 ) writes:
        
        You're not wrong, but people make different trade-offs between cost/consumption, convenience and privacy.
        It'd be more energy efficient to stick to plain white LED bulbs and turn them on and off with mechanical switches from the 1950s. It'd be more energy efficient to use off-the-shelf timer thermostats to have my home be warm or cool when I arrive but not while I'm gone, and even more efficient to just suck it up and come home to a cold house and dial up the thermostat like a proper octogenarian. But it's
- Re: (Score:3)
  
  by ForkInMe ( 6978200 ) writes:
  
  I have a dream of running my own personal "google home" from my basement - I want it to turn on and off lights, maybe adjust the thermostat using voice commands. I also want it access a few web pages and be able to answer questions (via voice) regarding their contents: local weather, stock prices, maybe Wikipedia. No reporting back to the motheship because I am the mothership. This might be of a size to be able to accomplish this.
  - Re: I have yet to see a use case for small LLMs (Score:2)
    
    by Frank Burly ( 4247955 ) writes:
    
    I was going to ask this very question.
  - Re: (Score:3, Interesting)
    
    by Bradac_55 ( 729235 ) writes:
    
    Z-Wave controllers already do all of that without the non-privacy of Google or the shit of AI.
    - Re: (Score:2)
      
      by ForkInMe ( 6978200 ) writes:
      
      I didn't see any reference to voice commands being an option when interacting with a Z-Wave controller or hub - am I missing something?
  - Re: (Score:2)
    
    by kackle ( 910159 ) writes:
    
    Web page? Weather? Then it's just a different mothership.
- Re: (Score:3)
  
  by thegarbz ( 1787294 ) writes:
  
  LLMs don't need to be large to be useful. Large LLMs are great for generative AI where you insist it creates a story, but small scale LLMs find their niche in contextual search, translation, OCR, and many cases at the *input* side of whatever it is you are trying to achieve.
  You can also get very small models if you restrict the application. E.g. if you need basic inference the model can be small. If you need reasoning the model can also be small if your source space is small.
  AI is more than LLMs, and LLMs a
  - Re: (Score:2)
    
    by dfghjk ( 711126 ) writes:
    
    As if people are training models for the job, and will do so specifically for tasks running on a Pi.
    - Re: (Score:2)
      
      by ceoyoyo ( 59147 ) writes:
      
      There are lots of people training smaller language models, including ones for specific tasks, and using the ones that already exist.
      You seem to be big on assumptions but for readers who are actually interested Hugging Face has a couple of articles:
      Small Language Models (SLM): A Comprehensive Overview [huggingface.co]
      A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness [huggingface.co]
    - Re:I have yet to see a use case for small LLMs (Score:4, Interesting)
      
      by thegarbz ( 1787294 ) writes: on Thursday January 15, 2026 @09:27PM (#65928068)
      
      As if people are training models for the job, and will do so specifically for tasks running on a Pi.
      Errr. Most of AI out there are models trained for specific jobs. This is literally something we have been doing before anyone has heard of OpenAI, and beyond ChatGPT, Grok, and Gemini, there's thousands of alternatives. Many special purpose LLMs are shared openly online, optimised for specific tasks.
      Hugging Face currently lists 2.47million different available models. TFS even lists a few such as Qwen, they are a all LLMs. Qwen published 26 different LLMs each with a different purpose. In fact the models which aren't trained for specific tasks like generic language models are usually the worst at a task.
      Also no one trains for tasks running on specifically anything. Models are provided with a parameter size and most special purpose models are actually quite small. What you run them on is entirely up to you, and your hardware capabilities.
      
    - Re: (Score:2)
      
      by DamnOregonian ( 963763 ) writes:
      
      You literally have no fucking idea what you're talking about, lol
- Re: (Score:2)
  
  by Hadlock ( 143607 ) writes:
  
  For voice assistants it's helpful for it to be local. It turns out that 98% of commands fall into about 10-12 commands (Set a timer for 5 min, turn on/off the lights, what time is it, whats todays date, turn on/off tv, turn on/off the lights in another room). The device catalogs all these requests and then makes a list of the top ~30 requests and if the request matches something on the list with ~0.85 confidence it doesn't even go to the LLM it just runs the command. That's how you get the instant response
- Re: (Score:2)
  
  by avandesande ( 143899 ) writes:
  
  Here is an example of a project that could probably be done on this unit: https://www.youtube.com/watch?... [youtube.com]
- Re: (Score:2)
  
  by allo ( 1728082 ) writes:
  
  Pis were thought as learning platform, people only (ab)use them to build all kind of smart devices. You can learn to do programming with simple python exercises and maybe pygame on a Pi and get quick results. Now you do your first steps into installing local LLM without having to buy a graphics card for $300.
- Re: I have yet to see a use case for small LLMs (Score:2)
  
  by LifesABeach ( 234436 ) writes:
  
  consider.
  terms and conditions evaluation
Terribly disappointed in the name (Score:4, Funny)

by SubmergedInTech ( 7710960 ) writes: on Thursday January 15, 2026 @05:17PM (#65927528)

Given the general view of AI and LLMs (especially on /.), they should have called it the AI Supplementary Storage HAT.
Or, ASSHAT for short.

- Re: (Score:1)
  
  by blackomegax ( 807080 ) writes:
  
  People on slashdot who are luddites are hilarious to me. AI is the next stage of human evolution (as soon as we can integrate it into our brains), and yet they resist.
  
  Reminds me of how VR/AR is a logical step to cybernetics, and yet they resist. The real beta tests for that, ghost in the shell cybernetic utopia future, were google glass etc, but the beta testers were called glassholes, when all they were, were visionaries who were a few decades too early to a future that is coming.
  - Re: (Score:3)
    
    by toxonix ( 1793960 ) writes:
    
    I'd be using AR glasses right now if they were not created by the big platforms as just another way to make you, (me) the consumer, the product. To surf the net like the Major using her cyberbrain and a few virtual and physical agents we'd need a much larger leap in understanding the mammalian brain. I don't trust Elmo to develop a safe brain/computer interface.
  - Re: (Score:2)
    
    by silvergig ( 7651900 ) writes:
    
    People on slashdot who are luddites are hilarious to me. AI is the next stage of human evolution (as soon as we can integrate it into our brains), and yet they resist. Reminds me of how VR/AR is a logical step to cybernetics, and yet they resist. The real beta tests for that, ghost in the shell cybernetic utopia future, were google glass etc, but the beta testers were called glassholes, when all they were, were visionaries who were a few decades too early to a future that is coming.
    No. These people were glassholes because the only thing they enabled was recording people for the purpose of large companies somehow monetizing it.
    
    One is not a luddite for shunning shit tech.
    - Re: (Score:3)
      
      by FunkSoulBrother ( 140893 ) writes:
      
      Lets not jump to conclusions, sometimes is was for the purposes of their own sexual gratification
  - Re: (Score:2)
    
    by ForkInMe ( 6978200 ) writes:
    
    Not sure there are that many Luddites here, we just want it on our terms. And in many instances that means being unwilling to move forward with a new technology if it means surrendering our privacy.
  - Re: (Score:2)
    
    by codebase7 ( 9682010 ) writes:
    
    Oh, it's evolution alright.... Darwinism specifically.
    
    Seriously, the human brain already has a thing that hallucinates grand successes / benefits from crap, and that can be trained to do far more useful things for a fraction of the cost to operate. Replacing that thing with an AI is a downgrade. Although, I'm sure that for some people, the inability to say no and perfect ability to manipulate the output in a predictable way is the entire point.
    Once men turned their thinking over to machines in the hope that this would set them free. But that only permitted other men with machines to enslave them.
  - Re: (Score:2)
    
    by codebase7 ( 9682010 ) writes:
    
    Fuck it, double post:
    ghost in the shell cybernetic utopia future
    What part of Ghost in the Shell is a utopia!?
    
    Is your head on straight? Seriously, this is a world in which the police can force you to smile as they throw you into a cell. A place where people's memories are constantly manipulated by viruses, any random person can suddenly start shooting government officials because they opened the wrong set of files in the correct order, children can be abducted by the government have their identities overwritten and given to a bunch of senior citiz
    - Re: (Score:2)
      
      by dunkelfalke ( 91624 ) writes:
      
      Depending on the specific installation it is not an utopia, but Japan is still considered a better place to live than pretty much everywhere else.
- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  Funny as your comment is, the irony is that this can't run general models. The hardware will limit you to running special purpose models, and special purpose AI models are actually really frigging good at doing various things.
  They just get no love in the media because it's not fancy to hear how we solve problems with small AI models when Open AI is in an arms race to see who reaches 10trillion.
Product in search of a market (Score:2)

by Turkinolith ( 7180598 ) writes:

Is there a demand for these at all? Seems like their making the product before there is a market..
- Re: (Score:2)
  
  by Pseudonymous Powers ( 4097097 ) writes:
  
  They're chasing a fad. Their hope is that there are enough of their customers chasing this same fad that they can make a profit off of them buying what sounds to be an essentially worthless product. (And that's even if you're willing to grant that "full-size" LLMs are worthwhile.)
  Maybe they do; maybe they don't. But as one of their customers, I personally resent this diversion of resources from more worthwhile projects in any case.
- Re: (Score:2)
  
  by Kernel Kurtz ( 182424 ) writes:
  
  Is there a demand for these at all? Seems like their making the product before there is a market..
  I can't speak to language models but for things like object detection for surveillance systems these are very popular. 40 TOPS is lots of inferencing power, considering a Google Coral has 4. Hailo already has smaller models, 12 and 25 TOPS I think. And there are Jetsons, and Memryx, and others.
  
  I'm just looking to get my feet wet with this shortly and am researching my options, of which there are quite a few, so I would say yes there definitely is a market.
  - Re: (Score:3)
    
    by ZipNada ( 10152669 ) writes:
    
    >> I would say yes there definitely is a market
    I'm interested in it but the article says;
    "Jeff Geerling found that a standalone Raspberry Pi 5 with 8GB of RAM generally outperformed the AI HAT+ 2 across the supported models."
    - Re: (Score:3)
      
      by DamnOregonian ( 963763 ) writes:
      
      Unsurprising.
      
      This mistake is constantly made.
      NPUs don't accelerate large models that are bandwidth-bottlenecked, rather than compute bottlenecked.
      Pi5 is using a single channel of LPDDR4X. The hat is using a single channel of LPDDR4.
      The Pi 5 is going to outperform it unless the model is compute-bound (which basically precludes any SLM/LLM)
- Re: (Score:2)
  
  by sinkskinkshrieks ( 6952954 ) writes:
  
  Exactly.
  ---
  Hey from SATX ex-ATX. The snowpocaylpse and paving over of Rainey St for condo douchebags and treatlerites, I noped out of there.
- Re:Who cares? (Score:5, Insightful)
  
  by TurboStar ( 712836 ) writes: on Thursday January 15, 2026 @08:34PM (#65928004)
  
  Hold a grudge much? All the RAM companies have deprioritized consumer markets so FUCK THEM you'll never buy RAM again? LOL. Rage on!
  
Just because you can... (Score:3, Insightful)

by GoRK ( 10018 ) writes: on Thursday January 15, 2026 @06:18PM (#65927698) Journal

Seeing as this consumes the only PCIe port on the device, you can't use NVMe storage in conjunction with it, making the entire thing far less useful since all of the other storage options for the Pi are dogshit

- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  You're using this model for an output. Storage isn't the factor here. You don't need a lot of IO for this. These models get loaded into the RAM. The Pi isn't a general purpose computer. If you're using this you're doing a thing that very VERY likely has no remote need for any kind of fast non-volatile storage.
Compared to BeagleBone - ? (Score:2)

by evil_aaronm ( 671521 ) writes:

How does this Pi / Hat add on compare to the BeagleBone AI-64 (https://www.beagleboard.org/boards/beaglebone-ai-64)? If anyone knows.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

I have yet to see a use case for small LLMs (Score:2)

Re: I have yet to see a use case for small LLMs (Score:2)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:3)

Re:I have yet to see a use case for small LLMs (Score:4, Funny)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: I have yet to see a use case for small LLMs (Score:2)

Re: (Score:3, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re:I have yet to see a use case for small LLMs (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: I have yet to see a use case for small LLMs (Score:2)

Terribly disappointed in the name (Score:4, Funny)

Re: (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Product in search of a market (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re:Who cares? (Score:5, Insightful)

Just because you can... (Score:3, Insightful)

Re: (Score:2)

Compared to BeagleBone - ? (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals