<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="../assets/xml/rss.xsl" media="all"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>TinyComputers.io (Posts about rx 7900)</title><link>https://tinycomputers.io/</link><description></description><atom:link href="https://tinycomputers.io/categories/rx-7900.xml" rel="self" type="application/rss+xml"></atom:link><language>en</language><copyright>Contents © 2026 A.C. Jokela 
&lt;!-- div style="width: 100%" --&gt;
&lt;a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"&gt;&lt;img alt="" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/80x15.png" /&gt; Creative Commons Attribution-ShareAlike&lt;/a&gt;&amp;nbsp;|&amp;nbsp;
&lt;!-- /div --&gt;
</copyright><lastBuildDate>Wed, 11 Mar 2026 00:05:46 GMT</lastBuildDate><generator>Nikola (getnikola.com)</generator><docs>http://blogs.law.harvard.edu/tech/rss</docs><item><title>AMD GPU Comparison: Max+ 395 vs RX 7900 for LLM Inference</title><link>https://tinycomputers.io/posts/amd-gpu-comparison-max%2B-395-vs-rx-7900-xtx.html?utm_source=feed&amp;utm_medium=rss&amp;utm_campaign=rss</link><dc:creator>A.C. Jokela</dc:creator><description>&lt;p&gt;This report compares the inference performance of two GPU systems running local LLM models using Ollama. The benchmark tests were conducted using the llm-tester tool with concurrent requests set to 1, simulating single-user workload scenarios.&lt;/p&gt;
&lt;div class="audio-widget"&gt;
&lt;div class="audio-widget-header"&gt;
&lt;span class="audio-widget-icon"&gt;🎧&lt;/span&gt;
&lt;span class="audio-widget-label"&gt;Listen to this article&lt;/span&gt;
&lt;/div&gt;
&lt;audio controls preload="metadata"&gt;
&lt;source src="https://tinycomputers.io/amd-gpu-comparison-max+-395-vs-rx-7900-xtx_tts.mp3" type="audio/mpeg"&gt;
&lt;/source&gt;&lt;/audio&gt;
&lt;div class="audio-widget-footer"&gt;8 min · AI-generated narration&lt;/div&gt;
&lt;/div&gt;

&lt;h3&gt;Test Configuration&lt;/h3&gt;
&lt;h4&gt;Systems Tested&lt;/h4&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href="https://baud.rs/WZgnl1"&gt;AI Max+ 395&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Host: bosgame.localnet&lt;/li&gt;
&lt;li&gt;ROCm: Custom installation in home directory&lt;/li&gt;
&lt;li&gt;Memory: 32 GB unified memory&lt;/li&gt;
&lt;li&gt;VRAM: 96 GB&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href="https://baud.rs/CVlNMe"&gt;AMD Radeon RX 7900 XTX&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Host: rig.localnet&lt;/li&gt;
&lt;li&gt;ROCm: System default installation&lt;/li&gt;
&lt;li&gt;Memory: 96 GB&lt;/li&gt;
&lt;li&gt;VRAM: 24 GB&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h4&gt;Models Tested&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://baud.rs/yc2cPE"&gt;deepseek-r1:1.5b&lt;/a&gt; - Small reasoning model (1.1 GB)&lt;/li&gt;
&lt;li&gt;&lt;a href="https://baud.rs/IRySKd"&gt;qwen3:latest&lt;/a&gt; - Latest Qwen 3 model (1.1 GB)&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Test Methodology&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Benchmark Tool: llm-tester (https://github.com/Laszlobeer/llm-tester)&lt;/li&gt;
&lt;li&gt;Concurrent Requests: 1 (single-user simulation)&lt;/li&gt;
&lt;li&gt;Tasks per Model: 5 diverse prompts&lt;/li&gt;
&lt;li&gt;Timeout: 180 seconds per task&lt;/li&gt;
&lt;li&gt;Backend: Ollama API (http://localhost:11434)&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Performance Results&lt;/h3&gt;
&lt;h4&gt;deepseek-r1:1.5b Performance&lt;/h4&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;System&lt;/th&gt;
&lt;th&gt;Avg Tokens/s&lt;/th&gt;
&lt;th&gt;Avg Latency&lt;/th&gt;
&lt;th&gt;Total Time&lt;/th&gt;
&lt;th&gt;Performance Ratio&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;AMD RX 7900&lt;/td&gt;
&lt;td&gt;197.01&lt;/td&gt;
&lt;td&gt;6.54s&lt;/td&gt;
&lt;td&gt;32.72s&lt;/td&gt;
&lt;td&gt;1.78x faster&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Max+ 395&lt;/td&gt;
&lt;td&gt;110.52&lt;/td&gt;
&lt;td&gt;21.51s&lt;/td&gt;
&lt;td&gt;107.53s&lt;/td&gt;
&lt;td&gt;baseline&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Detailed Results - AMD RX 7900:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Task 1: 196.88 tokens/s, Latency: 9.81s&lt;/li&gt;
&lt;li&gt;Task 2: 185.87 tokens/s, Latency: 17.60s&lt;/li&gt;
&lt;li&gt;Task 3: 200.72 tokens/s, Latency: 1.97s&lt;/li&gt;
&lt;li&gt;Task 4: 200.89 tokens/s, Latency: 1.76s&lt;/li&gt;
&lt;li&gt;Task 5: 200.70 tokens/s, Latency: 1.57s&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Detailed Results - Max+ 395:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Task 1: 111.78 tokens/s, Latency: 13.38s&lt;/li&gt;
&lt;li&gt;Task 2: 93.81 tokens/s, Latency: 82.23s&lt;/li&gt;
&lt;li&gt;Task 3: 115.97 tokens/s, Latency: 3.83s&lt;/li&gt;
&lt;li&gt;Task 4: 114.72 tokens/s, Latency: 4.52s&lt;/li&gt;
&lt;li&gt;Task 5: 116.34 tokens/s, Latency: 3.57s&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;img alt="AMD RX 7900 XTX running deepseek-r1:1.5b benchmark" src="https://tinycomputers.io/images/llm-benchmarks/rig-deepseek-r1.png"&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;AMD RX 7900 XTX performance on deepseek-r1:1.5b model&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;img alt="Max+ 395 running deepseek-r1:1.5b benchmark" src="https://tinycomputers.io/images/llm-benchmarks/bosgame-deepseek-r1.png"&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Max+ 395 performance on deepseek-r1:1.5b model&lt;/em&gt;&lt;/p&gt;
&lt;h4&gt;qwen3:latest Performance&lt;/h4&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;System&lt;/th&gt;
&lt;th&gt;Avg Tokens/s&lt;/th&gt;
&lt;th&gt;Avg Latency&lt;/th&gt;
&lt;th&gt;Total Time&lt;/th&gt;
&lt;th&gt;Performance Ratio&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;AMD RX 7900&lt;/td&gt;
&lt;td&gt;86.46&lt;/td&gt;
&lt;td&gt;12.81s&lt;/td&gt;
&lt;td&gt;64.04s&lt;/td&gt;
&lt;td&gt;2.71x faster&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Max+ 395&lt;/td&gt;
&lt;td&gt;31.85&lt;/td&gt;
&lt;td&gt;41.00s&lt;/td&gt;
&lt;td&gt;204.98s&lt;/td&gt;
&lt;td&gt;baseline&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Detailed Results - AMD RX 7900:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Task 1: 86.56 tokens/s, Latency: 15.07s&lt;/li&gt;
&lt;li&gt;Task 2: 85.69 tokens/s, Latency: 18.37s&lt;/li&gt;
&lt;li&gt;Task 3: 86.74 tokens/s, Latency: 7.15s&lt;/li&gt;
&lt;li&gt;Task 4: 87.91 tokens/s, Latency: 1.56s&lt;/li&gt;
&lt;li&gt;Task 5: 85.43 tokens/s, Latency: 21.90s&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Detailed Results - Max+ 395:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Task 1: 32.21 tokens/s, Latency: 33.15s&lt;/li&gt;
&lt;li&gt;Task 2: 27.53 tokens/s, Latency: 104.82s&lt;/li&gt;
&lt;li&gt;Task 3: 33.47 tokens/s, Latency: 16.79s&lt;/li&gt;
&lt;li&gt;Task 4: 34.96 tokens/s, Latency: 4.64s&lt;/li&gt;
&lt;li&gt;Task 5: 31.08 tokens/s, Latency: 45.59s&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;img alt="AMD RX 7900 XTX running qwen3:latest benchmark" src="https://tinycomputers.io/images/llm-benchmarks/rig-qwen3.png"&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;AMD RX 7900 XTX performance on qwen3:latest model&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;img alt="Max+ 395 running qwen3:latest benchmark" src="https://tinycomputers.io/images/llm-benchmarks/bosgame-qwen3.png"&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Max+ 395 performance on qwen3:latest model&lt;/em&gt;&lt;/p&gt;
&lt;h3&gt;Comparative Analysis&lt;/h3&gt;
&lt;h4&gt;Overall Performance Summary&lt;/h4&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model&lt;/th&gt;
&lt;th&gt;RX 7900&lt;/th&gt;
&lt;th&gt;Max+ 395&lt;/th&gt;
&lt;th&gt;Performance Multiplier&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;deepseek-r1:1.5b&lt;/td&gt;
&lt;td&gt;197.01 tok/s&lt;/td&gt;
&lt;td&gt;110.52 tok/s&lt;/td&gt;
&lt;td&gt;1.78x&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;qwen3:latest&lt;/td&gt;
&lt;td&gt;86.46 tok/s&lt;/td&gt;
&lt;td&gt;31.85 tok/s&lt;/td&gt;
&lt;td&gt;2.71x&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;h4&gt;Key Findings&lt;/h4&gt;
&lt;ol&gt;
&lt;li&gt;RX 7900 Dominance: The AMD RX 7900 significantly outperforms the Max+ 395 across both models&lt;/li&gt;
&lt;li&gt;78% faster on deepseek-r1:1.5b&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;171% faster on qwen3:latest&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Model-Dependent Performance Gap: The performance difference is more pronounced with the larger/more complex model (qwen3:latest), suggesting the RX 7900 handles larger models more efficiently&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Consistency: The RX 7900 shows more consistent performance across tasks, with lower variance in latency&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Total Execution Time:&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;For deepseek-r1:1.5b: RX 7900 completed in 32.72s vs 107.53s (3.3x faster)&lt;/li&gt;
&lt;li&gt;For qwen3:latest: RX 7900 completed in 64.04s vs 204.98s (3.2x faster)&lt;/li&gt;
&lt;/ol&gt;
&lt;h3&gt;Comparison with Previous Results&lt;/h3&gt;
&lt;h4&gt;Desktop PC (i9-9900k + RTX 2080, 8GB VRAM)&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;deepseek-r1:1.5b: 143 tokens/s&lt;/li&gt;
&lt;li&gt;qwen3:latest: 63 tokens/s&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;M4 Mac (24GB Unified Memory)&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;deepseek-r1:1.5b: 81 tokens/s&lt;/li&gt;
&lt;li&gt;qwen3:latest: Timeout issues (needed 120s timeout)&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Performance Ranking&lt;/h4&gt;
&lt;p&gt;deepseek-r1:1.5b:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;AMD RX 7900: 197.01 tok/s ⭐&lt;/li&gt;
&lt;li&gt;RTX 2080 (CUDA): 143 tok/s&lt;/li&gt;
&lt;li&gt;Max+ 395: 110.52 tok/s&lt;/li&gt;
&lt;li&gt;M4 Mac: 81 tok/s&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;qwen3:latest:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;AMD RX 7900: 86.46 tok/s ⭐&lt;/li&gt;
&lt;li&gt;RTX 2080 (CUDA): 63 tok/s&lt;/li&gt;
&lt;li&gt;Max+ 395: 31.85 tok/s&lt;/li&gt;
&lt;li&gt;M4 Mac: Unable to complete within timeout&lt;/li&gt;
&lt;/ol&gt;
&lt;h3&gt;Cost-Benefit Analysis&lt;/h3&gt;
&lt;h4&gt;System Pricing Context&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Framework Desktop with Max+ 395: ~$2,500&lt;/li&gt;
&lt;li&gt;AMD RX 7900: Available as standalone GPU (~$600-800 used, ~$900-1000 new)&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Value Proposition&lt;/h4&gt;
&lt;p&gt;The AMD RX 7900 delivers:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;1.78-2.71x better performance than the Max+ 395&lt;/li&gt;
&lt;li&gt;Significantly better price-to-performance ratio (~$800 vs $2,500)&lt;/li&gt;
&lt;li&gt;Dedicated GPU VRAM vs shared unified memory&lt;/li&gt;
&lt;li&gt;Better thermal management in desktop form factor&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The $2,500 Framework Desktop investment could alternatively fund:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;AMD RX 7900 GPU&lt;/li&gt;
&lt;li&gt;High-performance desktop motherboard&lt;/li&gt;
&lt;li&gt;AMD Ryzen CPU&lt;/li&gt;
&lt;li&gt;32-64GB DDR5 RAM&lt;/li&gt;
&lt;li&gt;Storage and cooling&lt;/li&gt;
&lt;li&gt;With budget remaining&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Conclusions&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Clear Performance Winner: The AMD RX 7900 is substantially faster than the Max+ 395 for LLM inference workloads&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Value Analysis: The Framework Desktop's $2,500 price point doesn't provide competitive performance for LLM workloads compared to desktop alternatives&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Use Case Consideration: The Framework Desktop offers portability and unified memory benefits, but if LLM performance is the primary concern, the RX 7900 desktop configuration is superior&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;ROCm Compatibility: Both systems successfully ran ROCm workloads, demonstrating AMD's growing ecosystem for AI/ML tasks&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Recommendation: For users prioritizing LLM inference performance per dollar, a desktop workstation with an RX 7900 provides significantly better value than the Max+ 395 Framework Desktop&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3&gt;Technical Notes&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;All tests used identical benchmark methodology with single concurrent requests&lt;/li&gt;
&lt;li&gt;Both systems were running similar ROCm configurations&lt;/li&gt;
&lt;li&gt;Network latency was negligible (local Ollama API)&lt;/li&gt;
&lt;li&gt;Results represent real-world single-user inference scenarios&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Systems Information&lt;/h3&gt;
&lt;p&gt;Both systems are running:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Operating System: Linux&lt;/li&gt;
&lt;li&gt;LLM Runtime: Ollama&lt;/li&gt;
&lt;li&gt;Acceleration: ROCm (AMD GPU compute)&lt;/li&gt;
&lt;li&gt;Python: 3.12.3&lt;/li&gt;
&lt;/ul&gt;</description><category>amd gpu</category><category>benchmarks</category><category>deepseek</category><category>llm inference</category><category>machine learning</category><category>max+ 395</category><category>ollama</category><category>performance comparison</category><category>qwen</category><category>rocm</category><category>rx 7900</category><guid>https://tinycomputers.io/posts/amd-gpu-comparison-max%2B-395-vs-rx-7900-xtx.html</guid><pubDate>Tue, 28 Oct 2025 21:11:26 GMT</pubDate></item></channel></rss>