<?xml version='1.0' encoding='UTF-8'  ?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/">	<channel rdf:about="http://www.semanlink.net/tag/reasoning_models">		<title>Reasoning models (Inference-time scaling)</title>		<link>http://semanlink.net/tag/reasoning_models</link>		<description>Documents tagged with Reasoning models (Inference-time scaling)</description>		<items>			<rdf:Seq>							<rdf:li resource="http://www.semanlink.net/doc/2025/08/the_hidden_drivers_of_hrm_s_per"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/08/2506_21734_hierarchical_reaso"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/diffuse_one_reasoning_reflectio"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/cameron_r_wolfe_ph_d_sur_x_"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/openai_o1_hub_%7C_openai"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/deepseek_r1_model_by_deepseek_a"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/diffuse_one"/>			</rdf:Seq>		</items>	</channel>		<item rdf:about="http://www.semanlink.net/doc/2025/08/the_hidden_drivers_of_hrm_s_per">		<title>The Hidden Drivers of HRM&apos;s Performance on ARC-AGI</title>		<link>http://www.semanlink.net/doc/2025/08/the_hidden_drivers_of_hrm_s_per</link>		<description>&gt; The HRM model architecture itself (the centerpiece of the paper) is not an important factor. 		</description>		<dc:date>2025-08-16T14:39:18Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/08/2506_21734_hierarchical_reaso">		<title>[2506.21734&#93; Hierarchical Reasoning Model</title>		<link>http://www.semanlink.net/doc/2025/08/2506_21734_hierarchical_reaso</link>		<description>&gt; Inspired by the hierarchical and multi-timescale processing in the human brain...		</description>		<dc:date>2025-08-16T14:35:57Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti">		<title>[2501.19393&#93; s1: Simple test-time scaling</title>		<link>http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti</link>		<description>&quot;Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50&quot; [techcrunch.com&#93;(https://techcrunch.com/2025/02/05/researchers-created-an-open-rival-to-openais-o1-reasoning-model-for-under-50/)		</description>		<dc:date>2025-03-03T09:04:57Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/diffuse_one_reasoning_reflectio">		<title>diffuse.one/reasoning_reflections: AI for science with reasoning models</title>		<link>http://www.semanlink.net/doc/2025/02/diffuse_one_reasoning_reflectio</link>		<dc:date>2025-02-24T14:08:53Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/cameron_r_wolfe_ph_d_sur_x_">		<title>Cameron R. Wolfe, Ph.D. sur X : &quot;The trajectory of research for open LLMs and open reasoning models has been shockingly similar, but there are still many open questions…&quot;</title>		<link>http://www.semanlink.net/doc/2025/02/cameron_r_wolfe_ph_d_sur_x_</link>		<description>&gt; To me, these are pivotal questions to answer for current research on open reasoning models:
&gt; - Do the smaller / distilled models generalize well?
&gt; - Are we missing any gaps in performance?
&gt; - How do these findings relate to findings from traditional LLM research?		</description>		<dc:date>2025-02-24T13:55:04Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/openai_o1_hub_%7C_openai">		<title>OpenAI o1 Hub</title>		<link>http://www.semanlink.net/doc/2025/02/openai_o1_hub_%7C_openai</link>		<description>&gt; a new series of AI models designed to spend more time thinking before they respond		</description>		<dc:date>2025-02-24T13:45:41Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/deepseek_r1_model_by_deepseek_a">		<title>deepseek-r1 Model by Deepseek-ai | NVIDIA NIM</title>		<link>http://www.semanlink.net/doc/2025/02/deepseek_r1_model_by_deepseek_a</link>		<description>&gt; DeepSeek-R1 is a first-generation **reasoning model trained using large-scale reinforcement learning** (RL) to solve complex reasoning tasks across domains such as math, code, and language. The model leverages RL to develop reasoning capabilities, which are further enhanced through supervised fine-tuning (SFT) to improve readability and coherence.		</description>		<dc:date>2025-02-24T13:34:19Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/diffuse_one">		<title>diffuse.one/reasoning_update_0</title>		<link>http://www.semanlink.net/doc/2025/02/diffuse_one</link>		<description>&gt; There is an emerging pattern of fine-tuning a small language model followed by reinforcement learning.

&gt; A reasoning model is a large language model that is trained to output both a chain of thought and a response. The chain of thought should be relatively long (
&gt; 1,000 tokens) and the reasoning should improve its performance relative to a similar-sized non-reasoning models. This is sometimes called &quot;test-time&quot; or &quot;inference-time&quot; scaling because reasoning models emit more tokens per completion and gain some performance as a result.		</description>		<dc:date>2025-02-24T13:21:09Z</dc:date>	</item></rdf:RDF>