<div dir="ltr">
<div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><font color="#000000" style="font-family:times new roman,serif" size="4">Sayin Liste Uyeleri, </font><div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font style="font-family:times new roman,serif" size="4"><font color="#000000">Gebze Teknik Üniversitesi (GTU) Matematik Bölümü Genel Seminerleri kapsamında, 7 Mart Cuma günü saat 14:00'te Dr. </font><span style="color:rgb(0,0,0)">Berkay Anahtarcı</span> (<span lang="EN-US">Özyeğin Üniversitesi)</span><font color="#000000"> bir seminer verecektir. Seminerin detayları aşağıda olup tüm ilgilenenler davetlidir.</font></font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4">Saygılarımızla. </font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4">Dear all,</font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font style="font-family:times new roman,serif" size="4"><font color="#000000">There will be a seminar in Gebze Technical University (GTU) on 7th of March by Dr. </font><span style="color:rgb(0,0,0)">Berkay Anahtarcı </span>(</font><font style="font-family:times new roman,serif" size="4">University of Özyeğin<span lang="EN-US">)</span><font color="#000000">.</font></font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font color="#000000" style="font-family:times new roman,serif" size="4">Time and place: At 14:00 in Department of Mathematics, Lecture Amphi 2</font></div></div><div><font color="#000000" style="font-family:times new roman,serif" size="4"><br></font></div><div><font style="font-family:times new roman,serif" size="4"><font color="#000000">Title: </font></font><font size="4"><span style="font-family:times new roman,serif"> The Mathematics Behind the DeepSeek Model</span></font></div><div><font style="font-family:times new roman,serif" size="4"><font color="#000000">Abstract: </font> </font><font size="4"><span style="font-family:times new roman,serif">
This talk explores the mathematical underpinnings of DeepSeek R1, a
reinforcement learning model tailored for complex reasoning. Unlike
conventional supervised fine-tuning approaches, DeepSeek R1 leverages
Group Relative Policy Optimization (GRPO), an innovative technique that
refines Proximal Policy Optimization (PPO) by eliminating the need for a
critic. GRPO enhances chain-of-thought reasoning by structuring
problem-solving into sequential steps. Through an analytical
perspective, we will examine the theoretical properties of GRPO.
</span></font><p style="font-size:13px;font-family:Helvetica Neue;margin:0px;font-kerning:auto;font-size-adjust:none"><br>
</p>
</div></div></div></div></div></div></div>
<br></div>