Close Menu
    Trending
    • What If Your Portfolio Could Speak for You? | by Lusha Wang | Jun, 2025
    • High Paying, Six Figure Jobs For Recent Graduates: Report
    • What If I had AI in 2018: Rent the Runway Fulfillment Center Optimization
    • YouBot: Understanding YouTube Comments and Chatting Intelligently — An Engineer’s Perspective | by Sercan Teyhani | Jun, 2025
    • Inspiring Quotes From Brian Wilson of The Beach Boys
    • AI Is Not a Black Box (Relatively Speaking)
    • From Accidents to Actuarial Accuracy: The Role of Assumption Validation in Insurance Claim Amount Prediction Using Linear Regression | by Ved Prakash | Jun, 2025
    • I Wish Every Entrepreneur Had a Dad Like Mine — Here’s Why
    Finance StarGate
    • Home
    • Artificial Intelligence
    • AI Technology
    • Data Science
    • Machine Learning
    • Finance
    • Passive Income
    Finance StarGate
    Home»Machine Learning»How Deepseek Destroyed OpenAI, and How You Can Do it Too! | by Mohit Varikuti | Mar, 2025
    Machine Learning

    How Deepseek Destroyed OpenAI, and How You Can Do it Too! | by Mohit Varikuti | Mar, 2025

    FinanceStarGateBy FinanceStarGateMarch 8, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    What’s PTX/ASM?

    Towards AI

    Within the quickly evolving world of GPU computing, efficiency can typically be the make-or-break consider an software’s success. One of many secret weapons behind high-performance frameworks like DeepSeek is the clever use of CUDA PTX and inline meeting (ASM). DeepSeek’s exceptional effectivity and velocity didn’t come solely from high-level algorithm design; it was additionally the way in which DeepSeek bought so good by exploiting low-level CUDA PTX/ASM optimizations to squeeze each ounce of efficiency from trendy GPUs.

    On this article, we’ll dive into CUDA’s PTX (Parallel Thread Execution) language and discover how inline meeting can be utilized inside CUDA kernels. We’ll take a look at what PTX is, the way it suits into the CUDA compilation pipeline, and look at some sensible code examples.

    CUDA PTX is an intermediate assembly-like language utilized by NVIDIA GPUs. Consider PTX because the “meeting language” for CUDA, although it’s higher-level than the precise machine code executed on the GPU. While you compile CUDA code utilizing nvcc, your high-level C/C++ code is reworked into PTX code, which is then optimized and additional compiled right down to machine-specific binary code (SASS) for the goal GPU, extra particularly:

    • Portability: PTX abstracts many {hardware} particulars, making it simpler to put in writing code that works throughout totally different GPU architectures.
    • Optimization: Low-level…



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleThe Surprising Way AI is Making Investor Pitches Impossible to Ignore
    Next Article Why Tariffs Could Be the Unexpected Gift Bitcoiners Never Saw Coming
    FinanceStarGate

    Related Posts

    Machine Learning

    What If Your Portfolio Could Speak for You? | by Lusha Wang | Jun, 2025

    June 14, 2025
    Machine Learning

    YouBot: Understanding YouTube Comments and Chatting Intelligently — An Engineer’s Perspective | by Sercan Teyhani | Jun, 2025

    June 13, 2025
    Machine Learning

    From Accidents to Actuarial Accuracy: The Role of Assumption Validation in Insurance Claim Amount Prediction Using Linear Regression | by Ved Prakash | Jun, 2025

    June 13, 2025
    Add A Comment

    Comments are closed.

    Top Posts

    Here’s What It Really Takes to Lead a Bootstrapped Business

    May 10, 2025

    Before ChatGPT: The Core Ideas That Made Modern AI Possible | by Michal Mikulasi | May, 2025

    May 10, 2025

    Amazon CEO Andy Jassy Says He Wants Fewer Middle Managers

    March 25, 2025

    Forecasting Uncertainty: Lessons from Modeling Hybrid Vehicle Sales After 2020 | by Samuel Imbody | Apr, 2025

    April 29, 2025

    Building a Streamlit App for Deepfake Audio Detection and Multi-label Defect Prediction | by Ayesha Saeed | May, 2025

    May 4, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    Most Popular

    8 Passive Income Ideas That Are Actually Worth Pursuing

    June 6, 2025

    The enterprise path to agentic AI

    April 9, 2025

    Advice From a First-Time Novelist

    June 3, 2025
    Our Picks

    These Are the Top 10 Franchises Under $25,000 in 2025

    May 21, 2025

    How to Start a YouTube Channel in 2024

    March 9, 2025

    Boost Your LLM’s Intelligence: 7 Must-Have Synthetic Reasoning Datasets | by Oliver Matthews | Feb, 2025

    February 2, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 Financestargate.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.