Close Menu
    Trending
    • Future of Business Analytics in This Evolution of AI | by Advait Dharmadhikari | Jun, 2025
    • You’re Only Three Weeks Away From Reaching International Clients, Partners, and Customers
    • How Brain-Computer Interfaces Are Changing the Game | by Rahul Mishra | Coding Nexus | Jun, 2025
    • How Diverse Leadership Gives You a Big Competitive Advantage
    • Making Sense of Metrics in Recommender Systems | by George Perakis | Jun, 2025
    • AMD Announces New GPUs, Development Platform, Rack Scale Architecture
    • The Hidden Risk That Crashes Startups — Even the Profitable Ones
    • Systematic Hedging Of An Equity Portfolio With Short-Selling Strategies Based On The VIX | by Domenico D’Errico | Jun, 2025
    Finance StarGate
    • Home
    • Artificial Intelligence
    • AI Technology
    • Data Science
    • Machine Learning
    • Finance
    • Passive Income
    Finance StarGate
    Home»Machine Learning»How Deepseek Destroyed OpenAI, and How You Can Do it Too! | by Mohit Varikuti | Mar, 2025
    Machine Learning

    How Deepseek Destroyed OpenAI, and How You Can Do it Too! | by Mohit Varikuti | Mar, 2025

    FinanceStarGateBy FinanceStarGateMarch 8, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    What’s PTX/ASM?

    Towards AI

    Within the quickly evolving world of GPU computing, efficiency can typically be the make-or-break consider an software’s success. One of many secret weapons behind high-performance frameworks like DeepSeek is the clever use of CUDA PTX and inline meeting (ASM). DeepSeek’s exceptional effectivity and velocity didn’t come solely from high-level algorithm design; it was additionally the way in which DeepSeek bought so good by exploiting low-level CUDA PTX/ASM optimizations to squeeze each ounce of efficiency from trendy GPUs.

    On this article, we’ll dive into CUDA’s PTX (Parallel Thread Execution) language and discover how inline meeting can be utilized inside CUDA kernels. We’ll take a look at what PTX is, the way it suits into the CUDA compilation pipeline, and look at some sensible code examples.

    CUDA PTX is an intermediate assembly-like language utilized by NVIDIA GPUs. Consider PTX because the “meeting language” for CUDA, although it’s higher-level than the precise machine code executed on the GPU. While you compile CUDA code utilizing nvcc, your high-level C/C++ code is reworked into PTX code, which is then optimized and additional compiled right down to machine-specific binary code (SASS) for the goal GPU, extra particularly:

    • Portability: PTX abstracts many {hardware} particulars, making it simpler to put in writing code that works throughout totally different GPU architectures.
    • Optimization: Low-level…



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleThe Surprising Way AI is Making Investor Pitches Impossible to Ignore
    Next Article Why Tariffs Could Be the Unexpected Gift Bitcoiners Never Saw Coming
    FinanceStarGate

    Related Posts

    Machine Learning

    Future of Business Analytics in This Evolution of AI | by Advait Dharmadhikari | Jun, 2025

    June 14, 2025
    Machine Learning

    How Brain-Computer Interfaces Are Changing the Game | by Rahul Mishra | Coding Nexus | Jun, 2025

    June 14, 2025
    Machine Learning

    Making Sense of Metrics in Recommender Systems | by George Perakis | Jun, 2025

    June 14, 2025
    Add A Comment

    Comments are closed.

    Top Posts

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    April 29, 2025

    How to Perform Data Analysis in Less Than 2 Minutes | by Gabriel Capela | Mar, 2025

    March 21, 2025

    Will You Spot the Leaks? A Data Science Challenge

    May 13, 2025

    09360627233

    March 26, 2025

    Master JavaScript: 10 Surprising One-Liners You Need to Know 🚀 | by Lokesh Prajapati | Feb, 2025

    February 24, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    Most Popular

    10 Machine Learning Internships in India (2025)

    May 11, 2025

    Beyond Human Limits: Training AI Web Agents for the Entire Internet | by Jenray | Apr, 2025

    April 20, 2025

    How to Get Performance Data from Power BI with DAX Studio

    April 23, 2025
    Our Picks

    What Business Leaders Can Learn from Alex Ferguson’s Client-First Mentality

    March 11, 2025

    Building Real-World AI Apps with Google’s Gemini & Imagen | by Vipin Kumar | May, 2025

    May 28, 2025

    MapReduce: How It Powers Scalable Data Processing

    April 22, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 Financestargate.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.