Close Menu
    Trending
    • 7 AI Tools to Build a Profitable One-Person Business That Runs While You Sleep
    • Estimating Product-Level Price Elasticities Using Hierarchical Bayesian
    • The Great Workforce Reconfiguration: Navigating Career Security in the Age of Intelligent Automation | by Toni Maxx | May, 2025
    • Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail
    • New to LLMs? Start Here  | Towards Data Science
    • Predicting Customer Churn Using Machine Learning | by Venkatesh P | May, 2025
    • AI Inference: NVIDIA Reports Blackwell Surpasses 1000 TPS/User Barrier with Llama 4 Maverick
    • Why Every Company Should Have a 90-Day Cash Flow Buffer
    Finance StarGate
    • Home
    • Artificial Intelligence
    • AI Technology
    • Data Science
    • Machine Learning
    • Finance
    • Passive Income
    Finance StarGate
    Home»Machine Learning»Compression Power is Intelligence | by Andy Yu
    Machine Learning

    Compression Power is Intelligence | by Andy Yu

    FinanceStarGateBy FinanceStarGateApril 7, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Earlier than I begin, I invite you to memorize the next string:

    firstsecondthirdfourthfifthsixthseventheighthninthtenth

    Bought it? Let’s start.

    I constructed an information compressor that might scale back English textual content measurement by 80% over the previous 4 months. My project was a replication of a few of the experiments carried out by Grégoire Delétang et al. within the 2023 paper titled “Language Modeling is Compression”. It demonstrates that an correct massive language mannequin (LLM), akin to that used within the well-known ChatGPT, will also be used to compress textual content information losslessly and very successfully.

    Beforehand, I’ve all the time considered laptop science as extra just like math and engineering than science. In any case, there wasn’t a lot science in constructing 2D platformer video games. This undertaking is perhaps, strictly talking, my very first laptop science undertaking. It led me to appreciate that laptop science merely makes use of arithmetic and engineering as instruments to conduct basically scientific experiments.

    To hypothesize the conduct of my compressor below completely different fashions, arithmetic, akin to info principle and statistics, had been utilized. Software program engineering was used to construct the compressor itself. Nevertheless, as an entire, this undertaking in the end experimented with the top product of my software program engineering. I graphed information, analyzed error, and made conclusions in a manner not in contrast to faculty chemistry or statistics labs.

    In information compression, a compression algorithm converts a big file right into a smaller one. For the system to be thought of lossless, the algorithm have to be designed in such a manner that one other algorithm can completely reproduce the big file solely from trying on the smaller file.

    Compression entails figuring out from the info info that may be discarded with out being misplaced utterly. Step one of knowledge compression is to seize a predictable sample within the information. That which is predictable is discardable, as we will merely restore this information throughout decompression.

    This predictable sample could appear to be various things, relying on the info compressed:

    • Many programming languages like to finish traces with semicolons “;”.
    • English textual content tends to make use of Latin letters and comply with right English grammar.
    • Pixels in a picture usually tend to be related colors to their neighbours.

    Intelligence is compression energy, arguably.

    By info principle, you’ll be able to show that the accuracy of a language mannequin and the compression ratio (% file discount) of my information compressor can each be measured utilizing the identical metric, the cross-entropy:

    Thus, a wiser language mannequin corresponds to a tighter information compressor. A extra detailed clarification could be discovered within the undertaking repository on Github.

    I’ve additionally decided, via experiments, that giant language fashions that carry out higher throughout testing metrics are inclined to additionally carry out higher on information compression. Right here, I present that bigger, smarter GPT-2 fashions, when used to create my textual content compressor, additionally result in higher compression charges.

    Lastly, this assertion could be defined intuitively. To compress information, a system must seize its patterns, and one can argue that the acquisition of patterns is intelligence. Right here’s an instance.

    Firstly of this text, I requested you to memorize a string of characters. Most readers ought to have been in a position to establish a sample throughout the textual content. We turned the large string into an simply memorizable thought, compressing it, if you’ll, in order that it could match extra compactly in our brains. The method required you to grasp primary counting, in addition to English phrases akin to “first” and “second”. In some sense, that’s intelligence.

    So should you nonetheless recall that piece of textual content, congratulations, you might be in all probability “clever”.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleCEO of 8-Figure Company Says You Don’t Need to Be an Expert for Your Business to Thrive — You Just Need This Mindset
    Next Article Let’s Call a Spade a Spade: RDF and LPG — Cousins Who Should Learn to Live Together
    FinanceStarGate

    Related Posts

    Machine Learning

    The Great Workforce Reconfiguration: Navigating Career Security in the Age of Intelligent Automation | by Toni Maxx | May, 2025

    May 23, 2025
    Machine Learning

    Predicting Customer Churn Using Machine Learning | by Venkatesh P | May, 2025

    May 23, 2025
    Machine Learning

    10 Practical SQL interview Questions I failed to answer during an interview!! | by The Analyst’s Edge | May, 2025

    May 23, 2025
    Add A Comment

    Comments are closed.

    Top Posts

    From Ethereum to Meme Coins: A Comprehensive History of Altcoins

    March 6, 2025

    Improving Cash Flow with AI-Driven Financial Forecasting

    March 5, 2025

    Younger Canadians are outsaving older ones as they enter trade war ‘survival mode’

    April 24, 2025

    MIT researchers develop an efficient way to train more reliable AI agents | MIT News

    February 16, 2025

    Sam The Concrete Man is North America’s #1 Residential Concrete Franchise

    February 19, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    Most Popular

    Why the world is looking to ditch US AI models

    March 25, 2025

    4 Ways to Build an Educated Workforce

    March 18, 2025

    Mastering Digital Marketing Strategies for Explosive Growth in 2025 | by Digital Biz Scope | Apr, 2025

    April 26, 2025
    Our Picks

    Top 9 Amazon Textract alternatives for data extraction

    February 3, 2025

    Jddjdkdk

    March 19, 2025

    09045080282 – شماره خاله تهران شماره خاله شیراز شماره خاله

    May 22, 2025
    Categories
    • AI Technology
    • Artificial Intelligence
    • Data Science
    • Finance
    • Machine Learning
    • Passive Income
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 Financestargate.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.