Summarization as Compression in Search Engine Architecture | by Jason Stubblefield

Shrink your index with summarization as compression.

Summarization as Compression in Search Engine Structure

Serving a big textual content search index in manufacturing could be a expensive factor. These prices develop proportionally as the dimensions of the dataset being listed grows and will get dearer when the queries per second develop. Due to the best way search works, conventional disk compression isn’t an excellent choice, leaving architects in a scenario the place they have to stability value and latency.

Retaining index sizes as small as attainable gives efficiency value, and environmental advantages. Smaller indexes might be searched extra shortly by the identical {hardware} and may typically be held in reminiscence as a substitute of on disk. An efficient compression algorithm for search may drastically cut back the {hardware} necessities for a lot of search functions.

Enter summarization as compression. The appearance of AI/ML and fashions reminiscent of Bert, Titan and Llama enable for the fast and reasonably priced summarization of enormous textual content information units, thereby shrinking the dimensions of the index at the price of a lack of precision. Including a compression step in a search engine indexing pipeline can dramatically alter the efficiency of a search software.

How does it work? Every doc is submitted to a summarization mannequin, and the abstract is saved within the index fairly than the unique textual content. Queries might be quicker however much less exact. Think about a textual content index of Wikipedia. Compression as summarization may dramatically cut back the index dimension whereas preserving the flexibility to shortly direct folks to articles.

What if I want exact outcomes? Retaining an index of the unique textual content would possibly nonetheless be needed used alongside compressed indexes. The unique textual content could possibly be supplied as an “develop your search” choice. Compressed indexes used as a triage step catch a lot of the queries making a low QPS authentic textual content index possible.

What about decompression? Retrieving the unique textual content from the database or s3 bucket the place it’s saved would successfully be decompression.

This can be a very new idea made attainable by the low value of summarization with ML fashions. I’ve achieved some preliminary experiments with it and the outcomes appear to be promising, and I’m in search of an opportunity to make use of this system in manufacturing.

Source link

Exploring Unsupervised Learning Algorithms | by Himanshu Yadav | Jul, 2024

Building a Scalable Speech-to-Text Service with Azure, Kubernetes, and Twilio | by Mahmood Hamsho | Jul, 2024

The Rise of Local AI: How Your Devices Are Getting Smarter (with Code!) | by Visheshtaposthali | Jul, 2024

Say ‘Hi’ to The Acolyte’s New Little Guy

‘Metroid Prime 4’ Gets a Release Date After Years of Troubled Development

Nvidia, with $3.34 Trillion Market Cap, Becomes Most Valuable Company

Netflix House will open two locations in Texas and Pennsylvania in 2025

CoinPoker Up 80x During Bear Market – Could It Be the Best Crypto Gaming Platform? ClayBro’s Video Reviews

Most Popular

Say ‘Hi’ to The Acolyte’s New Little Guy

‘Metroid Prime 4’ Gets a Release Date After Years of Troubled Development

Nvidia, with $3.34 Trillion Market Cap, Becomes Most Valuable Company

Our Picks

Foundations of Human-in-the-Loop Machine Learning — Part One | by Sanket Saxena | Jun, 2024

The Purchase Order Process – Are you doing it right?

Zenless Zone Zero – Did Nicole get a nerf to her jiggle physics? Is ZZZ a 16+ game?

Summarization as Compression in Search Engine Architecture | by Jason Stubblefield | Jun, 2024

Related Posts