Refiant AI shrinks models to run on low-power devices

Refiant AI, a South African firm, is developing a new type of artificial intelligence infrastructure that focuses on smaller, more economical models that can run on constrained hardware while maintaining performance.

The business has raised $5 million in seed capital to expedite the development of its compression platform, grow its engineering team, and improve enterprise integration.

At its core, Refiant AI creates software that radically changes how AI models operate.

See here: Nigeria’s ₦1.11 trillion energy problem meets PowerLabs, the startup optimising power for manufacturing

Instead of adding more processing power to current models, the company uses superior compression techniques and retraining processes to lower their size and computational load.

The goal is to enable high-performance AI systems to run on conventional hardware, such as laptops and local enterprise servers.

The company was founded in 2025 by engineers Viroshan Naicker, Siddharth Gutta, and Mathew Haswell, and it operates at the confluence of machine learning optimization and industrial computer efficiency.

Its technology is aimed at removing superfluous computation from huge AI models while retaining the majority of their original intelligence and accuracy.

One of Refiant AI’s main engineering claims is its ability to compress extraordinarily big models into lightweight systems that can run on basic hardware.

The company claims that it has shrunk a model with over 120 billion parameters to run on a system with only 12 GB of RAM, as seen on TechCabal.

Historically, models of that magnitude have required high-end GPUs and at least 80 gigabytes of memory to run properly.

According to the firm, this compressed version preserves 95–99 percent of the original model’s performance while decreasing energy consumption by more than 80 percent.

This strategy opposes the mainstream AI infrastructure development trend, in which firms such as Meta and Microsoft invest extensively in expanding data center capacity, high-performance GPUs, and advanced cooling systems to handle increasingly bigger models.

The device is especially useful in sectors that require safe, low-latency AI systems.

Compressed models can run locally in industries such as banking, telecommunications, and government services, minimizing reliance on cloud providers while keeping sensitive data secure.

This method has the potential to drastically reduce the barrier to implementing advanced AI in areas like as Africa, where data center capacity is limited.

The rapid rise of generative AI workloads and cloud-based computing demand has pushed global data center infrastructure spending into the hundreds of billions of dollars.

Organisations can implement AI systems without the cost and complexity associated with large-scale infrastructure by allowing high-performance models to run on existing hardware.

What's Hot

Cursor reports over 800,000 monthly active users in 2026

Africa’s largest fuel manufacturer is looking to process 130 different crude grades

Masayoshi Son commits €75 billion to French AI computing hubs

Africa’s largest fuel manufacturer is looking to process 130 different crude grades

Nigeria just unveiled West Africa’s largest AI-ready data centre

NASA sets sights on Moon base while one of the world’s biggest billionaires steps into the mission

$34 million-backed Nigerian defense company set to launch Africa’s largest drone factory with 50,000-unit target

Moroccan cement conglomerate CIMAF invests $45M to turn Gabon into a cement hub

Refiant AI: The South African startup that is making powerful AI run without data centres

Zimbabwe turns lithium export ban into a $1 billion manufacturing push

How Dakar’s advanced lab manufacturing systems helped decode a deadly virus in 24 hours

Africa in sight as Ukraine looks for new manufacturing partners to help with its weapons system

News

Africa’s largest fuel manufacturer is looking to process 130 different crude grades

Nigeria just unveiled West Africa’s largest AI-ready data centre

NASA sets sights on Moon base while one of the world’s biggest billionaires steps into the mission

Features

Zimbabwe turns lithium export ban into a $1 billion manufacturing push

How Dakar’s advanced lab manufacturing systems helped decode a deadly virus in 24 hours

Africa in sight as Ukraine looks for new manufacturing partners to help with its weapons system

Subscribe to Updates

What's Hot

Refiant AI: The South African startup that is making powerful AI run without data centres

Related Posts