Libya Launches First National Large Language Model, LibiGPT

21

Libya has unveiled LibiGPT, its first nationally developed large language model (LLM). Created by Smart Co for Technology Projects and Artificial Intelligence, the AI is designed to address a critical gap in regional language AI. The model comes in three versions: LibiGPT-Base (7 billion parameters), LibiGPT-Instruct (13 billion parameters), and LibiGPT-Enterprise (34 billion parameters).

Bridging the AI Gap in Libya

Existing global LLMs such as OpenAI’s ChatGPT and Google Gemini struggle with the nuances of Libyan Arabic dialects and cultural context. LibiGPT is trained on a massive dataset including Modern Standard Arabic (MSA) and North African dialects, enabling it to understand and generate text in Libyan colloquial Arabic (dārija), English, and French.

Why this matters: The launch of a localized LLM isn’t just about language processing. It addresses the broader issue of AI accessibility for populations where existing models lack cultural and linguistic accuracy. For Libyan businesses, government agencies, and educational institutions, LibiGPT offers customizable AI aligned with national priorities.

Key Features and Development

The LibiGPT project included:

  • Training Data: A multi-hundred-billion-token corpus with a substantial Arabic focus, sourced from public datasets, academic texts, Arabic Wikipedia, and licensed content.
  • Optimization Pipeline: Custom Arabic processing, including orthographic normalization, dialect filtering, and improved tokenization.
  • Synthetic Data: Creation of high-quality synthetic Arabic data to improve robustness, reasoning, and translation capabilities.
  • Translation: Accurate translation between Arabic, English, and French, tailored to local cultural contexts.

The development team has also prioritized data security by storing all information locally to meet sovereignty concerns.

Future Roadmap

According to Dr. Ali Othman Al-Baji, founder and CEO of Smart Co, future plans include:

  • Extended Context Windows: Increasing model capacity to handle over 200,000 tokens.
  • Domain-Specific Models: Development of specialized AI for legal, financial, healthcare, and government sectors.
  • Dialect Expansion: Improved understanding of Arabic dialects across the region.
  • Enterprise Solutions: Retrieval-augmented generation systems optimized for Arabic.

Regional Trend: National AI Development

LibiGPT is part of a growing trend in the Maghreb region. The lack of localized language models has historically limited AI adoption by local communities and prevented governments from fully leveraging AI for public services. National AI initiatives are now underway across the region, driven by both commercial and academic sectors.

The launch of LibiGPT marks a significant step towards greater AI accessibility and sovereignty for Libya, aligning with a broader movement to prioritize localized language models in North Africa.

The project demonstrates that AI development can be tailored to the unique linguistic and cultural needs of specific regions, offering a more relevant and effective technological solution.