Large language models (LLMs)

The BharatGPT group — led by IIT Bombay announced that it would launch its first ChatGPT-like service. The group is backed by Reliance Industries Ltd and the Department of Science and Technology.

  • The group built the ‘Hanooman’ series of Indic language models in collaboration with Seetha Mahalaxmi Healthcare (SML).
  • Actually, Hanooman is a series of large language models (LLMs) that can respond in 11 Indian languages like Hindi, Tamil, and Marathi, with plans to expand to more than 20 languages.
  • LLMs are large general-purpose language models that can be pre-trained and then fine-tuned for specific purposes.
  • It means, these models are trained to solve common language problems such as text classification, question answering, text generation across industries, document summarisation, etc.
  • Large language models use deep learning techniques to process large amounts of text. They work by processing vast amounts of text, understanding the structure and meaning, and learning from it.
  • LLMs are ‘trained’ to identify meanings and relationships between words. The greater the amount of training data a model is fed, the smarter it gets at understanding and producing text.
  • The LLMs can also be tailored to solve specific problems in a variety of domains such as finance, retail, entertainment, etc., using perhaps a relatively small size of field datasets.
  • It involves the training of artificial neural networks, which are mathematical models which are believed to be inspired by the structure and functions of the human brain.
  • These can summarizes long articles, news stories, research reports;
  • These can understand and answer questions, offer suggestions and engage in natural language conversation.
  • These can assist developers in building applications and finding errors and security-related faults.

Written by 

Leave a Reply

Your email address will not be published. Required fields are marked *