As the Artificial intelligence (AI) war between companies grows fiercer, those at the forefront are making sure they stay ahead of the curve to win the bigger game. With generative AI leader OpenAI's Chat GPT-4 already making waves, Google has launched its newest AI model Gemini Pro for developers and organisations, along with a range of other AI tools, models and infrastructure.
Gemini Pro will be available via the Gemini API to developers in Google AI Studio, which was previously known as MakerSuite. It’s also available to enterprises through Google Cloud’s Vertex AI platform.
"Gemini Pro accepts text as input and generates text as output. Gemini Pro Vision multimodal endpoint that's also available accepts text and imagery as input, with text output," says the company.
Google has also launched other models -- Imagen 2 and MedLM -- in Vertex AI, and these are specifically designed to provide developers the flexibility to build and ship applications. The upgraded 'Imagen 2' is a text-to-image diffusion tool, while MedLM is a family of foundation models fine-tuned for the healthcare industry, though it'll cater to the US market initially.
Imagen-2 is Google DeepMind's most advanced text-to-image diffusion technology to date, which provides improved image quality and features like creative and realistic logos for businesses, and rendering text in multiple languages. The company also announced the availability of 'Duet AI' for developers and 'Duet AI' in security operations.
With Duet AI, developers can build applications inside a code editor. It streamlines running applications on Google Cloud. Google says it will incorporate Gemini across ‘Duet AI’ portfolio soon. The company claims to be the first major cloud provider to make generative AI generally available to defenders in a unified SecOps platform.
The 'Gemini Pro' API can be used via AI Studio, a free web-based developer tool that enables one to quickly develop prompts and then get an API key to use in app development. Google had launched Gemini just last week. The company says it is its "largest and most capable AI model and comes in Ultra, Pro and Nano sizes. "We've already started rolling out Gemini in our products: Gemini Nano is in Android, starting with Pixel 8 Pro, and a specifically tuned version of Gemini Pro is in Bard."
Features of GeminiPro
It’s free to use right now, within limits, but Google says it will be competitively priced soon.
It comes with a 32K context window for text
Features include function calling, embeddings, semantic retrieval and custom knowledge grounding, and chat functionality.
Supports 38 languages across 180+ countries
Notably, like Google's Gemini, OpenAI's advanced system GPT-3 also allows developers to build with its API. They use GPT-3 to build interactive chatbots and virtual assistants that can carry out conversations. They can also use it to summerise, synthesise, and answer questions about the large text. It can also be fine-tuned on custom data to improve any task.