Technology

The Age of Google Gemini

Published

5 months ago

December 12, 2023

Skylar Lee

Google’s AI Gemini is the new promising AI tool suite rivaling ChatGPT and other large language models. Google believes that this new AI is their state-of-the-art model so far. What makes Google Gemini a sophisticated and more powerful AI language model?

What is Google Gemini?

Google Gemini is a multimodal language model. It understands code and outfits, detects images and emojis, and guesses a movie. General users can experience Google Gemini with Google Bard.

Google Gemini isn’t the only language model created by the search engine corporation. PaLM 2 is the other Google model, which powers the Workspace Labs AI (Duet AI) to help users work with AI in Google Workspace.

Developers

Developers can leverage Google Gemini to create other AI language models. The Gemini comes in three forms: Nano, Pro, and Ultra.

Nano is made for on-device tasks. Meanwhile, Pro helps you scale the AI across various devices. Finally, Ultra is for highly complex tasks.

Additionally, you don’t have to build a code on your own since it can generate code. Do you also need Gemini to compete with other web developers for your team? Gemini can be used for competitive programming, too! It’s adept in various coding languages and has an advanced coding system to solve coding problems.

Gemini can be used for building software systems on Google AI Studio and Google Cloud Vertex AI starting December 13, 2023.

Google Bard

Google Bard is the search engine’s answer to compete with ChatGPT-4. Since Gemini’s release, Google Bard is now powered by Google Gemini.

Like ChatGPT, you can submit a prompt, and Bard provides the answers. One advantage of Bard against ChatGPT is that it’s free and can even provide images to show you processes. However, Bard isn’t a perfect model yet. You can correct Bard if it makes a mistake. Plus, it doesn’t generate images, unlike other AI models.

An additional benefit of using Google Bard is you can upload images. OpenAI’s GPT-4 model allows you to post images, but not on the GPT-3.5 model. If you don’t want Google Bard to keep the images on your chat history, you can delete the prompt, along with the image(s) you uploaded on the prompt.

Is Google Gemini Safe to Use?

Google’s tech officers have mentioned that they are building safeguards to ensure that Google Gemini is safe to use. Plus, they ensured built-in responsibility practices, too. They collaborate with organizations to establish benchmarks and test models for safer and more responsible use as they develop AI.

Is Google Gemini a Sham?

Bloomberg published an op-ed last December 7 praising the Gemini’s remarkable features. However, it was still behind the AI giant OpenAI. What’s surprising about the op-ed was Bloomberg pointed out an issue in the Google demo. The media organization reached out to Google regarding the demo.

Parmy Olson, author of the Bloomberg op-ed, noticed a discrepancy in how the prompts and the seamlessness of Gemini were being used on the video. It appears that Gemini received text prompts from Google beforehand, and audio and narration were added afterward. It was misleading since the editing appears that a voice prompt was responsible for Gemini to provide reasoning or outputs.

There was a disclaimer regarding the latency and brevity of prompts, but it was added only to the description box on the YouTube video.

Really happy to see the interest around our “Hands-on with Gemini” video. In our developer blog yesterday, we broke down how Gemini was used to create it. https://t.co/50gjMkaVc0

We gave Gemini sequences of different modalities — image and text in this case — and had it respond… pic.twitter.com/Beba5M5dHP
— Oriol Vinyals (@OriolVinyalsML) December 7, 2023

Oriol Vinyals, VP of Research and Deep Learning Lead, had responded to the allegations about the demo on X (formerly Twitter). Vinyals emphasized the disclaimer posted in the description box and said that it illustrated possible experiences with the multimodal model. But other X users weren’t happy with his response.

If you want to inspire developers then why don’t you post factual content? The prompts can’t be “real” and shortened at the same time. It was disingenuous and misleading
— Benedict (@BenedictSlaney) December 7, 2023

Wow, very disappointed to hear this.
— earl pantone (@earlpantone) December 9, 2023

Comparisons with OpenAI and Microsoft Bing AI

OpenAI

Google has published its own comparison table with OpenAI’s ChatGPT-4 model. Based on the table, Google Gemini seems to do better in multimodal tasks (image, video, and audio) across the board. Meanwhile, Google Gemini does well in text, except for Reasoning (HellaSwag), lagging behind 7.5% from OpenAI.

OpenAI has ChatGPT and Dall-E to help users create written content and images. Like Google, OpenAI works with machine learning experts to ensure safety and responsibility. The AI company claims GPT-4 is more creative and provides more context. Additionally, it reads images and can generate captions for images.

Microsoft Bing AI

One primary feature of Microsoft Bing AI is the user’s ability to choose among three conversational styles: Creative, Balanced, and Precise. Additionally, Bing claims that their AI is an expert in generating responses for the following topics:

Advice
Recipes
Travel plans
Language translations

Microsoft Bing AI is ideal for Windows users and Microsoft Edge since they use the Copilot feature. Copilot helps users answer complex questions and get further assistance with the touch of a button.

Bing AI can also generate written content, such as blogs, emails, and ad copies. Additionally, it can generate images.

Owner's Magazine

Technology

The Age of Google Gemini