By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
TechgoonduTechgoonduTechgoondu
  • Audio-visual
  • Enterprise
    • Software
    • Cybersecurity
  • Gaming
  • Imaging
  • Internet
  • Media
  • Mobile
    • Cellphones
    • Tablets
  • PC
  • Telecom
Search
© 2023 Goondu Media Pte Ltd. All Rights Reserved.
Reading: Betting on AI rush, Google Cloud connects to rival AI models, boosts GPU offering
Share
Font ResizerAa
TechgoonduTechgoondu
Font ResizerAa
  • Audio-visual
  • Enterprise
  • Gaming
  • Imaging
  • Internet
  • Media
  • Mobile
  • PC
  • Telecom
Search
  • Audio-visual
  • Enterprise
    • Software
    • Cybersecurity
  • Gaming
  • Imaging
  • Internet
  • Media
  • Mobile
    • Cellphones
    • Tablets
  • PC
  • Telecom
Follow US
© 2023 Goondu Media Pte Ltd. All Rights Reserved.
Techgoondu > Blog > Enterprise > Betting on AI rush, Google Cloud connects to rival AI models, boosts GPU offering
EnterpriseSoftware

Betting on AI rush, Google Cloud connects to rival AI models, boosts GPU offering

Alfred Siew
Last updated: September 6, 2023 at 9:32 AM
Alfred Siew
Published: September 4, 2023
8 Min Read
SHARE
Visitors at the Google Cloud Next 2023 event in San Francisco in late August 2023. PHOTO: Google Cloud website

From delivering the digital infrastructure to run AI workloads to pushing its AI collaboration assistants across its products, Google Cloud is ramping up AI efforts to make its cloud offerings more attractive to businesses.

The cloud giant is even allowing its customers to make use of AI models from rival technology companies, such as Meta, to enable more flexibility and see real results.

Google Cloud customers will be able to make use of Meta’s Llama 2 large language model (LLM), along with AI startup Anthropic’s Claude 2 chatbot, as they build AI-based apps and services.

These will be alternatives to Google’s own PaLM 2 LLM that it already offers to its cloud customers to make use of generative AI features, which power its various tools including the Bard chatbot.

In Southeast Asia, Palm 2 will now support text and chat in languages commonly used in the region, such as Simplified Chinese, Traditional Chinese, Thai and Vietnamese.

Google is also planning to host PaLM 2 for text and chat in its Singapore cloud region later this year. It will support question-answer chats and summarise and analyse large documents like research papers, books and legal briefs.

Tying all this together is the Google Cloud infrastructure, which is optimised for AI and used by more than 70 per cent of AI unicorns, including AI21, Anthropic, Cohere and more, the company said at its annual developer and customer event in San Francisco last week.

At Google Cloud Next, Google Cloud chief executive officer, Thomas Kurian, said its AI offerings come with the required security that safeguards customers’ data so it is not exposed without their consent.

“We take a snapshot of the model, allowing you to train and encapsulate it together in a private configuration, giving you complete control over your data,” he noted.

“Your prompts and data, as well as user inputs at inference time, are not used to improve our models and are not accessible to other customers,” he added.

Besides promising flexibility and security, perhaps Google Cloud’s strongest selling point to customers looking to the cloud to run AI workloads is the raw power of its digital infrastructure.

The hyperscale cloud company has worked out a deal with chipmaker Nvidia to run AI workloads, such as training of machine learning models, with supercomputers powered by Nvidia’s H100 graphics processing unit (GPU).

Available next month, each A3 super computer promises up to 26 exaFlops of AI performance, thus improving the time and costs for training large machine learning models.

And for companies moving from training to serving their machine learning models, the new new A3 offering can offer up to 30x improvement in inference performance over a previous A2 version.

With a shortage in GPUs and AI-optimised performance chips in the industry, Google Cloud’s new offering could draw more businesses in as they seek to crunch their data on hand to develop their own AI models.

This will be one advantage that it will leverage to close the gap with rivals Amazon Web Services and Microsoft Azure, which respectively command 30 per cent and 26 per cent of the cloud infrastructure market. Google Cloud has 9 per cent.

Besides cutting edge AI, Google Cloud is also looking to attract businesses that want more cost effective AI-optimised cloud infrastructure through its new Cloud TPU v5e AI accelerator.

Customers can make use of a single Cloud Tensor Processing Unit platform to run both large-scale AI training and inferencing. It promises twice the training performance per dollar and up to 2.5 times higher inference performance per dollar for LLMs and gen AI models compared to an earlier Cloud TPU v4 platform.

This means more businesses may be able to get onboard the AI bandwagon and start creating their own, more complex, generative AI models.

Cloud TPU v5e is currently available in public preview in Google Cloud’s Las Vegas and Columbus cloud regions, with plans to expand to other regions, including Google Cloud’s Singapore cloud region, later this year.

Inside a Google Cloud region running Cloud Tensor Processing Units, including custom-built chips, data centre networking, optical circuit switches, water cooling systems and biometric security verification. PHOTO: Google Cloud

To be sure, giving businesses the ability to run AI workloads is one part of the equation. Google Cloud is also offering AI features across its Google Workspace apps that business users depend on every day to get tasks done.

Here, the company has made Duet AI, the AI collaborator that is always on, available to the general public. It promises to a writing helper, a spreadsheet expert, a project manager, a note taker for meetings, and a creative visual designer.

Later this year, the AI collaborator will take on more roles, helping those managing cloud infrastructure by becoming an expert coder, a software reliability engineer, a database pro, an expert data analyst, and a cybersecurity adviser, said Google Cloud.

“Imagine you’re a financial analyst and you get an e-mail at 5pm from your boss asking for a presentation on Q3 performance by 8am tomorrow,” said Aparna Pappu, general manager and vice president for Google Workspace.

“Instead of scrambling through forecasts in Sheets, P&L docs, monthly business review slides, and reading e-mails from the regional sales leads, you’ll soon be able to simply ask Duet AI to do the heavy lifting with a prompt like “create a summary of Q3 performance”,” she noted.

Duet AI can create a whole new presentation, complete with text, charts, and images, based on a user’s relevant content in Drive and Gmail, she added.

“A last-minute request that once called for an all-nighter, can now be completed before dinner time,” she said.

In future, AI may even make dreary team meetings more tolerable. Duet AI is now available in the Google Meet video call app and it can now capture notes, action items, and video snippets in real-time with the new “take notes for me” feature. A summary can be sent to attendees after the meeting.

Duel AI can even help latecomers get up to speed with a “summary so far” option, which gives a quick snapshot of everything they have missed.

For users who cannot make a meeting but have inputs to share, the “attend for me” feature even lets the AI join the meeting on their behalf, delivering their message and ensuring they get the recap.

CORRECTION at September 6, 2023, 9:29am: An earlier version of the article used the wrong pronoun for a newsmaker. This has been corrected. We are sorry for the error.

Red Hat eyes PaaS market with OpenShift
Green tropical data centre testbed gets boost with Schneider Electric collaboration
Q&A: 5G is just starting but researchers are looking to 6G now: Keysight
SAP fuels AI innovation in Singapore with S$12 million investment
Tech remains key factor to attract foreign investments to Singapore: EDB
TAGGED:AICloud TPU v5eDuet AIGoogle CloudGoogle WorkspaceH100metaNvidiaPalm 2

Sign up for the TG newsletter

Never miss anything again. Get the latest news and analysis in your inbox.

By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Whatsapp Whatsapp LinkedIn Copy Link Print
Avatar photo
ByAlfred Siew
Follow:
Alfred is a writer, speaker and media instructor who has covered the telecom, media and technology scene for more than 20 years. Previously the technology correspondent for The Straits Times, he now edits the Techgoondu.com blog and runs his own technology and media consultancy.
Previous Article Sennheiser Ambeo Soundbar Mini review: Immersive movie audio despite compact size
Next Article An inside look at Intel’s Malaysia chip assembly and test operations
Leave a Comment

Leave a ReplyCancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Stay Connected

FacebookLike
XFollow

Latest News

Scammers are so successful they even accidentally scam themselves now
Cybersecurity Internet
June 10, 2025
Doom: The Dark Ages review: Future fantastic demon slaying
Gaming
June 10, 2025
Plaud NotePin review: Note-taking made easy with AI
Internet Mobile
June 9, 2025
Can smart grocery carts, biometric payments boost retailers like FairPrice?
Enterprise Internet
June 6, 2025

Techgoondu.com is published by Goondu Media Pte Ltd, a company registered and based in Singapore.

.

Started in June 2008 by technology journalists and ex-journalists in Singapore who share a common love for all things geeky and digital, the site now includes segments on personal computing, enterprise IT and Internet culture.

banner banner
Everyday DIY
PC needs fixing? Get your hands on with the latest tech tips
READ ON
banner banner
Leaders Q&A
What tomorrow looks like to those at the leading edge today
FIND OUT
banner banner
Advertise with us
Discover unique access and impact with TG custom content
SHOW ME

 

 

POWERED BY READYSPACE
The Techgoondu website is powered by and managed by Readyspace Web Hosting.

TechgoonduTechgoondu
© 2024 Goondu Media Pte Ltd. All Rights Reserved | Privacy | Terms of Use | Advertise | About Us | Contact
Join Us!
Never miss anything again. Get the latest news and analysis in your inbox.

Zero spam, Unsubscribe at any time.
 

Loading Comments...
 

    Welcome Back!

    Sign in to your account

    Username or Email Address
    Password

    Lost your password?