chatgpt -

What could kill the $1trn artificial-intelligence boom?

August 14, 2024 by

Mr Pichai is not alone. New Street Research, a firm of analysts, estimates that Alphabet, Amazon, Meta and Microsoft will together splurge $104bn on building AI data centres this year. Add in spending by smaller tech firms and other industries and the total AI data-centre binge between 2023 and 2027 could reach $1.4trn.

The scale of this investment, and uncertainty over if and when it will pay off, is giving shareholders the jitters. The day after Alphabet’s results the Nasdaq, a tech-heavy index,fell by 4%,the biggest one-day drop since October 2022. This week analystswill pore over the quarterly results of Amazon and Microsoft,the world’s two biggest cloud companies,for clues as to how their AI businesses are faring.

For now, the tech giants show little inclination to pare back their investments, as Mr Pichai’s remarks show. That is good news for the myriad suppliers that are benefiting from the boom. Nvidia, a maker of AI chips that in June briefly became the world’s most valuable company, has grabbed most of the headlines. But the AI supply chain is far more sprawling. It spans hundreds of firms, from Taiwanese server manufacturers and Swiss engineering outfits to American power utilities. Many have seen a surge in demand since the launch of ChatGPT in 2022, and are themselves investing accordingly. In time, supply bottlenecks or waning demand could leave them over-extended.

AI investment can broadly be split into two. Half of it goes to chipmakers, with Nvidia the main beneficiary. The rest is spent on makers of equipment that keeps the chips whirring, ranging from networking gear to cooling systems. To assess the goings-on along the ai supply chain, The Economist has examined a basket of 60-odd such companies. Since the start of 2023 the mean share price of firms in our universe has risen by 106%, compared with a 42% increase in the s&p 500 index of American stocks (see chart). Over that time their expected sales for 2025 climbed by 14%, on average. That compares with a 1% increase across non-financial firms, excluding tech companies, in the S&P 500.

The biggest gainers were chipmakers and server manufacturers (see chart). Nvidia accounted for almost a third of the rise in the group’s expected sales. It is forecast to sell $105bn of AI chips and related equipment this year, up from $48bn in its latest fiscal year. AMD, its nearest rival, will probably sell about $12bn of data-centre chips this year, up from $7bn. In June Broadcom, another chipmaker, said that its quarterly AI revenues jumped by 280%, year on year, to $3.1bn. It helps customers, including cloud providers, design their own chips, and also sells networking equipment. Two weeks later Micron, a maker of memory chips, said its data-centre revenues had also jumped, thanks to soaring AI demand.

Companies that make servers are also raking it in. Both Dell and Hewlett Packard Enterprise (HPE) said in their most recent earnings calls that sales of AI servers doubled in the past quarter. Foxconn, a Taiwanese manufacturer that assembles lots of Apple’s iPhones, also has a server business. In May it said its AI sales had tripled over the past year.

Other firms are seeing interest spike, even if new sales have not yet materialised. Eaton, an American maker of industrial machinery, said that in the past year it saw more than a four-fold increase in customer enquiries related to its AI data-centre products. AI servers can require up to ten times more power than conventional ones. Earl Austin junior, the boss of Quanta Services, a firm that builds renewable-power and transmission equipment, recently admitted that the surge in demand for its data-centre business had “caught me off guard a little bit”. Vertiv, which sells cooling systems used in data centres, noted in April that its pipeline of AI projects more than doubled within two months.

All this interest is setting off a further frenzy of investment. This year around two-thirds of firms in our sample are expected to raise their capital expenditure, relative to sales, above their five-year averages. Many companies are building new factories. They include Wiwynn, a Taiwanese server-maker, Supermicro, an American one, and Lumentum, an American seller of advanced networking cables. Many are also spending more on research and development.

Some companies are investing through acquisitions. This month AMD said it was buying Silo AI, a startup, to boost its AI capabilities. In January HPE announced that it would spend $14bn to buy Juniper Networks, a networking firm. In December Vertiv announced its purchase of CoolTera, a liquid-cooling specialist. The firm hopes this will help it scale up its production of liquid-cooling technology 40-fold.

Just as the spending ramps up, though, the threats to the ai supply chain are building. One problem is its heavy reliance on Nvidia. Baron Fung, of Dell’Oro Group, a research firm, notes that when Nvidia went from launching a new chip every two years to every year, the entire supply chain had to scramble to build new production lines and meet accelerated timelines. Future sales for lots of firms in the AI supply chain are predicated on keeping the world’s most valuable chipmaker happy.

Another threat stems from supply bottlenecks, most notably in the availability of power. An analysis by Bernstein, a broker, looks at a scenario in which by 2030 AI tools are used roughly as much as Google search is today. That would raise the growth in power demand in America to 7% a year, from 0.2% between 2010 and 2022. It would be hard to build that much power capacity swiftly. Stephen Byrd of Morgan Stanley, a bank, notes that in California, where many AI data centres could be built, it takes six to ten years to get connected to the grid.

Some companies are already trying to fill the gaps by providing off-grid power. In March Talen Energy, a power company, sold Amazon a data centre connected to a nuclear-power plant for $650m. CoreWeave, a small AI cloud provider, recently struck a deal with Bloom Energy, a fuel-cell maker, to produce on-site power. Others are repurposing sites such as bitcoin-mining locations that already have grid access and power infrastructure. Still, the energy needs for AI are so vast that the risk of a power shortage limiting activity remains.

The biggest threat to the AI supply chain would come from waning demand. In June Goldman Sachs, a bank, and Sequoia, a venture-capital firm, published reports questioning the benefits of current generative-AI tools, and—by extension—the wisdom of the cloud-computing giants’ spending bonanza. If AI profits remain elusive, the tech giants could cut capital spending, leaving the supply chain exposed.

The build-out of factories has brought higher fixed costs. Across our sample of firms the median spending on property, plants and equipment is expected to jump by 14% between 2023 and 2025. Some investments may start to look suspect if demand is slow to materialise. The price tag on HPE’s purchase of Juniper Networks was two-thirds of the acquirer’s market value when it was announced in January.

Even after the wobbles of last week, market expectations remain bullish. For our sample of firms the median price-to-earnings ratio, a measure of how investors value profits, has climbed by nine percentage points since the start of 2023. If such expectations are to be met, AI tools need to improve quickly, and businesses need to adopt them en masse. For the many companies along the AI supply chain, the stakes are getting uncomfortably high.

Source link

Why OpenAI-Google battle is not just about search. It’s also about building the most powerful AI

August 14, 2024 by

While this is the obvious part, beneath the surface, the bigger fight is also about controlling all streams of user data, including those from search engines and social media, which can help big tech companies such as Google, OpenAI, Microsoft, Meta, Nvidia and Elon Musk’s xAI build the world’s most powerful artificial intelligence (AI) model.

ChatGPT managed to garner more than 100 million users in just the first two months of its launch in December 2023, prompting many to dub it a search-engine killer. The reason was that ChatGPT allows us to write poems, articles, tweets, books, and even code like humans and is interactive, while search engines passively provide article links. Microsoft, which has a stake in OpenAI, even integrated ChatGPT with its own search engine, Bing. At that time, though, ChatGPT was still being tested and lacked knowledge of current events, having trained on data only till the end of 2021.

From September 2023, ChatGPT began accessing the internet, thus providing up-to-date information. But it started facing allegations of “verbatim”, “paraphrase”, and “idea” plagiarism and copyright violations from publishers around the world. Late last year, for instance, The New York Times initiated legal proceedings against Microsoft and OpenAI, alleging unauthorized “copying and using millions of its articles”. OpenAI did give publishers the option to block bots from crawling their content but separating AI bots from those originating from search engines such as Google or Microsoft’s Bing, which facilitate page indexing and visibility in search outcomes, is easier said than done.

OpenAI’s SearchGPT prototype, which is currently available for testing, will not only access the web but also provide “clear links to relevant sources”, the company said in a blog post on 26 July. This implies that more than targeting Google’s search engine, OpenAI appears to be trying to pacify and rebuild rapport with publishers it has antagonised. And this time around, OpenAI is “…also launching a way for publishers to manage how they appear in SearchGPT, so publishers have more choices”.

It clarifies that SearchGPT is about search and “separate from training OpenAI’s generative AI foundation models”. It adds that the search results will show sites even if they opt out of generative AI training. OpenAI explains that a webmaster can allow its “OAI-SearchBot to appear in search results while disallowing GPTbot to indicate that crawled content should not be used for training OpenAI’s generative AI foundation models”.

Equations are changing, but slowly

To be sure, ChatGPT’s success is already making a dent in Google’s worldwide lead, which makes most of its revenue from advertising. For instance, Google saw its smallest search market share on desktops registered in more than a decade. Microsoft’s Bing, which supported and integrated ChatGPT into its service, surpassed 10% of the market share on desktop devices, according to Statista.

Google, whose advertising search revenue was $279.3 billion in 2023, is taking a hit, with many users already preferring Generative AI (GenAI) for searching online information first. “Many companies heard the call and saw $13 billion invested in generative AI (GenAI) for broad usage, namely search engines and large language models (LLMs), in 2023,” according to Statista.

Yet, Google, according to Statista, continues to control more than 90% of the search-engine market worldwide across all devices, handling over 60% of all search queries in the US alone and generating over $206.5 billion in ad revenues from its search engine and YouTube. In India, too, the search-engine giant has a market share of over 92%, but in countries like Germany and France, though, online users are increasingly choosing “privacy- or sustainability-focused alternatives such as DuckDuckGo or Ecosia”, according to Statista. China, on its part, has Baidu, while South Korea favours Naver; even Russia’s Yandex now has the third-largest market share among search engines worldwide.

ChatGPT certainly did not topple Google, agrees Dan Faggella, founder of market research firm Emerj Artificial Intelligence Research. “But it (OpenAI) definitely was seemingly their strongest real competitor,” he adds. “I’m much more nervous for Perplexity in, say, the next three months than I am about Google,” says Fagella, for the lack of a “differentiator”.

“I think it’s a cool app. But I wonder if there’s enough of a context wrap for things like enterprise search. Google used to do enterprise search but no longer sees sense in it,” he adds. Perplexity, which has raised $100 million from the likes of Amazon founder Jeff Bezos and Nvidia, was valued at $520 million in its last funding round.

In a February interview with Mint, Srinivas argued that while Google will continue to have a “90-94% market share”, they will lose “a lot of the high-value traffic—from people who live in high-GDP countries and earning a lot of money, and those who value their time and are willing to pay for a service that helps them with the task”. He argued that over time, “the high-value traffic will slowly go elsewhere”, while low-value “navigational traffic” will remain on Google, making Google “a legacy platform that supports a lot of navigation services”.

“The bigger consideration is that the means and interfaces through which search occurs are evolving. These may become new interfaces other than the Chrome tab, where Google can very much get pushed aside, and I think the VR (virtual reality) ecosystem will be part of that as well. I don’t see Google dying tomorrow. But I think they should be shaking in their boots a little bit around what the future of search will be,” says Fagella.

Race to dominate the AI space

Fagella believes that “search is a subset of a much broader substrate monopoly game. It’s all about owning the streams of attention and activity—from personal and business users for things like their workflows, personal lives and conversations to help them (big tech companies) build the most powerful AI”. This, he explains, is why all big companies want you to have their chat assistant so that they can continue to economically dominate.

Fagella believes that all the moves indicate that the big tech companies, including Google, Meta, and OpenAI, “are ardently moving towards artificial general intelligence (AGI). “Apple’s a little quieter about it. I don’t know where Tim Cook stands. They’re always a little bit more standoffish. But suffice it to say, they’re probably in that same running as well, although seemingly not as overt about it,” he adds.

OpenAI, for instance, has multimodal GenAI models, including GPT-4o and GPT-4 Turbo, while Google’s Gemini 1.5 Flash is available for free in more than 40 languages. Meta recently released Llama 3.1 with 405 billion parameters, which is the largest open model to date, and Mistral Large 2 is a 128 billion-parameter multilingual LLM. Big tech companies are also marching ahead on the path to achieve AGI, which envisages AI systems that are smarter than humans.

OpenAI argues that because “…the upside of AGI is so great, we do not believe it is possible or desirable for society to stop its development forever; instead, society and the developers of AGI have to figure out how to get it right…We don’t expect the future to be an unqualified utopia, but we want to maximize the good and minimize the bad and for AGI to be an amplifier of humanity”.

And OpenAI does not mind spending a lot of money to pursue this goal. The ChatGPT maker could lose as much as $5 billion this year, according to an analysis by The Information. However, in a conversation this May with Stanford adjunct lecturer Ravi Belani, Sam Altman said, “Whether we burn $500 million a year, or $5 billion or $50 billion a year, I don’t care. I genuinely don’t (care) as long as we can, I think, stay on a trajectory where eventually we create way more value for society than that, and as long as we can figure out a way to pay the bills like we’re making AGI it’s going to be expensive it’s totally worth it,” he added.

In July, Google DeepMind proposed six levels of AGI “based on depth (performance) and breadth (generality) of capabilities”. While the ‘0’ level is no AGI, the other five levels of AGI performance are: Emerging, competent, expert, virtuoso and superhuman. Meta, too, says it’s long-term vision is to build AGI that is “open and built responsibly so that it can be widely available for everyone to benefit from”. Meanwhile, it plans to grow its AI infrastructure by the end of this year with two 24,000 graphics processing unit (GPU) clusters using its in-house designed Grand Teton open GPU hardware platform.

Elon Musk’s xAI company, too, has unveiled the Memphis Supercluster, underscoring the partnership between xAI, X and Nvidia, while firming up his plans to build a massive supercomputer and “create the world’s most powerful AI”. Musk aims to have this supercomputer—which will integrate 100,000 ‘Hopper’ H100 Nvidia graphics processing units (and not Nvidia’s H200 chips or its upcoming Blackwell-based B100 and B200 GPUs)—up and running by the fall of 2025.

What can spoil the party

No AI model to date can be said to have powers of reasoning and feelings as humans do. Even Google DeepMind underscores that other than the ‘Emerging’ level, the other four AGI levels are yet to be achieved. LLMs, too, remain highly advanced next-word prediction machines and still hallucinate a lot, prompting sceptics like Gary Marcus, professor emeritus of psychology and neural science at New York University, to predict that the GenAI “…bubble will begin to burst within the next 12 months”, leading to an “AI winter of sorts”.

“My strong intuition, having studied neural networks for over 30 years (they were part of his dissertation) and LLMs since 2019, is that LLMs are simply never going to work reliably, at least not in the general form that so many people last year seemed to be hoping. Perhaps the deepest problem is that LLMs literally can’t sanity-check their own work,” says Marcus.

I elaborated on these points in my 19 July newsletter, Misplaced enthusiasm over AI Appreciation Day. When will AI, GenAI provide RoI?, where Daron Acemoglu, institute professor at the Massachusetts Institute of Technology (MIT), argues that while GenAI “is a true human invention” and should be “celebrated”, “too much optimism and hype may lead to the premature use of technologies that are not yet ready for prime time”. His interview was published in a recent report, Gen AI: too much spend, too little benefit?, by Goldman Sachs.

There’s also the fear that all big AI models will eventually run out of finite data sources like Common Crawl, Wikipedia and even YouTube to train their AI models. However, a report in The New York Times said many of the “most important web sources used for training AI models have restricted the use of their data”, citing a study published by the Data Provenance Initiative, an MIT-led research group.

“Indeed, there is only so much Wikipedia to vacuum up. It takes billions of dollars to train this thing, and you’re going to suck that up pretty quickly. You’re also going to start sucking up all the videos pretty quickly, despite how quickly we can pump them in,” Fagella agrees.

He believes that the future of AI development will involve integrating sensory data from real-world interactions, such as through cameras, audio, infrared, and tactile inputs, along with robotics. This transition will enable AI models to gain a deeper understanding of the physical world, enhancing their capabilities beyond what is possible with current data.

Fagella points out that the competition for real-world data and the strategic deployment of AI in robotics and life sciences will shape the future economy, with major corporations investing heavily in AI infrastructure and data acquisition, even as data privacy and security will remain critical issues. He concludes, “The inevitable transition is to be touching the world.”

Source link

It’s swallowed billions of dollars, but has AI lived up to the hype?

August 14, 2024 by

Since AI’s most popular offering, OpenAI’s ChatGPT, debuted two years back and made esoteric AI tech accessible to the masses, there has been excitement over intelligent machines taking over mundane tasks or assisting humans in complex work. Geeks declared that costs would drop and productivity skyrocket, eventually leading to ‘artificial general intelligence’, when machines would run the world.

Huge sums were poured into companies focused on building AI solutions. In 2023, venture capital investments into Generative AI (a subset of AI to create text, images, video) startups totalled $21.3 billion, growing three-fold from $7.1 billion in 2022, according to consultancy EY.

But AI is a cash guzzler—Microsoft, Meta and Alphabet invested $32 billion in the first quarter of 2024 in AI development. The billions that were invested have been spent on expensive hardware, software and power-hungry data centres, totting up Big Tech valuations, but without real benefits.

Enterprises, meanwhile, have been waiting on the sidelines for the most part. With little return on investment (RoI) expected in the foreseeable future, they have been hesitant to deploy or depend entirely on AI. They also have doubts about the accuracy of AI generated results, aside from concerns over data privacy and governance.

So, while huge sums of money have been invested in AI, the rate of adoption has been slow, costs (of access) are very high, and the output is not reliable. For all the money that has been spent, AI should be able to solve complex tasks. But the only visible beneficiaries are the few big companies with a stake in AI, such as AI chipmaker Nvidia, which saw its market value jump by over $2 trillion in under two years as investors picked the stock anticipating a disruptive change. But what happened on 24 July shows that investors are running out of patience.

Inflated expectations

Goldman Sachs forecasts there will be expenditure of $1 trillion over the next few years to develop AI infrastructure.

Last month, Wall Street investment bank Goldman Sachs released a 31-page report on AI, questioning its benefits. Titled‘GenAI: Too much spend, too little benefit’ the report points out that AI’s impact on productivity and economic returns may have been overestimated. Jim Covello, head of global equity research, Goldman Sachs, asked, “What $1 trillion problem will AI solve?”

The venerable investment bank forecasts there will be expenditure of $1 trillion over the next few years to develop AI infrastructure but casts doubts over returns or breakthrough applications. In fact, the report warns that if significant AI applications fail to materialize in the next 12-18 months, investor enthusiasm may wane.

The flow of funds is already thinning, particularly in early-stage AI ventures. While investments in AI startups surged in 2023, the first quarter of 2024 saw just $3 billion invested globally, according to the EY report. The consultancy projects total global investment to be in the region of$12 billion in 2024, a little over half the level in 2023.

“GenAI was crowned very quickly to be the best new thing to have happened since sliced bread,” said Archana Jahagirdar, founder and managing partner, Rukam Capital, a Delhi-based early-stage investor which has backed three AI ventures—unScript.ai, Beatoven.ai and upliance.ai. “Now, there’s a realization that GenAI tech is exciting, but monetizable use cases are yet to emerge.”

Daron Acemoglu, institute professor at MIT, noted in the Goldman Sachs report that “truly transformative changes won’t happen quickly. Only a quarter of AI exposed tasks will be cost effective to automate in the next 10 years”.

Indeed, technology research and consulting firm Gartner, which popularized the concept of the new-technology hype cycle, says that Generative AI has passed the peak of inflated expectations (marked by overenthusiasm and unrealistic projections) and is entering the trough of disillusionment.

Poor RoI

“The RoI (return on investment) is not in tune with the high capex on AI. At the heart of GenAI is the ability to summarize, synthesize and create content. People are using ChatGPT, like they use Google search,” said Arjun Rao, partner, Speciale Invest, a venture capital firm.

Comparisons with another disruptive technology, the internet, are inevitable. The internet impacted every area of work, business, the economy, and society with tangible benefits—banks could expand without opening branches, or online retail could reach anyone without investing in physical stores. The internet led to the global IT services boom, as work could be sent online to tap affordable resources. This resulted in a $250 billion industry in India employing nearly five million. The internet offered cost effective and efficient alternatives. In contrast, AI will likely be replacing low-wage jobs with expensive technologies and lack of reliability, as of now.

“Unless there is RoI, companies will not invest. But we believe every business will be an AI business in future. Voice assistants are improving, and can also analyze conversations at scale. We do see adoption going up,” said Ganesh Gopalan, chief executive and co-founder, Gnani.ai. Set up by a group of former Texas Instruments engineers, Gnani.ai is a conversational AI platform backed by Samsung Ventures.

To be fair, technology disruptions are not easy and geeks tend to oversell ideas saying they will change the world. “A lot of people will lose money before they start making money,” Nishit Garg, partner, RTP Global Asia, an early-stage venture capital firm, toldMint. “This happens with every disruption we have seen, in cloud, internet and e-commerce. AI is going to raise the intelligence level of every organization. But before that happens it has to be affordable to use and error free.” RTP Global has invested in a few AI-led ventures, in areas such as market automation and drug development.

The internet, cloud, smartphones went through that hype cycle of lofty promises but eventually did improve and changed the way we work. Proponents argue that it takes a lot of money to set up infrastructure. For instance, it took billions of dollars to set up mobile networks before calls could be made.

Repeating history?

Back in 1905, Spanish-American philosopher George Santayana wrote: “Those who cannot remember the past are condemned to repeat it”. Geeks fervently believe that the next big tech idea will change the world. But history shows that many of the tech ideas that lured investors and enterprises like moths to light were either ahead of their time or just plain wrong.

For instance, after companies poured billions into solving the Y2K problem, the dotcom bubble started taking shape. Fuelled by investments in internet-based companies in the late 1990s, the value of equity markets grew exponentially during the dotcom bubble, with the Nasdaq rising from under 1,000 to more than 5,000 between 1995 and 2000. Everyone from autoparts sellers to the neighbourhood bakery were sold the idea that if they weren’t online they were doomed.

By the end of 2001, reality set in—companies were online but there were no users. TheNasdaq composite stock market index, which had risen almost 800% in just a few years, crashed from its peak by October 2002, giving up all its gains as the bubble burst.

More recent examples are the metaverse and non fungible tokens (NFTs). The metaverse was a vision that people flock to the 3D virtual web via their avatars. Analysts projected that the market would be worth over $1 trillion in a decade. NFTs started selling with eyepopping valuations. Both were swept away as AI mania took over and were clearly ahead of their time.

Still early days

For all its niggles, AI is a more fundamental technology shift than the metaverse or NFTs. But if it was having a meaningful impact, more people, at least in developed economies, would have been willing to pay to use ‘reliable’ premium services. But that is not quite the case. Open AI’s ChatGPT has around 180 million daily active users worldwide, but less than 5% (less than 9 million) pay to use it. And across companies, the use of AI varies, with digital startups using it more than traditional companies.

Sam Altman, chief executive officer, Open AI. **(AFP)**

“From a tech evolution standpoint, we are at the infrastructure buildout phase,” said Namit Chugh, principal W Health Ventures, a healthcare focused venture investor. “The middleware, services layer, applications layer will come on top of that. That’s when companies can start monetizing. The problem is AI infrastructure is very expensive to build.” W Health Ventures has invested in AI-focused startups such as Wysa, an AI assistant for people who need mental health support.

“There is a lot of FOMO—fear of missing out—ensuring that enterprises have an AI strategy. But at 60-65% accuracy AI won’t be good. This has to improve,” said RTP Global’s Garg.

“If you ignore AI you will be out of business. Ventures like Uber, Netflix, Amazon, Airbnb disrupted the market. If they don’t adapt with AI they will be dinosaurs. The problem is, a lot of people do not understand this animal,” said Arnab Basu, partner and leader, advisory, PwC India.

There is a lot of FOMO ensuring that enterprises have an AI strategy. But at 60-65% accuracy, AI won’t be good.
—Nishit Garg

The India reality

“India’s ambition is to…become one of the top three global economies in terms of GDP,” Rajnil Malik, partner and GenAI go-to-market leader, PwC India, said. AI services will play a big role in this. RoI is not evident yet, but building blocks are being put in place. Platforms like Uber were using AI from day 1, but there was no RoI for long, he added.

According to EY, 66% of India’s top 50 unicorns are already using AI. But only 15-20% of proof of concept AI projects (more like trials) by domestic enterprises have rolled out into production. However, among Global Capability Centres (GCCs), the back offices of global companies in India, the shift from PoC to roll out is around 40%. According to IT body Nasscom, there are around 1,600 GCCs in India and their numbers are growing.

About a third of the use cases in India are for intelligent assistants and chatbots. Another 25% relate to marketing automation enabled by text generation and other capabilities like test-to-images or text-to-videos. Document intelligence is emerging as a key opportunity with around one-fifth of the use cases focusing on document summarization, enterprise knowledge management and search, according to EY.

Tata Steel has partnered with an AI tech platform to use AI for green steel by reducing emissions. Indigo has introduced the AI chatbot 6Eskai to assist travellers. Ecommerce major Flipkart’s knowledge assistant Flippi uses GenAI and LLMs to offer customized recommendations. Reliance Industries and Tata Group inked a strategic pact with Nvidia in September last year to develop India-focused AI powered supercomputers, cloud (for AI use cases) and GenAI applications. The government of India has also made a provision of ₹10,000 crore to procure computing power for AI projects.

About a third of the use cases in India are for intelligent assistants and chatbots. Another 25% relate to marketing automation enabled by text generation and other capabilities.

Rao of Speciale Invest believes that in India, in sectors such as manufacturing, there may not be a blanket use of AI as it competes with relatively low labour costs. AI will be more cost effective in software development if it takes over some coding tasks, and decreases the need for additional manpower.

“There are productivity improvements,” said Mahesh Makhija, partner and technology consulting leader, EY India. “But with errors, hallucinations (when an AI model generates misleading or incorrect results), and the risk of data thefts, securitycompanies are cautious about using AI.”

But Makhija is bullish on AI’s long-term prospects. “Things will improve. The nature of work will change, like Excel sheets and PPTs decades back, collapsed business planning times from weeks to days. Further improvements will come with AI,” he said.

The human element

Users often find the experience of interacting with chatbots frustrating and want a human to solve their problems. **(istockphoto)**

An oft-cited example of AI success is Swedish fintech company Klarna. In 2023, Klarna partnered with OpenAI to develop a virtual assistant. This March, the fintech claimed its virtual agent helped shrink its query resolution time from 11 minutes to just two. The assistant does the work of 700 humans and Klarna expects to save $40 million this year.

Virtual assistants and chatbots are increasingly being used across enterprises to reduce the load (and save costs) on human contact centres and also improve what they can do (though this is mostly restricted to answering FAQs). But users often find the experience frustrating and want a human to solve their problems.

In the US, a Gartner survey of 5,728 customers, conducted in December 2023, underlined that people remain concerned about the use of AI in the customer service function. Of those surveyed, 64% said they would prefer that companies didn’t use AI in customer service. In addition, 53% of the customers surveyed stated that they would consider switching to a competitor if they found a company was going to use AI for customer service. The top concern? It will get more difficult to reach a human agent. Other concerns include AI displacing jobs and AI providing wrong answers.

“Once customers exhaust self-service options, they’re ready to reach out to a person. Many customers fear that GenAI will simply become another obstacle between them and an agent,” Keith McIntosh, senior principal, research, Gartner customer service and support practice, said in a media release earlier this month.

For AI to take off, its proponents will have to address high costs, build killer apps, and generate correct, error-free output for institutions and people. If this disruptive force is to become as ubiquitous as the internet is today, it has to show trustworthy results. Else it runs the risk of a further erosion in value as stakeholders grow impatient.

Source link

OpenAI takes on Google, with new AI-powered search engine ‘SearchGPT’: All we know so far

August 14, 2024 by

After months of rumours, Sam Altman’s startup OpenAI has finally unveiled a search engine competitor to Google called SearchGPT. The new feature is currently in ‘prototype’ stage and is only available via a waiting list, but is expected to be rolled out to all users in the future.

In a blogpost about new search feature, OpenAI wrote, “We’re testing SearchGPT, a prototype of new search features designed to combine the strength of our AI models with information from the web to give you fast and timely answers with clear and relevant sources.”

Also Read | Meta prioritizes open-source play, native Hindi support to rival OpenAI, Google

SearchGPT start page is akin to Google and we get a message reading, “what are you looking for?” After entering the search query, though, you get a direct answer much like Perplexity or Google’s disgraced AI overviews feature.

A query for music festivals in Boone, Northern California in August returns a list of all such festivals, along with a 2-3 line description that prominently mentions the source from which the information was taken. Users are also given a links option on the left-hand side of the page, where they can view all the links cited by OpenAI and open them for more detailed information. In addition, similar to ChatGPT, users can ask follow-up questions to get more information.

OpenAI, which is already being sued by major news publishers like The New York Times, said that it is committed to a thriving ecosystem of publishers and creators. The company said SearchGPT uses AI to highlight high quality content in a conversational interface while providing user the opportunity to connect with news publishers via the cited links.

Source link

OpenAI launches small AI model GPT-4o Mini. What is it and why is it important?

August 14, 2024 by

Sam Altman led AI startup OpenAI has announced GPT-4o, the company’s most cost effective model to date. The model is being seen as an attempt by OpenAI to stay relevant with increasing competition from more deep pocketed rivals like Google and Meta.

GPT-4o Mini (O stands for Omni), will replace GPT-3.5 Turbo and will be available to use starting today for free along with ChatGPT Plus and Team members. Meanwhile, it will be offered to enterprise users starting next week.

Also Read | Anthropic rivals GPT-4o with Claude 3.5 Sonnet model, makes it free for all users

OpenAI said that GPT-4o Mini is priced at 15 cents per million input token and 60 cents per million output tokens, making it 60% more cheaper than GPT-3.5 Turbo. The model scored 82% on Massive Multitask Language Understanding (MMLU) and outperformed GPT-4 on chat preferences in LMSYS leaderboard.

The company also claimed that GPT-4o Mini also comprehensively defeated other small models in reasoning tasks with Gemini Flash only managing a MMLU score of 77.9% and Claude Haiku a score of 73.8%.

Announcing the new model in a blog post, OpenAI wrote, “OpenAI is committed to making intelligence as broadly accessible as possible. Today, we’re announcing GPT-4o mini, our most cost-efficient small model. We expect GPT-4o mini will significantly expand the range of applications built with AI by making intelligence much more affordable.”

Small models like GPT-4o require low computational power and hence are a more affordable option for devleopers with limited resources who want to use generative AI in their applications.

GPT-4o Mini currently support text and vision in application programming interface (API) and support for text, image, video and audio outputs will be made available in the future, OpenAI said.

The latest model has a context window of 128K token, which translates to around 95,000 words, and has a cut off date of October 2023. Meanwhile, OpenAI stated that GPT-4o Mini is even more cost effective handling non-English text now owing to the improved tokenizer.

Source link

OpenAI unveils’ Five-Tier’ system to gauge AI progress towards human surpassing abilities: How it works

August 14, 2024 by

OpenAI has introduced a five-tier system to measure its progress toward developing artificial intelligence (AI) capable of surpassing human performance, reported Bloomberg. This move aims to provide clearer insight into the company’s approach to AI safety and its vision for the future. The classification system was unveiled to employees during an all-hands meeting, an OpenAI spokesperson confirmed.

Reportedly, the tiers range from the current conversational AI (Level 1) to advanced AI that can operate an entire organization (Level 5). OpenAI, widely regarded as a frontrunner in the quest for more powerful AI systems, plans to share these levels with investors and other external stakeholders.

Currently, OpenAI considers itself at the first level, but nearing the second, known as “Reasoners.” This stage refers to AI systems capable of basic problem-solving tasks comparable to a human with a doctorate, but without access to additional tools.

During the same meeting, OpenAI’s leadership showcased a research project involving its GPT-4 model, demonstrating new capabilities that exhibit human-like reasoning.

As per Bloomberg, an insider who requested anonymity, mentioned that OpenAI continually tests new functionalities internally, a standard practice in the AI industry.

OpenAI has long aimed to create artificial general intelligence (AGI), which entails developing computers that outperform humans on most tasks. Although AGI does not currently exist, CEO Sam Altman has expressed optimism that it could be achieved within this decade. The criteria for reaching AGI have been a topic of debate among AI researchers.

In November 2023, researchers at Google DeepMind proposed a five-level framework for AI, including stages such as “expert” and “superhuman,” akin to the system used in the automotive industry for self-driving cars. OpenAI’s newly introduced levels also feature five ascending stages towards AGI. The third level, “Agents,” refers to AI systems capable of performing tasks over several days on behalf of users. The fourth level involves AI that can generate new innovations, and the highest level, “Organizations,” signifies AI that can operate autonomously within an organization.

These tiers were developed by OpenAI’s executives and senior leaders and are still considered a work in progress. The company intends to collect feedback from employees, investors, and its board to refine the levels further.

(With inputs from Bloomberg)

Source link