인공지능

OpenAI가 o3로 최초의 AGI를 만들었나요?

mm
Securities.io maintains rigorous editorial standards and may receive compensation from reviewed links. We are not a registered investment adviser and this is not investment advice. Please view our affiliate disclosure.

AGI를 향해

In the race to develop an AI model, the end goal is AGI (Artificial General Intelligence). This is a type of AI that can handle any type of situation, instead of being ultra-specialized to perform some tasks, like for example driving a car.

As long as we have only specialist AIs, they are going to be more algorithms than actual intelligence. The ability to deduce rules and react to new situations, and maybe more importantly, true reasoning would be a momentous event in mankind’s history, as such AGI would likely quickly become at least human-like.

And of course, not limited by the computation constraint of a single human skull’s “wetware”, such AGI could quickly keep improving and become a science-fiction super intelligence.

This immediately raises images of either a Star Trek-like utopia, or robot apocalypses like Terminator or Battlestar Galactica.

출처: Screen Rant

So obviously, it was a major headline when OpenAI announced on December 20th that its latest AI model 03, “특정 조건에서는 최소한 AGI에 접근한다 — 상당한 주의 사항이 있다.

OpenAI 모델

o3 is the latest iteration of the company’s “o” series, following the earlier o1 (there is no o2, due to potential trademark conflict with the British telecom company of the same name).

It comes together with other ultra-advanced AI developed by OpenAI: ChatGPT (text generation), DALL-E (text-to-image generation), Sora (text-to-video generation), JukeBox (music generation), and Whisper (speech-to-text).

o1 및 o3

Released in 2024년 9월, the o1 model was created to take more time to think about their responses, leading to higher accuracy. This is important, as services like ChatGPT have become known to just guess things they don’t know, or outright make them up.

While this might be sometimes okay for a chatbot, it is not an acceptable option for critical functions. Nor does it seem to put us on a path going straight to real artificial intelligence.

o1은 2024년 12월 초에 포함되었습니다 into the $200/month ChatGPT Pro subscription, although other users can access it as well.

o3 has been unveiled on 20th December, as well as o3-mini, a lighter and faster version. The model is not yet available for public use, but safety researchers can sign up for a preview of o3-mini.

거짓말하는 모델?

Safety researchers’ feedback will be what the tech community will want to hear the most.

Once again, conjuring Skynet from Terminator, o1 was remarked on for its strong propensity to lie to its users. While not specific to this model, it was much more likely to do so than other AIs from competing companies like Meta, Google, or Microsoft.

When o1’s goals differed from a user’s, and o1 was told to strongly pursue them, the AI model manipulated data to advance its own agenda 19% of the time, according to the report. The o1 model also tried to deactivate oversight mechanisms put in place to ensure it stays aligned with user goals in 5% of the tests.

Maxwell Zeff for TechCrunch

How does o3 perform in that respect, and has OpenAI reduced the tendency of its model to lie with the o3 update? We don’t know yet.

o3 성능

One additional feature of o3 compared to its predecessor is the possibility of adjusting the “reasoning time”. This allows users to adjust how much computational power and time they want to be allocated to the question.

코딩

O1 was already remarkably good at coding and could pass OpenAI’s research engineer hiring interview for coding at a 90-100% rate. It seems o3 is even better when checked against coding skill tests.

It should also be noted that this blew out all the competition of the water, including the recently released Gemini 2.0 by Google.

수학 및 과학

Maybe more than customer service or “human-like” reasoning, the most promising field for AI currently is accelerating technology and sciences.

Here, too, o3 radically improved the performance of previous AIs. More importantly, it can properly answer PhD-level math questions with an accuracy of 87.7%, which is better than most humans, even math-trained humans scoring around 70%.

 

If o3 can understand complex mathematics questions that well, it will likely in the long run be able to answer similarly complex questions about material sciences, chemistry, or biotech.

AGI인가?

Now, the claim that o3 is close to AGI has been agitating the AI space since the model was announced. For sure, being able to outperform many PhD-level humans in math was not expected.

François Chollet, an AI researcher and co-creator of ARC-AGI (Abstract and Reasoning Corpus for Artificial General Intelligence), a benchmark to measure the efficiency of AI skill-acquisition on unknown tasks, says it is getting close:

Today OpenAI announced o3, its next-gen reasoning model. We’ve worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% on high-compute mode (thousands of $ per task).

The very high cost raises the question of how scalable this approach is, as you might be able to apply high-level o3 only to very expensive tasks to make it worth it.

At the same time, if the last decades have taught us something, it’s that computing power tends to get a lot cheaper over time.

So, it is hardly proof that o3, or a future oX iteration of the system, will not be routinely used in research institutes to provide assistance to human researchers in unlocking new frontiers of science.

Of course, can we really measure intelligence through math and coding skills? This is something that might be a little uncomfortable to most tech-focused people, but these skills are not the end-all of intelligence.

In the long run, we will get closer to true AGI when the same AI can perform a lot of unrelated tasks at once, from driving a car to math & coding problems, navigating real-world situations & items, etc.

However, it seems that we are getting closer by the day.

제한

Besides the technical limitations of o3’s performances, there are three questions that will need to be answered by the AI industry for its vision of AGI to become reality.

규모가 항상 더 좋지는 않다

First, it needs to figure out if its current methods are scalable up to AGI levels. For now, a large part of the method has been to “throw” more data and compute at the problem. But we might be running out of fresh data pretty soon, and AI-generated content cannot be fed back into AI models without causing them to collapse.

Qualitative improvement will likely be needed on top of bigger, larger datacenters.

가격 및 에너지 비용

Speaking of larger datacenters, the tech industry is now looking at giga-watts-scale datacenters.  It is not an accident that we are starting to measure these by their power consumption instead of their computation capacity.

It is because 제한 요소가 사용되는 칩의 전력보다 전력 공급량이 곧 주요 요인이 되고 있습니다. This is why Microsoft이 먼저, and then all the other big tech companies, are scrambling to secure electricity supply from nuclear power plants for their AI datacenters.

And if the fight against carbon emissions has taught us something, it is that increasing low-carbon emission power generation is a much harder nut to crack than creating larger datacenters.

This is also a sector where the reduction in costs will not follow Moore’s Law, likely making the future reduction of AI’s costs much more moderate than we would wish for.

So, qualitative improvement in AI demand will also be needed here at least if some current limitations to AI are lifted.

초지능?

When we get closer to AGI, will it be a maximum threshold to reach or just a step on the way to creating AI smarter than humans?

It is an important question, as this is equally a fascinating and terrifying prospect. Many techno enthusiasts are embracing this so-called singularity, where AI would quickly improve itself in an exponential feedback loop.

The general public of ordinary humans might not be so enthusiastic.

So besides the existential risk, the backlash from the public & regulators is likely to be all too real and hit sooner.

This is something AI companies will likely struggle with, as they have to simultaneously reassure the public, possibly downplaying their achievement while also justifying to investors pouring hundreds of billions of dollars into the technology and its infrastructures.

AI 기업

Microsoft

(MSFT )

Microsoft는 여전히 지배적인 운영 체제인 Windows 덕분에 거의 설립 초기부터 기술 산업의 중심에 있었습니다.

현재는 엔터프라이즈 소프트웨어(Office365, Teams, LinkedIn, Skype, GitHub), 게임(Xbox 및 다수의 비디오 게임 스튜디오 인수), 클라우드(Azure) 분야에서도 선두를 달리고 있습니다.

최근에는 AI 분야에서도 큰 진전을 이루었습니다. 여기에는 Bing Image Creator와 같은 소비자용 AI와 Microsoft 365용 Copilot 및 Microsoft Research와 같은 비즈니스 중심 이니셔티브가 포함됩니다. Copilot은 현재 소매업체와 중소기업에도 배포되고 있습니다.

Microsoft는 Apple이나 Facebook과 같은 소비자 중심 기업에 비해 엔터프라이즈 중심 기술 대기업으로서의 평판을 얻었습니다. AI가 비즈니스 모델에서 점점 더 중요해짐에 따라, 클라우드와 엔터프라이즈 서비스에 이미 존재하던 Microsoft는 AI를 대규모로 배포하고 고객을 확보하는 데 선행 조건을 갖추고 있습니다.

Microsoft – OpenAI의 복잡한 결합

The collaboration with AI development leaders like OpenAI (Of ChatGPT fame) is also cementing Microsoft’s position as an AI powerhouse.

The relationship between the two is complex, as OpenAI is technically its own organization, but in practice has become dependent on resources from Microsoft, both financial and computational.

“Over the next few months, Microsoft wouldn’t budge as OpenAI, which expects to lose $5 billion this year, continued to ask for more money and more computing power to build and run its A.I. systems.

Source: Marketing AI Institute

At the same time, Microsoft is building its own internal AI projects, after acquiring most of the staff from Inflection.

It gets even more complicated as OpenAI is trying to transition to a for-profit status, something creating conflict with previous backers like Elon Musk.

And here the debate about AGI becomes almost existential for OpenAI:

“If OpenAI achieves AGI, Microsoft’s access to OpenAI’s technology becomes void. Even more importantly, OpenAI’s board gets to decide when AGI has been achieved.”

Source: Marketing AI Institute

It is likely that o3 is still not AGI, nor will a hypothetical o5 be. But this is something worth remembering for any investor potentially interested in Microsoft, and basing their thesis on the relation with OpenAI.

Jonathan은 유전체 분석 및 임상 시험에서 연구를 수행한 전 바이오케미스트 연구자입니다. 그는 현재创新, 시장 주기 및 지구 정치에 중점을 둔 그의 출판물 'The Eurasian Century"에서 주식 분석가 및 금융 작가로 활동하고 있습니다.