Tuesday, October 3, 2023
NEVERFOMOAGAIN
en English▼
X
ar Arabicen Englishfr Frenchde Germanpt Portugueseru Russianes Spanish
  • PASSIVE INCOME
    • How to Earn Cryptocurrencies for free ?
    • Play Games & apps to earn
  • Reviews
  • BLOCKCHAIN ACADEMY
  • TOP 10
  • News
No Result
View All Result
NEVERFOMOAGAIN
NEVERFOMOAGAIN
en English▼
X
ar Arabicen Englishfr Frenchde Germanpt Portugueseru Russianes Spanish
Home CryptoCurrency News

There’s More Evidence ChatGPT Is a Good Doctor But a Bad Coder

Jose Antonio Lanz by Jose Antonio Lanz
August 11, 2023
in CryptoCurrency News
Reading Time: 6 mins read
0
There’s More Evidence ChatGPT Is a Good Doctor But a Bad Coder
74
SHARES
1.2k
VIEWS
Share on FacebookShare on TwitterShare on Reddit


In the race to develop advanced artificial intelligence, not all large language models are created equal. Two new studies reveal striking differences in the capabilities of popular systems like ChatGPT when put to the test on complex real-world tasks.

According to researchers at Purdue University, ChatGPT struggles with even basic coding challenges. The team evaluated ChatGPT’s responses to over 500 questions on Stack Overflow, an online community for developers and programmers, on topics like debugging and API usage.

“Our analysis shows that 52% of ChatGPT-generated answers are incorrect and 77% are verbose,” the researchers wrote. “However, ChatGPT answers are still preferred 39.34% of the time due to their comprehensiveness and well-articulated language style.”

In contrast, a study from UCLA and the Pepperdine University of Malibu demonstrates ChatGPT’s prowess at answering difficult medical exam questions. When quizzed on over 850 multiple-choice questions in nephrology, an advanced specialty within internal medicine, ChatGPT scored 73% —similar to the passing rate for human medical residents.

Image credit: UCLA via Arvix

“The demonstrated current superior capability of GPT-4 in accurately answering multiple-choice questions in Nephrology points to the utility of similar and more capable AI models in future medical applications,” the UCLA team concluded.

Anthropic’s Claude AI was the second best LLM with 54.4% correct answers. The team evaluated other open-source LLMs but they were far from acceptable, with the best score being 25.5% achieved by Vicuna.

So why does ChatGPT excel at medicine but flounder at coding? The machine learning models have different strengths, notes MIT computer scientist Lex Fridman. Claude, the model behind ChatGPT’s medical knowledge, received additional proprietary training data from its maker Anthropic. OpenAI’s ChatGPT relied only on publicly available data. AI models do great things if properly traiend with huge amounts of data, even better than most other models.

Image courtesy: MIT
Image courtesy: MIT

However, an AI won’t be able to act properly outside the parameters it was trained on, so it will try to create content with no prior knowledge of it, which results in what’s known as hallucinations. If the dataset of an AI model does not include a specific content, it will not be able to yield good results in that area.

You might also like

Hong Kong Adds ‘Potential Tailwind’ for East Asia Crypto Trading Volumes: Chainalysis

Bankrupt Crypto Lender Celsius Eyes Creditor Payback by Year End

Solana Extends Investment Streak to 27 Weeks of Inflows: CoinShares

As the UCLA researchers explained, “Without negating the importance of the computational power of specific LLMs, the lack of free access to training data material that is currently not in public domain will likely remain one of the obstacles to achieving further improved performance for the foreseeable future.”

ChatGPT clunking at coding aligns with other assessments. As Decrypt previously reported, researchers at Stanford and UC Berkeley found ChatGPT’s math and visual reasoning skills declined sharply between March and June 2022. Though initially adept at primes and puzzles, by summer it scored only 2% on key benchmarks.

So while ChatGPT can play doctor, it still has much to learn before becoming an ace programmer. But it’s not far from reality, after all, how many doctors do you know that are also proficient hackers?

Stay on top of crypto news, get daily updates in your inbox.



Source link

Share30Tweet19Share
Jose Antonio Lanz

Jose Antonio Lanz

Recommended For You

Hong Kong Adds ‘Potential Tailwind’ for East Asia Crypto Trading Volumes: Chainalysis

by Nivesh Rustgi
October 3, 2023
0
Hong Kong Adds ‘Potential Tailwind’ for East Asia Crypto Trading Volumes: Chainalysis

Hong Kong may offer up “a potential tailwind for East Asia” as crypto volumes plummeted in the region due to anti-crypto regulations in China, a Chainalysis report read.Hong...

Read more

Bankrupt Crypto Lender Celsius Eyes Creditor Payback by Year End

by Andrew Asmakov
October 3, 2023
0
Bankrupt Crypto Lender Celsius Eyes Creditor Payback by Year End

Celsius Network, the insolvent crypto lending company, is seeking court approval to begin making payments to its customers by the end of the year, the firm’s legal counsel...

Read more

Solana Extends Investment Streak to 27 Weeks of Inflows: CoinShares

by Pedro Solimano
October 3, 2023
0
Solana Extends Investment Streak to 27 Weeks of Inflows: CoinShares

Institutions love Solana, according to CoinShares’ latest crypto fund report, even when the rest of the crypto space is flat.“Very little activity was seen in the altcoin space,”...

Read more

Three Reasons SBF Will Be Convicted: Former SEC Lawyer

by Andrew Throuvalas
October 3, 2023
0
FTX Sues to Reclaim $700M Bankman-Fried Allegedly Spent on Celebrity Connections

Former SEC attorney John Reed Stark took to social media on Monday with an in-depth forecast of how Sam Bankman-Fried's criminal trial is likely to play out. The...

Read more

Tom Hanks and Zelda Williams Warn Fans of AI-Generated Deepfakes

by Jason Nelson
October 2, 2023
0
Tom Hanks and Zelda Williams Warn Fans of AI-Generated Deepfakes

Legendary actor Tom Hanks and Zelda Williams, daughter of the late Robin Williams, joined the chorus of voices sounding the alarm about the proliferation of AI deepfakes. Hanks...

Read more
Next Post
Are You Not Entertained? Elon Musk Says Fight With Zuck Will Be Livestreamed via Twitter, Meta

Are You Not Entertained? Elon Musk Says Fight With Zuck Will Be Livestreamed via Twitter, Meta

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

I agree to the Terms & Conditions and Privacy Policy.

1 × 4 =

Support Us.

Donate

  • Donate withMetaMask
  • Donate With MetaMask

  • Donate withNano
  • Donate Nano

    Scan to Donate Nano to nano_38oxm7kwnysjeyz1mdcp9d5rrq55wyox3gm9ejeed3uhdieurwe4r3k39ntt

Cloud

#Avoid Crypto Scam #Banano #BAT #Bitcoin #Brave Browser #Coinbase #Coinbase Earn #CoinMarketCap #CoinMarketCap Earn #Counter-Strike: Global Offensive #Crypto App #Cryptocurrency Faucet #Cryptocurrency glossary #Cryptocurrency scam #Crypto redflags #CryptoRoyale #Crypto scam #Cryptos Wallet #Do Your Own Research #DYOR #DYOR Checklist #Earn Cryptocurrencies #Earning while browsing #Earn NFT #Folding@Home #Free cryptocurrencies #Free NFT #Hi Dollar #Just cause 2 #Learn Crypto #LIKE #Low-cap cryptocurrencies #NANO #NFT #PERP #Play to earn #PRE #Princeton University #Redflags #Review #ROY #Top 10 #URUS #xMOON #XMS
NEVERFOMOAGAIN

© 2021 By NEVERFOMOAGAIN - All rights reserved.

Navigate Site

  • Best Play to Earn Crypto games and Apps
  • Contact Us
  • Content licensing
  • Cryptocurrency News
  • Cryptocurrency Rankings
  • Home
  • How to Earn Cryptocurrencies for free ?
  • How to Learn about Crypto and Blockchain ?
  • Legal Information.
  • Privacy policy
  • Reviews
  • Terms & Conditions

Follow Us

No Result
View All Result
  • PASSIVE INCOME
    • How to Earn Cryptocurrencies for free ?
    • Play Games & apps to earn
  • Reviews
  • BLOCKCHAIN ACADEMY
  • TOP 10
  • News

© 2021 By NEVERFOMOAGAIN - All rights reserved.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.
Go to mobile version