20 February 2025 AI

AI Reflections: DeepResearch, Grok and Disappointment

By Bilal Hafeez

This article is only available to Macro Hive subscribers. Sign-up to receive world-class macro analysis with a daily curated newsletter, podcast, original content from award-winning researchers, cross market strategy, equity insights, trade ideas, crypto flow frameworks, academic paper summaries, explanation and analysis of market-moving events, community investor chat room, and more.

If you spend time on X or Twitter, you would think the release of Grok 3.0 and ChatGPT’s DeepResearch models marks the dawn of superintelligence (AGI). In reality, these advances highlight the limitations of large language models (LLMs) and how far we still are from AGI. At their core, these models are only as good as the data they are fed. They do not truly understand their responses, and the quality of your questions matters just as much as the models themselves.

Take this example: I asked, ‘What drives currencies?’ – a topic studied for decades, if not centuries. I posed the question to three cutting-edge research LLMs. Here are their abridged answers:

ChatGPT DeepResearch: Interest rates, inflation, economic growth, trade balances, political stability, central banks, traders, capital flows.
Grok 3.0 DeepSearch: All the above, plus public debt, terms of trade, and purchasing power parity (PPP).
Gemini 2.0 Thinking: Similar to ChatGPT, but adds fiscal policy, unemployment, risk appetite, FX regimes, and commodity prices.

Then, I tested a custom RAG-LLM pipeline I built, incorporating data sources tailored to financial market players. Its answer included everything from the models above, plus trend-followers (bandwagon effects), investor positioning, options flows, order flow, net foreign asset positions, productivity, and savings-investment balances.

The difference is clear: better data yields better answers. Push this further with a time-sensitive question like, ‘What market factors should I watch today?’ and these models lean on news outlets like Reuters or MarketWatch. Suddenly, your ‘AGI’ is channelling journalists’ views on markets – not quite the genius you imagined.

That is not to dismiss the progress. Grok 3.0 is lightning-fast and structures answers cleanly. ChatGPT’s DeepResearch offers unmatched depth compared to its siblings. Like calculators, spreadsheets, or Python, these tools will streamline and accelerate human tasks. But human-level intelligence? We are still a long way off.

(The commentary contained in the above article does not constitute an offer or a solicitation, or a recommendation to implement or liquidate an investment or to carry out any other transaction. It should not be used as a basis for any investment decision or other decision. Any investment decision should be based on appropriate professional advice specific to your needs.)

Your comments

Cancel reply

Please sign-up for a Macro Hive account then log in to leave your comments.

array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI Reflections: Which LLM Performs Best? Our Benchmark Says Fine-Tuning Wins

27 March 2025

Dalvir Mandara Eric Wang Bilal Hafeez
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI reflections: Think Before You Speak!

21 March 2025

Viresh Kanabar
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

8 Cool New AI Papers: From Anxious LLMs to Fake News Detection

07 March 2025

Bilal Hafeez
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI Reflections: Smarter, Faster, Cheaper – But Still Not Ready

07 March 2025

Viresh Kanabar

Spring sale - Prime Membership only £3 for 3 months! Get trade ideas and macro insights now

AI Reflections: DeepResearch, Grok and Disappointment

Your comments

Cancel reply

Enter your email to read this Macro Hive Exclusive

Like these insights? Join our free newsletter for regular updates and trade ideas.

Like these insights? Join our free newsletter for regular updates and trade ideas.

AI Reflections: DeepResearch, Grok and Disappointment

Your comments

Cancel reply

Enter your email to read this Macro Hive Exclusive

Like these insights? Join our free newsletter for regular updates and trade ideas.

Like these insights? Join our free newsletter for regular updates and trade ideas.

Cancellation Confirmed

Discount Applied