07 March 2025 AI

8 Cool New AI Papers: From Anxious LLMs to Fake News Detection

By Bilal Hafeez

This article is only available to Macro Hive subscribers. Sign-up to receive world-class macro analysis with a daily curated newsletter, podcast, original content from award-winning researchers, cross market strategy, equity insights, trade ideas, crypto flow frameworks, academic paper summaries, explanation and analysis of market-moving events, community investor chat room, and more.

Assessing and Alleviating State Anxiety in LLMs: Finds that trauma prompts can change the ‘state’ anxiety of LLMs, while relaxation ones reduce anxiety!
Structured Outputs Enable General-Purpose LLMs to Be Medical Experts: The paper does away with fine-tuning an LLM on medical data and instead introduces a seven-step cognitive process (chain-of-thought) to answer questions. E.g., ‘understand question’, ‘recall relevant medical knowledge’, etc. Performs well on benchmarks. See Figure 1 below.
Expert Prompting: Instructing Large Language Models to Be Distinguished Experts: Clever use of in-context learning to generate automated expert prompts (rather than having to describe the expert yourself). Leads to much better answers to questions.
Agentic AI Needs a Systems Theory: This thought-provoking paper argues for a systems approach to understanding agent behaviour rather than individual agent behaviours.
Unseen Fake News Detection Through Casual Debiasing: I like the use of causal reasoning to get rid of fake news and bias from training sets. Could be used for finding errors more generally in training sets.
ToolFuzz – Automated Agent Tool Testing: We are working on agentic LLMs which requires good documentation on the codebase of the tools. This ToolFuzz approach looks promising for testing for documentation errors.
How Diversely Can Language Models Solve Problems? Exploring the Algorithmic Diversity of Model-Generated Code: Finds that LLM models have little diversity in solving problems compared to human solutions. They find that raising the temperature of models beyond 1.0 could increase diversity.
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains: Great attempt at assessing how the chronological knowledge of LLMs evolves. Most LLM models struggle with capturing time.

(The commentary contained in the above article does not constitute an offer or a solicitation, or a recommendation to implement or liquidate an investment or to carry out any other transaction. It should not be used as a basis for any investment decision or other decision. Any investment decision should be based on appropriate professional advice specific to your needs. You are not permitted to publish, transmit, or otherwise reproduce this information, in whole or in part, in any format to any third party without the express written consent of Macro Hive. This includes providing or reproducing this information, in whole or in part, as a prompt.)

Your comments

Cancel reply

Please sign-up for a Macro Hive account then log in to leave your comments.

array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI Reflections: Which LLM Performs Best? Our Benchmark Says Fine-Tuning Wins

27 March 2025

Dalvir Mandara Eric Wang Bilal Hafeez
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI reflections: Think Before You Speak!

21 March 2025

Viresh Kanabar
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI Reflections: Smarter, Faster, Cheaper – But Still Not Ready

07 March 2025

Viresh Kanabar
array(17) { [0]=> array(2) { ["select_cpt"]=> string(10) "hive-model" ["cpt_image"]=> int(131299) } [1]=> array(2) { ["select_cpt"]=> string(17) "weekly-investment" ["cpt_image"]=> int(132823) } [2]=> array(2) { ["select_cpt"]=> string(14) "views-analysis" ["cpt_image"]=> int(136150) } [3]=> array(2) { ["select_cpt"]=> string(13) "bilals-points" ["cpt_image"]=> int(135792) } [4]=> array(2) { ["select_cpt"]=> string(17) "bilals-macroscope" ["cpt_image"]=> int(127923) } [5]=> array(2) { ["select_cpt"]=> string(19) "global-rates-weekly" ["cpt_image"]=> int(136150) } [6]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(65996) } [7]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134235) } [8]=> array(2) { ["select_cpt"]=> string(20) "centralbank-monitors" ["cpt_image"]=> int(136307) } [9]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(134128) } [10]=> array(2) { ["select_cpt"]=> string(15) "emerging-trends" ["cpt_image"]=> int(66000) } [11]=> array(2) { ["select_cpt"]=> string(10) "key-events" ["cpt_image"]=> int(66001) } [12]=> array(2) { ["select_cpt"]=> string(16) "asset-allocation" ["cpt_image"]=> int(131366) } [13]=> array(2) { ["select_cpt"]=> string(4) "post" ["cpt_image"]=> int(66003) } [14]=> array(2) { ["select_cpt"]=> string(18) "charts-of-the-week" ["cpt_image"]=> int(134094) } [15]=> array(2) { ["select_cpt"]=> string(11) "commodities" ["cpt_image"]=> int(66003) } [16]=> array(2) { ["select_cpt"]=> string(12) "global-rates" ["cpt_image"]=> int(136150) } }

Hive Exclusives

AI Reflections: DeepResearch, Grok and Disappointment

20 February 2025

Bilal Hafeez

Spring sale - Prime Membership only £3 for 3 months! Get trade ideas and macro insights now

8 Cool New AI Papers: From Anxious LLMs to Fake News Detection

Your comments

Cancel reply

Enter your email to read this Macro Hive Exclusive

Like these insights? Join our free newsletter for regular updates and trade ideas.

Like these insights? Join our free newsletter for regular updates and trade ideas.

8 Cool New AI Papers: From Anxious LLMs to Fake News Detection

Your comments

Cancel reply

Enter your email to read this Macro Hive Exclusive

Like these insights? Join our free newsletter for regular updates and trade ideas.

Like these insights? Join our free newsletter for regular updates and trade ideas.

Cancellation Confirmed

Discount Applied