Funding & Investment in Travel
Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now
Japanese AI lab Sakana AI has introduced a new technique that allows multiple large language models (LLMs) to cooperate on a single task, effectively creating a “dream team” of AI agents. The method, called Multi-LLM AB-MCTS, enables models to perform trial-and-error and combine their unique strengths to solve problems that are too complex for any individual model.
For enterprises, this approach provides a means to develop more robust and capable AI systems. Instead of being locked into a single provider or model, businesses could dynamically leverage the best aspects of different frontier models, assigning the right AI for the right part of a task to achieve superior results.
The power of collective intelligence
Frontier AI models are evolving rapidly. However, each model has its own distinct strengths and weaknesses derived from its unique training data and architecture. One might excel at coding, while another excels at creative writing. Sakana AI’s researchers argue that these differences are not a bug, but a feature.
“We see these biases and varied aptitudes not as limitations, but as precious resources for creating collective intelligence,” the researchers state in their blog post. They believe that just as humanity’s greatest achievements come from diverse teams, AI systems can also achieve more by working together. “By pooling their intelligence, AI systems can solve problems that are insurmountable for any single model.”
Thinking longer at inference time
Sakana AI’s new algorithm is an “inference-time scaling” technique (also referred to as “test-time scaling”), an area of research that has become very popular in the past year. While most of the focus in AI has been on “training-time scaling” (making models bigger and training them on larger datasets), inference-time scaling improves performance by allocating more computational resources after a model is already trained.
One common approach involves using reinforcement learning to prompt models to generate longer, more detailed chain-of-thought (CoT) sequences, as seen in popular models such as OpenAI o3 and DeepSeek-R1. Another, simpler method is repeated sampling, where the model is given the same prompt multiple times to generate a variety of potential solutions, similar to a brainstorming session. Sakana AI’s work combines and advances these ideas.
“Our framework offers a smarter, more strategic version of Best-of-N (aka repeated sampling),” Takuya Akiba, research scientist at Sakana AI and co-author of the paper, told VentureBeat. “It complements reasoning techniques like long CoT through RL. By dynamically selecting the search strategy and the appropriate LLM, this approach maximizes performance within a limited number of LLM calls, delivering better results on complex tasks.”
How adaptive branching search works
The core of the new method is an algorithm called Adaptive Branching Monte Carlo Tree Search (AB-MCTS). It enables an LLM to effectively perform trial-and-error by intelligently balancing two different search strategies: “searching deeper” and “searching wider.” Searching deeper involves taking a promising answer and repeatedly refining it, while searching wider means generating completely new solutions from scratch. AB-MCTS combines these approaches, allowing the system to improve a good idea but also to pivot and try something new if it hits a dead end or discovers another promising direction.
To accomplish this, the system uses Monte Carlo Tree Search (MCTS), a decision-making algorithm famously used by DeepMind’s AlphaGo. At each step, AB-MCTS uses probability models to decide whether it’s more strategic to refine an existing solution or generate a new one.
The researchers took this a step further with Multi-LLM AB-MCTS, which not only decides “what” to do (refine vs. generate) but also “which” LLM should do it. At the start of a task, the system doesn’t know which model is best suited for the problem. It begins by trying a balanced mix of available LLMs and, as it progresses, learns which models are more effective, allocating more of the workload to them over time.
Putting the AI ‘dream team’ to the test
The researchers tested their Multi-LLM AB-MCTS system on the ARC-AGI-2 benchmark. ARC (Abstraction and Reasoning Corpus) is designed to test a human-like ability to solve novel visual reasoning problems, making it notoriously difficult for AI.
The team used a combination of frontier models, including o4-mini, Gemini 2.5 Pro, and DeepSeek-R1.
The collective of models was able to find correct solutions for over 30% of the 120 test problems, a score that significantly outperformed any of the models working alone. The system demonstrated the ability to dynamically assign the best model for a given problem. On tasks where a clear path to a solution existed, the algorithm quickly identified the most effective LLM and used it more frequently.
More impressively, the team observed instances where the models solved problems that were previously impossible for any single one of them. In one case, a solution generated by the o4-mini model was incorrect. However, the system passed this flawed attempt to DeepSeek-R1 and Gemini-2.5 Pro, which were able to analyze the error, correct it, and ultimately produce the right answer.
“This demonstrates that Multi-LLM AB-MCTS can flexibly combine frontier models to solve previously unsolvable problems, pushing the limits of what is achievable by using LLMs as a collective intelligence,” the researchers write.
“In addition to the individual pros and cons of each model, the tendency to hallucinate can vary significantly among them,” Akiba said. “By creating an ensemble with a model that is less likely to hallucinate, it could be possible to achieve the best of both worlds: powerful logical capabilities and strong groundedness. Since hallucination is a major issue in a business context, this approach could be valuable for its mitigation.”
From research to real-world applications
To help developers and businesses apply this technique, Sakana AI has released the underlying algorithm as an open-source framework called TreeQuest, available under an Apache 2.0 license (usable for commercial purposes). TreeQuest provides a flexible API, allowing users to implement Multi-LLM AB-MCTS for their own tasks with custom scoring and logic.
“While we are in the early stages of applying AB-MCTS to specific business-oriented problems, our research reveals significant potential in several areas,” Akiba said.
Beyond the ARC-AGI-2 benchmark, the team was able to successfully apply AB-MCTS to tasks like complex algorithmic coding and improving the accuracy of machine learning models.
“AB-MCTS could also be highly effective for problems that require iterative trial-and-error, such as optimizing performance metrics of existing software,” Akiba said. “For example, it could be used to automatically find ways to improve the response latency of a web service.”
The release of a practical, open-source tool could pave the way for a new class of more powerful and reliable enterprise AI applications.
Source link
Funding & Investment in Travel
North Korea’s ‘Benidorm’ resort bans foreign visitors – despite bid to bring tourists | World | News
International visitors have been banned from North Korea‘s massive new beach resort following its grand opening. The Wonsan Kalma complex, unveiled by leader Kim Jong-un at the end of June and dubbed the North Korean Benidorm, boasts a capacity for nearly 20,000 guests and includes accommodation, a shoreline, sporting venues, and restaurants.
Kim declared it would be remembered as “one of the greatest successes this year” and hailed the location as “the proud first step” towards advancing tourism. However, only North Koreans can experience the facilities. DPR Korea Tour, a platform operated by the nation’s tourism officials, announced that the resort “is temporarily not receiving foreign tourists”.
No additional information was provided regarding the reasons behind the ban or how long it would last. Shortly after its launch, a limited number of Russians were the only foreign tourists to visit.
North Korea may have stopped international visitor access after a Russian journalist wrote a damning story about the Wonsan Kalma resort.
Accompanied the Russian foreign minister, the journalist suggested the people at the resort were government operatives rather than genuine guests.
Kim has been pushing to make North Korea a tourist destination as part of efforts to revive the isolated country’s struggling economy.
Wonsan Kalma, with a 2.5 mile beach, is one of Kim’s most-discussed tourism projects, and state media reported North Korea will also confirm plans to build large tourism areas in other parts of the country.
Photos shared by state media show the leader taking in the views and watching someone go down a slide.
Despite the dangers Westerners may face if allowed to visit, Brits have voiced their desire to go.
Holiday planners On The Beach opened a link for people to express their interest, which racked up more than 250 sign-ups from Brits within a month.
Funding & Investment in Travel
Amazon has a dual voltage hot air brush on sale for 15% off
If you’ve ever landed in a dreamy European Airbnb only to realize your blow dryer won’t work (or worse, fries itself in the outlet — #ToAlltheDysonsWeveLostBefore), you know the struggle is real.
Voltage mismatches are the silent saboteurs of great hair days abroad, so if you want to avoid them at all costs, a versatile blowout brush that adapts to both U.S. and international power standards is a great tool to have.
Amazon has the Aima Beauty Hot Air Brush on sale for 15% off with a clickable coupon, and if you’re a Prime member you can get it as early as tomorrow with free overnight shipping.
Aima Beauty Worldwide Travel Hair Dryer Brush
$42 NOW FOR $36
Apply the clickable 15% off coupon by checking the box next to the orange “Coupon” banner beneath the regular retail price.
The lightweight, oval-shaped hot air brush is designed to do it all: heat up fast, dry, straighten, volumize, and smooth, so you don’t have to pack multiple tools or gamble on the hotel hair dryer.
According to Aima and dozens of happy Amazon customers, the compact, 10-inch x 3.4-inch brush — which weighs less than a pound — is said to fit easily into a suitcase or backpack without hogging space or adding bulk.
It features 360-degree airflow and negative ion technology, which the company says helps reduce frizz, boosts shine, and protects against heat damage — all of which are ideal for bouncing back from long flights or humid climates.
The nylon pins and tufted bristles are built for detangling and gripping hair without tugging, and the ergonomic handle contributes to more effortless styling, even in a cramped hotel bathroom.
With two heat and speed settings, you can tailor the airflow to your hair type — lower temp and slower speed for fine strands, higher temp and faster speed for thicker textures. And thanks to the 360-degree swivel cord, you won’t get tangled up mid-style.
But the real flex with this dryer is its dual voltage, which means you can adjust its voltage from 110-240V (110-120V for the U.S., Canada, Japan, etc. 220-240V for the E.U., U.K., Australia, etc.) with a simple screwdriver turn. So, whether you’re in Paris, Tokyo, or a remote Greek island, you can plug in with confidence.
Aima even includes a free European plug adapter in the box. Depending on your destination, you may need to purchase other adapters separately.
If you’ve ever been burned by hair drying across the pond, literally or figuratively, needed to buy a last-minute hair dryer abroad (and then leave it behind), or gone days without styling because your tools just didn’t work, this brush is your redemption arc.
Grab it with the 15% off coupon and saves yourself time, money, and suitcase space.
Other travel deals to shop at Amazon right now
Our journalism needs your support. Please subscribe today to NJ.com.
Danielle Halibey can be reached at dhalibey@njadvancemedia.com. Have a tip? Tell us at nj.com/tips.
If you purchase a product or register for an account through a link on our site, we may receive compensation. By using this site, you consent to our User Agreement and agree that your clicks, interactions, and personal information may be collected, recorded, and/or stored by us and social media and other third-party partners in accordance with our Privacy Policy.
Funding & Investment in Travel
Dozens dead after tourist boat capsizes in Vietnam
At least 34 people have died and several are still missing after a tourist boat capsized in Vietnam during bad weather.
The incident took place in Ha Long Bay, a popular tourist destination in the north of the country.
Most of the passengers were reportedly Vietnamese families visiting from the capital Hanoi.
Heavy rain has been hindering the search for survivors, rescuers say, but so far 11 people have been pulled from the water alive.
The vessel, named Wonder Seas, was carrying 53 people when it capsized after encountering a sudden storm, a statement from the Vietnamese Border Guards and navy said.
An eyewitness told AFP news agency that the sky darkened around 14:00 local time on Saturday (07:00 GMT).
There were “hailstones as big as toes with torrential rain, thunderstorm and lightning”, he said.
A 10-year-old boy was rescued after being trapped in an air pocket in the upturned hull, local media say.
“I took a deep breath… dived, then swam up. I even shouted for help, then I was pulled up by a boat”, the boy – who had been travelling with his parents – told state media outlet VietnamNet.
Of the bodies so far recovered, at least eight were children, VNExpress reports.
Rescue efforts are set to continue into the night to find the many still missing.
Prime Minister Pham Minh Chinh sent his condolences to the families of the dead.
Authorities will investigate the cause of the accident and “strictly handle violations”, a government statement said.
Ha Long Bay in Quang Ninh province is dotted with hundreds of tiny islets, attracting 4 million tourists in 2019, and is a Unesco World Heritage site.
-
Mergers & Acquisitions1 week ago
Amazon weighs further investment in Anthropic to deepen AI alliance
-
Mergers & Acquisitions1 week ago
How Elon Musk’s rogue Grok chatbot became a cautionary AI tale
-
Mergers & Acquisitions1 week ago
UK crime agency arrests 4 people over cyber attacks on retailers
-
Brand Stories2 weeks ago
Voice AI Startup ElevenLabs Plans to Add Hubs Around the World
-
Asia Travel Pulse2 weeks ago
Looking For Adventure In Asia? Here Are 7 Epic Destinations You Need To Experience At Least Once – Zee News
-
AI in Travel2 weeks ago
‘Will AI take my job?’ A trip to a Beijing fortune-telling bar to see what lies ahead | China
-
Mergers & Acquisitions1 week ago
EU pushes ahead with AI code of practice
-
Mergers & Acquisitions2 weeks ago
ChatGPT — the last of the great romantics
-
Mergers & Acquisitions1 week ago
Humans must remain at the heart of the AI story
-
The Travel Revolution of Our Era1 month ago
CheQin.ai Redefines Hotel Booking with Zero-Commission Model