Ask HN: How to Use "Deep Research"?

17 points by muddi900 a day ago

The issue I have had with llms since the start has been the slop. Contrary to popular belief, the open web was mostly slop before ChatGPT. We just used to call it SEO blogspam. And all llms are trained on them.

So when I tried google's Deep Research in Gemini, I ran into the same problem. It was basically a typical llm chat response, but a lot more verbose and citations to the same blogspam listicles that makes regular human research difficult.

How do I avoid this pitfall? So what I am asking is, how do I use deep research?

leonidasv 4 hours ago

For market research, Exa.ai may be a better option. It's a search engine that crawls the web and index pages in LLM embeddings. For my searches, it delivers good results that are not found in Google's first page because the SEO spam is burying it. You should give it a try.

Boristoledano 3 hours ago

Hey, co-founder of linkup.so here. Did you try our API? I'd love to understand why you chose one option over another

addaon 20 hours ago

Use it for what? If your goal is understanding… probably don’t. If you’re trying to produce a document for an audience that doesn’t really care and will just use an LLM to summarize it without any fact checking, then give the minimal prompting to get the token count you need. This is where LLMs excel —- sequences of tokens that are plausible use of human languages where the cost for verifying if they have semantic value exceeds five or ten minutes.

Boristoledano 3 hours ago

Integrate our web search agent (linkup.so) which has been trained to cut through the noise :)

GianFabien 19 hours ago

My naive, yet practical, view is that all LLMs produce output that reflects what was most commonly found on the internet. So if your area of interest has a lot of slop, then that is exactly what Deep Research will return.

So if you want to do "real deep research" into some topic, then you start with the various summaries and drill down. I use Google Scholar to locate papers, then I search for the cited papers. It is tedious, but the only way that I know which avoids the pitfalls of random regurgitation of easily found materials.

Currently AGI is marketing hype. Critical thinking with real understanding, i.e. knowledge building hasn't yet been automated. You need to put in the intellectual effort.

muzani 4 hours ago

That sounds almost exactly what ChatGPT's Deep Research does. (Not to be confused with Grok's which seems to be doing mostly busywork)
It's a bit like a crawler, goes through papers, then follows down to other relevant papers. One advantage I see it seems to be able to search through movies as well.
But in the end you get a stack of reading materials and still have to do the work of reading them. I did try to get it to summarize them into instructions but it drops the main points.

solardev 20 hours ago

What topic or field are you researching? Is it something academic, programming related, or...?

It generally helps if you ask it to focus on specific kinds sources (gov sites, academic, etc.) but it really depends on the topic. Easy for some topics (the general sciences) but not so easy for other ones (popular culture stuff or current events).

muddi900 10 hours ago

Market research.
I guess I should have included the purpose of my research.
- solardev an hour ago
  
  I think it's going to be terrible at that :( Separating SEO spam from useful products isn't a strength of LLMs.
  Probably a normal prompt without deep research would be better for that (but less recent), especially for topics that humans previously discussed a lot in the past (like on pre-AI reddit). Signal to noise ratio is only going to go down, down, down from here... :(
  I don't think you'll get useful results from deep research in a case like this, trying to distinguish between all the contemporary advertisers and cut through the spam. Deep research is good at summarizing sources, and it works well if you can tell it what kind of trustworthy sources to focus on. It's not so good at judging the relative merit of each source. And in this case, there are no trustworthy sources... it's all just spam. If you manage to get good results out of it for market research, you should probably just start a business and become a billionaire selling that functionality instead of whatever you were researching :)

transformi 8 hours ago

Searching for free coupons :)

beefnugs 17 hours ago

Hire an expert in the field who can create 100 sentences of "dont do this... dont do that..." then create your own business as a middleman who keeps this list a top secret and supplies the service to others.

Then in a couple months go out of business when someone tricks the AI to spill its prompt

muddi900 10 hours ago

I am actually trying to use it for the purpose it is built for; Commercial research.

dangus 17 hours ago

Do actual research like a researcher instead of being a lazy bum with LLMs?

muddi900 10 hours ago

That seems to be only real course of action, but I need to be sure if I am doing anything wrong.