Re: [DIYbio] Repurposing Bio Paper webcrawls

All sorts of "A.I." software implementations are incredibly useful: Period. But the acts extracted by human users benefits from being somewhat careful with doing so.

Being 'careful' is hard for the same reason autonomous vehicle hardware and software gone bad usually generates accidents. A failure rate in the low ones of percents ( ABS MAX ) is a tough go for human awareness.


Misuse of things, in the general case creates hardship. This is not an exclusive property of some sorts of fancy software !

Endless social enginner wanna be's want to inflict and endless parade of limitation on A.I. generally. As always, they yowl up some singular sad story of "this thing did a bad thing", not too interested in the 100:1 of utility generated.

Shame on you, you low output loving do-gooders.

So: YES ! Programs of all sorta including machine learning, AI etc will help you deliver things that are desirable, but on the way you have to be careful. 

The problem is not the implementations, its the expectations leaning into A.I. problems as new-wave magic. Complaining, however has some utility to warn 'us all' to remember what we already know about adive and synopsis of all sorts form hmans. sometimes they look right, and aren't.

Regs to all,
Daniel B. Kolis ; 15 Apr 2026


 

--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/diybio/4421489a-3f36-4832-ab77-80dc8d3236b9n%40googlegroups.com.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

Re: [DIYbio] Repurposing Bio Paper webcrawls

I find it maddening when research papers use the idiom "AI" to refer to more reputable uses of machine learning, when it just conflates with and empowers the overcapitalised plagiarism engines we hear about ad-nauseum, day in, day out.

"Machine Learning": huge potential, if potentially hazardous. Has yielded fantastic research and technology in careful hands. Needs tight regulation or a rolled-up newspaper any time someone suggests using it to govern society.

LLMs and Chatbots: All rolled-up-newspaper, all the time. Go directly to the sea, do not pass go, do not collect your vested stock.

--
Are you at all interested in Irish Mythology? You might like my newsletter, The Gods and their Croziers:



15 Apr 2026, 13:20 by dcrookston@gmail.com:
Unfortunately, LLMs are essentially lie machines. They may get lucky and find some useful output, but I would think twice before trusting an LLM with real research.

I understand that I am in the minority here. You do not need to shower me with examples of "reputable" researchers using LLMs. I already know they exist, and you can probably infer what I think of them.

-DTC

On Sat, Mar 14, 2026, 8:47 PM Jonathan Cline <jncline@gmail.com> wrote:
A couple paragraphs in a recent Scientific American article caught my eye:

"Mathematicians find one pi formula to rule them all.
A mixture of AI and algorithms uncovered a hidden structure spanning 2,000 years of equations for pi"

"The group, who also have backgrounds in areas such as physics and math, approached the problem like experimentalists and decided to gather a dataset. Tomer Raz, then a master’s student at Technion, wrote code to download every math paper that had ever been uploaded to the preprint server arXiv.org, running his laptop seven days a week, 24 hours a day, for six weeks to download 455,050 papers at a slow enough rate to respect the website’s limit.

The group then deployed GPT-4o in combination with specialized algorithms to detect pi-related equations, translate them into executable code, and remove trivial duplicates. From nearly half a million papers, they extracted 385 unique formulas, including about 10 percent that originated from the Ramanujan Machine."


Some of you, already having written spidering code long ago and already downloaded every PDF published Bio paper from every major publisher since the 1970s, might want to ponder what new things to do with those Bio PDF's.


-- 
## Jonathan Cline
## Mobile: +1-805-617-0223
########################


--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.


--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

Re: [DIYbio] Repurposing Bio Paper webcrawls

Unfortunately, LLMs are essentially lie machines. They may get lucky and find some useful output, but I would think twice before trusting an LLM with real research.

I understand that I am in the minority here. You do not need to shower me with examples of "reputable" researchers using LLMs. I already know they exist, and you can probably infer what I think of them.

-DTC

On Sat, Mar 14, 2026, 8:47 PM Jonathan Cline <jncline@gmail.com> wrote:
A couple paragraphs in a recent Scientific American article caught my eye:

"Mathematicians find one pi formula to rule them all.
A mixture of AI and algorithms uncovered a hidden structure spanning 2,000 years of equations for pi"

"The group, who also have backgrounds in areas such as physics and math, approached the problem like experimentalists and decided to gather a dataset. Tomer Raz, then a master’s student at Technion, wrote code to download every math paper that had ever been uploaded to the preprint server arXiv.org, running his laptop seven days a week, 24 hours a day, for six weeks to download 455,050 papers at a slow enough rate to respect the website’s limit.

The group then deployed GPT-4o in combination with specialized algorithms to detect pi-related equations, translate them into executable code, and remove trivial duplicates. From nearly half a million papers, they extracted 385 unique formulas, including about 10 percent that originated from the Ramanujan Machine."


Some of you, already having written spidering code long ago and already downloaded every PDF published Bio paper from every major publisher since the 1970s, might want to ponder what new things to do with those Bio PDF's.



-- 
## Jonathan Cline
## jcline@ieee.org
## Mobile: +1-805-617-0223
########################

--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/diybio/30e4b4f4-9c35-44f0-b521-0acf48244268n%40googlegroups.com.

--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/diybio/CAGjqrSjqpS3gMrXrmhCtochtDcreGRkq9uTn2Lo%3Du764nX3K0A%40mail.gmail.com.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS