[DIYbio] Re: Targeting Compounds for Biosynthesis using Data

To those so gracious enough to invest your valuable time in contributing your thoughts towards my original post, I extend my sincerest gratitude. 

I'll add my replies on a few of the responses below and then leave the thread as it sits, less there is a direct question or response that warrants my return. The responses have given me ample information to sit with and explore.

>  I would have to say the most expeditious route would best be to leave expert biology to the expert biologists who have individually studied the topic rigorously for at least a decade via graduate and/or postdoc studies

I'd say your reply was the most practical and would likely be the same advice I'd give to someone else in my shoes. This approach would certainly optimize my time and skills for success (success being defined as making some sort of valuable contribution or progress towards the end objective of creating a synthetically engineered compound of high value).

It does not; however, address other personal goals and needs in the venture that correlate to my desire for meaningful work. The details are personal in nature but I'll just say that my definition of success is not contingent upon some end state, but rather to embark on the journey itself.

Thus far, the journey of self-learning synthetic biology and reaching towards an arguably unobtainable goal has already yielded experiences, connections and knowledge that I couldn't have imagined and provided a level of satisfaction in my life that I had yet to obtain from my many years grinding it out in business + tech.

This conversation reminds me of something I share with folks from time-to-time.

I grew up playing a brass instrument for many years for a while, I really struggled hitting the high C. My band director once told me that instead of aiming for the high C, aim higher - aim for the  E whenever I practiced. Lo-and-behold, after a few tries, I broke past the barrier and not only did hitting the high C become a non-issue, I found myself playing in the upper registers with relative ease.

I have embraced this idea of aiming higher than what I think I'm capable of in life and it has yielded great results. I can't say that I've always hit the upper register in all attempts, but aiming for something beyond my reach has made me a better person.

> Hi Josh, let me start off by wishing you the best of luck. I know the high-value compound you are looking for, its synthetic "FBS"

I did a little digging into this and I think this is simply just beyond the capabilities of any one man, certainly this one! There are many fine folks working on this, some of which who've made  progress, but the shear diversity of uses for FBS across a variety of cell-types makes a wholesale replacement infeasible for what I'm after. That said, there was enormous personal value in researching FBS, it's uses, challenges, etc - I learned a ton. So thanks for the recommendation and of course, the well-wishes.
--

Sean, you've presented several interesting ideas and resources and you're onto something on the software side that perhaps we'll need to dig into further when we next chat.

Re: saffranol

It's a great idea. I did find some relatively recent work on safranal biosythesis, in particular the identification of genes and metabolic pathways for the production of the various apo-carotenoid compounds found in saffron (crocins, crocetin, picrocrocin, safranal, etc)

As recently as June, this paper demonstrated how crispr/cas9 was used to transform yeast to produce crocetin:

This could be a great jumping off point for further research.

Your software ideas are much aligned with where my original question was headed and I could ask a million follow-on questions so I'll need to dig into these a bit more. The idea of creating a bioengineering platform that sort of provides a library of components and a tool for linking them together is quite intriguing. That said, I believe there are already some biotech startups right now working to develop ML for doing some variation of this at-scale. Your idea is more of a self-serve whereas I feel like the investment community is more interested in large-scale automated approaches to same problem space and I can see why.

> Also an amazing thing would be enabling better natural language searching of the primary literature. Right now you have to know what keywords you are looking for and go through lots of papers to find relevant information.

There're really two parts to this idea in my mind. One is the NLP component, which is challenging given the varied and unstructured way in which research is published. Some papers leave out or obfuscate important data too which also makes it challenging. I'm also quite surprised that there isn't already some sort of structured way in which research data is shared and published back with the broader community.

> Being able to query something like, "What gene expression changes happen when yeast is placed in a hyperosmotic solution?" and get papers that focus on that question would be so helpful.

Now this is actually a more tractable problem that could be solved with existing search engine technology. I suspect the challenge here will be in gaining access to the research itself as most of it is behind paywall. Presuming access to the research could be addressed, it would then be a matter of identifying metadata or annotating the research to augment the search index. Once that is done, existing search engines are already adept at responding to question-based queries.

-- 
OK folks. I'll leave it at that. Thank you again to all who responded and expressed support. The journey continues and I look forward to sharing updates on the site, social and my YouTube channel (under construction) in the coming weeks. 

On Monday, August 3, 2020 at 1:09:31 PM UTC-7 Sean Sullivan wrote:

I know you are interested in finding economically attractive targets to synthesize, so I wanted to provide links for a few resources that the government has published where they analyze what chemicals are most attractive to be produced biotechnologically:

 

The caveat to these is that these chemicals would need to be produced at scale, so maybe not the type of target that you need.

 

The margin on a biotech product tends to be proportional to the difficulty it is to synthesize and inversely proportional to the volume that can be produced. Biological therapeutics are low volume/high margin but there's a lot of regulations around this space. Something I haven't looked too much into but I know the margins are good and the regulatory regime much softer is the Flavors and Fragrances (F&F) industry, especially for fragrances.

 

One idea I had would be to produce the active ingredients in saffron, the most prominent of which is safranal - https://en.wikipedia.org/wiki/Safranal

 

Saffron is currently painstakingly harvested from wild-growing flowers and has very low yield which results in a very high price. Check out the following video - Why Saffron Is The World's Most Expensive Spice

 

You would need to figure out what subset of the compounds that the saffron plant makes that you would need to make to recapitulate the flavor profile, but this can often be surprisingly few of the hundreds of compounds a plan makes. Researchers associated with Jay Keasling found they could make a delicious hoppy beer using no hops by having the brewing yeast make just two (I think) of the natural oils that the hop plant makes - https://www.nature.com/articles/s41467-018-03293-x

 

My (not current) understanding is that there's a bit of a hop shortage world-wide, could be an opportunity to recreate different hop flavors by having the yeast make their most important chemical components during the brewing process…

 

In terms of really cool software tools you could develop, the first big idea I came up with would be a single software package or web interface that made it very easy to collect data from disparate biological databases to answer the types of questions a bioengineer might have such as:

  • Upon seeing the name of a gene/protein that they haven't heard of before - "What does it do? Has it been characterized in vitro or in vivo?"
    • This could involve pulling information on the substrates that the protein acts on from KEGG, then cross-referencing the BRENDA enzyme database to see if is any published activity data, maybe for a mammalian protein you would be very interested in knowing where the enzyme localizes in the cell which could be extracted from UniProt…
  • Having a target molecule they want to biosynthesize - "What reactions are necessary to synthesize this metabolite in my chosen organism? What different homologous enzymes are known that I could express to catalyze these reactions? Have any been characterized?"

 

Also an amazing thing would be enabling better natural language searching of the primary literature. Right now you have to know what keywords you are looking for and go through lots of papers to find relevant information. Being able to query something like, "What gene expression changes happen when yeast is placed in a hyperosmotic solution?" and get papers that focus on that question would be so helpful.



On Wednesday, July 22, 2020 at 3:28:24 PM UTC-4, everymanbio.com wrote:
Hi friends. First and foremost, I want to wish everyone reading this my best wishes during these challenging times.

After spending 15 years working in tech as a software engineer + exec and eye-opening psychedelic experience, I've decided to pivot my life into a new direction: documenting my journey to self-learn genetic engineering and the development of a new product that helps mankind.

It's early days, but I have a working site and social media up here if you'd like to learn more: https://everymanbio.com/ + https://www.instagram.com/everymanbio/ 

One of the projects I'm using to concentrate and advance my learning is to engineer an organism and use it to produce a valuable compound en masse. The idea isn't new; a couple of youngsters recently utilized this technique to create a $200M company that produces hydrogen peroxide with yeast.

I'm enthralled by the idea of using something like Saccharomyces cerevisiae or e.coli to produce a highly valued compound at low-cost and reasonable biological efficiency. 

My question is this: how can I use public data sources to identify potential targets for biosynthesis? Given I can code and things like crawling, big data collection and analysis are well within my wheelhouse, I can't help but to think of the value it might offer to me and the community in narrowing in on which compounds would be best suited for biosynthesis from am GMO'd organism.

But! I don't know what I don't know and I could use some insights into the following:
  • What features or attributes make a compound a feasible target for biosynthesis? What makes it completely infeasible?
  • What data sources can be used to search and assess compound targets? I'm looking for ways to both quantify high-demand /valuable compounds and compounds that are suited for biosynthesis.
Ideally, I'd like the end result of my analysis to capture something akin to:
Compound Name | E.Coli Synthesis Feasible? | Yeast Synth Feasible? | Market Value | Production Difficulty | Metabolic Complexity | Compound Use (Medical, Industrial, etc)

I realize this is probably fairly wide in scope of a question, but I hope to at least get the conversation going and to be pointed in the right direction.

If you'd like to reach out to me directly to discuss the idea or explore a collaboration, please feel free to email me jo...@everymanbio.com

With much gratitude,

Josh McGinnis, Founder of EverymanBio




  

--
-- You received this message because you are subscribed to the Google Groups DIYbio group. To post to this group, send email to diybio@googlegroups.com. To unsubscribe from this group, send email to diybio+unsubscribe@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/diybio?hl=en
Learn more at www.diybio.org
---
You received this message because you are subscribed to the Google Groups "DIYbio" group.
To unsubscribe from this group and stop receiving emails from it, send an email to diybio+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/diybio/e762282b-cbc4-457c-9fee-12e1c4d4e758n%40googlegroups.com.

  • Digg
  • Del.icio.us
  • StumbleUpon
  • Reddit
  • RSS

0 comments:

Post a Comment