|
and the Theory of Molecular Machines |
Bits measure biological conservation! |
|
Brief overview: Biological Information Theory (BIT) is the application of Shannon's information theory to all of biology. Tom Schneider is best known for inventing sequence logos, a computer graphic depicting patterns in DNA, RNA or protein that is now widely used by molecular biologists. Logos are only the beginning, however, as the information theory measure used to compute them gives results in bits. But why would a binding site have some number of bits? This led to a simple theory: the number of bits in the DNA binding site of a protein is the number needed to find the sites in the genome. Click on the dinosaur to see how these evolve! Next Tom asked how are bits related to binding energy? He solved this problem by using a version of the second law of thermodynamics to convert the bits to the energy needed to select them. Dividing the bits used to define a binding site by the bits that could have been selected for the given energy, he then discovered that the efficiency of DNA binding site selections is near 70% and he constructed a theory to explain this result. Schneider has a number of nanotechnology patents derived in part from this theory. |
--- Richard Dawkins, The Blind Watchmaker, 1986, Norton, p. 112. |
|
Note: These pages use two addional windows, a glossary and references. You can use frames in the bar to the left or launch them separately by clicking on these links. Once you have launched the windows, you can instantly look up papers or get glossary definitions. You can try this with the icons to the right in the green bar. |
![]() Sequence logos were invented here!
|
|
![]() Sequence walkers were invented here!
|
![]() Evolution - how DNA gets information!
|
![]() Sphere packing - biological states!
|
![]() Molecular efficiency - a measure of biological states |
|
|
Recommendations for Making Sequence Logos |
Publications using this server (PDF)
Publications using this server (text)
 
(https://tinyurl.com/splice-server-citations)
This server is recommended by the American College of Medical Genetics. | |
|
![]() |
What's New? |
![]() |
paper:
![]() ![]() |
![]() |
talk:
Biological Information Theory (BIT) gives a natural binding site cutoff
A Cancer Data Science Laboratory (CDSL) Zoom talk: Monday August 13 3:00 PM EST https://nih.zoomgov.com/j/1614867690 Slides: cutofftalk.pdf Video (at google drive): cutofftalk.mp4 Video (from this website): cutofftalk.mp4 ![]() |
![]() |
![]() |
talk:
![]() ![]() |
![]() |
paper:
Escherichia coli σ38 promoters use two UP elements instead of a -35 element: resolution of a paradox and discovery that σ38 transcribes ribosomal promoters Kevin S. Franco, Zhe Sun, Yixiong Chen, Cedric Cagliero, Yuhong Zuo, Yan Ning Zhou, Mikhail Kashlev, Ding Jun Jin and Thomas D. Schneider
![]() |
![]() |
paper:
Elements in the λ Immunity Region Regulate Phage Development: Beyond the `Genetic Switch' L. C. Thomason, K. Morrill, Murray, C. Court, B. Shafer, T. D. Schneider and D. L. Court",
![]() |
![]() |
paper:
Restriction enzymes use a 24 dimensional coding space to recognize 6 base long DNA sequences. Thomas D. Schneider and Vishnu Jejjala
![]() |
![]() |
paper:
osmotic: Density of σ70 promoter-like sites in the intergenic regions dictates the redistribution of RNA polymerase during osmotic stress in Escherichia coli Zhe Sun, Cedric Cagliero, Jerome Izard, Yixiong Chen, Yan Ning Zhou, William F Heinz, Thomas D Schneider and Ding Jun Jin
![]() |
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2019 May 07.
![]() |
![]() |
web page:
What is a Preprint? ![]() |
![]() |
preprint:
Restriction enzymes use a 24 dimensional coding space to recognize 6 base long DNA sequences. Thomas D. Schneider and Vishnu Jejjala
|
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2018 May 03.
![]() |
![]() |
Video and slides for my
2016
"Information Theory in Biology"
talk in Kanpur India
are now available on the Videos page
![]() |
![]() |
Glossary:
Reviews of the book: A Mind at Play
by Jimmy Soni and Rob Goodman, 2017
![]() |
![]() |
Binary Release:
Mac OS X Binary versions
of the Delila system programs were released in a single zip directory:
delila_binaries.zip
![]() |
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2017 May 11.
![]() |
![]() |
talk: "Three Principles of Biological States: Ecology and Cancer" at the University of Missouri Life Sciences Week, 1:15 p.m. April 11 in the Monsanto Auditorium at Bond Life Sciences Center. |
![]() |
paper:
Qian.Adhya2016
Z. Qian, A. Trostel, D. E. A. Lewis, S. Jun Lee, X. He, A. M. Stringer, J. T.
Wade, T. D. Schneider, T. Durfee, and S. Adhya.
Genome-Wide Transcriptional Regulation and Chromosome Structural
Arrangement by GalR in E. coli
Front Mol Biosci, 3:74, 2016.
http://dx.doi.org/10.3389/fmolb.2016.00074
![]() |
![]() |
program upgrade:
The
lister program
has been upgraded
so that multi-part sequence walkers are now completely connected
using vertical lines along with the rainbow horizontal connections:
![]() ![]() |
![]() |
talk:
"Information Theory in Biology"
at the meeting
Shannon Centenary
Department of Electrical Engineering
at the
Indian Institute of Technology, Kanpur, India,
Wednesday, October 19th, 2016.
Speaker: Thomas Schneider, National Institute of Health, USA In this talk I will sweep across the major ideas I have developed using information theory to understand biology (see https://alum.mit.edu/www/toms/). We will begin with measuring the information of protein or RNA binding sites on DNA or RNA (Rsequence, bits per site) using Claude Shannon's information theory. The resulting information curve can be displayed by the now-popular graphical method of sequence logos which we invented. The total information of binding sites (area under a logo) is predicted by the genome size and number of sites (Rfrequency, bits per site), and this leads to a model for the evolution of binding sites which you can run (https://alum.mit.edu/www/toms/papers/ev/). I will then introduce how to apply the same theory to individual binding sites, sequence walkers. An important question is the relationship between binding site information and the binding energy. This lead to my discovery that many molecular systems are 70% efficient. Surprisingly, the mathematics that explains 70% efficiency applies to all biological systems that have distinct states. ![]() |
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2015 Apr 09.
![]() |
![]() |
talk:
Three Principles of Biological States: Ecology and Cancer
by Tom Schneider
2014 Oct 29 Wednesday 09:04-10:13
at the meeting
Biological and Bio-Inspired Information Theory (14w5170)
at the
Banff International Research Station (BIRS),
Banff, Canada.
![]() |
![]() |
poster:
Escherichia coli sigma38 promoters use two UP elements
instead of a -35
KS Franco
CNL Cagliero
YN Zhou
DJ Jin
TD Schneider
NIH Research Festival 2014. Wednesday, September 24, 2014 - Poster Session IV ![]() |
![]() |
talk:
Three Universal Principles of Biological States
Tom Schneider,
Bibliography: suggested reading
Frederick Faculty Seminar Series, Campus: NCI at Frederick, Location: Building 549 Auditorium 2014 June 11 12:00 PM to 1:30 PM. ![]() |
![]() |
Patent:
Nanoprobe,
US Patent No. 8,703,734 Issued April 22, 2014,
Ilya Lyakhov, Thomas D. Schneider, and Danielle Needle.
![]() |
![]() |
Paper:
An important paper was published confirming for the first time
my 70% efficiency discovery:
Analysis of DevR regulated genes in Mycobacterium tuberculosis.
Bandyopadhyay A, Biswas S, Maity AK, Banik SK.
Syst Synth Biol. 2014 Mar;8(1):3-20.
(PubMed link,
preprint at arxiv.)
They observed that the DevR protein has 66% isothermal efficiency.
![]() |
![]() |
talk:
Why is the Genetic Code Degenerate?
at
Bits <-> Biology
2014 May 1, Building E14, 6th floor, MIT
My presentation is available as a PDF
created by
PowerDot.
VIDEO LINK
![]() |
![]() |
paper:
Average of Fractions
a trivial but surprising result.
![]() |
![]() |
talk:
Why is the Genetic Code Degenerate?
at the meeting
Mathematical and Statistical Models for Genetic Coding
September 26th to 28th 2013, Mannheim, Germany
![]() |
![]() |
paper:
Confirmation of Base Flipping in Transcriptional Initiation
Predicted by Information Theory
in our flexprom paper!
![]() |
![]() |
talk:
Sequence Logos and the Helix of DNA (PDF)
![]() |
![]() |
Patent:
Nanoprobe,
by
Ilya Lyakhov, Thomas D. Schneider, and Danielle Needle.
Allowed 2012 Sep 24
![]() |
![]() |
video:
TechAlliance presents National Biotechnology Week - Cytognomix Inc.
![]() |
![]() |
paper:
Promoter variants in the MSMB gene associated with prostate
cancer regulate MSMB/NCOA4 fusion transcripts
![]() |
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2012 Apr 24.
![]() |
![]() |
paper:
An unusual feature associated with LEE1 P1 promoters in
enteropathogenic Escherichia coli (EPEC)
![]() |
![]() |
talk:
Evolution of Binding Sites
by Tom Schneider.
A new
PDF version of the talk is available.
This talk emphasizes the Shannon uncertainty formula.
See Information Theory Primer
to learn more.
![]() |
![]() |
Patent:
Molecular motor,
United States Patent: 8,086,432
by
Thomas D. Schneider and Ilya Gennadiyevich Lyakhov.
Issued 2011 Dec 27.
Freepatentsonline for 8086432.
![]() |
![]() |
talk:
Why Do Restriction Enzymes Prefer 4 and 6 Base DNA Sequences?
Tom Schneider,
Bibliography: suggested reading
Frederick Faculty Seminar Series, Campus: NCI at Frederick, Location: Building 549 Auditorium 2012 Jan 11 12:00 PM to 1:30 PM. ![]() |
![]() |
paper:
A Brief Review of Molecular Information Theory
is now available at PubMed, PubMedCentral
![]() |
![]() |
New pages:
Mathematical Terms of Biological Information Theory.
![]() |
![]() |
Patent:
by
Peter K. Rogan.
Rapid and comprehensive identification of prokaryotic organisms.
In this patent Pete Rogan made a clever use of sequence logos.
See figure 2: this is a logo of 16S rRNAs. Pete
used the logo to
chose well conserved parts (into which to put PCR primers)
surrounding less conserved parts (which are different for
different species).
Thus the sequence of the fragment obtained can rapidly
identify a bacterial species.
The patent will issue on December 13th as US Pat 8,076,104.
![]() |
![]() |
Article:
Gaining Skills That Last a Lifetime (PDF)
an article about
Elaine (Bucheimer) Cagnina,
former
Werner H. Kirsten student intern
working with Tom Schneider
at NCI at Frederick in 1995-1996
in the March 2011
NCI at Frederick Poster, page 13
(whole issue, PDF)
![]() |
![]() |
Program upgrade:
MakeLogo now has Form Control.
The forms are:
normal logo,
varlogo,
equallogo,
rarelogo
and
rareequallogo.
![]() |
![]() |
Review of James Gleick's interview with NPR
![]() |
![]() |
Presentation:
Evolution in a Nutshell
at Science UNRESTRICTED
2011 May 4.
![]() |
![]() |
Talk:
by
Tom Schneider.
70% efficiency of bistate molecular machines explained by information
theory, high dimensional geometry and evolutionary convergence,
at the
Electrical & Computer Engineering
Colloquium Series
Rutgers Department of Electrical and Computer Engineering,
February 23, 2011.
Host:
Dr. Athina Petropulu
and
Dr. Christopher Rose.
Slides from Tom's talk.
![]() |
![]() |
Review of Tillman.Russel1961
![]() |
![]() |
US Patent 7,871,777 Issues:
MedusaTM Sequencing
by
Thomas D. Schneider, Ilya Lyakhov and Danielle Needle.
![]() |
Thanks for visiting!
For comments, problems, questions or suggestions
please contact
Tom Schneider.
This is my personal website.
I have set up a permanent
URL that will always point to wherever this page is:
|
A tiny URL for this web page is tinyurl.com/tomschneider; it also points to the MIT web page and hence is permanent.
You can view earlier pages with the incredible: Internet Archive WaybackMachine:
What's that icon?
origin: 1995 Feb 24
updated:
version = 6.84 of home.html 2024 Aug 26
unique visitors since 2005 Sep 4.
statcounter