Archive for the ‘teaching’ Category.

Against the Copenhagen Interpretation

I’ll get around to quantum mechanics eventually. Bear with me.

Biology is autonomous from physics. Any change to quantum mechanics will have at most cosmetic implications for biology. Quantum mechanics contributes nothing more than the existence of atoms and molecules, which are necessary for the lossless transmission of information. However, anything that replaces present day quantum mechanics must also predict atoms and molecules. The interactions between the two subjects a minimized because their interface is pinned by experiment.

Within physics, thermodynamics, fluid mechanics, elasticity, and the other macroscopic theories bear the same relation to quantum mechanics. The interfaces between the subjects are experimentally pinned.

A similar pattern appears within biology. Population genetics is built on molecular biology, but their interface is pinned by the existence of genes. We can replace our understanding of an allele’s molecular character, but that allele’s propagation in a population isn’t going to dramatically change.

This pinning is a form of damage control. All fields of science depend on other fields to be able to bootstrap themselves. The only way to keep the structure from falling to shreds is to pin the interfaces. A particle physicist uses macroscopic equipment, which experimentally obeys classical mechanics and electromagnetism. If the relation between his equipment and what he studies weren’t experimentally fixed in the intervening scales, his experiments would be impossible.

It is worth keeping your field as self contained as possible. Black boxes that reach into the heart of entirely different fields are a recipe for disaster.

This is why I dislike the Copenhagen interpretation of quantum mechanics. It posits an observer, whose observation collapses the wave function to an eigenstate of the observation in question. In so doing, it reaches directly to the heart of neuroscience, and builds quantum mechanics on the hardest, most central questions of an even larger area of science.

If the neuroscientists come up with the answer that consciousness isn’t anything special, just a pattern of spikes in neurons, then we have an insoluble problem. Further, there isn’t likely to be a physically robust structure of these firings which is consciousness, so what’s to stop other random patterns of particles from observing and collapsing wave functions? These difficulties are so great, that any interpretation — that is, a connection of the undisputed mathematical structure to reality — which is properly pinned at its boundaries are immediately preferable.

Yet most physicists aren’t willing to accept a new interpretation which requires conceptual gimmicks. Visualizations are tools with local use, not integral parts of theories. Since Heisenberg, we want our theories only to relate observable quantities, not enforce a particular picture. This was the great downfall of the Bohm-de Broglie pilot wave mechanics.

Thankfully, there is an interpretation which is both properly pinned and satisfies the Heisenberg aesthetic. There’s a beautifully written, absolutely simple book on it (Consistent Quantum Mechanics by Robert Griffiths). The book is available for free online.

Physical intuition in biology

To start, a question: why are the expositions of Landau and Lifshitz compelling? Here are ten volumes, without data. Compare them to any number of disastrous texts with an equal amount of data. Why does Landau & Lifshitz leave us credulous and so many others make us peery?

I think the answer is physical intuition. Chapters one and two of Mechanics set forth are a reference on physical intuition. A physicist believe the world has this shape, or one near to it, no matter what object we embed. For the next four volumes we get physics justified only from this intuition, and then the first chapter of volume 5 - Statistical Physics - hammers in the next piece of intuition: here is how many particles behave. The road forward is probabilistic.

These are the preconceptions a physicist carries with him into biology, where at first they seem to fail him. Some physicists become cynical and jettison this baggage (often at the behest of some biologists). Others refuse and build models that fail, fail, and fail again.

Step back for a moment. Imagine a physicist who has internalized the intuition in Mechanics but not in Statistical Physics. What is his fate? Feynman tells us (Feynman Lectures on Physics, vol.1, p.39-2):

“Anyone who wants to analyze the properties of matter in a real problem might want to start by writing down the fundamental equations and then try to solve them mathematically. Although there are people who try to use such an approach, these people are the failures of the field; the real successes come to those who start from a physical point of view, people who have a rough idea where they are going and then begin by making the right kind of approximations, knowing what is big and what is small in a given complicated situation.”

In other words, don’t neglect the first chapter of Landau and Lifshitz, vol.5.

What must we bolt onto our intuition to handle biology? The key is in Dobzhansky’s statement “Nothing in biology makes sense except in the light of evolution.”

A biologist tells a physicist, “When you have enough of X, this happens.” The naive physicist takes this literally: there exists a concentration of X at which something physically happens. But the mechanism in question probably exists in several related species, from different environments, all of which rely on similar versions of X. X probably is transcribed at different levels, into a different environment, and yet the system functions roughly the same.

More importantly, the system had to evolve, and the system had to function in all the environments leading up to the present. This is what we must add to our intuition. Exact fixed points and thresholds don’t happen. Now when we hear, “When you have enough of X, this happens,” from a biologist we filter it to, “In a bunch of organisms and places, some level of increase of X causes this to happen.” We do this with (as-is) absurd statements about mechanics and ensembles of particles every day.

This can be made rigorous in the same way as the pieces of Landau and Lifshitz. I suspect that evolutionary game theory of some form may be the proper language to express this.

What I call myself

What follows is abject navel gazing.

I originally studied physics. I did a lot of mathematics. I dabbled in computer science (and programmed a lot, which is something else). Now I’m immersed in biology. What do I call myself when someone asks?

These days, a mathematician.

This strikes me as odd: I don’t define my occupation by what I work on. But I think I finally understand why.

Here is the best definition I know of a physicist: someone who has been through Jackson’s E&M. If you survived it, you cemented a mindset shared by a community. In physics, everything is about time series. Anything not about time series is a statistical property of a time series.

In biology — not biophysics — trees are the natural unit. The logic of genetics deals in trees. Time series are awkward and dangerous. If we go to behavior and psychology, even the tree is inadequate.

After I settled into biology, I found myself with two deep mental frameworks, one next to the other. I spend a lot of thought trying to mesh them, and I have not yet managed it.

But wait! This is half of mathematics. Gian-Carlo Rota pointed out that axiomatic frameworks in mathematics are a sort of “hardware” for a “software” of mathematical facts. Lisp wizards who think of programs as abstract, nigh Platonic objects will interpret this the way I mean it; I hesitate to throw this analogy to anyone used to a FORTRAN descended language. A new mathematical fact is desirable; a new framework that brings known facts closer to trivial is just as novel. It itself is a mathematical fact. This way lies category theory.

The scientific equivalent of a stateless expatriate is a mathematician.

How to teach Haskell

The authors of Real World Haskell just posted the first drafts for reviewers, including me. Much of the book is spent trying to explain simple concepts, while addressing the subtleties that result from Haskell’s shorthands and abstracctions. Something occurred to me as I was reading: the easiest way to learn Haskell is to learn some Lisp.

But not traditional Lisp. This Lisp is lazy, has nothing but symbols, lists, and tuples, and enforces the type differences among them. There is a let form but no define, certainly no setq. It’s lambdas are curryable. The whole realm can be explored, its lessons learned, in a couple of days.

It’s a useless language, except for its purpose. It is Limbo to Haskell’s Inferno — not Haskell itself, but sitting on the edge, where the virtuous of the past go who did not survive to the monadic revolution…

After developing the patterns of lists and lambda calculus, and the basic habits of strong typing, in this realm, the first descent into Haskell is fast. In ten minutes, you discuss the slightly changed notation, point out that we have entirely hidden the underlying cons structure of the list, and arrive at the type system.

Now the type system is an obvious extension of a habit to handle this new, more complicated world. The first unexpected structure is the algebraic data type, a transformation of a piece of BNF. I think this is the only place in computer science where the BNF is the clearest way to teach something.

Then there’s a bumpy half an hour about the syntax for defining functions, a period of clarity in general but confusion in detail about type classes, and then you arrive at

MONADS

And I don’t know how to make this one easy. sigfpe has made the beginning clear, but the rest is still obscure.

On the other hand, if a newcomer picked up a book on Haskell and was told that they were going to spend two chapters learning a dialect of Lisp they would never use again, it might not sell well.

Programmer Virtue in Molecular Biology

Larry Wall taught us the three great virtues: laziness, hubris, and ego. I have long cultivated this in my life, and continue to do so now that I am a biologist.

My latest act of virtue (or perhaps virtù) involves selecting restriction enzymes for some cloning. Starting with a plasmid and a sequence I want to replicate in it, I need enzymes A, B, C, D, E, and X with the following properties:

  • A and B have compatible ends, but when those two ends are ligated, none of the enzymes can recut them.
  • None of the others have compatible ends.
  • A must appear one time in the plasmid without the insert.
  • B and C must not appear in the plasmid.
  • D, E, and X must appear one time in the plasmid.

Don’t worry about where these rules come from for now. I’ll explain another time. Instead, let us focus on my tool for laziness: Prolog. In Prolog, you write down a set of logical propositions and rules, and then query them. This sounds fairly trivial, but it’s in fact really, really useful.

I typed in NEB’s compatible cohesive ends table for their restriction enzymes, which consisted of many lines that looked like this:

compatible(spei, [avrii,nhei,styi,xbai], [bfai]).

This is an example of a fact in prolog. It consists of a label for the fact (compatible), and a series of arguments. The first argument is the enzyme, the second is the list of compatible enzymes (lists are enclosed in [], and the entries are separated by commas), and the third is the list of enzymes that recleave such a ligation. I can create as many facts as I like with the same label and different arguments. In this case, it’s one for every line of that NEB table. Then I can define rules which let Prolog derive more facts from the simple facts I have given it:

is_compatible(X,X) :- true.
is_compatible(X,Y) :- compatible(X,F,_), member(Y,F).


The first line says that any sites cleaved by the same enzyme are compatible. The second says two enzymes are compatible is the second is a member of the list of compatible enzymes of the first.

is_incompatible(X,Y) :- \+ is_compatible(X,Y).

And to be incompatible, they must not be compatible (\+ is ‘not’ in Prolog). Here’s another predicate which is fairly straightforwards:

is_recleaved_by(A,A,A) :- true.
is_recleaved_by(X,Y,Z) :- compatible(X,G,F),member(Y,G),member(Z,F).

Then I put in the list of the restriction enzymes I have on hand: ncutter(aatii). and so forth. I also declare two lists for sites in the plasmid: restriction sites which appear once in the plasmid, and sites that do not appear in the plasmid (among those enzymes I have on hand):

m361_1([aatii, ecori, hindiii, hpai, kpni, mlui, ndei, nhei, noti, pvuii, spei, sphi, xhoi, xmni]).
m361_0([asci, ecorv, ncoi, paci, sbfi, scai, sfoi, snabi]).
And some convenience predicates so I don’t have to dig into the lists all the time:

singleM361(A) :- m361_1(B), member(A,B).
zeroM361(A) :- m361_0(B), member(A,B).

As an exercise, try to figure out what these do:

unrecleavable_pair(A,B) :-
is_compatible(A,B),
\+ is_recleaved_by(A,B,A),
\+ is_recleaved_by(A,B,B).

incompatible_with(_,[]) :- true.
incompatible_with(X,[Y|Ys]) :-
is_incompatible(X,Y),
incompatible_with(X,Ys).

incompatible_set([_]) :- true.
incompatible_set([X,Y|Xs]) :-
incompatible_with(X,[Y|Xs]),
incompatible_set([Y|Xs]).

Then I encode most of my rules into one query:

enzyme_set(A,B,C,X) :-
ncutter(A), ncutter(B),
unrecleavable_pair(A,B),
ncutter(C), ncutter(X),
incompatible_set([A,C,X]),
incompatible_set([B,C,X]).

Now I ask for sets of A, B, C, and X which work:

?- enzyme_set(A,B,C,X), singleM361(A), singleM361(X), zeroM361(B), zeroM361(C).

Prolog gives me every possible set. I look at my plasmid map and pick one that seems reasonable (A=MluI, B=AscI, C=EcoRV, X=AatII). Then I cheat a bit, look for a pair of enzymes in the polylinker that only appear once in the plasmid, and which are incompatible with everything else (which I check in Prolog as well).

It’s not as automated as it could be, nor is it elegant, but compare this to searching through enzymes descriptions by hand! Nor should you be intimidated by the language. I began this with a bare minimum of Prolog, which I picked up in about two hours by reading the first three chapters of Clocksin and Mellish’s Programming in Prolog.

How to learn to program

Peter Norvig has addressed this question. Eric Raymond has his nice essay on the subject. But these don’t provide much guidance for the beginner who wants to know “which books do I sit down and churn through?”

The short answer: The Haskell Road to Maths, Logic, and Programming by Doets and van Eijck (what appears to be an earlier draft is available in PDF here), and then Abelson and Sussman’s Structure and Interpretation of Computer Programs (also available online).

The long answer:

Until you understand what Abelson and Sussman are about in their book, you’re missing the whole point of computer science. Peter Norvig wrote about it,

“…if you’re like me, you’re not looking for one more trick, rather you’re looking for a way of synthesizing what you already know and building a rich framework onto which you can add new learning over a career. That’s what SICP has done for me. I read a draft version of the book around 1982 and it changed the way I think about my profession.”

Unfortunately, SICP requires a fair degree of intellectual, indeed mathematical, maturity. That’s where Doets and van Eijck comes in. Hopefully it will guide you far enough up the slope where SICP makes sense.

After this, there remains only one development in programming languages which you won’t be able to take completely in stride: monads and all the apparatus around them. But that’s a story for another day.

BIO2010 (Part 3)

In my previous post on the Bio2010 report I gave some principles to guide curriculum design. Remember, the goal is a mature and sophisticated mind, not a knowledge of any particular subfield. I’m not qualified to select the content for such a curriculum, so please suggest changes. We’ll begin with a survey of the advanced undergraduate courses.

Neurobiology: I give neurobiology its own place because certain aspects of it are very mature and therefore a prime field for training minds. For similar reasons, immunology makes no appearance in this list, as the field is too contorted, at least as represented in the textbooks, to usefully train anyone.

The first milestone of the course is a physical understanding of the Hodgkin-Huxley model, and how it came to be. This will probably take half a semester.

After Hodgkin-Huxley, I am out of my depth. The two obvious directions are the essential features of the synapse, and the basics of neural architecture, probably using vision and hearing as the models. This brief discussion (warning: PDF) has some interesting references, but I have not had time to follow them up.

Molecular biology has two aspects. The first is structural understanding of the three major biopolymers (DNA, RNA, and protein); the second is designing protocols to manipulate them. Students should not only understand the structural differences between GC rich vs. AT rich helices (at the level of something like Calladine’s Understanding DNA), but also why each step in a miniprep is there, and how to calculate what it should be in the absence of a protocol. This means they need quantitative descriptions of sedimentation and separation by centrifugation, chromatography, and amplification by various kinds of PCR. Then move on to models of transcription and translation, and mechanics of molecular motors. This course will probably take about a year.

Evolution and ecology begins with population genetics (Gillespie’s Population Genetics seems about right), epidemiology (some selection from Diekmann and Heesterbeek’s Mathematical Epidemiology of Infectious Diseases), something about species interactions and evolution of behavior (a couple bits and pieces of Gintis’s Game Theory Evolving), bioenergetics and flows of energy carrying molecules both in organisms, which gives a chance to discuss metabolism, and in ecosystems. These selected topics constitute a semester.

The next semester is genetic mapping and screening (design of screens and selections, RFLP and other techniques) and phylogeny (constructing the universal phylogenetic tree, with visits to representative areas such as evolution of horses, gene transfer in bacteria, and parallel evolution of fluorescent proteins in corals).

Biomechanics
is the successor to anatomy, merged with mechanical engineering and developmental biology. It begins with a couple weeks of statics such as Galileo’s argument on the scaling of bones, pressures on bacterial membranes and cell walls, DNA pressure and packing in viral capsids.

Then it moves to dynamics: swimming of microscopic and macroscopic creatures, motility of cells by actin, circulation in the body, and basic models of cell migration and patterning in development.

What about supporting courses?

Single and multivariable calculus are necessary in all the courses, as are ordinary differential equations, with an emphasis on phase space analysis and qualitative features, but definitely including the Fourier transform. Partial differential equations are used only in biomechanics and evolution and ecology, and in neither place in really intricate ways, so they can be handled in those classes. Probability and stochastic processes appear in all of the courses except biomechanics, and possibly there as well depending on the exact selection of topics. It is also necessary to statistics.

Chemistry of ions in solutions is necessary for neurobiology, chemical thermodynamics for evolution and ecology and molecular biology. Organic chemistry is necessary to make sense of the pieces in molecular biology. A general chemistry course with a really physical bent is probably enough as a starting point.

Physics of point particles and rigid bodies, thermodynamics, and basic electromagnetism are all that need to be covered outside of the courses. I expect most science students to take introductory physics, chemistry, and biology as part of choosing their field of study, so these courses cannot be altered too much.

All of the classes can provide opportunities for students to dig into real data, so a separate data analysis and statistics course makes sense. I think a selection from Tukey’s Exploratory Data Analysis and Data Analysis and Regression and Kiefer’s Introduction to Statistical Inference would form a good basis for such a course. All science students should have such a course, so there is no reason to specialize it for biology.

The data analysis course and the evolution and ecology course both involve an understanding of programming and computational cost of algorithms. A computer science course on basic numerical analysis and search, sorting, and alignment algorithms would be a useful companion to both. Students should write all their own code for the assignments in this class, and use almost no libraries and certainly no algorithmic black boxes. About the only way to actually do this is in a semester is to teach the course in Scheme or a similar language.

What goes in the introductory course? My inclination is the first two lectures of material from each major section of the four courses, rearranged into a more unified year-long presentation.

What laboratory classes should students take? I would propose a separate introductory laboratory class of experiments that can be done in one or two sessions, separate from the introductory biology course, but covering what of that material is checkable in the lab in reasonable time.

Molecular biology is an obvious for a candidate for lab, learning how to prepare DNA, digest and religate it, run western blots, and do genetic engineering in E. coli.

Almost everyone uses a microscope, so a one semester laboratory on optical microscopy covering material something like Shinya Inoué’s Video Microscopy would be a good time investment. Then an intermediate lab where students carry out three or four longer experiments in small groups throughout the semester.

The Bio2010 report recommends a research seminar, which I think is a good idea. The Rockefeller University requires such a course of its graduate students. The class reads two of the classic papers in biology each week and meets with two faculty members over lunch to go over them.

In the end we have a schedule that looks like this:

First year:
Introductory biology (all year)
Introductory chemistry (all year)
Introductory physics (all year)
Single and multivariable calculus (all year)

Second year:
Introductory biology laboratory (all year)
Differential equations (fall) / Probability and stochastic processes (spring)
Neurobiology (fall) / Biomechanics (spring)
Numerical analysis (fall) / Data analysis and statistics (spring)

Third year:
Evolution and Ecology (all year)
Molecular biology (all year)
Microscopy lab (fall) / Intermediate lab (spring)
Research seminar (all year)

Fourth year:
Molecular biology laboratory (fall)

This leaves plenty of space for electives and general requirements. Now everyone have at and rip this to shreds.

BIO2010 (Part 2)

In an earlier post I gave my quibbles with the language of the BIO2010 report. I promised to lay out an alternative curriculum proposal, but first I’m going to set forth my underlying philosophy. First a few principles:

Repeat a canon several times.
Choose a core set of techniques and ideas which will properly shape the students’ minds. The shape of their mind when they come out is far more important than any particular collection of facts.

My sister once noted in surprise, the night before a calculus exam, "I can’t study for this class! I can only practice." There is no body of things that a science student should know, only a body of things he should be able to do. Skills and mindsets take much longer to teach than facts, so it is important to by stingy with what gets space in the curriculum. For example, given familiarity with the simple SIR epidemiology model and with partial differential equations from mechanics, there is no conceptual difficulty with adding spatial effects to the SIR model. Unless it is a step to some other compelling mental skill, ditch it.

If math is your language, don’t work in translation.
You would not expect students of Italian literature to have everything taught in English translation except for one course where they played with Italian translation. Just so, if mathematics isn’t the language in which you teach your classes, adding a random course will not fix the situation. Having students take carefully designed courses in math, computer science, and physics is not the right approach. They should only take those courses which impart tools they will use in every biology class from there on out. If no biology class in your department uses complex analysis, then don’t require the class of your students. If almost every class needs stochastic processes, then abstract that out and require a course in it. If no biology class in your department uses math, then if you find this state of affairs unacceptable, it falls upon you to correct it, not to delegate the task to a mathematician who spends his days worrying about category theory.
Prerequisite courses are for the universal, not the potentially useful.
Physicists have separate calculus courses not because it’s a good idea for students to know calculus, but because the tools are used in every physics course the students take, so it is more efficient to deal with the tools once, uniformly, and assume them from there on out. Topics like Laguerre polynomials which show up in only some of the following physics classes are simply taught alongside the physics, not abstracted out.

Does every course require the students to know what a calcium ion is, what it does in solution, and how it combines with things? Then general chemistry seems like a compelling prerequisite.

A course is the minimal mental pattern of a practitioner.
Students taking a course shouldn’t have to memorize facts. They are there to pick up a core of tools and structures which shapes their mind for research in a field. I propose a rule of thumb: if a practitioner uses something weekly, it should be in the introductory course; monthly, in the advanced undergraduate course; annually, in the graduate course.

Finally, the report wants physics and chemistry departments to modify their introductory courses for the needs of the biology department, or to offer special introductory courses. No. Many students are undecided as to their major in their first year, and take the introductory courses in all the sciences. I think each of these courses should be taught as if to a class full of majors in the department teaching the course. For instance, why should physics departments throw out relativity? And if an ostensible biology student seeing relativity decides that they like that more, shouldn’t they become a physics major?

I’ll post the curriculum proposal next.

(Update: Talking to my adviser, his comment was that the single most important thing that could be done would be to cut the administrative burden (not including teaching) of principal investigators from half to two thirds of their time to something much, much smaller. I think perhaps five to ten percent should be the upper limit.)

BIO2010 (Part 1)

The NIH and HHMI organized a much trumpeted report entitled BIO2010: Transforming Undergraduate Education for Future Research Biologists. It came out in 2003 to much acclaim. I reach the crux of it (chapter 2, "A New Biology Curriculum") last night. I don’t like it. I have some basic philosophical objections to their approach, and a lot of quibbles with the details as they stand. In this post I’ll go through the quibbles, and in later ones I’ll attack the underlying philosophy, and propose a curriculum in my turn.

The chapter is organized into a list of core concepts to be taught from biology, physics, chemistry, engineering, and math and computer science, then a set of suggested curricula. I’ve broken my gripes down by section:

Biology: Some of the concepts listed are important ("Biological systems obey the laws of chemistry and physics." and "Lipids assemble with proteins to form membranes, which surround cells to separate them from their environment. Membranes also form distinct compartments within eukaryotic cells.") and I dearly hope they are being taught already. Others are sloppy, tautological, or espouse absurdities such as "holistic" science.

Chemistry: I know almost know chemistry and am not willing to comment on this section except for the fact that "computational methods and modeling" is a bullet point right alongside Lewis structures.

Physics: The physics recommendations largely consist of a standard intro physics course for hard science majors with a few topics moved here or there. This isn’t a bad thing. I think that the best approach would be to take an 1960’s edition of Halliday and Resnick and go through that in a year (for those who don’t know it, the early editions of Halliday and Resnick were clear, comprehensive, and had a doable number of extremely well selected problems).

The report also wants the students to do learn by interacting with simulations. I can tell you from being on the receiving end of such an attempt and from anecdotal evidence in the community that such approaches are dismal failures. Let the course be analytical and experimental. Leave the computer out of it.

Engineering: Aside from the incredibly broad heading, this has about the only interesting, concrete recommendation in the report. The introduction was written by the biologists who are convinced that systems engineering is all that they might really want. Then someone wrote a curriculum outline for the first third of a really solid neurobiology course. Whoever wrote that, bravo!

Mathematics and Computer Science: Some of the topics confuse me (computability theory? why?). The gist of this section is to give the biology students special math classes where the computer can do all the work for them. Hamming said, "The computer is an extension of the body, not the mind." (I found this quote in the preface to McNeil’s Interactive Data Analysis) I agree that biologists should be able to program, and should really understand what BLAST does. This would be better taught as a one or two semester course on numerical analysis and relevant search methods, plus data structures such as ropes to handle long sequences. And it should be taught in Scheme (or this for the ambitious), not in "higher-level languages such as Matlab, Perl, or C" as the report recommends. Everyone who knows programming languages just stopped taking the computing recommendations of the report seriously with that list.

Apparently students should be taught data analysis. I completely agree. Astonishingly, John Tukey wrote some great books for this back in the ’70’s because he was teaching people data analysis. And it’s not just biologists who need this. Every scientist would benefit. Call up your local statistics department and get them to institute an "interdisciplinary" course based on John Tukey’s two classics Exploratory Data Analysis and Data Analysis and Regression, or whatever modernized version they want to teach.

The recommended math comes down to the following classes in the math department: single and multivariable calculus, linear algebra, differential equations (more like Arnol’d’s book than what the engineers are taught), and probability and stochastic processes (this would be a great place to use Nelson’s Radically Elementary Probability Theory). There, three years of math, a semester of computer science, and a semester of data analysis. That’s not so bad, is it?

The report also mentions that medical school admissions requirements govern a lot of what biology departments cover. My approach: ignore them. Physics departments do. If the biologists ignore the MCATs completely, then they will change to follow the biology.

The recommended solutions for dealing with not having the expertise to teach a course are frightening: "…taught…by a collaborating team of faculty from multiple departments" or "A mathematician or computer scientist might also be invited to give a guest lecture or two." Two lectures isn’t long enough to have any real effect on students’ mental processes, and team teaching makes the course scattered and disorderly. I speak from experience. Biology courses are often taught this way, and it’s absolutely useless.

There is an obsession throughout with having a course on modeling and simulation. I believe the rationale is that all the ugly mathematics stuff can be put into this, taught by one of those egg-headed math people, and the biology professors can go on doing exactly as they have been.

At one point it says, "Opportunities to learn mathematical skills in a rich content context will enhance conceptual understanding and procedural fluency." No it won’t. Math professors should (and usually do not) ask students, "Why is this theorem interesting?" If a student has not reached a level of mental abstraction sufficient to answer such a question in the context of pure mathematics, the mathematics education has failed. Later it even recommends remedial math courses before calculus! What are you doing in college if you need that?

Math for biologists

Here’s a post which poses some questions about mathematics training for biologists. I’m close enough where I figured I’d try to answer:

  1. Are you a biologist, if so what kind? Sort of. My PhD will be in biology, on the response of mycobacteria to antibiotics. My undergraduate degrees were in physics and math.
  2. What math did you take in college? Calculus through multivariable, ordinary differential equations, a two semester course on math methods for physics students covering special functions, complex analysis through contour integration, and some partial differential equations, real analysis through the last course they shove the doctoral students through, the probability course that comes just after, a bit of functional analysis, and a two semester graduate sequence in algebra.
  3. What math do you or have you used? My image analysis requires some partial differential equations, a lot of calculus, and a bit of differential geometry. I do some theoretical biology on the side which particularly takes the probability and random processes, and partial differential equations as the limiting cases of the former. My bench work itself doesn’t take more than highschool algebra, but designing and analyzing my experiments requires statistics, which I’ve been teaching myself. Before this, when I was training to be a mathematical physicist, I obviously used everything I knew and it wasn’t enough.
  4. What math do you wish you’d studied? I wish I had actually endured the second year of graduate algebra, and a graduate course on combinatorics. A couple semesters of statistics would have saved me a lot of time now, but I can fill in the gaps. I really wish I had taken serious courses on differential geometry and topology.
  5. How do you use math in your job (or research)? See above.

I think a case could be made that biologists should take all the math required to take a full scale mathematical statistics course including multivariate and nonparametric methods, and a heavy dose of experimental design. Let’s see: real analysis through measure theory and Lebesgue integration and Stokes’s theorem on manifolds, a hefty course on probability from the axiomatic basis to the beginnings of random processes, and then blast through statistics because a t-test would only require about ten minutes and a homework problem, and so would a U-test. This is four years (two of analysis, one of probability, one of statistics), but it’s only one course a term.

Admittedly, biology departments would suddenly have as many undergraduates as physics departments.