Systems and Signals Group: 2012

Wednesday, 3 October 2012

Exploring noise in cellular biology

We're used to thinking about machines as robust, hard-wearing objects made from solid materials like metal and plastics. If they crack, split or overheat they are liable to malfunction, and if we subject them to too much jostling and shaking we're asking for trouble. However, the biochemical machines responsible for keeping us alive work in a rather different world -- they're made from soft, organic materials, and contained in a disorganised bag (the cell) that is constantly shaken, bumping our machines against each other and other cellular inhabitants. How can the delicate processes required by living organisms take place in this chaotic environment? And how can scientific progress be made in such a tumultuous, unpredictable world?

Extrinsic factors can modulate the stability of essential, but noisy, cellular circuits

Iain recently wrote an article, targeted at a broad audience, looking at some of these questions. One of the most important cellular processes that has to take place in this chaotic world is that of 'gene expression': the interpretation of genetic blueprints which describe how to build cellular machinery, and the subsequent construction process. Gene expression can be likened to using a bad photocopier to copy books from a library that opens and closes randomly, then using these photocopies (which are prone to decay) to construct machines. This problematic environment gives rise to many medically important random effects, including bacterial resistance to antibiotics and differing responses to anti-cancer drugs. We are particularly interested in how fluctuating power supplies (see our other blog articles here, here and here!) influence the cell's ability to produce these machines, and what effects this unreliable power has on medically important processes. The article -- available here and appearing in the expository magazine Significance -- takes a look at how cellular noise arises, current techniques for its detection and analysis, and its influence on important biological phenomena. Iain

Sunday, 23 September 2012

Organizing networks using their dense regions

Many systems in fields ranging from biology to sociology, to politics and finance can be represented as networks. For example, in protein interaction networks each node represents a protein and each link, connecting a pair of nodes, quantifies the strength of the interaction between those proteins. Similarly, in political voting networks nodes represent politicians and the edges connecting pairs of politicians represent the similarity of their legislative voting records. Despite the significant differences in the underlying systems, the common network representation enables researchers in different fields to ask questions that can be surprisingly similar. Given this, it would be useful to have a systematic method to highlight similarities in networks from different fields to identify problems that might be tackled using the same techniques. For example, if a biological network representing covariation in neural activity in different regions of the brain could be shown to be structurally similar to a financial network representing correlation of stock returns, certain analytical tools and models might be applicable to both problems.

A taxonomy of networks

In our paper, we tackle this problem by first developing a method to quantify the similarity of different networks based on their community structure. A community in a network, loosely put, is a set of nodes which are more connected to each other than they are to the rest of the network (like a group of friends who have the majority of the social interactions with each-other). We introduce the idea of “mesoscopic response functions” which are curves that summarize the community structure of each network at different scales and enable us to define a single number that quantifies the similarity of network pairs. Importantly, this approach allows us to compare networks with different numbers of nodes and different link densities. We then use this similarity measure to construct taxonomies of networks. From an historical perspective, classification of objects in this way has been central to the progress of science, as demonstrated by the periodic table of elements in chemistry and the phylogenetic tree of organisms in biology.

The taxonomies constructed using our approach are successful at grouping networks that are known to be similar. For example, political voting networks for the US Congress, UK House of Commons and United Nation are clustered together in the same group. Perhaps more importantly, the method also identifies networks that are not grouped with members of the same class and are therefore unusual in some way. For example, a Facebook network for Caltech is not grouped with the Facebook networks of other universities. We also used the technique to detect historically significant financial and political changes in temporal sequences of networks; we found the stock market network corresponding to the 1987 crash and the voting network corresponding to the American Civil War to stand out from their respective sequences of networks.

You can read the full story in our paper “Taxonomies of networks from community structure” in Physical Review E 86, 036104 (2012) In the paper, we demonstrate the range of fields in which this approach can be usefully applied using a set of 746 networks and case studies that include US Congressional voting, Facebook friendship, fungal growth, United Nations voting, and stock market return correlation networks. Dan, JP and Nick

Friday, 21 September 2012

How conserved are protein-protein interactions? And why would you want to know?

A comparison of biological sequences from multiple species shows a great deal of evolutionary conservation. An overall question of interest is the following: what is the connection between similarities in biological sequence between species and similarities in the function of their components and cells?

We started to think about this question in the specific context of proteins: the construction blocks of cells which are specified by a sequence (of amino acids). If two human proteins are known to physically interact (stick to each other) will their equivalent (homologous) proteins in mouse physically interact? (To say that two proteins are homologous means that they are similar through common evolutionary descent: in some sense, they are the 'same' protein).

The answer, it is often assumed, is 'yes': a fairly similar sequence that specifies the protein makes for a fairly similar function of that protein. Indeed, partly because new sequence data is being generated at a much faster rate than any other type of data, it is common practice to 'transfer' functional knowledge (such as interaction partners) from a functionally characterised protein to its unstudied matches in other species. But is this legitimate?

If we know the interaction between proteins in the network of the green organism what can we say about the interactions of similar proteins in the blue organism's network?

An answer could also shed light on more theoretical questions.
- If only small changes in sequence can lead to new interactions between proteins, then this could be a fast evolutionary mechanism to generate new functions.
- Homologs are also found within a species - do these maintain the same interactions (a form of robustness), or rapidly lose them (release from evolutionary selection)?

Our paper just appeared in PLoS Computational Biology with the title "What Evidence is There for the Homology of Protein-Protein Interactions?". Our results returned some expected conclusions: more closely related species have more conserved interactions; the more stringent the definition to consider two proteins as homologs, the more conservation observed. An overall conclusion was that, at definitions of homology/similarity frequently used in the community, conservation of interactions is low, and hence 'transferred' functional annotations should be used with care. We also compared the transfer of interactions between and within species, and found within-species transfers were less reliable than between-species transfers. Using our method we also made some guesses as to the rate at which protein-protein interactions are lost through evolutionary time and about the total number of interactions that are present between all the proteins in an organism.

Our work is preliminary in many ways: better attempts at dealing with interaction data errors could be made; we treated all the interactions as independent of each other, which of course they are not; we didn't compare our derived rate of loss of interactions with other evolutionary rates; and much else besides! Hopefully someone else will pick up where we have left off... Anna and Nick

Predicting network flows

Many biological, geophysical and technological systems involve the transport of material over a network by bulk fluid flow (advection) and diffusion within that fluid. The analogy is that ink spilled in the middle of a river both spreads out symmetrically by diffusion (even if the river were stationary) and also gets transported bodily with the flow of the river (advection). Bulk fluid transport systems are found in the vast majority of multi-cellular organisms, as the component cells of such organisms require resources for metabolism and growth, and the speed of diffusion alone is often such that it is only an effective means of exchange at microscopic length scales. Molecules of interest are carried by advection and diffusion through the networks that make up fungi, the blood vessel networks of animals, the xylem and phloem elements of plants, and various body cavities of many different animals. Advection and diffusion are also fundamental to transport in geological and technological systems, such as rivers and drainage networks, gas pipelines, sewer systems and ventilation systems.

In all of these cases the particles of interest diffuse within a moving fluid, which is constrained to flow within a given network. Furthermore, the molecules that are carried through the network may be consumed or delivered out of the network at a particular rate. For example, glucose molecules are carried through the blood, and at each point in the network there is some probability that a given glucose molecule will be transported out of the vascular system and into the surrounding tissue. We have recently developed an algorithm for predicting how the spatial distribution of nutrients in a network will vary over space and time, when the resource in question is subject to given rates of advection, diffusion and delivery. We explain the algorithm in our paper "Advection, diffusion, and delivery over a network" that recently appeared in Physical Review E.

(a) Phanerochaete velutina in a 24cm x 24cm microcosm, photographed just before radio-tracer was dripped onto the inoculum.

(b) Data from the photon counting camera. The brightness of the image reflects the concentration of the tracer in each part of the network.

(c) Digitized network, coloured to indicate the tracer concentration. Concentration is measured in arbitrary units, and edges that could not be measured are coloured black.

(d) Predicted concentration measured in arbitrary units, under the assumption that the tracer enters the network at the inoculum at a constant rate, each edge in the final network continues to grow (or shrink) at the same rate that was observed over the final time step, and 10% of each edge is occupied by transport vessels.

(e) Predicted intensity in arbitrary units under the same assumptions as diagram d), except that in this case we assume that 20% of each edge is occupied by transport vessels.

We are particularly interested in modeling the movement of radio-labelled tracers in growing fungal networks. As mentioned in a previous post, we hypothesize that within fungal networks, there is a bulk movement of fluid from the sites of water uptake to the sites of growth. To test this hypothesis, we allowed the fungi Phanerochaete velutina to grow on a dish for a four week period, taking photographs every three days. An image analysis program was then used to convert the sequence of photographs into a sequence of networks, comprised of edges of measured length and volume.

After taking a final photograph of our fungi, we added a radio-labelled tracer, placed a scintillation screen over the network, and used a photon counting camera to see where the tracer moved. This experiment gave us empirical data which we could use to evaluate our model of transport in fungal networks. Our model has one free parameter, corresponding to the fraction of each edge that is occupied by transport vessels. We found that our model (see Fig. d) of growth-induced mass flows was remarkably good at predicting where the tracers would spread (compare to Fig c), if we make the biologically plausible assumption that the fluid flows occur within transport vessels that occupy 10% of each edge in the network. Luke and Nick

Thursday, 5 April 2012

Educated guesses about ancestral shapes and sounds: what do dead languages like Latin and Greek sound like?

What might ancient creatures have looked like? What would dead languages have sounded like? And what are the evolutionary relationships between currently observed shapes and sounds? While we have widely accepted methods that allow us to speculate (in an educated fashion) about ancestral genetic sequences we don't have well developed approaches for shapes and functions.

I proposed to John Moriarty that we attempt to extend sequence inference to functions, and we got a grant with some most excellent colleagues John Aston, Dorothy Buck and Vincent Macaulay. John and I wrote a paper in which we investigated this using the versatile mathematical tools that are Gaussian Processes. We showed that in some controlled settings we could take (functional) observations from the world and make sensible guesses about what their ancestors might have been. If you want to see a video of us implementing an experiment with the help of some school children then click here (a blog specifically about our school engagement is here). More or less, our task is to take the game of telephone and run it backwards to identify the original sound (a sound can be viewed as a curve, or function on the line, or as, e.g., a spectrogram, a function on the plane).

But once you suppose you can reconstruct original sounds from mutated versions then one might hope to engage with some big and old questions: what do dead languages like Latin and Greek sound like? Can we use observations of contemporary speech sounds made at different leaves of linguistic trees (see the picture below) "to put probability distributions over" (make educated guesses about) possible ancestral speech sounds? One takes an audio recording of the same (sufficiently homologous) word in multiple different languages and attempts to make (probabilistic) inferences about the corresponding ancestral sounds.

On the left is Schleicher's original tree of Indo-European languages, from 1860. On the right is a numerical experiment where, given knowledge of the three black curves at the bottom and the evolutionary tree (thick black object) we can put a probability distribution over possible ancestral curves and sample from that distribution (red curve is the mean, blue a measure of standard deviation and dotted black is a sample from that distribution).

We just wrote a relatively non-technical paper in Trends in Ecology and Evolution "Phylogenetic inference for function-valued traits: speech sound evolution" (free version here and not free version here) with authors: John Aston (Warwick Stats) Dorothy Buck (Imperial Maths), John Coleman (Oxford Phonetics), Colin Cotter (Imperial Aero), NJ, Vincent Macaulay (Glasgow Stats), Norman Macleod (Natural History Museum), John Moriarty (Manchester Maths), Andrew Nevins (UCL Linguistics). In this we suggest that we have all the tools to try to reconstruct ancient speech (we also have lots of people with strong opinions about what ancient speech might have sounded like). We also use the paper to emphasise that this approach could allow us to reconstruct evolutionary trees from (functional) data. John Coleman says that they're (informally) calling this activity of reconstructing past speech sounds, necro-phonetics. I think that's neat. Nick

Tuesday, 20 March 2012

Taking the pulse of cellular power stations

Discharge and recharge: why cellular power stations might pulse

An abstract representation (acrylic on canvas) of a single
mitochondrion undergoing a 'pulse'. Its change in energy status
is shown by the change in colour that we have also observed by
microscopy using fluorescent sensors. Artist: Markus S

We've just written an article in the journal Plant Cell about pulsing cellular power-stations and will motivate it by an analogy. Imagine we have a reservoir of water, and this water flows downhill through an outlet pipe, turning a turbine and producing energy. In this thought experiment, we're faced with a problem: the only way we can get water into our reservoir is by pumping it into the bottom of the reservoir. The higher up a reservoir is, the harder it is to pump water up there and the higher the risk of pumps overheating and getting damaged.

The problem can be solved by allowing the height of our reservoir to vary. If we lower our reservoir, it will become easier to fill, and the higher water pressure that arises from an increasingly filled reservoir will partly compensate for the fact that turbine-turning water will flow downhill from a lowered height, while allowing the pumps to relax and cool.

This model is a crude representation of mitochondria, the power stations of the cell, which use energy from respiration to create an energetic gradient across their membranes -- like a natural version of an AA battery. In our picture, this corresponds to the pumps feeding into our reservoir -- and in the cell, these pumps produce dangerous chemicals when they are overworked. The gradient they produce imbues protons with energy that is part electrical -- which we picture as the height of our reservoir -- and part chemical -- which we picture as the amount of water in our reservoir. These protons then flow through a protein complex -- the turbine -- to produce ATP, the universal cellular fuel.

When mitochondria pump many protons, their "reservoirs" rise, with the increase in height forcing the pumps to work harder to pump water into the reservoir. This work produces dangerous chemicals which can damage the cell and the mitochondria themselves (called reactive oxygen species - they're what antioxidants try to combat). We have found a new mechanism by which this risk is decreased: if mitochondria are having to work hard, they "pulse", spontaneously lowering the height of their reservoir. This decreases the amount of work that the mitochondrial pumps have to do to fill the reservoir. The amount of turbine-turning energy per unit of water decreases, but as it becomes easier to fill the reservoir, more water gets pumped into it, partly compensating for the loss of height by an increase in volume. The pulsing process thus lowers the reservoir but fills it with more water, allowing the mitochondrial pumps to relax and reducing the production of dangerous chemicals.

We observed these pulses, spontaneous decreases of mitochondrial membrane potential, in Arabidopsis thaliana, a model plant species used in many biological contexts. Treating plant mitochondria with a variety of chemicals and observing the effects on pulsing, we deduced a biochemical mechanism by which pulsing occurs: a controlled influx of cations such as calcium ions into the mitochondrial matrix decreases membrane potential. We also found that pulsing is increased when plants face stressful environments: if they are suddenly heated, for example, or exposed to toxic chemicals. This novel mechanism may help explain some of the variability that our cellular engines exhibit and may be an important discovery in considering how mitochondria react to dangerous cellular conditions. You'll find the article free by following this link. This article about single mitochondria complements our work on the mitochondrial population of the cell - we blogged about that here and here. Iain, Markus & Nick.

Tuesday, 24 January 2012

How cellular power stations might fluctuate

Mitochondria are the tiny engines of eukaryotic cells (the cells of animals, plants and fungi) -- responsible for producing the energy vital for fundamental processes of life. A recent explosion of experimental results has shown that their behaviour is far more dynamic and rich than any man-made engine. They move through cells, fuse into large networks, break apart, replicate and get degraded if they don't perform adequately.

Mitochondrial populations are also observed to differ significantly between otherwise similar cells: one cell may possess many efficient engines, while another must make do with a small number of inefficient ones (see our blog entry here). As well as explaining why the error-bars in biology can be so big, this cell-to-cell variability in mitochondria can lead to profound medical consequences: many diseases are known to result from low quality mitochondria, unable to produce enough energy for a cell to remain healthy. Mitochondria (and mechanisms to keep their quality high) have been implicated in ageing and diseases like Parkinson's, Alzheimer's, diabetes and cancer.

We have recently produced a mathematical model (coupled to some new experimental data) to give an explanation as to how experimentally observed variability in mitochondrial populations arises and explore its potential consequences: ranging from differences in the rate at which fundamental biochemical elements like mRNAs and proteins are produced, to differences in the stability and ultimate fate of stem cells. Our model is physically simple (we suggest several future experimental directions which would help in further development) and can mathematically be combined with other descriptions of cellular processes, providing a general framework to investigate the biologically and medically important effects of mitochondrial variability. It appeared in PLoS Computational Biology (an open-access journal) under the title "Mitochondrial Variability as a Source of Extrinsic Cellular Noise" but you can also find it here. Iain and Nick.

Systems and Signals Group