{"id":83238,"date":"2016-06-13T14:00:49","date_gmt":"2016-06-13T14:00:49","guid":{"rendered":"http:\/\/healthmedicinet.com\/i\/classification-of-polyhedral-shapes-from-individual-anisotropically-resolved-cryo-electron-tomography-reconstructions\/"},"modified":"2016-06-13T14:00:49","modified_gmt":"2016-06-13T14:00:49","slug":"classification-of-polyhedral-shapes-from-individual-anisotropically-resolved-cryo-electron-tomography-reconstructions","status":"publish","type":"post","link":"http:\/\/healthmedicinet.com\/i\/classification-of-polyhedral-shapes-from-individual-anisotropically-resolved-cryo-electron-tomography-reconstructions\/","title":{"rendered":"Classification of polyhedral shapes from individual anisotropically resolved cryo-electron tomography reconstructions"},"content":{"rendered":"<p id=\"Par4\" class=\"Para\">Cryo electron microscopy (cryo-EM) involves imaging biological samples flash frozen at cryogenic temperatures using a transmission electron microscope (TEM). Cryogenic freezing in a frozen-hydrated state prevents the biological sample from structurally deforming during sample preparation [<span class=\"CitationRef\">1<\/a><\/span>]. Unlike traditional TEM or X-ray crystallography, which also offer molecular to atomic level resolution, cryo-EM thus enables the imaging of macromolecular complexes, assemblies, cells and even tissues in a near native state [<span class=\"CitationRef\">2<\/a><\/span>].<\/p>\n<p>Cryo-electron tomography (cryo-ET) collects data by exposing the sample to an electron beam over multiple tilting angles (Fig.\u00a0<span class=\"InternalRef\">1<\/a><\/span> a), enabling 3-D reconstruction of individual objects from the resulting 2-D projections. This reconstruction, which involves inversion of the 3-D Radon transform, does not require a priori assumptions about the objects structure [<span class=\"CitationRef\">2<\/a><\/span>]. However, there are a number of factors which limit the resolution of cryo-ET reconstructions. First, due to limitations in the degree of tilt of the mounting stage, the incomplete range of view angles causes a \u201cmissing wedge\u201d in the Fourier (projection) domain data [<span class=\"CitationRef\">2<\/a><\/span>]. This in turn causes resolution of the 3-D reconstruction perpendicular to the sample surface to be worse than in the plane of the sample surface (Fig.\u00a0<span class=\"InternalRef\">1b<\/a><\/span>). Secondly, the total amount of radiation damage is proportional to the number of view angles times the radiation dose of the incident beam at a single view angle [<span class=\"CitationRef\">2<\/a><\/span>]. This multiplicative effect imposes strict constraints on intensity of the incident beam [<span class=\"CitationRef\">3<\/a><\/span>]. The consequently low and uneven resolution means that it can be difficult to identify the structure of individual objects from their cryo-ET reconstructions.<\/p>\n<figure class=\"Figure\" id=\"Fig1\">\n                    <img decoding=\"async\" src=\"https:\/\/static-content.springer.com\/image\/art%3A10.1186%2Fs12859-016-1107-5\/MediaObjects\/12859_2016_1107_Fig1_HTML.gif\" alt=\"https:\/\/static-content.springer.com\/image\/art%3A10.1186%2Fs12859-016-1107-5\/MediaObjects\/12859_2016_1107_Fig1_HTML.gif\" \/><figcaption class=\"Caption\" lang=\"en\"><span class=\"CaptionNumber\">Fig. 1<\/span><\/p>\n<p class=\"SimplePara\">Cryo-EM tomography: <strong class=\"EmphasisTypeBold\">a<\/strong> Schematic of single axis cryo-EM tomography. The (small) sample placed on a stage which is progressively tilted along a plane in a typical range of?\u00b1?60\u00b0, while exposed to an electron beam. Each tilt angle generates a planar projection image, which are collectively processed using an algorithm such as filtered back projection to generate a 3-d reconstruction of the original object. Due to the limited range of tilt angles, the reconstruction doesn\u2019t have uniform resolution across the object. Typically, opposite ends (top and bottom) of the 3-d reconstruction in one direction have poor contrast resolution. <strong class=\"EmphasisTypeBold\">b<\/strong> Mid-level cross-section from a 3-d cryo-EM reconstruction of an <em class=\"EmphasisTypeItalic\">E. Coli<\/em> cell. The small enclosures (black arrows) are micro-compartments (MC). The red arrow shows the cell membrane. Note that part of the cell membrane is missing. The scale bar is 200\u00a0nm<\/p>\n<\/figcaption><\/figure>\n<p>Here we investigate the structure of bacterial micro-compartments (BMC): thin walled protein enclosures inside bacterial cells which separate certain metabolic pathways from the remaining cytoplasm [<span class=\"CitationRef\">4<\/a><\/span>]. Previous cryo-ET analyses of a class of BMCs known as carboxysomes, in two other strains of bacteria suggest that they have a polyhedral, specifically icosahedral, external structure [<span class=\"CitationRef\">5<\/a><\/span>, <span class=\"CitationRef\">6<\/a><\/span>]. Visual inspection of reconstructed slices (Fig.\u00a0<span class=\"InternalRef\">1b<\/a><\/span>) and 3-d volume rendering (Fig.\u00a0<span class=\"InternalRef\">2f<\/a><\/span>) for our BMCs also suggest a convex polyhedral structure. In recombinant BMCs in <em class=\"EmphasisTypeItalic\">E.coli<\/em>, we demonstrate large variation in size and shape across copies within the same bacteria.<\/p>\n<figure class=\"Figure\" id=\"Fig2\">\n                    <img decoding=\"async\" src=\"https:\/\/static-content.springer.com\/image\/art%3A10.1186%2Fs12859-016-1107-5\/MediaObjects\/12859_2016_1107_Fig2_HTML.gif\" alt=\"https:\/\/static-content.springer.com\/image\/art%3A10.1186%2Fs12859-016-1107-5\/MediaObjects\/12859_2016_1107_Fig2_HTML.gif\" \/><figcaption class=\"Caption\" lang=\"en\"><span class=\"CaptionNumber\">Fig. 2<\/span><\/p>\n<p class=\"SimplePara\">Extraction of polyhedral graph from cryo-EM reconstructions: <strong class=\"EmphasisTypeBold\">a<\/strong> Upper level cross-sectional slice showing an MC with boundary partially visible due to poorer resolution. The scale bar is 50\u00a0nm. <strong class=\"EmphasisTypeBold\">b<\/strong> Hand-drawn segmentation showing interior (deep green), exterior (yellow). Light green indicates a conservatively drawn region of uncertainty (caused by poor resolution) in which we suspect the object boundary lies. This region of uncertainty is used to constrain the 3-d reconstruction. <strong class=\"EmphasisTypeBold\">c<\/strong> Stacked hand drawn boundaries from slices along the z-axis. Note that boundary information is completely missing for slices above and below this stack. <strong class=\"EmphasisTypeBold\">d<\/strong> Volume rendering of regularized least squares reconstruction of object using data from stack in (<strong class=\"EmphasisTypeBold\">b<\/strong>). Note missing wedge on right hand side. <strong class=\"EmphasisTypeBold\">e<\/strong> Volume rendering of regularized least squares reconstruction of object using data from stacks of slices along x, y and z-axis. <strong class=\"EmphasisTypeBold\">f<\/strong> Ball and stick diagram of polyhedral graph (PG) for object in (<strong class=\"EmphasisTypeBold\">e<\/strong>), drawn using Chimera. Blue balls are observed vertices. Red lines are completed edges. Yellow lines are incomplete edges. Green balls are ends of incomplete edges<\/p>\n<\/figcaption><\/figure>\n<p id=\"Par7\" class=\"Para\">Previous methods identifying structure from cryo-ET reconstructions with missing wedge involves extracting multiple subvolumes (subtomograms) of the structure of interest, and then \u2018averaging\u2019 them, after appropriate alignment, to improve the resolution [<span class=\"CitationRef\">7<\/a><\/span>]. Another approach is by matching subtomograms against a high resolution template [<span class=\"CitationRef\">8<\/a><\/span>]. The limitations of present methods are thus: i) the structure of the template needs to be known\/guessed in advance or ii) subtomogram averaging fails to capture variation of shapes across multiple copies of the object. Instead, we propose to realize the full potential of cryo-ET by identifying shapes using data from individual objects, without averaging of any sort or a priori assumptions about its shape. Further, we examine how accurately shapes can be identified from reconstructions of poor resolution using this methodology.<\/p>\n<p id=\"Par8\" class=\"Para\">With a polyhedral structure in mind, we represent object shapes as incomplete polyhedral graphs (PG), i.e. a set of vertices connected by edges. We have developed a pipeline for extracting the PG from cryo-ET reconstructions. We also identify a library of reference polyhedra to which these objects should belong. Shape identification can thus be achieved by classifying the observed incomplete PG (which can be subject to measurement error) to one of the reference polyhedra. Apart from BMCs, these techniques could potentially also be applied to other biological objects exhibiting polyhedral structure, such as several types of viruses [<span class=\"CitationRef\">1<\/a><\/span>, <span class=\"CitationRef\">7<\/a><\/span>, <span class=\"CitationRef\">9<\/a><\/span>, <span class=\"CitationRef\">10<\/a><\/span>] and protein complexes such as clathrin [<span class=\"CitationRef\">11<\/a><\/span>].<\/p>\n<p id=\"Par9\" class=\"Para\">Developing optimal classification rules in this setting raises a number of methodological questions. Firstly, we need an appropriate stochastic model for PGs. Stochastic models for graphs typically assume that their edges are generated by a random process, e.g. Gaussian graphical models [<span class=\"CitationRef\">12<\/a><\/span>], exponentially generated random graphs [<span class=\"CitationRef\">13<\/a><\/span>] or stochastic block models [<span class=\"CitationRef\">14<\/a><\/span>]. In our library, two regular polyhedra can differ from each other in only a face or two. It would be complicated to define a stochastic model at edge level which could capture such small differences. Instead, we propose to model the observed PG as an incompletely sampled version of an underlying deterministic complete PG. Based on this model, we propose a method for estimating the sampling distribution of an incomplete PG for cryo-ET reconstructed images. Given that PGs are typically high dimensional, a general non-parametric density estimate would appear to suffer from the curse of dimensionality [<span class=\"CitationRef\">15<\/a><\/span>]. However, the highly structured form of PGs allows us to treat the sampling distribution like a discrete random variable with limited support, enabling convergence of the proposed density estimate at the parametric rate.<\/p>\n<p id=\"Par10\" class=\"Para\">A second issue relates to how to incorporate information from edges that are only partially visible due to poor resolution (e.g. Fig.\u00a0<span class=\"InternalRef\">2a<\/a><\/span>): we can only identify one vertex of such an edge. Such edges cannot be incorporated into the adjacency matrix, which is commonly used to encode and analyse graphs [<span class=\"CitationRef\">16<\/a><\/span>]. To address this, we develop statistics for incomplete PGs, somewhat analogous to those used for right censored data.<\/p>\n<p id=\"Par11\" class=\"Para\">Finally, previous approaches to classification with incomplete data involve a strategy of data augmentation, writing the posterior density as: <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>)?=??<em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>,?<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>)<em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>)<em class=\"EmphasisTypeItalic\">dx<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>, where <em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub> is the <em class=\"EmphasisTypeItalic\">i<\/em>-th class label, <em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span> and <em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span> are the observed and missing features respectively [<span class=\"CitationRef\">17<\/a><\/span>]. The difficulty in implementing this approach lies in constructing an appropriate model for <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>). Because the PG is uniquely specified by the polyhedron type, it is natural to first condition on <em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>, i.e. obtain <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>,?<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>) and then take an expectation over the polyhedron class, i.e. <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>)?=??<em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>,?<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>)<em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>). The marginal probability <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">m<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>) thus becomes dependent on the class of polyhedra chosen, making it a circular formulation. Instead, we propose a simpler procedure based solely on observed (incomplete) data. By modelling the incompleteness as a censoring mechanism, we propose a simulation based estimate of the probability density <em class=\"EmphasisTypeItalic\">p<\/em>(<em class=\"EmphasisTypeItalic\">x<\/em><br \/>\n                        <span class=\"stack\"><br \/>\n                  <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                  <\/sub><sup><em class=\"EmphasisTypeItalic\">o<\/em><br \/>\n                  <\/sup><\/span>|<em class=\"EmphasisTypeItalic\">y<\/em><br \/>\n                        <sub><em class=\"EmphasisTypeItalic\">i<\/em><br \/>\n                <\/sub>). We construct the Bayes classifier using this density estimate and demonstrate that this classifier is accurate for most polyhedra.<\/p>\n<p id=\"Par12\" class=\"Para\">Extraction of the PG from tomographic reconstructions involves a number of processing steps such as vertex and edge identification, which are potentially liable to error, e.g. missing vertices and edges. We show the accuracy of the Bayes classifier seriously deteriorates in the presence of such errors. We propose two strategies for robust inference in this setting: i) selection of PG features, such as local topology, which are nearly preserved despite random missing edges or vertices ii) use of distance based classification methods, such as support vector machines (SVM), which can recognize near preservation. The methodology is illustrated by application to a set of <em class=\"EmphasisTypeItalic\">E. coli<\/em> MCs and the results are compared to those obtained for other types of bacteria.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Cryo electron microscopy (cryo-EM) involves imaging biological samples flash frozen at cryogenic temperatures using a transmission electron microscope (TEM). Cryogenic freezing in a frozen-hydrated state prevents the biological sample from structurally deforming during sample preparation [1]. Unlike traditional TEM or X-ray crystallography, which also offer molecular to atomic level resolution, cryo-EM thus enables the imaging <a class=\"read-more-link\" href=\"http:\/\/healthmedicinet.com\/i\/classification-of-polyhedral-shapes-from-individual-anisotropically-resolved-cryo-electron-tomography-reconstructions\/\">Read More<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[],"tags":[],"class_list":["post-83238","post","type-post","status-publish","format-standard","hentry"],"_links":{"self":[{"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/posts\/83238","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/comments?post=83238"}],"version-history":[{"count":0,"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/posts\/83238\/revisions"}],"wp:attachment":[{"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/media?parent=83238"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/categories?post=83238"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/healthmedicinet.com\/i\/wp-json\/wp\/v2\/tags?post=83238"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}