Professional Documents
Culture Documents
Sequence analysis
Fig. 1. Multiple alignment of the DOMON superfamily. Proteins are represented by their gene names, species abbreviations and GIs. The coloring
reflects the consensus at 80% conservation calculated from a more extensive alignment. Ligand interacting residues are shaded red. Consensus
abbreviations are h: hydrophobic, s: small, p: polar, h: hydrophobic and b: big. Refer to the Supplementary Material for species abbreviations.
scop.mrc-lmb.cam.ac.uk/scop/). All the retrieved sequences strands of the classical Ig-like fold, including two additional
were classified into distinct families by first clustering with N-terminal strands, and an extra strand in the ligand binding
the BLASTCLUST program (Supplementary Material) and sheet and (3) a characteristic long loop between strands 5 and 6
then further combining the clusters using uniquely shared of the conserved core that folds against the ligand-binding
sequence features and shared domain architectures. By this we -sheet and provides an interface for ligand contact. Several
obtained at least nine distinct protein families: classical other -sandwich domains have been previously identified as
DOMON (of which the fungal CDH is a member), EDH
- carbohydrate (e.g. Cellulose binding domain II and III)- or
cytochrome domain, CbsA/cytochrome b558/556, Hajella heme (e.g. Cytochrome f)-binding modules (http://scop.mrc-
HCH_03667-like, NirT C-terminal-like, Shewanella SO2192- lmb.cam.ac.uk/scop/). However, the DOMON domain differs
like, CBD9-like, Gibberella zeae FG07921.1-like and from all of them, both in terms of the position and specific
Glucodextranase C-terminal domain-like families (Table 1 in mode of ligand interaction, and number of strands in the
Supplementary Material). Hereinafter, we refer to this unified -sandwich. These suggest innovation of specific ligand-binding
monophyletic assemblage of domains as the DOMON features in the DOMON superfamily after their divergence
superfamily. from the generic group of ligand-binding -sandwich domains.
A comprehensive multiple alignment of the superfamily The defining conserved sequence features of the DOMON
shows that the Ig-like -sandwich DOMON superfamily has superfamily (Figs 1 and 2) include: (1) multiple hydrophobic
10–11 strands, and shares several unique structural features residues that contribute to the hydrophobic core of the strands
(Figs 1 and 2). These include: (1) a common ligand-binding of the -sandwich, and small residues found at the boundaries
interface. (2) Several additional strands beyond the core seven of strands and loops. (2) A strongly conserved charged residue
2661
L.M.Iyer et al.
(usually arginine/lysine) at the end of strand 9. The strong As suggested by previous studies, the two heme-binding
conservation of this non-ligand-binding residue suggests that it versions EDH
and CDH contain a conserved methionine in
may have a structural role, such as stabilizing the loop between the curved loop between conserved strands 5 and 6, which is
strands 9 and 10 or mediating conformational changes and directly linked to the heme (Hallberg et al., 2000; Kloer et al.,
(3) a polar residue (usually histidine, lysine or arginine), that 2006). These heme-binding versions also share a histidine or
interacts or coordinates ligands. lysine residue present in the beginning of the terminal strand
that directly contacts the ligand. The primary ligand-contacting
residue in the sugar-binding CBD9 family is a conserved
2.2 Deciphering the DOMON domain’s function: arginine that occupies the same position as the heme-interacting
evidence from sequence and structure H/K residue of the above versions, and makes contacts with the
The above characterization of the DOMON superfamily polar groups of the sugar moiety. The CBD9 family also
resulted in identification of previously experimentally char- contains a unique conserved tryptophan in a large insert in the
acterized versions binding different ligands. Of these EDH
last strand, and was previously shown to stack against the sugar
and CDH bind a single heme moiety, NirT-C binds a di-heme (Notenboom et al., 2001; Figs 1 and 2). At least six of the
cofactor and the CBD9 either glucose or cellobiose (Devreese nine families, namely DOMON, EDH
-like, cytochrome b558/
et al., 2000; Hallberg et al., 2000; Kloer et al., 2006; 556-like, HCH_03667-like, NirT C-terminus-like and Gibberella
Notenboom et al., 2001). Four of these, representing diverse zeae FG07921.1-like families, have the conserved ligand-
families of this domain, have crystal structures, with three of contacting histidine or lysine at the base of the terminal
them containing a bound soluble ligand (Fig. 2). In all these strand. Of these, most members of the DOMON, EDH
-like
structures the ligand is bound in a strikingly similar fashion, and cytochrome b558/556-like families also contain the
in a comparable pocket formed by one of the sheets of the methionine residue in the insert between strands 5 and 6,
-sandwich (Fig. 2). The presence of a similarly bound ligand, strongly supporting a heme-binding function for these versions.
and conservation of the essential structural features (Figs 1 Despite lacking the methionine, the corresponding inserts of the
and 2) required for the maintenance of the binding pocket, NirT C-terminus and HCH_03667 families contain conserved
suggest that soluble ligand binding is likely to be the conserved histidine residues which could provide an alternative ligand for
function of this domain. To understand the shared ligand- heme (Fig. 1). Some members of the former family also contain
binding features of the superfamily, we extracted all ligand- a conserved insert between strands 1 and 2 with two cysteine
interacting residues from the available structures and compared and histidine residues that might contribute to binding a second
them with the conservation pattern seen in the multiple heme. The functionally obscure Gibberella zeae FG07921.1-like
alignment. proteins have a conserved tyrosine in the insert in place of the
2662
The DOMON domains
methionine. Given that tyrosine has been observed as a heme- Experimental studies on the fungal CDH have shown that the
ligand in unrelated heme-binding Ig-fold proteins, it is possible heme bound by the DOMON domain transfers electrons to the
that it functionally substitutes the methionine (Fig. 1). flavin ligand of the oxidoreductase domain during oxidation
The Glucodextranase C-terminal domain family shares the of cellobiose or cello-oligosaccharides (Stoica et al., 2006).
conserved arginine at the base of the terminal strand and the The animal SDR2-like proteins that are fused to a cytochrome
distinctive insert in the terminal strand with the conserved B561 domain have been shown to function as ferric reductases
tryptophan with the CBD9 family, suggesting a similar sugar- (Vargas et al., 2003). Together, these observations suggest that
binding function. The Shewanella SO2192-like family is also the heme-binding versions of the DOMON domains are
related to the sugar-binding versions. In spite of sharing cytochromes mediating electron transfers in redox reactions.
a common ligand, very few of the other residues lining the This prediction offers an important functional clue regarding the
binding site in the DOMON domains of CDH and EDH
are several animal extracellular matrix proteins in which the
conserved across both the known and predicted heme-binding DOMON domain is fused to other adhesion modules such as
versions. Likewise few residues beyond those described above EGF, reelins, trypsin-inhibitor and SEA domains and the poly-
are shared by binding sites of the known or predicted sugar- DOMON domain proteins (Aravind, 2001). An examination of
2663
L.M.Iyer et al.
bacteria followed by possible dispersion through lateral transfer Conflict of Interest: none declared.
to eukaryotes and certain archaea. Divergence of the domain
into heme- and sugar-binding versions also appears to have
occurred in the bacteria. At least three distinct families of the REFERENCES
DOMON superfamily have been transferred to eukaryotes Aravind,L. (2001) DOMON: an ancient extracellular domain in dopamine
of which the classical DOMON and the Gibberella zeae beta-monooxygenase and other proteins. Trends Biochem. Sci., 26, 524–526.
Devreese,B. et al. (2000) Primary structure characterization of a Rhodocyclus
FG07921.1-like families are present in a wide range of
tenuis diheme cytochrome c reveals the existence of two different classes of
eukaryotes suggesting an early transfer. Likewise, we were low-potential diheme cytochromes c in purple phototropic bacteria. Arch.
also able to identify several bacterial versions (Supplementary Biochem. Biophys., 381, 53–60.
Material) of the DM13 domain, previously known only from Gross,R. et al. (2005) Site-directed modifications indicate differences in axial
haem c iron ligation between the related NrfH and NapC families of
animals. Thus, the DM13 domain and the classical family of
multihaem c-type cytochromes. Biochem. J., 390, 689–693.
the DOMON domain, both ultimately of bacterial origin, Hallberg,B.M. et al. (2000) A new scaffold for binding haem in the cytochrome
appear to have proliferated in animal extracellular proteins. domain of the extracellular flavocytochrome cellobiose dehydrogenase.
We believe the identification of the DOMON domain as Structure, 8, 79–88.
2664