Type I Restriction-Modification enzymes are composed of three
subunits, encoded by the three host specificity
determinant (hsd) genes; the hsdR gene is
absolutely required for restriction activity; the hsdS gene is responsible for DNA specificity and together
with hsdM is the minimal genetic requirement for methylation
activity.
However, the holoenzyme can also behave as a methylase,
the activity of the enzyme is dependent upon the
methylation status of the DNA substrate. Therefore, the
enzyme must control the
opposing functions of restriction and modification.

hsdR - restriction gene, phenotype of mutations = r-m+.
hsdM - modification gene, phenotype of mutations = r-m-.
hsdS - specificity gene, phenotype of mutations = r-m-.
The DNA-binding subunit of a Type I R-M system consists of two target recognition domains separated by conserved domains, which are highly conserved between family members.

For the EcoR124I R-M system the recognition sequence (sR124) is 5'-GAAnnnnnnRTCG-3'. TRD1 is responsible for recognition of the 5'-end of the recognition sequence (GAA), while TRD2 recognises the 3'-end (RTCG).
The conserved regions contain two highly conserved repeats (shown by the arrows) that were originally described by Argos (1983) as the regions of the protein likely to recognise the target DNA sequence. However, this has now been shown not to be the case (e.g. Gubler, 1992).
Careful analysis of the arrangement of these repeated units in all Type I HsdS subunits indicates that all the subunits consist of a similar set of sub-repeats and this has led to a prediction of a circular structure for the HsdS subunit (Kneale 1994). To test this model, Janscak & Bickle (1998) circularly permuted the HsdS subunit of the Type IB R-M enzyme EcoAI, at the DNA level, by direct linkage of codons for the original termini and introduction of new termini elsewhere along the N-terminal and central conserved regions. By analysing the activity of these mutant enzymes, two circularly permuted variants of HsdS, which had termini located at equivalent positions in the N-terminal and central repeats, respectively, were found to fold into a functional DNA recognition subunit with wild-type specificity. This result suggests that the N- and C-termini of the native protein are in close proximity.
Recently, two crystal structures have been solved for HsdS
proteins from the uncharacterised Type I R-M systems MjaXIP (Kim
et al., 2005) and MgeORF438 (Calisto
et al., 2005) respectively. While these HsdS subunits are not known to be
components of functional restriction enzymes, and their DNA specificity is
unknown, the structure has confirmed the circular nature of the HsdS subunit and
the close proximity of the N- and C-termini as indicated by
Janscak & Bickle.
The central conserved region has been subjected to intensive
mutagenesis using deletion mutagenesis, site-directed mutagenesis and
PCR-misincorporation mutagenesis. This has allowed us to show that this region
is critical for protein-protein interactions with the other subunits of the
endonuclease (Abadjieva et
al., 1994). TRD1 and the central conserved region are sufficient to
assemble an active enzyme (Abadjieva
et al., 1993;
Macwilliams & Bickle, 1996). This reflects the circular structure of the
HsdS subunit and the deletion mutants assemble through
dimerisation of a M1S½
subunit. Since the recognition sequence of this deletion mutant is
GAAnnnnnnnTTC, and TRD1 recognises the GAA component of the sR124
sequence, it is apparent that each half subunit recognises the opposite strand
on the DNA strand.
The Type IC DNA methyltransferase M.EcoR124I
recognises the nonpalindromic DNA sequence GAAn6RTCG.
Taylor et al. (1994)
have used small angle X-ray scattering to investigate the solution structure of
the methyltransferase and of complexes of the enzyme with unmethylated and
hemimethylated 30 bp DNA duplexes containing the specific recognition sequence.
A major change in the quaternary structure of the enzyme was observed following
DNA binding, based on a decrease in the radius of gyration and a reduction in
the maximum dimension of the enzyme. The structural transition observed is
independent of the methylation state of the DNA. CD shows that there is no
change in the secondary structure of the protein subunits when DNA is bound. In
contrast, there is a large increase in the CD signal arising from the DNA,
suggesting considerable structural distortion which may allow access to the
bases targeted for methylation. We propose that DNA binding induces a large
rotation of the two HsdM subunits towards the DNA, mediated by hinge bending
domains in the specificity subunit HsdS.
van Noort et al. (2004) have shown that the large structural distortion in the DNA is due to DNA bending and that the MTase induces a DNA bend of 49º. This bend is comparable to that observed with EcoKI, which is also the angle of the 'mimic' protein OCR (Walkinshaw et al 2002), which acts an antirestriction protein (Atanasiu et al 2002), and is presumed compete for the DNA target site within the enzyme.
Type I restriction-modification enzymes recognise bipartite sequences, which consist of a 5' specific region of 3-4 bp and a 3' specific region of, usually, 4 bp, separated by a non-specific spacer of 6-8 bp (e.g. EcoR124I recognises GAAn6RTCG - where R = purine and n = any nucleotide). DNA cleavage by Type I R-M enzymes occurs at non-specific sites far from their recognition sequence, which is mediated by ATP-dependent DNA translocation past the enzyme (Yuan et al., 1980; Studier & Banyopadhyay 1888; Dryden et al., 1997; Szczelkun et al., 1997).
| Enzyme Name |
Family |
Recognition Sequence |
| CfrI |
Type IB |
GCANNNNNNNNGTGG |
| Eco377I | - | GGANNNNNNNNATGC |
| Eco585I | - | GCCNNNNNNTGCG |
| Eco646I | - | CCANNNNNNNCTTC |
| Eco777I | - | GGANNNNNNTATC |
| Eco826I | - | GCANNNNNNCTGA |
| Eco851I | - | GTCANNNNNNTGAY |
| Eco912I | - | CACNNNNNTGGC |
| Eco1158I | - | TGANNNNNNNNTGCT |
| Eco1265I | - | TGANNNNNNNNTGCT |
| Eco1323I | - | GGANNNNNNNNATGC |
| Eco1341I | - | CCANNNNNNNCTTC |
| Eco1342I | AACNNNNNNGTGC | |
| Eco1344I | AACNNNNNNGTGC | |
| Eco1344II | GGANNNNNNNNATGC | |
| Eco1348I | GGANNNNNNTATC | |
| Eco1383I | CCANNNNNNNCTTC | |
| Eco1386I | GGANNNNNNNNATGC | |
| Eco1394I | AACNNNNNNGTGC | |
| Eco1412I | GGANNNNNNTATC | |
| Eco1413I | CCANNNNNNNCTTC | |
| Eco1422I | CCANNNNNNNCTTC | |
| Eco1424I | CCANNNNNNNCTTC | |
| Eco1427I | GGANNNNNNNNATGC | |
| Eco1430I | GGANNNNNNNNATGC | |
| Eco1432I | CCANNNNNNNCTTC | |
| Eco1441I | TGANNNNNNNNTGCT | |
| Eco1443I | TGANNNNNNNNTGCT | |
| Eco1446I | GAGNNNNNNNGTCA | |
| Eco1447I | TGANNNNNNNNTGCT | |
| Eco1455I | GCANNNNNNCTGA | |
| Eco1456I | GGANNNNNNNNATGC | |
| Eco1476I | GGANNNNNNNNATGC | |
| Eco14444I | TGANNNNNNNNTGCT | |
| EcoAI | Type IB | GAGNNNNNNNGTCA |
| EcoA0ORF42P | GGANNNNNNNNATGC | |
| EcoBI | Type IA | TGANNNNNNNNTGCT |
| EcoDI | Type IA | TTANNNNNNNGTCY |
| EcoDR2 | TCANNNNNNGTCG | |
| EcoDR3 | TCANNNNNNNATCG | |
| EcoDXXI | Type IC | TCANNNNNNNRTTC |
| EcoEI | GAGNNNNNNNATGC | |
| EcoKI | Type IA | AACNNNNNNGTGC |
| EcoR124I | Type IC | GAANNNNNNRTCG |
| EcoR124II | Type IC | GAANNNNNNNRTCG |
| EcoRD2 | GAANNNNNNRTTC | |
| EcoRD3 | GAANNNNNNNRTTC | |
| EcoW3110ORF4339P | AACNNNNNNGTGC | |
| EcoprrI | Type IC | CCANNNNNNNRTGC |
| EsaSS290P | GAGNNNNNNNATGC | |
| KpnAI | Type ID | GAANNNNNNTGCC |
| KpnBI | CAAANNNNNNRTCA | |
| NgoAV | GCANNNNNNNNTGC | |
| StyLTIII | GAGNNNNNNRTAYG | |
| StySBLI | CGANNNNNNTACC | |
| StySEAI | ACANNNNNNTYCA | |
| StySENI | CGANNNNNNTACC | |
| StySGI | TAANNNNNNRTCG | |
| StySJI | GAGNNNNNNGTRC | |
| StySKI | CGATNNNNNNNGTTA | |
| StySPI | Type IA | AACNNNNNNGTRC |
| StySQI | AACNNNNNNRTAYG |
Last modified on
24 July 2010
© University of Portsmouth
Author Dr Keith Firman.