THE STRUCTURE AND ORGANIZATION OF A MOUSE PROLINE-RICH PROTEIN GENE (REGULATION, DNA, CLONING)

DAVID KONG ANN, Purdue University

Abstract

One of the family of mouse proline-rich protein genes was isolated on a cloned 3.6 kilobase pair EcoR1/Bg1II-generated DNA fragment from a (partial) Sau3A bacteriophage "library" of CD-1 mouse chromosomal DNA. Phage harboring the gene were identified by plaque hybridization using ('32)P-labeled cDNA inserts from clones pRP33 and pMP1, obtained from rat and mouse, respectively. The transcriptional unit includes three exonic sequences separated by 1434 base pairs (Intron I) and 450 base pairs (Intron II). The complete primary structure of the gene and the 5' and 3' flanking regions were determined by the Maxam and Gilbert sequencing method. Analysis of the nucleotide sequence 5' to the first exon indicates several sequences that might be involved in the induction and expression of a mouse proline-rich protein. The derived amino acid sequences revealed that this protein contains thirteen tandemly repeat regions of fourteen amino acids in length with prototype sequence PPPPGGPQPRPPQG. Each amino acid within the repeat has its favorite codon, and the consensus DNA sequence for each repeat is CCACCACCACCAGGAGGCCCACAGCCGAGACCCCCTCAAGGC. The high(' )conservation in both nucleotide and amino acid sequences in the repeat region suggests that this gene may have evolved by gene duplication of a 42 base pair internal repeat and by gene conversion.

Degree

Ph.D.

Subject Area

Biology

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server
.

Share

COinS