LOCUS AAAGLOBIN 3775 bp DNA INV 15-APR-1994 DEFINITION Anadara trapezia globin gene, complete cds. ACCESSION L20699 NID g472794 KEYWORDS globin. SOURCE Anadara trapezia blood DNA. ORGANISM Anadara trapezia Eukaryotae; mitochondrial eukaryotes; Metazoa; Mollusca; Bivalvia; Pteriomorphia; Arcoida; Arcidae; Anadara. REFERENCE 1 (bases 1 to 3775) AUTHORS Titchen,D.A., Glenn,W.K., Nassif,N.T., Thompson,A.R. and Thompson,E.O.P. TITLE A minor globin gene of the bivalve mollusc Anadara trapezia JOURNAL Biochim. Biophys. Acta 1089, 61-67 (1991) MEDLINE 91223099 REFERENCE 2 (bases 1 to 3775) AUTHORS Glenn,W.K., Titchen,D.A., Thompson,E.O.P. and Mackinlay,A.G. TITLE Characterization of a repeated sequence occurring within an intron of a globin gene in the bivalve mollusc Anadara trapezia JOURNAL Unpublished (1993) FEATURES Location/Qualifiers source 1..3775 /organism="Anadara trapezia" /cell_type="erythrocyte" /tissue_type="blood" GC_signal 115..122 /note="This consensus sequence is displaced from its usual distance from the start point of transcription (cap site) by approximately 150bp." CAAT_signal 265..>273 TATA_signal 354..369 prim_transcript 383..3758 /note="putative" 5'UTR 383..446 exon <447..571 /number=1 /codon_start=1 CDS join(447..571,1786..2005,3441..3554) /codon_start=1 /db_xref="PID:g472795" /translation="MSTFGELANEVVNNSYHKDLLRLSWGVLSDDMEGTGLMLMANLF NMSPESRLKFGRLGHLSTGRDNSKLRGHSITLMYALKNFVDALDDVDRLKCVVEKFAV NHINRQISAEEFGKIVGPFRAVLRIRMGDYFDEEIVAAWAALIAVVQAAL" intron 572..1785 /number=1 exon 1786..2005 /number=2 /codon_start=1 intron 2005..3439 /number=2 repeat_region 2160..2263 /note="consists of 3 copies of a 35bp sequence" /citation=[2] /rpt_type=direct repeat_region 2762..3384 /note="dimeric repeat of approximately 311bp repeat units" /rpt_family="Alu-like" exon 3440..>3553 /number=3 /codon_start=1 3'UTR 3554..>3731 polyA_signal 3730..3735 polyA_site 3758 /note="putative" BASE COUNT 1178 a 601 c 649 g 1347 t ORIGIN 1 ctattaattt tgtcctattt ccaagaaaat aaattaagtt taaatactcc ttttatatgc 61 cttattatta tttgcttata tgtagatcgt ccaggcgtga atgttctagt aattcggggc 121 ccaaatggtt acctggaatc aaatttacta tgacaaatgt tttttttatt ctatatacag 181 atgttagctt gaaacacaac aaagacaaga aatacttttg cgttggatgt ctgaataatg 241 aatatatctg aacgattttt ttttcaatca attctttata aatgttatta gcactatgtc 301 atgttcaaaa tagcttttat gcaaagtaga tgtaaaatgc ctgtacatgc atgatttaaa 361 aaaaaattac gttgaaaata agacttcata tgtagatata cctttataaa gatgatgttg 421 aatgtagtgc ccttaaatat ttcagaatga gtacatttgg tgagttagct aacgaagtcg 481 taaacaactc ttaccacaaa gacctactga ggctaagctg gggagtatta tctgacgaca 541 tggaaggtac tggattgatg cttatggcga agtaagtgtg ataataaaaa aacaaactaa 601 ctaatacatg tccgaattct cgtatttctt tatcactgca ttttgttcta tctcgtcgcc 661 tataattcaa ataaagttta aatcatataa tcagtctcat tatttttata gtcacatatc 721 gccatggaaa cgaggcctgg tgtttcagga gggtaagcaa ctccttttag gtaacgaatt 781 ttgcaatgac attgtatttt aatgacataa ataagatgtt caatgacgtt gataatacta 841 tgagttatta ttgacgtgca aaacaaacgt ataaagtgtg ttaattcaaa tatgcataaa 901 atgttgcatt ggttatttta ttctaaacat tgaattacgg ttttatctcg gaatgccaaa 961 ctttacagag tttgggtttt tatttgtatt tgtgtttact gcatttgata aaagatgttt 1021 ttctgtactt catttttaaa agattatttt atttcattct ttaattatcg catctagtat 1081 aataaatgtt atccaaaata ctcattttta tgtttgaagt ttatttaatc ggatcagttt 1141 tttctctttg tctgccattg ctgccattca ggctgacatc tattttagat atgcatgtat 1201 ttgcagtctt tttaatatga gcctagaatc caattaaaat ttggtcgtca ggaacatttt 1261 ttcttttttg tgtgtcattg ctgatattca aacttacaac tttttaatta tattaccgtc 1321 agtttgatac aagcccagtc gagtaatagt gtttaacgtc acagtcaaga gtatttcact 1381 tatatcgaga cgtcgccagc tatagccaaa gatgaggaaa ttttgcctgt gcgttgagct 1441 gatagcatta tacaacaggg ttctttagcg tgccaaaccc acaatgacac agaacctcgt 1501 ttattttaag tgcccatccg taagaacctt cactttaatt tcgtaatgcc gagtgcttgg 1561 cgaaagaaca gccactaccc actattaaat cttgggtcag acggcggcaa gggatcgaac 1621 acacgacctc ccacttacgg agcgaggcat tctaccaact gagcttcagc agtggttgat 1681 ataagactag tatccagatt aaaatttggt cgtattgaac atttttctct tggtccatca 1741 taactgttat tcaaactaac atcgcaattc tttatatatt cgtagtctgt ttaatatgag 1801 tccagagtcc aggttgaagt tcggtcgtct gggacattta tcaactggaa gagataatag 1861 taaattaagg ggacattcca ttaccttgat gtacgccctc aagaacttcg ttgatgctct 1921 ggatgacgta gacagactaa agtgtgttgt agaaaaattt gccgtcaacc atatcaacag 1981 acaaatctct gctgaagaat ttggggtaag tgatattgca actacattaa aatataataa 2041 aaactacttt aattatatcc tcttttattt caatttcaca cgtgtcaagc gtagttagct 2101 aggcttaatt gcctactctt cttaagcacc gacgtctttt gccgttttgt agattgtgaa 2161 ttgaatattg acctgtatgg tagtgtttaa tttcattgaa tattgacctg tatggtagtg 2221 tttaatttca ttgaatattg acctgtatgg tagtgtttaa tttcattgaa tatcaaaata 2281 acaaacgatg gctggtttag ggcatgaaca gaagtgattt agctcgttta atgacgtagt 2341 ttatcgttaa aatgagacac ttttcgttat tattagaaac tatcttctta taacgaccta 2401 gtattacgaa attaattctt gttgtaacga gatagtttct tgttataatg aaaaagggtc 2461 tagtttttac ggcgtatgat ttcgttataa cgatataaat aaaaacattc aagtccttta 2521 ccagccacca aacaaaccat atttattggg ttttttttta ctttctgatg aatgacttcc 2581 ataaaaactc tgtttatatt aattactcta ccgagttcta agcacaaata gtttatacat 2641 gtatattcga aggaagtaca aacttatatt ccattttatt agctcaactg gctatagcca 2701 gcttagctta tggcatagtg aagcgtgcgt tcgtgcatct gtctgtgcgt tatctttgat 2761 cttcttctca gaaaccactg agccaattga aatcaaattt ggcgtgaaaa gtccttaggt 2821 ggagtagatc aaagttcgtg tatggcaacc ttgtatgaaa tccaaaatga ctgccgtaac 2881 tataaatagc aatatcacta aaatggacat tatgcattaa attcgtcacc cattctcttc 2941 atttttaaaa cagagaaagt atatagctag ctatttttga tatgtaacgg gtttttccga 3001 caattttttc attgactaat ttttgctcat ttatgcacat gaggtctgat ttttgaccaa 3061 atctttaaaa atttcttctc agaaaccaca atgccaactg caatcaaatt tggtgtggaa 3121 ggtccctagg tggagaaaat caaacaagct tgtgcaaggt tatcatgtat gaaatcaaag 3181 atgaccgctg ttactataaa tagcaaaatc actaaaatgg cccttgtgta tttatttttt 3241 caccgatttt ctccatattt gaaacttata aagtaaaaat cctagccatt taatatatgc 3301 gagttttccg agaaattttt ctttgacctt attttagcaa atgagttcaa attatgccag 3361 ttgagcatta caggcctttt gtttatattt tagctggatt cagtgagcgt atagtaaatt 3421 caagaatctt tatattttag aaaatagttg gaccatttag agcggtacta aggattagaa 3481 tgggagacta tttcgatgaa gaaattgttg ccgcatgggc tgcactcatt gctgttgttc 3541 aagctgcatt atgacttgct agattatata agtaaaaatt gaagtatgac tgaactagca 3601 ctcagtgttt tatgcattaa acattttggt ataataaatg atatcctgat gtttaatatt 3661 gatagtagtt cattatgtaa tgatctgtta aaacttcttg attaacttgt ctatattctt 3721 gcaaaatcaa aataaaacca gtttgtaatt agctatgcag tgttgtttgt ttcaa //