GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:11:52 Sequence gi568815581f:72021392_72224384 : 202993 bp : 43.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5958 5997 40 -1.86 1.01 Init + 15151 15200 50 0 2 83 103 70 0.986 8.42 1.02 Intr + 17146 17228 83 1 2 71 85 43 0.321 1.58 1.03 Intr + 23750 23863 114 0 0 71 65 56 0.063 2.02 1.04 Intr + 30670 30753 84 2 0 85 47 57 0.027 1.09 1.05 Term + 46125 46252 128 1 2 60 48 78 0.022 -0.56 1.06 PlyA + 47479 47484 6 1.05 2.05 PlyA - 48562 48557 6 1.05 2.04 Term - 50399 50244 156 2 0 54 41 161 0.789 5.93 2.03 Intr - 53299 53158 142 0 1 80 100 67 0.861 7.46 2.02 Intr - 68090 67930 161 2 2 77 41 104 0.104 3.39 2.01 Init - 73523 73485 39 2 0 90 72 -6 0.051 -1.73 2.00 Prom - 76222 76183 40 -3.86 3.00 Prom + 76854 76893 40 -4.56 3.01 Init + 82286 82360 75 1 0 67 61 33 0.121 -0.41 3.02 Intr + 85972 86106 135 0 0 22 80 71 0.231 0.46 3.03 Intr + 95480 95603 124 1 1 98 41 105 0.870 6.96 3.04 Intr + 95664 95941 278 1 2 90 40 136 0.659 6.24 3.05 Intr + 99911 100431 521 1 2 43 75 1054 0.747 91.35 3.06 Intr + 101328 101581 254 0 2 44 100 538 0.776 47.68 3.07 Term + 102152 102996 845 0 2 112 55 1472 0.999 139.37 3.08 PlyA + 103739 103744 6 1.05 4.00 Prom + 104784 104823 40 -4.56 4.01 Init + 108212 108284 73 1 1 55 77 25 0.249 -0.67 4.02 Intr + 110293 110369 77 2 2 92 111 -7 0.187 1.23 4.03 Term + 125282 125914 633 1 0 -55 38 1719 0.999 146.89 4.04 PlyA + 126224 126229 6 1.05 5.00 Prom + 126566 126605 40 -9.46 5.01 Init + 127233 127320 88 2 1 67 96 84 0.688 7.90 5.02 Term + 137232 137374 143 1 2 104 42 89 0.770 3.99 5.03 PlyA + 138845 138850 6 1.05 6.00 Prom + 146330 146369 40 -0.86 6.01 Init + 151086 151122 37 2 1 71 119 -17 0.140 0.03 6.02 Term + 159072 159187 116 1 2 93 46 92 0.393 4.23 6.03 PlyA + 160337 160342 6 1.05 7.03 PlyA - 160546 160541 6 1.05 7.02 Term - 168220 167196 1025 2 2 -29 36 2364 0.782 211.78 7.01 Init - 174352 174343 10 1 1 86 62 2 0.516 -1.84 7.00 Prom - 175684 175645 40 -7.46 8.00 Prom + 175731 175770 40 -2.66 8.01 Init + 176018 176023 6 1 0 73 105 17 0.382 1.78 8.02 Intr + 176683 176734 52 0 1 141 75 23 0.948 4.78 8.03 Intr + 183679 183837 159 2 0 86 23 105 0.772 3.76 8.04 Term + 185899 186029 131 2 2 72 52 112 0.691 4.34 8.05 PlyA + 188855 188860 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_1|152_aa MAEGKEKQVTSYMGSSSLYALKKQAAVLERPMWQGNEGGLQSMAGTKAHMEHFTFLIEEF AFDISVTSVSQFGDHIEPSLSSGTWMTLEAIILSKLPQGQKARYCMFSLIECTVPLCHAP WQYSRKFPPESQNMDDNDKRAITSASGDRYRV >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_1|459_bp atggcagaaggcaaggagaagcaagttacatcttacatgggcagcagcagcttgtatgct ttgaagaagcaagctgcagtgttggagaggcccatgtggcaaggaaatgagggtggcctc cagtcaatggctggaacaaaagctcatatggagcatttcacattcttaatagaagagttt gcctttgacatctcagtgaccagtgtgagtcaatttggagaccacattgaaccatctcta tcttctgggacatggatgacgctggaagccatcattctgagcaaactaccacaaggacag aaagccagatactgcatgttctcactcatagaatgcactgtgcccctctgccatgctccc tggcaatattccagaaagttcccgcctgagtcccagaatatggatgacaatgacaagcgg gctatcacctcagcttctggtgatagatacagagtgtaa >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_2|165_aa MKVVVGCSSGVIKPLVDQSISQFSFVLISLEEKVVAEAAPCVRKTTVPGLWLQPLFPLAF PYGSVVSVRNGKEVTKRLAMDVITGCRNSGSEGNVHDYFIPPAALENQTQEPPKNPSFFM ALANATNFSACSGSSQLMKGHPLVCFLAVSGPQEEAAARKKHVAK >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_2|498_bp atgaaggtggtggttggatgctcttcaggagtgatcaagcctctggtggaccagagcatc agccagttttcatttgtgttgatttccttggaagaaaaagttgtggcagaggctgcccca tgtgtgcgaaagaccaccgttcctgggctgtggctgcagcctctgtttccgttagccttt ccgtacggcagtgttgtcagtgtgagaaatggaaaggaggtcactaagagacttgcaatg gatgtaataactggatgcagaaactctggatctgaagggaatgttcatgactatttcatc cctcctgctgccttggagaaccaaacgcaggagccacccaagaatccttccttcttcatg gcattggcaaatgccaccaacttctcagcctgctcagggtcaagccagctcatgaaagga caccctctggtgtgctttctggctgtttcaggaccacaagaagaagcagcagctcgcaag aaacatgttgctaaatag >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_3|743_aa MKFQIHQDRTLYESDKLCIQNKNEEPRPPETAEPPVSGNVSFVVLADGMFAVWDCFMQLI LSKSLRFRVWLHDKPAGLVSDLGSGPYPRGALLCYSRPTPPDPARSPGQIGGKGRKLKRA SWSAVLAESGFHSPEWHMPVNFPKSGSRVGRNTHAGMHKHRVFATVAGRSGGCDGTCTHS GNTSTGDIYSISQQIFIAHLRRYLPRGEEPAPRVPEPPRLLAFPGHQPPAPGPRMNLLDP FMKMTDEQEKGLSGAPSPTMSEDSAGSPCPSGSGSDTENTRPQENTFPKGEPDLKKESEE DKFPVCIREAVSQVLKGYDWTLVPMPVRVNGSSKNKPHVKRPMNAFMVWAQAARRKLADQ YPHLHNAELSKTLGKLWRLLNESEKRPFVEEAERLRVQHKKDHPDYKYQPRRRKSVKNGQ AEAEEATEQTHISPNAIFKALQADSPHSSSGMSEVHSPGEHSGQSQGPPTPPTTPKTDVQ PGKADLKREGRPLPEGGRQPPIDFRDVDIGELSSDVISNIETFDVNEFDQYLPPNGHPGV PATHGQVTYTGSYGISSTAATPASAGHVWMSKQQAPPPPPQQPPQAPPAPQAPPQPQAAP PQQPAAPPQQPQAHTLTTLSSEPGQSQRTHIKTEQLSPSHYSEQQQHSPQQIAYSPFNLP HYSPSYPPITRSQYDYTDHQNSSSYYSHAAGQGTGLYSTFTYMNPAQRPMYTPIADTSGV PSIPQTHSPQHWEQPVYTQLTRP >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_3|2232_bp atgaagtttcagatacaccaagacaggacactttatgagtcggataaattgtgtatccag aataagaacgaagagccaaggcctccagagactgcagaaccaccagtctctggaaacgtc tccttcgtcgtcctggctgacggcatgtttgcagtttgggactgttttatgcagctgatt ctttccaagagtctgaggtttagggtttggctccacgacaagccagctggtctggtctct gacttgggctccggtccgtacccccggggcgccctgctgtgttacagccgcccgacgccc ccagacccggccaggtcaccagggcagattggaggaaagggaagaaaactgaaacgggcc tcttggtctgcagttttagcggagtcgggattccacagccctgagtggcacatgccggtc aacttcccaaagtcgggctcccgtgtggggagaaatacacacgcaggaatgcacaagcat cgcgtgttcgcaactgtcgctgggaggtctggcggctgtgatgggacatgcactcactcg ggcaacacgtccacaggtgacatctattcgatcagtcaacagatatttattgcgcaccta cgacgctatctgccgaggggagaggagcccgcgcctcgagtccccgagccgccgcggctt ctcgcctttcccggccaccagccccctgccccgggcccgcgtatgaatctcctggacccc ttcatgaagatgaccgacgagcaggagaagggcctgtccggcgcccccagccccaccatg tccgaggactccgcgggctcgccctgcccgtcgggctccggctcggacaccgagaacacg cggccccaggagaacacgttccccaagggcgagcccgatctgaagaaggagagcgaggag gacaagttccccgtgtgcatccgcgaggcggtcagccaggtgctcaaaggctacgactgg acgctggtgcccatgccggtgcgcgtcaacggctccagcaagaacaagccgcacgtcaag cggcccatgaacgccttcatggtgtgggcgcaggcggcgcgcaggaagctcgcggaccag tacccgcacttgcacaacgccgagctcagcaagacgctgggcaagctctggagacttctg aacgagagcgagaagcggcccttcgtggaggaggcggagcggctgcgcgtgcagcacaag aaggaccacccggattacaagtaccagccgcggcggaggaagtcggtgaagaacgggcag gcggaggcagaggaggccacggagcagacgcacatctcccccaacgccatcttcaaggcg ctgcaggccgactcgccacactcctcctccggcatgagcgaggtgcactcccccggcgag cactcggggcaatcccagggcccaccgaccccacccaccacccccaaaaccgacgtgcag ccgggcaaggctgacctgaagcgagaggggcgccccttgccagaggggggcagacagccc cctatcgacttccgcgacgtggacatcggcgagctgagcagcgacgtcatctccaacatc gagaccttcgatgtcaacgagtttgaccagtacctgccgcccaacggccacccgggggtg ccggccacgcacggccaggtcacctacacgggcagctacggcatcagcagcaccgcggcc accccggcgagcgcgggccacgtgtggatgtccaagcagcaggcgccgccgccacccccg cagcagcccccacaggccccgccggccccgcaggcgcccccgcagccgcaggcggcgccc ccacagcagccggcggcacccccgcagcagccacaggcgcacacgctgaccacgctgagc agcgagccgggccagtcccagcgaacgcacatcaagacggagcagctgagccccagccac tacagcgagcagcagcagcactcgccccaacagatcgcctacagccccttcaacctccca cactacagcccctcctacccgcccatcacccgctcacagtacgactacaccgaccaccag aactccagctcctactacagccacgcggcaggccagggcaccggcctctactccaccttc acctacatgaaccccgctcagcgccccatgtacacccccatcgccgacacctctggggtc ccttccatcccgcagacccacagcccccagcactgggaacaacccgtctacacacagctc actcgaccttga >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_4|260_aa MQLIRANWENRHREGGVDGFGNGFAKALNVVLFSIYLHYLGKCLTHNRSSKVKNKKKKKK EKEEEEEKKKEKEEEKKKEEEEEEGKKKKKKKKKKKKKKEKEKEKEKEKEKEKEKKKKKK KKEKEKEKEKEKEKKKKKKKKKKKKKKKKKEEEEEEEEEEEEKEKKKKKKEKEEEEKKKK EEEEEGKKKKKKKKKKKKKEEEEEEEEKKQKQKQKKKKKKKKKKEEEEEGRKEGKGRRKE EGRRRRRRRRRTRRRRKRRK >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_4|783_bp atgcaactcattagagccaactgggagaacaggcacagggaaggaggagttgatggattt ggaaatggttttgctaaggctttgaacgtcgtcttgttctcaatttatctccattacctg ggaaagtgtttgacacataacaggagctcgaaagtaaaaaataagaagaagaagaagaag gaaaaggaggaggaggaggagaagaagaaggaaaaggaggaggagaagaagaaggaggag gaagaggaggaggggaagaagaagaagaagaagaagaagaagaagaagaagaagaaggag aaggagaaggagaaggagaaggagaaggagaaggagaaggagaagaagaagaagaagaag aagaaggagaaggagaaggagaaggagaaggagaaggagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaaggaggaggaggaggaggaggaggaggaggag gaggagaaggagaagaagaagaagaagaaggaaaaggaggaggaggagaagaagaagaag gaggaagaggaggaggggaagaagaagaagaagaagaagaagaagaagaagaagaaggag gaggaggaggaggaggaggagaagaagcagaagcagaagcagaagaagaagaagaagaag aagaagaagaaggaggaggaggaggagggaaggaaggaaggaaaaggaagaaggaaggaa gaaggaagaaggaggagaagaagaagaaggaggacaaggaggagaagaaagaggagaaaa taa >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_5|76_aa MGTNQVLSKRGPNPDTKRGFLDLAQERIQDAPLCHQQRRMVGPSGYLPYDFICKIQEKGL GVALEPLINVEMGGED >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_5|231_bp atgggtaccaaccaggtgttatcgaaaaggggtcccaatccagacaccaagagagggttc ttggatcttgcacaagaaagaattcaggatgcccccttgtgccatcaacaaagacggatg gttggaccctctggctaccttccatacgactttatttgcaaaatccaggaaaaaggactg ggcgtggcattagaaccccttatcaatgtggagatgggcggggaagactaa >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_6|50_aa MELESQAPAPVSDFSVFTCNAKSCYGLKNKAAFFFVGLISGTLRKEHLGF >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_6|153_bp atggagctagagagccaggcaccagctcctgtctctgactttagtgtcttcacctgcaat gccaagagctgctatggcctgaaaaacaaggctgctttcttcttcgttggcctcatttca ggcactctcaggaaagaacatctggggttctga >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_7|344_aa MVEEHASITTTLITIIITTTITTVIITTTINTTITMNTTTSIITAITITTTATTITTTII TTIATTMNTITTTPTIITIMITINTVTITTTTTTTTITTTTTIITTTTITTIMITINTTT TNTNTSIITTTIITTNTTTTIIITTITITTTITTIIITIIATIIITTIATTTNTTTTIIT STTITTTTTITTIMVTIITTTTITINTTTITTIMITIITITMNITITMNTTTTIITTTIT ITTTITTVIITTIATIIITTIATTNTTTIITTTIATIVITINTTITITTTTTTITITTTT INTTTIVTTTTSTITTITTTTITMTATTTATFGPTLSTTITTSK >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_7|1035_bp atggtggaagagcatgcctccattactaccaccctcatcaccatcatcattactaccact atcaccaccgtcatcatcaccaccaccatcaacaccaccatcaccatgaacactaccacc agcatcattactgccatcaccatcactactactgccaccaccatcactaccaccatcatc accaccattgccaccaccatgaacaccatcaccaccacccccaccatcattaccatcatg atcaccatcaacaccgtcaccatcaccaccaccactaccaccaccaccatcaccaccacc accaccatcatcaccaccaccaccatcactactatcatgatcaccatcaacaccaccacc accaacaccaacacctccatcatcactaccaccatcatcaccacgaacactaccaccacc atcatcattaccactatcaccatcactactactattaccaccattatcatcaccatcatt gccaccatcatcatcaccactattgccaccaccaccaacacaaccaccaccatcatcacc agcaccaccatcaccaccaccaccaccatcactactatcatggtcaccattatcaccacc accaccatcaccatcaacaccaccaccatcactactatcatgatcaccatcatcaccatc accatgaacatcaccatcaccatgaacactaccaccaccatcatcactaccactatcacc atcactactactattacgaccgttatcatcaccaccattgccaccatcatcatcaccacc attgccaccaccaacaccaccaccatcatcaccaccaccatcgctaccatagtgatcacc atcaacactaccatcaccatcaccactaccaccaccaccatcaccattaccactaccacc atcaataccaccaccatcgttactaccaccacctccactatcactaccatcaccaccacc actatcaccatgaccgccaccaccactgccacttttggtcccactctctctaccacaatt accacctccaagtag >gi568815581f:72021392_72224384|GENSCAN_predicted_peptide_8|115_aa MLDTIESTHQAAQNSGNWPDQKKPQSVSTEDPCPSVSCRFCQAEKKFTRAPGWTVNGHRL AAFTAGKVELHGVLVKNGTRSNYHQQRITLTVVEHRAWIQVLTQSDPKLTLLLRD >gi568815581f:72021392_72224384|GENSCAN_predicted_CDS_8|348_bp atgctggacacaattgagagtacacaccaggcggcacaaaacagtggcaactggccagat cagaagaagcctcagagcgtgagcactgaagatccctgtccctctgtctcctgtcgcttc tgccaggccgaaaaaaagtttacccgagcacctgggtggacggtgaatggtcatcgtttg gcagcctttactgctggcaaagtggaattgcatggggtgctagtgaagaatggaacaaga tccaactaccatcagcagaggataaccctcactgtcgtggagcacagggcatggattcag gtgctgacccaatcagatccaaagctaacactgctgctaagagactga