GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:56:15 Sequence gi568815591r:122894762_123095634 : 200873 bp : 37.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12614 12768 155 1 2 65 88 55 0.249 1.15 1.02 Intr + 18737 19077 341 2 2 71 80 187 0.334 10.39 1.03 Intr + 19222 19301 80 2 2 76 -11 83 0.036 -4.45 1.04 Intr + 31346 31454 109 1 1 66 116 105 0.399 10.04 1.05 Term + 37678 37820 143 2 2 59 47 93 0.249 -0.59 1.06 PlyA + 38623 38628 6 1.05 2.05 PlyA - 38783 38778 6 1.05 2.04 Term - 41922 41801 122 1 2 69 55 67 0.731 -0.84 2.03 Intr - 47316 47150 167 2 2 81 106 49 0.121 4.68 2.02 Intr - 54760 54664 97 2 1 103 52 43 0.064 0.35 2.01 Init - 61339 61297 43 1 1 93 115 12 0.345 5.04 2.00 Prom - 68144 68105 40 -5.35 3.03 PlyA - 69734 69729 6 1.05 3.02 Term - 75401 75256 146 0 2 61 43 127 0.381 2.59 3.01 Init - 89510 89378 133 2 1 66 77 93 0.438 6.35 3.00 Prom - 91455 91416 40 -3.65 4.00 Prom + 95395 95434 40 -4.05 4.01 Init + 100538 100574 37 1 1 53 115 42 0.091 3.62 4.02 Intr + 117470 117556 87 0 0 69 106 64 0.127 5.32 4.03 Intr + 120575 120711 137 0 2 65 46 50 0.051 -2.13 4.04 Intr + 126611 126757 147 1 0 74 68 78 0.048 3.91 4.05 Term + 139752 139808 57 2 0 87 44 82 0.221 0.41 4.06 PlyA + 142051 142056 6 1.05 5.02 PlyA - 142607 142602 6 1.05 5.01 Sngl - 156142 155987 156 1 0 54 43 194 0.786 5.65 5.00 Prom - 161067 161028 40 -3.05 6.04 PlyA - 161628 161623 6 1.05 6.03 Term - 174621 174232 390 0 0 34 49 283 0.304 13.00 6.02 Intr - 177934 177878 57 1 0 108 103 -16 0.168 0.06 6.01 Init - 181819 181739 81 1 0 89 110 3 0.386 3.62 6.00 Prom - 185693 185654 40 -4.85 7.02 PlyA - 185810 185805 6 1.05 7.01 Sngl - 186407 185943 465 2 0 42 43 268 0.920 13.59 7.00 Prom - 186655 186616 40 -11.74 8.02 PlyA - 186756 186751 6 1.05 8.01 Sngl - 188385 187294 1092 0 0 41 45 466 0.659 34.36 8.00 Prom - 188572 188533 40 -11.84 9.02 PlyA - 188810 188805 6 1.05 9.01 Sngl - 189419 188817 603 2 0 58 49 405 0.529 29.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_1|275_aa MKKLRLGKMFSKLLQGYSAAELKFECRYSNSGMQSFYHYELQIFAIYRGSERRKPPFPQV HVRLPDNDSNVAEFLPLSSAKSGFLCHDQKNWARRHIEGYGERNLLGERKKGKIILSKER ESPARRFPASPIELQVTTLELKRPGSSPLQKARTSRSSTPSSQRTVINTQVQENCEYERH PLRAVHNQPPALGNEDPVTMKRLQNFMTFCYEKLLIVLSVLNRKVPTEGCPKESLPHPHL PWLQAAAATWLKPQLLAATLSTKQEDCSIAKVTLT >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_1|828_bp atgaagaaattgagacttgggaagatgttcagtaaacttcttcaaggttactcagcagca gagttgaaatttgaatgcagatattcaaactctggaatgcagtctttttaccactatgaa ttgcaaatatttgcaatatatagagggagtgaaaggagaaagcctccttttcctcaagta cacgtacggctccctgacaatgatagtaatgttgcagaattcctgccccttagttcagct aaatctgggttcctgtgtcacgaccagaaaaattgggcacgcagacacattgaagggtat ggggaacggaatttattgggtgaaaggaaaaaaggaaaaataattctcagcaaagagaga gagagtcctgctaggaggtttcccgcctcaccgattgaattgcaggtcaccaccctggaa ctgaagaggccaggctcctccccgctgcaaaaggcgcgaacttcccgaagctccaccccg tcttcccagcgaacagtaataaatacacaagtccaggagaactgtgaatatgaacggcat ccccttagagctgtacataaccaaccacctgccctgggtaatgaggacccagtgaccatg aagaggctccagaacttcatgaccttctgttatgaaaagttacttatagtactatctgtg ctcaacagaaaagtgcctactgaagggtgccctaaggaaagcctacctcacccacatctg ccatggcttcaggctgctgctgccacgtggctgaaaccccagctactggcagcaaccctg tccaccaagcaggaagactgcagcatagccaaagtcactctcacctaa >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_2|142_aa MNIMKYAHEYGTLHALMTTFTLENFCCPGNVRVGKAKNQVVSEKSVRTPVFLQNSLCGSL TGRQLKKWKCSLRSPRPTKTRKNTRVCLKLSDNGLKTSTYDKNYDKPNKGEWRCPSLEST PCTKYGNEALLDHPVPEMAIAA >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_2|429_bp atgaatataatgaaatatgcccatgaatatggcacattgcatgcactaatgaccacattt actttagagaatttctgctgccctggaaatgttagagtagggaaagccaagaaccaagtg gtgagtgaaaaatctgttaggactccagtcttccttcagaattcactatgtggaagccta acaggaaggcagctgaaaaagtggaaatgtagtttgcggagccccagacctactaagaca aggaaaaatacaagagtgtgtttgaaactaagtgataatggcttaaaaacaagcacgtat gataaaaattatgataagccaaataaaggagaatggagatgtccaagtctagaatcaaca ccatgtaccaaatatgggaatgaggccctcttggaccatccagtcccagagatggccata gctgcatga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_3|92_aa MAQIQIKHLYALARGKKLHEGEIRSIYNKMRDEHSPPCFSVILKGPKRRIKRREEGGEGK KEIKRRNREEEEKEGEGEQERRKKGISTNNIQ >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_3|279_bp atggcacagattcaaatcaaacatctgtatgctcttgccagaggcaaaaagctacatgaa ggagaaataaggagcatctacaacaagatgcgtgacgaacacagcccaccttgtttttct gtaattctcaaaggaccaaaaagaaggataaagaggagagaagaaggtggagaaggaaag aaagaaatcaagaggaggaatagagaggaggaagagaaggaaggagaaggagaacaagaa agaagaaagaagggaatttccacaaataatatccagtga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_4|154_aa MGERRDLDAVEHAYDLQVFKNMMQVIEIIEIPILLPKILFKINGYLGYFNFLAIVNKPAI NLMYNISAILLSDPLGIYPEVELLDYMHFDYIGYCLPASKVSDKKSADNLIKDLLYVMRC FYFAAFKILFEFILLEHLSQHCDLGTYGLPDPQA >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_4|465_bp atgggtgaaagaagagaccttgatgcagtagaacacgcctatgacctgcaagtatttaag aatatgatgcaagttattgaaattattgaaatacccattcttctgccaaaaatcctcttt aaaatcaatggatacttgggttactttaactttttagctattgtaaataaacctgctatc aacctgatgtacaatatctctgcaatcctgctttcagatcctttgggtatatacccagaa gtggaattgctggattatatgcactttgactatattggctactgccttcctgcctccaaa gtttctgataagaaatctgctgataatcttattaaggatctcttatatgtgatgaggtgc ttctattttgctgcttttaagattctctttgagttcatcttacttgagcacctcagccag cactgtgatctaggaacttacggacttcctgatccacaagcatga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_5|51_aa MWESLELHRHLMNGFDQNADSDMDNKVQVDMVSDGDEELVDTGVNVTLVMF >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_5|156_bp atgtgggaaagtttagaacttcatagacacttgatgaatggctttgatcaaaatgctgat agcgatatggacaataaagtccaggttgacatggtctcagatggagatgaggaacttgtt gacactggagtaaatgtgactcttgttatgttttag >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_6|175_aa MQSISVKAKIPPKLNRQGRLYSRLLQQKHKLKQVQKKQNKTKKTKVDLEEKIKKISKSDK GTLQLYHLSSEHTPAHDHQPQHLQIGKASELILTPSQHQPHPNSLCREKGHPDHPATMSF FFNFEKLEGAKMANWMQPGGTSPTKGPGHQEDWHTEQTFGEKALKVDGGRMQTLD >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_6|528_bp atgcagtccataagtgtcaaagcaaaaattccaccaaagttaaacagacaaggaagactt tattcaaggctattgcaacagaaacacaaactaaaacaagttcaaaagaaacaaaacaaa acaaaaaaaactaaagtggatttggaagaaaaaattaaaaaaatttccaagagtgataag gggactctccaactgtaccacctgtcatctgagcatactccagcccatgaccatcagccc cagcacctccagattggcaaagcatcagaactgatcctaacaccttcacagcaccaacct cacccaaacagcctttgcagagaaaaaggccacccagaccatcctgctactatgagtttt ttctttaattttgaaaagctggagggagccaagatggccaactggatgcagccaggagga acatctcccaccaagggaccaggacatcaggaagactggcacactgagcagacctttgga gagaaggcattgaaagtggatggagggaggatgcagacactggactga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_7|154_aa MCKNHKHSSTPIADKQESQIMSELPFTIASKRIKRLGIQLTRDMKGLFKENYKPLLSEIK QDTNKWNNIPCSWIGRINIVKMVILPKVIYRFNTIPIKLPMTFFTELEKTTLKFIWNQKR ARIAKTILSQKNKAGGIMLPDFKLYYKATVTKTA >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_7|465_bp atgtgcaaaaatcacaagcattcttctacaccaatagcagacaaacaagagagccaaatc atgagtgaactcccattcacaattgcttcaaagagaataaaacgcctaggaatccaactt acaagggatatgaagggcctcttcaaggagaactacaaaccattgctcagcgaaataaaa caggacacaaacaaatggaataacattccatgctcatggataggaagaatcaatatcgtg aaaatggtcatactgcccaaggtaatttatagattcaataccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgttacct gacttcaaactatactacaaggctacagtcaccaaaacagcatga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_8|363_aa MLVSDKTDFKPTKVKRDKEGHYIMVKESTQQEELTILNIYALNTGAPRFVKQVLRDLQRD IDSHTIITGDFNTPLSTLDRSMRQKVNKDIQELNSALHQVDLIDIYRTLHPKSTEYTFFS APHHTYSKTDHIVGSKALLSKCKRTEIITNCISDHSAIKLELRIKKLTQNRSTTWKLNNL LLNDYWVHNEMKAEIKMFFETNESKDTTYQNLWDTFKAVRREKFIALNAHKRKQERSKID TLISQLKELEKQEQTHSKASRRQEITKIRAELKEIETQNALQKINESRSWFFEKINKIDR PLVRLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIGEYYKHLYANKLENLEEMDKFLNT YTL >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_8|1092_bp atgctagtctctgataaaacagactttaaaccaacaaaggtcaaaagagacaaagaaggc cattacataatggtaaaggaatcaacgcaacaagaagagctaactatcctaaatatatat gcactcaatacaggagcacccagattcgtaaagcaagtccttagagacctacaaagagac atagactcccacacgataataactggagactttaacaccccactgtcaacattagacaga tcaatgagacagaaagttaacaaggatatccaggaattgaactcagctctgcaccaagtg gacctaatagacatctatagaactctccaccccaaatcaacagaatatacatttttctca gcaccacatcacacttattccaaaactgaccacatagttggaagtaaagcactcctcagc aaatgtaaaagaacagaaattataacaaactgtatctcagaccacagtgcaatcaaacta gaactcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctg ctcctgaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaa accaatgagagcaaagacacaacataccagaatctctgggacacatttaaagcagtgcgt agagagaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgac accctaatatcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagc agaaggcaagaaataactaagatcagagcagaactgaaggagatagagacacaaaacgca cttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaacaaaattgataga ccactagtaagactaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaat gacaaaggggatatcaccactgatcccacagaaatacaaactaccattggagaatactat aaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctcaacaca tacaccctctga >gi568815591r:122894762_123095634|GENSCAN_predicted_peptide_9|200_aa MELKTMARELRDECTSFSSRFDQLEERVSVMEDEMNEMKQEEKFREKRVKRNEQSLQKIW DYVKRPNLRLIGVPESDGENGTKLENTLKDIIQENLPNLARQANIQIQEIQRTLQRYSSR RATPRRIIVRFTKVEMKEKILRAAREKGWVTHKGKPIRLTADLSAETLQARREWGPIFNV LKEKEFSTQNFISSQTKLHK >gi568815591r:122894762_123095634|GENSCAN_predicted_CDS_9|603_bp atggagctgaaaacaatggcacgagaactacgtgacgaatgcacaagcttcagtagccga tttgatcaactggaagaaagggtatcagtgatggaagatgaaatgaatgaaatgaagcag gaagagaagtttagagaaaaaagagtgaaaagaaacgaacaaagcctccaaaaaatatgg gactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacggggagaat ggaaccaagttggaaaacactctgaaggatattatccaggagaacttacccaacctcgca aggcaggccaacattcaaattcaggaaatacagagaacgctacaaagatactcctcgaga agagcaactccaagacgcataattgtcagattcaccaaagttgaaatgaaggaaaaaata ttaagggcagccagagagaaaggttgggttacccacaaagggaagcccatcagattaaca gcggatctctcagcagaaactctacaagccagaagagagtgggggccaatattcaacgtt cttaaagaaaaagaattttcaacccagaatttcatatccagccaaactaagcttcataag tga