GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:34:25 Sequence gi568815580f:46005216_46228122 : 222907 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 23157 23497 341 1 2 39 40 529 0.132 37.90 1.02 PlyA + 25121 25126 6 1.05 2.05 PlyA - 25365 25360 6 1.05 2.04 Term - 34611 34593 19 2 1 107 54 20 0.806 -1.71 2.03 Intr - 34832 34732 101 2 2 107 102 197 0.847 21.91 2.02 Intr - 35253 35211 43 2 1 58 94 46 0.108 0.44 2.01 Init - 41173 41079 95 1 2 71 20 109 0.088 2.26 2.00 Prom - 53076 53037 40 -5.46 3.00 Prom + 55716 55755 40 -4.46 3.01 Init + 61314 61385 72 2 0 60 105 10 0.310 0.97 3.02 Intr + 65721 65855 135 2 0 53 96 68 0.343 4.86 3.03 Intr + 66635 66710 76 1 1 112 64 37 0.875 2.79 3.04 Term + 67062 67318 257 1 2 89 36 161 0.821 6.75 3.05 PlyA + 67787 67792 6 1.05 4.11 PlyA - 67976 67971 6 1.05 4.10 Term - 79148 79067 82 1 1 90 37 77 0.954 -0.03 4.09 Intr - 79439 79289 151 0 1 101 80 139 0.999 13.62 4.08 Intr - 81042 80898 145 0 1 79 87 112 0.969 10.06 4.07 Intr - 82017 81793 225 0 0 86 67 212 0.985 17.08 4.06 Intr - 82277 82126 152 0 2 48 48 212 0.999 13.08 4.05 Intr - 83042 82894 149 1 2 59 67 148 0.999 9.68 4.04 Intr - 84517 84351 167 1 2 83 38 223 0.998 15.56 4.03 Intr - 84781 84608 174 1 0 81 100 235 0.999 24.14 4.02 Intr - 86636 86467 170 0 2 77 94 105 0.189 9.67 4.01 Init - 93016 92800 217 1 1 84 49 268 0.109 19.36 4.00 Prom - 93217 93178 40 -7.96 5.00 Prom + 93710 93749 40 -6.76 5.01 Init + 99011 99064 54 1 0 43 98 49 0.872 2.69 5.02 Intr + 99979 100153 175 0 1 66 100 142 0.469 12.71 5.03 Intr + 117441 117490 50 1 2 100 111 -4 0.358 1.50 5.04 Intr + 120529 120576 48 0 0 80 94 28 0.671 1.48 5.05 Intr + 121123 121258 136 0 1 66 64 73 0.296 2.84 5.06 Term + 152275 152564 290 2 2 87 55 100 0.290 2.14 5.07 PlyA + 154480 154485 6 1.05 6.04 PlyA - 155883 155878 6 1.05 6.03 Term - 158818 158692 127 0 1 48 49 116 0.726 1.46 6.02 Intr - 159327 159290 38 0 2 104 113 56 0.719 6.76 6.01 Init - 160652 160605 48 2 0 70 58 38 0.366 -2.03 6.00 Prom - 166588 166549 40 -4.46 7.02 PlyA - 167785 167780 6 1.05 7.01 Sngl - 168566 167985 582 2 0 22 41 454 0.554 30.28 7.00 Prom - 192664 192625 40 -2.56 8.00 Prom + 198315 198354 40 -4.26 8.01 Init + 210672 211380 709 2 1 79 80 556 0.651 48.99 8.02 Term + 215564 215628 65 0 2 75 42 31 0.352 -4.75 8.03 PlyA + 217430 217435 6 1.05 9.02 PlyA - 218500 218495 6 1.05 9.01 Term - 221203 221127 77 2 2 106 32 98 0.826 4.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 23057 23497 441 1 0 -55 40 607 0.847 36.86 S.002 Term + 92688 93121 434 1 2 92 46 373 0.818 28.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_1|113_aa XKTAGQAKRGEGRGETAPAAEPPDAGEDKPAVEWCLEELVFGDVEDDEDTLLRRLRGPWV QGHEDLGDSEAENEAKGNCAPQKKPIWVDEEDEDEEMVDMMNNRFRKDNDEKC >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_1|342_bp ncgaaaaccgccggccaggccaagcgcggcgagggaagaggagagacggctccggcagcg gaaccgcccgacgctggagaagacaaaccggccgtggagtggtgcctggaggagctggtc ttcggcgatgtcgaggacgacgaggacacgctgctgcggcgtctgcggggtccgtgggtt caaggacatgaagacttgggagactcagaagcggagaatgaagcaaaaggtaattgtgca cctcaaaagaagccaatttgggtggatgaagaagatgaagatgaggaaatggttgacatg atgaacaatcggtttcggaaagataatgatgaaaaatgctag >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_2|85_aa MRGTGWRCSTELDVRMLSSGTRNPVGGFGDCGLSKVNMTVISICYQSADILSTIGYDNII QHLNNGRKNCKEFEDFLKERLWNDT >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_2|258_bp atgagaggaacgggctggagatgcagcacagaactagatgtgcggatgctcagcagcggg acccgtaaccctgttggtggctttggagactgcggtctgtcgaaggtcaacatgactgtg atcagcatttgttatcagagtgcagacatcctcagcaccatcggctatgacaacattatc caacatctgaacaatggccgcaagaactgcaaagagtttgaagactttctaaaagaaagg ttatggaatgatacctga >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_3|179_aa MEKKFLSSTSCCVFSVGHKVSHQKSLFPTVPAMGSPSPPFYLKNSQASRESHLWCPWFLC EAATTLHPREHALLGARAAEGGPWSPSPRDSRARDSTSAFHTRAGKPPPGPKPLPIATAA VGALRSLRETWAGPSALTRPGAAARTRTQAHLKPHSVGPRSIYQPEASCLLLEVFGYWK >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_3|540_bp atggagaagaagttcctctcttcaacaagttgctgtgttttctccgttggtcacaaggtc tctcaccagaagagcctttttcccacagtgcctgccatgggatcccccagtcctccattc tacctgaaaaactcgcaagcctcccgtgagagccatctgtggtgcccttggttcctctgt gaggctgccaccaccctgcaccctagggagcacgcgctgctgggggcgcgggcggcagag ggtgggccctggagtccgagcccccgggactcgcgggcacgcgactcaacttctgcgttc cacactcgcgcagggaagccgcccccgggtcctaaaccactcccgattgcgacggccgcc gtcggcgctctgcgctccctgcgcgagacctgggcgggaccctcggcgctcacccggcca ggggctgcagcccgcacccggacccaggcccatttaaagccccacagcgtcgggccacga tcaatatatcagcccgaagctagctgcctccttctggaagtctttgggtattggaaataa >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_4|543_aa MLSVRVAAAVVRALPRRAGLVSTEGRHDAGGRVGLQGGGAPARALSAGGRGAVANAAILH PWLLRLDRAGDTGTAEMSSILEERILGADTSVDLEETGRVLSIGDGIARVHGLRNVQAEE MVEFSSGLKGMSLNLEPDNVGVVVFGNDKLIKEGDIVKRTGAIVDVPVGEELLGRVVDAL GNAIDGKGPIGSKTRRRVGLKAPGIIPRISVREPMQTGIKAVDSLVPIGRGQRELIIGDR QTGKTSIAIDTIINQKRFNDGSDEKKKLYCIYVAIGQKRSTVAQLVKRLTDADAMKYTIV VSATASDAAPLQYLAPYSGCSMGEYFRDNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGR EAYPGDVFYLHSRLLERAAKMNDAFGGGSLTALPVIETQAGDVSAYIPTNVISITDGQVA GTMKLELAQYREVAAFAQFGSDLDAATQQLLSRGVRLTELLKQGQYSPMAIEEQVAVIYA GVRGYLDKLEPSKITKFENAFLSHVVSQHQALLGTIRADGKISEQSDAKLKEIVTNFLAG FEA >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_4|1632_bp atgctgtccgtgcgcgttgctgcggccgtggtccgcgcccttcctcggcgggccggactg gtgagcaccgaaggccggcatgatgcaggcggccgggtggggctgcagggtggtggtgcg ccggctcgggcgctctctgcaggagggcgaggggctgtggcgaatgccgccatcttgcac ccgtggcttctccggctggacagagcaggcgacacagggactgctgagatgtcctctatt cttgaagagcgtattcttggagctgatacctctgttgatcttgaagaaactgggcgtgtc ttaagtattggtgatggtattgcccgcgtacatgggctgaggaatgttcaagcagaagaa atggtagagttttcttcaggcttaaagggtatgtccttgaacttggaacctgacaatgtt ggtgttgtcgtgtttggaaatgataaactaattaaggaaggagatatagtgaagaggaca ggagccattgtggacgttccagttggtgaggagctgttgggtcgtgtagttgatgccctt ggtaatgctattgatggaaagggtccaattggttccaagacgcgtaggcgagttggtctg aaagcccccggtatcattcctcgaatttcagtgcgggaaccaatgcagactggcattaag gctgtggatagcttggtgccaattggtcgtggtcagcgtgaactgattattggtgaccga cagactgggaaaacctcaattgctattgacacaatcattaaccagaaacgtttcaatgat ggatctgatgaaaagaagaagctgtactgtatttatgttgctattggtcaaaagagatcc actgttgcccagttggtgaagagacttacagatgcagatgccatgaagtacaccattgtg gtgtcggctacggcctcggatgctgccccacttcagtacctggctccttactctggctgt tccatgggagagtattttagagacaatggcaaacatgctttgatcatctatgacgactta tccaaacaggctgttgcttaccgtcagatgtctctgttgctccgccgaccccctggtcgt gaggcctatcctggtgatgtgttctacctacactcccggttgctggagagagcagccaaa atgaacgatgcttttggtggtggctccttgactgctttgccagtcatagaaacacaggct ggtgatgtgtctgcttacattccaacaaatgtcatttccatcactgacggacaggtagca ggtaccatgaagctggaattggctcagtatcgtgaggttgctgcttttgcccagttcggt tctgacctcgatgctgccactcaacaacttttgagtcgtggcgtgcgtctaactgagttg ctgaagcaaggacagtattctcccatggctattgaagaacaagtggctgttatctatgcg ggtgtaaggggatatcttgataaactggagcccagcaagattacaaagtttgagaatgct ttcttgtctcatgtcgtcagccagcaccaagccttgttgggcactatcagggctgatgga aagatctcagaacaatcagatgcaaagctgaaagagattgtaacaaatttcttggctgga tttgaagcttaa >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_5|250_aa MPEPSTSLAQGLNGQESQVAAWLKKIFGDHPIPQYEVNPRTTEILHHLSERNRVRDRDVY LVIEDLKQKASEYESEARHYYSLFTDERTIEKLNPSLAQVKIEEAKRELWKKGNILRDIE YSRQKEFAKAYKLKHGGLTQAVASDKVRLLRNMKCLSTCNPECPRRETPGQVIPTGNCNN DWRVNVDQGVSWGTIQKRRVRKKLQHKMEIRGKNEVSGAIIICLPDAHSKSICRDTGLKQ RKRFNCSVTE >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_5|753_bp atgccagagccttcgaccagccttgcacaggggctcaacggccaggagtctcaggttgct gcgtggttaaaaaaaatatttggagatcatcctattccacagtatgaggtgaacccacgg accacagagattttacatcacctttcagaacgcaacagggtccgggacagggatgtctac ctggtaatagaggacttgaagcagaaagcaagtgaatacgagtcagaagctaggcattat tattccctttttacagatgagaggactatagagaagttgaatccgtctcttgctcaagtg aaaattgaagaagcaaagcgagaactatggaaaaaggggaatatattgagagatatagaa tattctaggcaaaaagaatttgcaaaggcatataagctgaaacatggaggacttactcaa gctgtagcaagtgataaagtccggctgcttagaaacatgaaatgtttgtccacatgcaat ccagagtgtcccaggagggaaaccccggggcaggtgatacctacaggcaactgcaacaat gactggagggtaaatgtggaccaaggtgtgagctggggcaccatacaaaagagaagggtg aggaagaagctgcagcacaaaatggagattagagggaaaaacgaggttagtggagccatc atcatctgcttgcccgatgcacacagcaagtcaatatgccgagacaccgggttgaagcag agaaagaggtttaactgtagtgtcactgaatga >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_6|70_aa MGFSMLVRLVLSSRPQVAVKAAIPPDTRRTGLNLETHHNSRGGRCRIGTCRMEKLIPKST KDFLEEEAFD >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_6|213_bp atggggttctccatgttggtcaggctggtcttgagctcccgacctcaggtggcagtcaaa gcggccattcccccagacaccagaagaacagggctcaacctagaaacacatcacaatagc agagggggtcgatgcagaatagggacatgtcgcatggagaaattaattcccaagagcacc aaggacttcttggaagaggaagcttttgactag >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_7|193_aa MQAKDNAFSKGQRSLFCALLSTHPLLKPFRKQLAVSESQSRSSGEPVPFSPRFFADADSW ARGDSAELFSGSGTNICSSFCPGTLQALQIDRHLLVGRGPLGRPRSAPNSASAVALRFAV SPGEGGGNEVDPNPEEGEEGAGPLAGGAFPKEEQEMLKTYLKTQDLAKDPRSPKAKQCFE KKNNNNKALRRKM >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_7|582_bp atgcaagccaaggacaatgccttcagcaaaggacagcgctcccttttctgtgcgctgctc agtactcaccccctcctgaagcccttccgaaagcagcttgcggtgtctgaatcccagagt cgctccagcggagagcccgtgcccttctctccccgctttttcgcggatgccgatagctgg gctcggggagactcggctgagctcttctcgggcagcggcaccaatatctgctcaagtttc tgtccgggcaccttgcaggccttgcagatcgaccgccacctgctggtgggacgtggcccg ctggggagaccccgctctgccccgaattccgcctctgcagtcgccctccgatttgcagtc agccctggggagggtgggggaaacgaggtggacccgaaccccgaagaaggggaggagggc gcaggcccactggcgggtggagcgtttcccaaggaggagcaggagatgctgaagacatat ctcaagactcaagatctggccaaggacccgcgctcccccaaagcaaagcagtgttttgag aagaaaaataataataataaagctttgaggaggaaaatgtaa >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_8|257_aa MEEAVGKVEELIESEAPPKASEQETAKEEDGSVELESQVQKDGVADSTVISSMPCLLMEL RRDSSESQLASTESDKPTTGRVYESDSSNHCMLSPSSSGHLADSDTLSSAEENEPSQAET AVEGDPSGVSGATVGRKSRRSRSESETSTMAAKKNRQSSDKQNGRVAKVKGHRSQKHKER IRLLRQKREAAARKKYNLLQDSSTSDSDLTCDSSTSSSDDDEEVSGSSKTITAEIPGYEH LRLLTTYCHTAILPYCF >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_8|774_bp atggaggaggcagtgggaaaagttgaagaactcattgagtccgaagccccaccaaaagca tctgaacaagagacagccaaggaggaagatggatctgtagaactggaatctcaagttcag aaagatggtgtagcggattctacagttatttcttcaatgccctgcttgttgatggaactg agaagggactcttctgagtctcagttagcatccacagagagtgacaagcctacaactggc cgagtttatgagagtgactcctctaatcactgcatgctttccccttcctctagtggtcac ctggctgattcagatacgttgtcttccgcagaagagaatgaaccctctcaggcagaaacg gcggtagaaggagacccttcaggagtgtctggtgccacagttgggcgcaagtctaggcgg tcccgatctgaaagtgaaacttccactatggctgccaagaaaaaccggcaatccagtgat aaacagaatggccgagtcgccaaggttaaaggtcatcggagccaaaagcacaaggagagg atcaggctactgaggcagaaacgggaggctgctgcaaggaagaaatataacctgctgcag gacagtagtaccagtgatagtgacctgacttgtgactcaagcacgagctcatcagatgat gatgaagaggtttcagggagcagcaagacaatcactgcagagataccagggtatgaacac ttaaggctcttgaccacatactgccatactgccatactgccatactgcttttaa >gi568815580f:46005216_46228122|GENSCAN_predicted_peptide_9|25_aa XILYNRVPEIPLADAYQIIALGTEL >gi568815580f:46005216_46228122|GENSCAN_predicted_CDS_9|78_bp ngtatcctttacaacagagtaccagaaattccactggctgatgcctaccagattattgcc ttgggcactgagctttaa