GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:59:45 Sequence gi568815592r:13205104_13426898 : 221795 bp : 44.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 712 1033 322 2 1 115 98 260 0.810 24.62 1.02 Intr + 8647 8720 74 1 2 68 46 88 0.204 1.65 1.03 Intr + 13652 13815 164 0 2 -96 70 267 0.036 6.19 1.04 Intr + 17605 17669 65 0 2 51 56 54 0.046 -3.98 1.05 Intr + 22752 22960 209 0 2 75 43 208 0.346 13.72 1.06 Intr + 24934 25090 157 2 1 120 91 198 0.853 22.37 1.07 Term + 27794 27893 100 2 1 82 43 42 0.245 -3.30 1.08 PlyA + 29654 29659 6 1.05 2.03 PlyA - 30246 30241 6 1.05 2.02 Term - 33940 33870 71 2 2 120 47 60 0.607 3.20 2.01 Init - 38857 38725 133 1 1 78 21 92 0.490 1.80 2.00 Prom - 40681 40642 40 0.24 3.00 Prom + 47345 47384 40 -2.86 3.01 Init + 55996 56107 112 0 1 53 94 63 0.272 3.78 3.02 Intr + 67687 67812 126 2 0 45 111 67 0.395 5.35 3.03 Intr + 73165 73226 62 2 2 55 100 61 0.375 2.45 3.04 Intr + 78319 78459 141 0 0 125 80 259 0.999 29.35 3.05 Intr + 81043 81119 77 0 2 74 96 79 0.695 5.61 3.06 Intr + 89336 89448 113 2 2 52 115 58 0.068 4.92 3.07 Term + 94515 94630 116 1 2 27 38 92 0.028 -3.17 3.08 PlyA + 95343 95348 6 1.05 4.07 PlyA - 98741 98736 6 1.05 4.06 Term - 100084 99998 87 1 0 77 47 43 0.179 -3.24 4.05 Intr - 102221 102084 138 2 0 53 50 84 0.371 1.76 4.04 Intr - 111605 111468 138 2 0 74 73 195 0.998 17.26 4.03 Intr - 115992 115805 188 1 2 121 53 153 0.991 14.51 4.02 Intr - 123323 123213 111 0 0 123 65 -11 0.458 0.45 4.01 Init - 123440 123353 88 2 1 11 80 169 0.814 7.20 4.00 Prom - 123966 123927 40 -4.86 5.00 Prom + 127000 127039 40 -2.56 5.01 Init + 128330 128344 15 1 0 65 115 4 0.375 1.09 5.02 Intr + 138916 139021 106 0 1 108 58 82 0.114 6.99 5.03 Term + 149724 149785 62 1 2 117 36 46 0.472 0.27 5.04 PlyA + 153039 153044 6 1.05 6.00 Prom + 153335 153374 40 -0.96 6.01 Init + 155552 155666 115 1 1 26 92 86 0.432 3.18 6.02 Term + 156252 156352 101 1 2 41 41 93 0.849 -1.81 6.03 PlyA + 159099 159104 6 1.05 7.03 PlyA - 159518 159513 6 1.05 7.02 Term - 160559 159640 920 0 2 136 42 2303 0.998 222.88 7.01 Init - 175389 175293 97 0 1 71 98 -4 0.158 -0.53 7.00 Prom - 177322 177283 40 -5.66 8.00 Prom + 179735 179774 40 -5.06 8.01 Sngl + 182240 182689 450 1 0 43 43 240 0.800 11.22 8.02 PlyA + 182924 182929 6 1.05 9.00 Prom + 183304 183343 40 -4.76 9.01 Init + 183596 184058 463 1 1 91 40 137 0.113 5.06 9.02 Intr + 204925 204978 54 2 0 102 92 3 0.040 1.05 9.03 Term + 205797 205990 194 1 2 69 48 74 0.028 -0.92 9.04 PlyA + 209523 209528 6 1.05 10.04 PlyA - 212344 212339 6 1.05 10.03 Term - 212741 212595 147 2 0 68 42 97 0.325 1.00 10.02 Intr - 219300 219105 196 2 1 49 38 229 0.421 13.42 10.01 Intr - 221117 220940 178 0 1 61 43 79 0.216 -0.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_1|363_aa XPGAPVKLPCLPVKLSPPLPPKKVMICMPVGGPDLSLVSYTAQKSGQQGVAQHHHTVLPS QIQHQLQYGSHGQHLPSTTGSLPMHPSGCRMIDELNKTLAMTMQRLESQRLQPSPDSQTC QRLDLGLSSPDNCCNELFKKKKKKKEEEKKKKEEEEEKKRRRRRRKEKRREEKRREEKRR EEKRREKDIWQCLKIVLAVTVGGSAAGICSGLHSGDGVTKAGPMGLPEIRQVPTVVIECD DNKENVPHESDYEDSSCLYTREEEEEEEDEDDDSSLYTSSLAMKVCRKDSLAIKLSNRPS KRELEEKNILPRQTDEERLELRQQIGTKLTSSATVQGEPQVCEGQMKIYLQMGVRCQSIW LPQ >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_1|1092_bp natcctggcgcccctgtgaaattgccttgtctgccagtgaaactgtcgcctccgctacct ccaaagaaagtcatgatctgtatgcccgtgggggggccagacctctcactggtgtcctac acagcccagaagagtggccagcagggtgtggcccagcaccaccacactgtcctgccctcc cagatccagcaccagctgcagtacggcagccacggccagcacctcccctccaccaccggc tccctccccatgcacccctcgggctgcagaatgatagacgagctcaacaaaacgctggcc atgaccatgcagaggctggaaagccagaggctgcagccctcaccagacagccaaacctgc cagcgccttgatcttggactttccagccccgacaactgttgcaatgagctatttaaaaag aagaagaagaagaaggaggaggagaagaagaagaaggaggaggaggaggagaaaaagagg aggaggaggaggaggaaagagaagagaagagaagagaagagaagagaagagaagagaaga gaagagaagagaagagaaaaggacatttggcaatgtctgaagattgttttggctgtcaca gttggaggaagtgctgctggcatctgctctgggttgcactcgggtgatggggtcaccaaa gcaggacctatgggccttccagaaataagacaagtgccaactgttgtgattgaatgtgat gacaataaagaaaatgtgcctcatgagtcagactacgaagactcttcttgcctgtataca agagaagaggaggaagaggaggaggacgaagacgacgacagctcattatacaccagctcc ctggccatgaaggtctgcaggaaggactccttagccatcaaactcagcaacaggccctcc aagcgagagctggaagaaaagaacatccttcccaggcagacggatgaggagcggctggag ctgaggcaacagattggcaccaagctcaccagctctgctacagtacaaggagagccccag gtatgtgaggggcaaatgaagatttatctgcagatgggagtgagatgccagagcatctgg ctgccccagtga >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_2|67_aa MEYYAAIKKNEIMFFAVTLMELEAIILSKLTQEQKTKYHMFSLTTEMKMRMRRRMMMVMI MAHNLKG >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_2|204_bp atggaatactatgcagccataaaaaagaatgagatcatgttctttgcagtaacactgatg gagctggaggccatcatccttagcaaactaacacaggaacagaaaaccaaataccacatg ttctcacttacaactgagatgaagatgaggatgaggaggaggatgatgatggtgatgata atggcgcacaacctcaagggctag >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_3|248_aa MLRTSTVKLCEQKPDWRVTETSSLDHPFERSGSQEQTGPRKLMTQGGYDMVQKLFLDFFR RRLSQRPTAEELEQRNILKPRNEQEEQEEKREIKRRLTRKLSQRPTVEELRERKILIRFS DYVEVADAQDYDRRADKPWTRLTAADKAAIRKELNEFKSTEMEVHELSRHLTRHSVRPSS CTIFFNPYDTLVTDEETQVPKQQRGQSHTDGKLYTLLSNQRQINTAAASEAPQLTQVEGK ELGKVDFL >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_3|747_bp atgctgagaaccagcacagtaaagctgtgtgaacagaagccagactggagagttactgag acctcgagcctggaccaccctttcgagaggtctggcagccaagagcagacagggccgagg aaattaatgacccaaggaggctatgatatggtccaaaaacttttcctggattttttccgt aggcggctgagccagaggccaactgcagaggaactggaacagaggaacattttgaaacct cggaatgaacaagaggaacaggaggagaagagagagatcaagaggaggctaacccgaaag ctcagtcaaaggcccacggtggaagagcttcgggaaagaaagatcctcatccgcttcagt gactacgtggaggtggctgacgctcaggactatgaccgcagggcagataagccgtggacc cgcctcaccgctgcagacaaagctgccatccgaaaggagctcaatgaattcaaaagcact gagatggaagttcatgaattgagtagacacttaacaaggcactctgttaggccatcttca tgcaccattttctttaacccttacgacaccttggtcaccgatgaggaaactcaggtcccc aaacaacaacgtgggcaaagtcacacagatgggaaactatacaccctcctgagcaatcag agacaaataaacacagctgctgccagcgaagccccccagttgacccaggtggaggggaag gagctgggcaaagtcgactttctgtag >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_4|249_aa MSRPAAAAAAAAAALSFVSRAVWARAGVPGLVPPGPGPQRCPPARRSVSAQGTASSASPG PLSLNPGILPPHHESHAKVMMYRKEQYLDVLHALKVVRFVSDATPQAEVYLRMYQLESGK LPRSPSFPLEPDDEVFLAIAKAMEEMVEDSVDCYWITRRFVNQLNTKYRDSLPQLKCDKS SSSRNLSVSFIMISLKKLKAVKDLTKFFQSVSDMSGYFGVQIPQDSSDAIVSKAIDLWHK HCGTPVHSS >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_4|750_bp atgtcacgtccggcggcggcggcagcggcagcggcagcggcgctgagttttgtctcccgg gccgtctgggcgcgcgcgggtgtcccaggactcgtccctcctggccctgggccccaacgg tgcccgcctgcccggcggtcggtcagcgcacagggcacggcttcctctgcttctcccgga ccacttagtctcaacccgggaatcttgcctccacaccacgagtcccatgccaaggtgatg atgtatcgtaaggagcagtacttggatgtccttcatgccctgaaagtcgttcgctttgtt agtgatgccacacctcaggctgaagtctatctccgcatgtatcagctggagtctgggaag ttacctcgaagtccctcttttccactggagccagatgatgaagtgtttcttgccatagct aaagccatggaggaaatggtggaagatagtgtcgactgttactggatcacccgacgcttt gtgaaccaattaaataccaagtaccgggattccttgccccagttgaaatgtgacaaatct tcctccagtagaaatcttagtgttagtttcatcatgatatctttaaagaagctgaaagca gtcaaagatttgactaagttcttccagtcagtttcagacatgtctgggtatttcggagtc cagattccccaggacagctcagacgcgatcgtgagcaaggccattgacttgtggcacaaa cactgtgggaccccggtccattcaagctga >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_5|60_aa MEQHQASEVPRSPFAFLMVEGFLRPPQKQMPVCLLYSLQNRRCRVLALHSRSDTELLFPG >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_5|183_bp atggaacagcaccaggcttctgaggtgcctcgctccccctttgccttcctcatggttgaa ggcttcctgaggcctccccagaagcagatgcctgtatgcctcctgtacagcctgcagaac cggagatgtcgagttttagcacttcacagccggagtgatacagagctgctctttccaggc tag >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_6|71_aa MGRNTGYFYVQLYSLLAEHQMLHPAEQEGADSRLSGAAGHSHIDGNETGMQELNGKTLKA LPPCIEALETH >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_6|216_bp atgggaagaaacactggctacttctatgtgcagctctacagcctcctggcagaacatcag atgttgcatcctgctgaacaggagggtgctgacagcaggctgagtggagccgcaggccat tcgcacattgatgggaacgaaacaggcatgcaggaactaaatgggaagactcttaaagca ctgcctccttgcatcgaggcccttgaaacacattaa >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_7|338_aa MIFTWADLGAAVLHFVFIHKSICPGIPGREPYSIGKNVICDRTATPLDAFRMTSAAHYYP KLMSIMGNVLRFLPAFVRMKQLIEEGYVGEPLVCEVQVHGGSLLGKKYNWSCDDLMGGGG LHSVGTYIIDLLTFLTGQKAVKVHGLLKTFVKQTDHIKGIRQITSDDFCTFQMVLEGGVC CTVTLNFNVPGEFKQDVTVVGSAGRLLAVGTDLYGQRNSAPEQELLVQDATPVSNSLLPE KAFSDIPSPYLRGTIKMMQAVRQAFQDQDDRRTWDGRPLTMAATFDDCLYALCVVDTIKR SSQTGEWQNIAIMTEEPELSPAYLISEAMRRSRMSLYC >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_7|1017_bp atgatattcacgtgggcagatctgggggctgcagttttgcattttgttttcattcataag agcatctgcccagggatccctggaagagagccctacagcatcggcaagaacgtcatctgc gaccgcacggccacgccgctggacgctttccgcatgacctcggccgcccactactacccc aagctcatgagcatcatgggcaacgtgctgcgcttcctgccggctttcgtgcgcatgaag cagctgatcgaggagggctacgtgggcgagccgctggtgtgtgaggtgcaggtgcacggc ggcagcctgctgggcaagaagtacaactggagctgcgacgacttgatgggcggcggcggc ctgcactccgtgggcacctacatcatcgacctgctcaccttcctcaccggccaaaaggcc gtcaaggtccacgggctgctcaagaccttcgtgaagcagactgaccacatcaagggcatc cgacagatcaccagcgatgacttctgcaccttccagatggtgctggagggcggggtgtgc tgcaccgtcaccctcaacttcaacgtgcccggcgagttcaagcaggatgtcactgtggtg ggctcagccgggcgcctgctggccgtgggcaccgacctgtacgggcagcgcaacagcgcc ccggagcaggagctgctggtgcaggacgccacgccggtgagcaactccctgcttccggag aaggccttcagcgacatcccctcgccctacctgcgcggcaccatcaagatgatgcaggcg gtgcgccaggccttccaggaccaggacgaccggcgcacgtgggatgggcggcccctcacc atggccgccaccttcgacgactgcctgtatgccttgtgcgtggtggacaccatcaagagg tccagccagacgggcgagtggcagaacattgccatcatgaccgaggagccggagctgagc cccgcctacctgatcagcgaggccatgcgccgcagcaggatgtccctctactgttag >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_8|149_aa MNIDAKILNKILANRIQQHIKKLIHHDQVSFIPRMQGWFNIRKSINVIHHINRTNNKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGHQLEAFPLKT GTRQGCPLSPLLFNTVLEVLARASGKKKK >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_8|450_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtcagcttcatccctcggatgcaaggctggttcaac atacgcaaatcaataaacgtaatccatcacataaacagaaccaacaacaaaaaccacatg attatctcaatagatgcagaaaaggccttcgacaaaatccaacagcccttcatgctaaaa actctcaataaactgggtattgatggaacgtatctcaaaataataagagctatttatgac aaacctacagccaatatcatactgaatgggcaccaactggaagcattccctttgaaaact ggcacaagacaaggatgccctctctcaccactcctattcaacacagtgttggaagttctg gccagggcatcaggcaagaaaaagaaataa >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_9|236_aa MGKDFMTKTPKATATKANIDKWDLIKLKSFCTAKETIIRMNRQPTEWEKIFEIYPSDKGL IARNYKELKQIYKKKTNNPIKKWAKDMNRHFSKEDINAANRHMKKCSSSLVIREMQIKTT MRYHLTPVRMAIIKKSGNNKCWRRCGEIGNFYTVDWQGKPLRPTEVNIIRKAAQKGPIIT HTAIQSLISRMQKESFQSNKEASHNCMVNLSFHQEEPSSCKQPACMQTFEYAVIAL >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_9|711_bp atgggcaaagacttcatgactaaaacaccaaaagcaacggcaacaaaagccaacattgac aaatgggatctaattaaactgaaaagtttctgcacagcaaaagaaactatcatcagaatg aacaggcaacctacagaatgggaaaaaatttttgaaatctacccatctgacaaagggcta atagccagaaactacaaagaacttaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggacatgaacagacacttctcaaaagaagacattaatgcagccaac agacacatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacaaa tgctggagaagatgtggagaaataggaaacttttacactgttgattggcagggtaaacct ctaaggccaactgaggtaaacatcatcagaaaagcagcacagaagggccccataattact cacacagcaatacagagtttaatttcacggatgcagaaggaaagctttcaaagtaacaag gaagcctcacataattgtatggtaaacctgagctttcaccaggaggagcccagctcctgc aaacagccagcatgtatgcagacctttgagtatgctgtcattgcgttatag >gi568815592r:13205104_13426898|GENSCAN_predicted_peptide_10|173_aa XGTAKPGEKGESWRLNHSLTNLLSTSAGCLESSVIFASRIVCPGDLEVELLQKPNWHCLA GFKISVDVITVVTIIIIIIIFLMFGKSVKEMSIPRTNASGLCEGSLQPIVEAKFLATRPE LSNGVVLNPDMHCCFDKNMATDIVTHAMVERIHIPGIILLLHLREAIPGLKAL >gi568815592r:13205104_13426898|GENSCAN_predicted_CDS_10|522_bp ngaggcacagccaagccaggggagaagggagagagctggagactgaatcatagcctgact aacctgctttccacgtctgctggctgcctagaaagctctgtgatatttgcatctcgcatc gtgtgccctggagacttagaggtagagctacttcagaaacccaactggcactgcctggca ggatttaagataagcgttgatgtgatcaccgtggtcaccatcatcatcatcatcatcatc ttcctaatgtttggcaaatctgttaaagaaatgagcatacccagaaccaatgcatcaggg ttatgtgagggcagcctccaacccattgtagaggcaaaatttttggccactagacctgag ctcagcaatggagtggttctgaacccagacatgcattgttgtttcgacaaaaacatggca actgatattgttactcatgctatggttgaaagaatccatattccgggaattattctcctg ctgcacttacgagaagccattcctgggctcaaggctctgtga