GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:44:17 Sequence gi568815592f:1510446_1712104 : 201659 bp : 47.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4795 4888 94 0 1 81 101 64 0.123 7.59 1.02 Intr + 13178 13364 187 0 1 66 58 153 0.427 8.85 1.03 Term + 26242 26332 91 1 1 87 47 71 0.019 0.09 1.04 PlyA + 26862 26867 6 1.05 2.04 PlyA - 27338 27333 6 1.05 2.03 Term - 27919 27792 128 2 2 40 46 162 0.880 5.64 2.02 Intr - 36190 36133 58 1 1 53 92 58 0.491 1.16 2.01 Init - 37533 37435 99 0 0 116 45 47 0.276 3.55 2.00 Prom - 54951 54912 40 -4.16 3.00 Prom + 65034 65073 40 -4.06 3.01 Init + 66280 66418 139 0 1 87 41 54 0.180 0.91 3.02 Term + 70971 71341 371 1 2 63 39 303 0.440 17.71 3.03 PlyA + 72823 72828 6 1.05 4.00 Prom + 79998 80037 40 -4.46 4.01 Init + 84920 85132 213 1 0 89 72 93 0.251 6.34 4.02 Term + 88344 88553 210 2 0 76 48 175 0.964 9.59 4.03 PlyA + 89750 89755 6 1.05 5.00 Prom + 90800 90839 40 -4.86 5.01 Init + 91922 91935 14 1 2 90 83 5 0.078 -0.73 5.02 Intr + 94326 94530 205 0 1 27 74 138 0.168 5.50 5.03 Intr + 96124 96252 129 0 0 134 94 7 0.176 6.79 5.04 Term + 98635 98715 81 0 0 75 47 77 0.050 -0.01 5.05 PlyA + 99425 99430 6 -0.45 6.00 Prom + 99435 99474 40 -9.46 6.01 Sngl + 100001 101662 1662 1 0 95 47 2816 0.999 273.43 6.02 PlyA + 101897 101902 6 1.05 7.02 PlyA - 101950 101945 6 1.05 7.01 Sngl - 108817 108404 414 1 0 79 43 229 0.714 13.80 7.00 Prom - 111585 111546 40 -4.06 8.05 PlyA - 112460 112455 6 1.05 8.04 Term - 113786 113724 63 2 0 97 53 118 0.099 7.09 8.03 Intr - 114095 114027 69 2 0 132 76 77 0.152 10.28 8.02 Intr - 114270 114177 94 2 1 5 56 77 0.069 -3.83 8.01 Init - 115793 115729 65 2 2 72 13 142 0.064 5.72 8.00 Prom - 122771 122732 40 -2.46 9.00 Prom + 125367 125406 40 -7.36 9.01 Init + 126233 126411 179 1 2 94 78 189 0.962 15.27 9.02 Intr + 127380 127423 44 0 2 80 119 -1 0.432 0.08 9.03 Intr + 130850 131017 168 0 0 67 77 86 0.529 5.32 9.04 Intr + 147868 148011 144 2 0 52 94 76 0.016 4.85 9.05 Term + 151174 151292 119 2 2 70 53 38 0.004 -2.80 9.06 PlyA + 152434 152439 6 1.05 10.00 Prom + 156112 156151 40 -0.66 10.01 Init + 160368 160377 10 2 1 93 114 9 0.787 3.73 10.02 Term + 165600 166177 578 1 2 -14 37 260 0.097 5.33 10.03 PlyA + 166252 166257 6 1.05 11.00 Prom + 167029 167068 40 -2.46 11.01 Init + 167153 167285 133 1 1 66 47 84 0.325 2.40 11.02 Intr + 170799 170903 105 1 0 78 44 66 0.341 1.39 11.03 Intr + 170944 171063 120 2 0 78 94 53 0.593 5.37 11.04 Intr + 180765 180844 80 1 2 64 99 62 0.068 4.17 11.05 Intr + 189228 189320 93 2 0 120 75 12 0.515 3.26 11.06 Intr + 198511 198620 110 0 2 89 96 -28 0.006 -2.82 11.07 Term + 199800 199938 139 0 1 68 34 112 0.015 1.24 11.08 PlyA + 201106 201111 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 113786 113561 226 0 1 97 68 188 0.839 14.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_1|123_aa MSRSSWFMGPSCRDVATFIPGLELLLHTDLHGVSSLNKKRNVLDGWSWALTGNEREPLFR TQATLRQPLSGCPEVMSWWSVEHTGKSSRQTSSRCSFHLTAADSRAVSVDPSDIPLQPLW DLP >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_1|372_bp atgagcagatccagctggttcatgggtccctcatgcagagacgttgccacgttcatcccg gggctggagctgctgcttcacactgacttacacggggtgtctagtttgaacaagaaaagg aatgttctggatggctggtcctgggcgctcaccgggaatgaacgtgagccactctttcgc acacaggccactctgcgtcagccactctcgggctgccccgaagtgatgtcctggtggtcc gtggagcacacaggaaagtccagcagacaaacatcatcaaggtgcagctttcacctcaca gcagccgacagccgggctgtctctgtagacccgtctgacattcctctgcagccgctgtgg gacttgccctga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_2|94_aa MALSRVGFRFPVTCKGSHTIHVPVEYNLAGSEKLQSQQQQEFNEGRLCARYYSQIIDEYG MDVGHGGSEKTNPTNCLTNKGKLILGNESTIHIC >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_2|285_bp atggcgctgagcagggtggggtttagattcccagtcacgtgtaaaggcagtcacaccatc cacgtgccagtggagtacaacttggcaggcagtgagaagctacaatctcaacagcaacaa gaatttaatgagggccgattatgtgcaagatactatagccagattattgatgagtatggg atggatgtgggccacggaggctctgagaagaccaacccaaccaactgtttgaccaacaaa gggaaactcatcttaggaaatgaatccacgattcacatctgctaa >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_3|169_aa MGKQTDPKWSCLAFREFCWFFEKGELVATIESPGFLLLSVDWKMRLQPCLKPLSLTTKLL ADRKHDSTGALHWKRDRPEKKTAHRDRELGHTLVKVYEGGITAMWSHYSGRTWATRSSVT VLPGVYIVGNVHDSVPACDPIISHPGTHFTEMHAQAQQHRMGELHRASS >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_3|510_bp atgggaaagcaaacagacccaaaatggtcgtgtttggccttcagggaattttgttggttc tttgaaaaaggtgaattagtcgccaccattgaaagtcccggtttcctgctcctctccgtg gattggaaaatgaggcttcagccatgcctgaagccactgtccctgaccacaaagcttttg gctgataggaagcatgacagcactggggccctacactggaagcgggaccgtccagagaag aagactgcgcacagggatcgggagctgggacacacgttagtcaaggtgtacgagggagga atcaccgccatgtggagccactactcggggaggacgtgggccacccggagctcagtgaca gtactcccgggagtgtacatcgttggtaatgtccacgacagtgtccctgcctgtgaccca ataatttcccatccagggacacacttcacagaaatgcacgcacaggcacaacagcatcga atgggagagctgcacagagcctcttcatag >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_4|140_aa MVGKCLPADWGSPGAPSAQGCSSGWPRGRAGRGTPGQGGRLFLDPSGRLPPLQRVRAPRV YFQFRSAALSRASCLHFIVCIILDLLQKEKWPFTAFTECGQKLDRADIELRTEMENIQGE HAHSTPTLPQRSEPQRWPHT >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_4|423_bp atggtcggaaagtgtctgccagcagactggggatccccaggggccccatcagcccagggc tgctcctcaggatggccgcggggccgtgccggccggggcacaccaggccagggtgggcgg ctgttcctggaccccagcgggcgtctgcctcccctacagcgggtccgggcaccccgggtc tacttccagttcagaagcgcagcactgagtagggcatcctgcctgcacttcatcgtgtgt attattttggatttgcttcagaaagaaaaatggcctttcacggcgtttacggaatgtggg cagaaattggaccgggcagacatagagctccgcacagagatggaaaatattcagggtgaa cacgcacactcgacacccacgctcccgcagagaagcgagccccagcgctggccccacacc tga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_5|142_aa MVSWRRRPTREDQRFPEASALRSSGEAGCTEPGVGLVPVLRGANTGPGPGPRGSCPEGSE LGADRALPAGGPALLGQVEPPLKLVSTCPRLPTAAHSSAFRAEPRAQRMEKPESGEAAKG RPPGEGGEKQGGPQRGGLILRG >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_5|429_bp atggtgtcctggaggcgccgacccactcgcgaggaccagcggttcccggaggcgtcggcc ctcaggtcctcgggggaggccggctgcaccgagccgggtgtcggccttgtccctgtcctg agaggtgcaaacaccggccccggcccaggcccccggggctcctgcccagaaggctcagag ctgggggccgaccgcgccttacccgcaggaggcccggcgctcctgggccaggtggagccc cccctgaagctggtttctacctgccctcgcctccccactgcagcccacagttcggctttc agggctgaacccagagcccagaggatggagaagccagaaagcggggaggccgccaagggc cggccccccggggaggggggcgagaagcagggcggcccgcagcggggcgggctcatcctt cgcgggtga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_6|553_aa MQARYSVSSPNSLGVVPYLGGEQSYYRAAAAAAGGGYTAMPAPMSVYSHPAHAEQYPGGM ARAYGPYTPQPQPKDMVKPPYSYIALITMAIQNAPDKKITLNGIYQFIMDRFPFYRDNKQ GWQNSIRHNLSLNECFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFLRRRRRFKKKDAV KDKEEKDRLHLKEPPPPGRQPPPAPPEQADGNAPGPQPPPVRIQDIKTENGTCPSPPQPL SPAAALGSGSAAAVPKIESPDSSSSSLSSGSSPPGSLPSARPLSLDGADSAPPPPAPSAP PPHHSQGFSVDNIMTSLRGSPQSAAAELSSGLLASAAASSRAGIAPPLALGAYSPGQSSL YSSPCSQTSSAGSSGGGGGGAGAAGGAGGAGTYHCNLQAMSLYAAGERGGHLQGAPGGAG GSAVDDPLPDYSLPPVTSSSSSSLSHGGGGGGGGGGQEAGHHPAAHQGRLTSWYLNQAGG DLGHLASAAAAAAAAGYPGQQQNFHSVREMFESQRIGLNNSPVNGNSSCQMAFPSSQSLY RTSGAFVYDCSKF >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_6|1662_bp atgcaggcgcgctactccgtgtccagccccaactccctgggagtggtgccctacctcggc ggcgagcagagctactaccgcgcggcggccgcggcggccgggggcggctacaccgccatg ccggcccccatgagcgtgtactcgcaccctgcgcacgccgagcagtacccgggcggcatg gcccgcgcctacgggccctacacgccgcagccgcagcccaaggacatggtgaagccgccc tatagctacatcgcgctcatcaccatggccatccagaacgccccggacaagaagatcacc ctgaacggcatctaccagttcatcatggaccgcttccccttctaccgggacaacaagcag ggctggcagaacagcatccgccacaacctctcgctcaacgagtgcttcgtcaaggtgccg cgcgacgacaagaagccgggcaagggcagctactggacgctggacccggactcctacaac atgttcgagaacggcagcttcctgcggcggcggcggcgcttcaagaagaaggacgcggtg aaggacaaggaggagaaggacaggctgcacctcaaggagccgcccccgcccggccgccag cccccgcccgcgccgccggagcaggccgacggcaacgcgcccggtccgcagccgccgccc gtgcgcatccaggacatcaagaccgagaacggtacgtgcccctcgccgccccagcccctg tccccggccgccgccctgggcagcggcagcgccgccgcggtgcccaagatcgagagcccc gacagcagcagcagcagcctgtccagcgggagcagccccccgggcagcctgccgtcggcg cggccgctcagcctggacggtgcggattccgcgccgccgccgcccgcgccctccgccccg ccgccgcaccatagccagggcttcagcgtggacaacatcatgacgtcgctgcgggggtcg ccgcagagcgcggccgcggagctcagctccggccttctggcctcggcggccgcgtcctcg cgcgcggggatcgcacccccgctggcgctcggcgcctactcgcccggccagagctccctc tacagctccccctgcagccagacctccagcgcgggcagctcgggcggcggcggcggcggc gcgggggccgcggggggcgcgggcggcgccgggacctaccactgcaacctgcaagccatg agcctgtacgcggccggcgagcgcgggggccacttgcagggcgcgcccgggggcgcgggc ggctcggccgtggacgaccccctgcccgactactctctgcctccggtcaccagcagcagc tcgtcgtccctgagtcacggcggcggcggcggcggcggcgggggaggccaggaggccggc caccaccctgcggcccaccaaggccgcctcacctcgtggtacctgaaccaggcgggcgga gacctgggccacttggcgagcgcggcggcggcggcggcggccgcaggctacccgggccag cagcagaacttccactcggtgcgggagatgttcgagtcacagaggatcggcttgaacaac tctccagtgaacgggaatagtagctgtcaaatggccttcccttccagccagtctctgtac cgcacgtccggagctttcgtctacgactgtagcaagttttga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_7|137_aa MTSTATVRPGLGRVCPLRVIIIKIIGIARKPLAPAPQPLLSWVSRLRKDPAAGPDPRVEH RAADPAGPTGLARGSGDRPRRALLRTVWDADFANLEDNFCFHSTSRFLKIQSPLKRASHP SGLFVAFDEPRHSLRDQ >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_7|414_bp atgaccagcacagccactgtgaggcccggccttggccgcgtttgtcctttgcgagtgata ataataaaaatcattggaattgcaaggaagcccctggccccggctccccagccgctgctg tcctgggtgtcccggctgcggaaagaccccgcagccgggcccgacccacgagtggaacac cgggctgcggacccagccgggcctacaggactggcgcggggctccggggacaggcctcgc agggccctcctgcgcaccgtctgggacgccgattttgccaacttggaggacaatttttgt tttcactctaccagcagatttttaaaaatccagtcgccgctgaaacgtgcttcccacccg tccggtttgtttgtagcctttgacgagccccggcacagcctccgggaccaatga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_8|96_aa MDDENDSESRCVYCCSPIASPRDLRRREPQLNAHSRDRKQGNPEPCAPRGSGTDFLQGDC TKAKQKLNWKPRVAFDELVREMVHADVELMRTNPNA >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_8|291_bp atggatgatgagaacgacagcgagagtcgctgcgtgtactgctgcagccccattgcaagc ccacgtgatttacggcgccgagagccgcaactgaacgcgcacagcagggaccgaaagcag ggaaacccggagccttgtgcgcccagagggagcggaacggactttctgcagggcgactgc accaaagcgaaacagaagctgaactggaagccccgggtcgctttcgatgagctggtgagg gagatggtgcacgccgacgtggagctcatgaggacaaaccccaatgcctga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_9|217_aa MSLGLGALAATQAALTTLRLCPCKFDSYPYQWAPEGLPMPHLPKVLCRNGNGVEGSREES LLTVLRGRQGNGGAGTIYVTVTLVKINGEKYALPAGGTQKVPWSSTACVSEWTGKKLYLL PVKGGKYLVEAEAAICEVAFSRRLSHLGLAEEHKSIIPPTAGIRPCAHTAQRSAYLCPAS GTVAAGDLLTGCPRTLPGSSSTTFQPKRVSKSLTESE >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_9|654_bp atgagcctaggattgggggcactggctgccacacaggcagcccttaccacgctacggtta tgtccttgcaagtttgattcttacccgtaccagtgggccccggaaggtctgcccatgcct cacctgcccaaggttctgtgcaggaatgggaatggcgtggaaggcagcagggaggagagc cttctaacagtcctacgaggtaggcaaggaaacggcggtgctgggacaatctatgtcact gtcacgttagtgaagataaatggagagaaatatgccctgccagctggtggcactcagaaa gtgccatggagcagcacagcctgcgtctcagaatggacaggaaaaaagctttatcttctt cctgtcaaaggagggaaataccttgtggaagctgaagctgcaatttgcgaagtcgcattt tcccgaaggctttcccacctgggccttgctgaggaacataagagtatcatcccaccgact gctggcattcggccctgcgcccacacagcccagcgctctgcctacctgtgtccagcaagt gggactgtggctgcaggagatctgctgacaggctgtccgaggaccctgcccggatcttcc tcaaccacgttccaacccaaaagagttagtaagagtctgacagagagcgagtga >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_10|195_aa MASAQNLLKLISNCSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGI QLTRNVKDLFKENYKPLLSKIKEDTNKWKNIPCSWIGRINIVKMATLPKVIYRFNAIPIK LPMTFFIELEKTTLNFIWNQKRARIAKTILSKKNKAGGITLPDFKLYCKATVTKTACYWY QNRDIDQWNRIESLK >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_10|588_bp atggcctcggcccaaaatctcctcaagctgataagcaactgcagcaaagtctcaggatac aaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacttaggaatc caacttacaaggaatgtgaaggacctcttcaaggagaactacaaaccactgctcagcaaa ataaaagaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaat atcgtgaaaatggccacactgcccaaggtaatttatagattcaatgccatccccatcaag ctaccaatgactttcttcatagaattggaaaaaactactttaaacttcatatggaaccaa aaaagagcccgcattgctaagacaatcctaagcaaaaagaacaaagctggaggcatcaca ctacctgacttcaaactatactgcaaggctacagtaaccaaaacagcatgctactggtac caaaacagagatatagaccaatggaacagaatagagtccctgaaataa >gi568815592f:1510446_1712104|GENSCAN_predicted_peptide_11|259_aa MEYFAAIKKDEFMSFAGTWMKLETIILSKRSQGQKTKHRMFSLIVSCHPAAKTRWYLLGC DKSFCLAAMGDILLQILLHASTVPKWLKKKVVLQRSESEVVRNYESSAIQDKMEISVAPG TSMKLEAITLSKLMQEQKTKHHMFSLDLVVTIRAPWKIQDHLQTLHSIPFAKSLDYHKIP QSLHTWCLCHWKRGCSCLMSPDCREPCTLTQLHRPGKTFVYVYIPEKLNHITAENTADLL INLVKKPITLIDQMPKTAS >gi568815592f:1510446_1712104|GENSCAN_predicted_CDS_11|780_bp atggaatactttgcagccataaaaaaggatgagttcatgtcttttgcagggacatggatg aagctggaaaccatcattctgagcaaacgatcgcaaggacagaaaaccaagcaccgcatg ttctcactcatagtgtcctgccaccctgccgccaagacacgctggtacttgttgggctgt gataaaagcttctgcttggctgccatgggtgacatcttactacagattctgttacatgca tctacagtcccaaaatggctcaagaagaaagtggtcctacagaggtcagaatctgaggtt gtaagaaattatgaaagctctgccattcaggacaagatggaaatcagtgtggctccaggg acatcgatgaagctggaagccattaccctcagcaaactaatgcaggaacagaaaaccaaa caccacatgttctcactcgaccttgtggttaccattagggctccctggaaaatccaggac catctccagacccttcactcaatcccatttgcaaagtctctggattatcataaaatccct caaagccttcatacctggtgcctgtgtcactggaaaagagggtgcagctgtctgatgtcc cctgattgccgagaaccgtgtacactgactcagctccacagaccggggaaaacatttgtg tatgtgtatattcctgagaagcttaaccacattactgctgagaacacagcagatctactc atcaacttggtaaagaaaccaatcacactgattgatcagatgcctaaaacagctagttga