GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:18:35 Sequence gi568815591r:83267589_83748542 : 480954 bp : 35.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 15873 15973 101 1 2 66 48 158 0.640 6.91 1.02 PlyA + 17370 17375 6 1.05 2.08 PlyA - 17891 17886 6 1.05 2.07 Term - 21722 21671 52 1 1 71 54 72 0.069 -1.98 2.06 Intr - 51363 51306 58 1 1 91 69 80 0.235 3.42 2.05 Intr - 54860 54772 89 1 2 35 58 103 0.070 0.70 2.04 Intr - 57726 57526 201 2 0 32 43 141 0.072 1.48 2.03 Intr - 59625 59522 104 0 2 108 66 39 0.162 1.75 2.02 Intr - 74512 74155 358 0 1 72 55 243 0.088 13.73 2.01 Init - 81798 81722 77 0 2 70 100 -6 0.056 -0.59 2.00 Prom - 89534 89495 40 -5.65 3.11 PlyA - 90931 90926 6 1.05 3.10 Term - 100450 99998 453 1 0 122 45 332 0.955 26.47 3.09 Intr - 117845 117706 140 0 2 42 76 106 0.564 4.06 3.08 Intr - 119462 119395 68 1 2 79 17 114 0.637 1.03 3.07 Intr - 125133 124967 167 0 2 76 100 87 0.968 6.54 3.06 Intr - 129141 129050 92 1 2 100 63 82 0.721 5.69 3.05 Intr - 132662 132440 223 2 1 47 103 215 0.530 15.68 3.04 Intr - 135188 135044 145 1 1 65 84 138 0.885 10.46 3.03 Intr - 137931 137862 70 1 1 49 94 -4 0.280 -6.38 3.02 Intr - 140899 140780 120 2 0 84 116 156 0.872 17.55 3.01 Init - 146085 145989 97 0 1 39 73 89 0.489 3.02 3.00 Prom - 174205 174166 40 -4.45 4.06 PlyA - 174301 174296 6 1.05 4.05 Term - 192847 192726 122 2 2 22 55 134 0.585 1.16 4.04 Intr - 194574 194460 115 0 1 20 94 121 0.650 5.00 4.03 Intr - 199013 198894 120 2 0 85 94 85 0.982 8.57 4.02 Intr - 222649 222526 124 0 1 23 65 100 0.003 0.87 4.01 Init - 272086 272010 77 1 2 97 119 31 0.713 7.71 4.00 Prom - 277579 277540 40 -3.05 5.04 PlyA - 277880 277875 6 1.05 5.03 Term - 295918 295802 117 1 0 62 42 126 0.187 2.96 5.02 Intr - 326832 326659 174 0 0 92 87 18 0.183 1.31 5.01 Init - 337404 337291 114 0 0 78 -13 131 0.511 2.26 5.00 Prom - 337528 337489 40 -2.95 6.00 Prom + 354286 354325 40 -3.05 6.01 Sngl + 355044 355187 144 2 0 78 48 137 0.668 2.36 6.02 PlyA + 355397 355402 6 1.05 7.05 PlyA - 355561 355556 6 1.05 7.04 Term - 358397 358178 220 1 1 51 43 189 0.978 6.23 7.03 Intr - 361689 361342 348 2 0 64 69 215 0.723 10.75 7.02 Intr - 395279 395131 149 2 2 16 54 172 0.004 4.71 7.01 Init - 406115 405924 192 2 0 79 68 142 0.036 8.33 7.00 Prom - 444317 444278 40 -3.65 8.02 PlyA - 444684 444679 6 1.05 8.01 Sngl - 445415 444720 696 2 0 86 48 192 0.978 10.96 8.00 Prom - 447512 447473 40 -6.15 9.03 PlyA - 447681 447676 6 1.05 9.02 Term - 448356 447908 449 1 2 22 43 199 0.948 3.19 9.01 Init - 449181 448938 244 0 1 71 71 161 0.787 10.64 9.00 Prom - 457277 457238 40 -3.65 10.02 PlyA - 457296 457291 6 1.05 10.01 Sngl - 478601 477837 765 2 0 43 44 338 0.983 20.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 380954 380840 115 2 1 95 100 106 0.834 11.03 S.002 Init - 461124 461100 25 0 1 97 115 40 0.806 7.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_1|33_aa XDDSSVLVIAPEDLPVGQDVEVENIDIDDSDPL >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_1|102_bp ngagatgacagttccgtgcttgttattgcccctgaagaccttccagtgggacaagatgtg gaagtggaaaatattgatattgatgactctgaccctctgtag >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_2|312_aa MHAISHLMFPPTPSGRTFTTHFTAEKDALPRKKESREVVWLQWLCRAAVGLAQFELPGSF VYTVRGKSPTQASVMADVPPCTKLEHPRSISDCRAGSENFKPVDLSLLGSVGVGSAELDH LAPRLQPPFQGSERFCLTGILGVTRMNSVVSLPCLSSSWVRMMVGKKRRSEKGNCEIQGS LFSRTLTVSTGRTQRTLFDACRTSYSEWNPSIQESSELSVLCILDMKGDHHEIGFPGKNE NNSQWSSAVEMRRSQRQEQGSKEKDLVNRGMGFRIDGQFRKRDSESENTGQMADLSLGRL TNARVYFVHTEA >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_2|939_bp atgcatgcaatatctcatttaatgttccctccaaccccaagtgggaggacttttaccacc cattttactgcagagaaagatgccctgcccagaaagaaggaatctagagaggtagtgtgg ctacagtggctttgccgagctgcagtgggcttggcccagtttgaacttcctggaagcttt gtttacactgtgaggggaaaatcccctactcaagcctcagtaatggcagatgtccctccc tgcaccaagctcgagcatcccaggtcaatttccgactgccgtgctggcagtgagaatttc aagccagtggatcttagcttactgggctctgtgggagtgggatctgctgagctagaccac ttggctcccaggcttcagccccctttccaggggagtgaacggttctgtctcactggcatt ctaggtgtcactaggatgaacagtgtggtttctctgccctgtctgtcttccagttgggtt cggatgatggtgggcaaaaaaaggagatcagagaaagggaactgtgagattcagggcagc ttgttcagccgcacgctaaccgtgtccactgggcggactcaaagaaccctctttgacgca tgtagaacttcatatagtgagtggaaccctagcatccaggaaagctctgagttgtctgtc ctctgtatactggatatgaaaggagatcaccatgaaattggtttccctggtaaaaatgag aataatagccaatggtcaagtgctgtggaaatgagaagatctcagagacaagagcagggc agtaaagagaaagaccttgtgaaccgaggtatgggcttcaggatcgatggacaatttagg aagcgggattctgagtcagaaaataccgggcagatggcagacctcagtctgggtcgatta acaaatgcccgtgtgtattttgtgcacactgaagcttga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_3|524_aa MPNKITHSESHILPGFFQHPKQNVSVNRDVEICSELFAGLYSDYWSRDAAIFRSMGRLAH IRTEHDDERLLKEDVFLLPTRDHKNPVIFGLFNTTSNIFRGHAICVYHMSSIRAAFNGPY AHKEGPEYHWSVYEGKVPYPRPGSCASKVNGGRYGTTKDYPDDAIRFARSHPLMYQAIKP AHKKPILVKTDGKYNLKQIAVDRVEAEDGQYDVLFIGTDNGIVLKVITIYNQEMESMEEV ILEELQIFKQQLYIGSASAVAQVRFHHCDMYGSACADCCLARDPYCAWDGISCSRYYPTG THAKRRFRRQDVRHGNAAQQCFGQQFVGDALDKTEEHLAYGIENNSTLLECTPRSLQAKV IWFVQKGRETRKEEVKTDDRVVKMDLGLLFLRLHKSDAGTYFCQTVEHSFVHTVRKITLE VVEEEKVEDMFNKDDEEDRHHRMPCPAQSSISQGAKPWYKEFLQLIGYSNFQRVEEYCEK VWCTDRKRKKLKMSPSKWKYANPQEKKLRSKPEHYRLPRHTLDS >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_3|1575_bp atgcctaataaaatcacacattcagaatctcacattcttcctggtttctttcagcatcct aagcagaatgtgagtgtgaaccgggatgttgaaatatgtagtgaattgtttgctggactc tacagtgactactggagcagagacgctgcgatcttccgcagcatggggcgactggcccat atccgcactgagcatgacgatgagcgtctgttgaaagaggacgtttttttgctacctacc agagatcataagaatccagtgatatttggactctttaacactaccagtaatatttttcga gggcatgctatatgtgtctatcacatgtctagcattcgggcagccttcaacggaccatat gcacataaggaaggacctgaataccactggtcagtctatgaaggaaaagtcccttatcca aggcctggttcttgtgccagcaaagtaaatggagggagatacggaaccaccaaggactat cctgatgatgccatccgatttgcaagaagtcatccactaatgtaccaggccataaaacct gcccataaaaaaccaatattggtaaaaacagatggaaaatataacctgaaacaaatagca gtagatcgagtggaagctgaggatggccaatatgacgtcttgtttattgggacagataat ggaattgtgctgaaagtaatcacaatttacaaccaagaaatggaatcaatggaagaagta attctagaagaacttcagatattcaagcaacagctgtatattggatctgcttctgctgtg gctcaagtcagattccatcactgtgacatgtatggaagtgcttgtgctgactgctgcctg gctcgagacccttactgtgcctgggatggcatatcctgctcccggtattacccaacaggc acacatgcaaaaaggcgtttccggagacaagatgttcgacatggaaatgcagctcagcag tgctttggacaacagtttgttggggatgctttggataagactgaagaacatctggcttat ggcatagagaacaacagtactttgctggaatgtaccccacgatctttacaagcgaaagtt atctggtttgtacagaaaggacgtgagacaagaaaagaggaggtgaagacagatgacaga gtggttaagatggaccttggtttactcttcctaaggttacacaaatcagatgctgggacc tatttttgccagacagtagagcatagctttgtccatacggtccgtaaaatcaccttggag gtagtggaagaggagaaagtcgaggatatgtttaacaaggacgatgaggaggacaggcat cacaggatgccttgtcctgctcagagtagcatctcgcagggagcaaaaccatggtacaag gaattcttgcagctgatcggttatagcaacttccagagagtggaagaatactgcgagaaa gtatggtgcacagatagaaagaggaaaaagcttaaaatgtcaccctccaagtggaagtat gccaaccctcaggaaaagaagctccgttccaaacctgagcattaccgcctgcccaggcac acgctggactcctga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_4|185_aa MAVMTLGKSIRKDFAAKTVWKAERNCPFGFLDLHTMLLDEYQERLFVGGRDLVYSLSLER ISDGYKEGECANYVRVLHHYNRTHLLTCGTGAFDPVCAFIRVGYHLEGASEVIGQCRSSA AKPRRSGKESESLGPEFQGLWEWLPLHDPCRSFGSTDKMCLLCLSQKMKGIEIKGEIEEW KGESG >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_4|558_bp atggcagtgatgacattaggaaaaagcataagaaaggatttcgcagcaaaaacggtttgg aaagctgaaagaaactgcccttttggatttcttgatctccatacaatgctgctggatgaa tatcaagagaggctcttcgtgggaggcagggaccttgtatattccctcagcttggagaga atcagtgacggctataaagagggtgaatgtgcaaattatgttcgggttttgcatcactat aacaggacacaccttctgacctgtggtactggagcttttgatccagtttgtgccttcatc agagttggatatcatttggagggggcttccgaggtgatcgggcagtgtcggtcttcagcc gctaagccgagaagatctgggaaggagtcagagagccttgggccagagttccaggggctc tgggaatggctgccactccatgacccgtgccggagttttgggtccacggataaaatgtgt ctcctttgtctctcccagaaaatgaaaggaattgaaattaagggagagattgaagagtgg aaaggagaaagtggttga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_5|134_aa MESYAAKKNEIMSFSGTWMKLKAIILSRQTQEQKTKHHHSWKWFVCLHIFGIFPIPNLIT CTWKILVISFENSGALLWNKALNKLPAGCTNLNCSHLLLLLLLLLVQRPPVRGINMEYVW LKVAAFEYKRSSEQ >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_5|405_bp atggaatcctatgcagccaaaaagaatgagatcatgtccttttcagggacgtggatgaaa ctgaaagccatcatcctcagcagacaaacacaggaacagaaaaccaaacaccaccattct tggaaatggttcgtttgtttacatatatttggcatatttcccatccctaatctgatcacc tgcacctggaaaatcctggtaatttcatttgaaaattcaggtgccctcttatggaacaaa gcgctaaataaacttcctgccggctgcacaaatttaaattgttctcacctgctgctgctg ctgctgctgttgctggtccagagaccacctgtgagaggcattaacatggagtatgtctgg ttaaaagtagcagcatttgagtataaaaggtcatcagaacaataa >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_6|47_aa MEYYAAVKRNEIMSFAGTWMELEAIILGKLTQEQKTKHSVFSLISGN >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_6|144_bp atggaatactatgcagccgtaaaaaggaatgagatcatgtcctttgcagggacatggatg gagctggaagccattatcctcggcaaactaacacaggaacagaaaaccaagcacagcgtg ttctcacttataagtgggaactga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_7|302_aa MRSHSSALGWSMGLGAVEQGAALLGEAAQEPTEVGEGSGMAGCSPQACPEGRQLRPGEKS STAPETRGYVGEPHITRISGRPVGLRAAFGVTGYKKRNPLVLQLQEAEFNQRLEPPDGKI PPNRGQQTPHTGELRLATGRCPSQMKLPEEGAGSNLCCFAASTGDTQANRVWSGTPVNSS ELQQTPADLEKRSLTVRKKTPKQEAIASTSTERMTTQKLHLKVTNLKDKRTNYNNHMIIS IDAEKAFDKIQHPFMLKTLNRRGIDGMYLKIIRAVYDKPTASIIMDGQKLEAFPLKIGTR QR >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_7|909_bp atgcgctcgcactcctcagcccttgggtggtcaatgggactgggcgccgtggagcagggg gcggcgctcctcggggaggctgcacaggaacccacggaggtgggggaaggctcaggcatg gcgggctgcagtccccaggcctgccccgagggaaggcagctaaggcccggcgagaaatct agcacagcgccggaaactagaggctatgttggagaaccccacataacaagaatctctggg agacctgtaggactgagggcagcctttggagtgacaggctataagaaacggaatccctta gtcctacaattacaagaagctgaatttaaccagcgacttgagcctcctgatgggaagata cctcccaacaggggccaacagacacctcatacgggagagctccggctagcaactggcagg tgcccctctcagatgaagcttccagaggaaggagcaggcagtaatctttgctgttttgca gcctccactggtgatacccaggcaaacagggtctggagtggaactccagtgaactccagt gaactccagcaaactccagcagacctggagaagaggagcctgactgttagaaagaaaact cccaagcaggaagcaatagcatcaacatcaactgaaaggatgactactcaaaaactccat ctgaaggtcaccaacctcaaagacaaaagaaccaattacaacaaccacatgattatctca atagatgcagaaaaggcctttgataaaattcaacaccccttcatgctaaaaacactcaat agacgaggtattgatggaatgtatctcaaaataataagagctgtttatgacaaacccaca gccagtatcatcatggatgggcaaaagctggaagcattccctttgaaaatcggcacaaga caaagatga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_8|231_aa MAILPKVIYRFNAIPIKLPITFFRELEKTTLRFIWNQKRARIAKIILSQKNKAGGIMLPD FKLYYKATVTKTAWYQYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDRNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTSNTKINLRWIKDLHVRPKTIKTLEENLDDTIQDIGMGKDF MSKTAKPMATKSKIDKWDLIKLKSFCTAKRNYHQGEQATYRMGENFCNLHI >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_8|696_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctgccaatt actttcttcagagaattggaaaaaactactttaaggttcatatggaaccaaaaaagagcc cgtatagccaagataatcctaagccaaaagaacaaagctggcggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtaccagtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacagaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca tctaacacaaaaattaatttaagatggattaaagacttacatgttagacctaaaaccata aaaaccctagaagagaacctagacgataccattcaggacataggcatgggcaaggacttc atgtctaaaacagcaaaaccaatggcaacaaaatccaaaattgacaaatgggatctaatt aagctaaagagcttctgcacagcaaaaagaaactaccatcagggtgaacaggcaacctac agaatgggagaaaatttttgcaatctacacatctga >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_9|230_aa MCPSEMKLPEERSGSNICCSAIFAVLQPLLVIPRQTWSGVDLQQTPTDLQLRVLFVRRKT NKQKGHPHQNPICMSQSSKTKARQASIQIQEIQRMPQRYSSSRATPRHIIVRFTKVRMKE KMLRAAREKGRVTHKGKPIRLTVDLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSF ISEGEIKSFTNKQMLRDFVTTRPALQELLKEALNMERNNQYQPLQKHAKL >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_9|693_bp atgtgcccctctgagatgaagcttccagaggaacgatcaggcagcaacatttgctgttct gcaatatttgctgttctgcagcccctgctggtgatacccaggcaaacatggtctggagtg gacctccagcaaactccaacagacctgcagctgagggtcctgtttgttagaaggaaaact aacaaacagaaaggacatccacaccaaaaccccatctgtatgtcacaatcatcaaagacc aaagcaaggcaggccagcattcaaattcaggaaatacagagaatgccacaaagatactcc tcgagcagagcaactccaagacacataattgtcagattcaccaaagttcgaatgaaggaa aaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatcaga ctaacagtggatctctcggcagaaactctacaagccagaagagagtgggggccaatattc aacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaactaagcttc ataagtgaaggagaaataaaatcctttacaaacaagcaaatgctgagagattttgtcacc accaggcctgccttacaagagctcctgaaggaagcactaaacatggaaaggaacaaccag taccagccacttcaaaaacatgcaaaattgtaa >gi568815591r:83267589_83748542|GENSCAN_predicted_peptide_10|254_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNTRKSINVIQHINRTKDKNHM IISIDAEKAFDKIQQCFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPPLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFAEDMIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTERQIMSELPFTIASENKIPRNLTYKG REGPLQGELQTTAQ >gi568815591r:83267589_83748542|GENSCAN_predicted_CDS_10|765_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat acacgcaaatcaataaatgtaatccagcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaatgcttcatgctaaaa actctcaataaattaggtattgatgggacgtatctcaaaataatacgagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccacccttattcaacatagtgttggaagttctg gccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagaagacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaacaacagacaaacagagagacaaatcatgagt gaactcccattcacaattgcttcagagaataaaatacctaggaatctaacttacaaggga cgtgaaggacctcttcaaggagaactacaaaccactgctcaatga