GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:13:34 Sequence gi568815583f:78339792_78598140 : 258349 bp : 43.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 638 707 70 1 1 114 94 199 0.845 24.31 1.02 Intr + 1252 1430 179 2 2 76 72 372 0.641 34.04 1.03 Intr + 3708 3821 114 2 0 98 58 142 0.866 12.84 1.04 Term + 8136 8186 51 2 0 113 49 101 0.728 6.13 1.05 PlyA + 12312 12317 6 1.05 2.06 PlyA - 13861 13856 6 1.05 2.05 Term - 21927 21756 172 2 1 97 43 47 0.061 -1.60 2.04 Intr - 34853 34715 139 0 1 87 53 72 0.359 3.12 2.03 Intr - 36602 36400 203 1 2 8 47 125 0.505 -0.67 2.02 Intr - 37907 36830 1078 0 1 -43 53 418 0.119 14.43 2.01 Init - 38931 38355 577 0 1 62 40 239 0.208 11.81 2.00 Prom - 39555 39516 40 -2.16 3.08 PlyA - 40041 40036 6 1.05 3.07 Term - 45094 44996 99 1 0 61 37 77 0.082 -1.97 3.06 Intr - 49370 49263 108 2 0 5 60 136 0.210 2.98 3.05 Intr - 57616 57463 154 0 1 115 64 8 0.062 1.17 3.04 Intr - 65507 65432 76 0 1 102 65 56 0.110 3.27 3.03 Intr - 70981 70888 94 1 1 69 50 135 0.920 7.34 3.02 Intr - 112692 112551 142 2 1 96 84 73 0.744 8.06 3.01 Init - 118792 118716 77 1 2 72 78 60 0.511 1.96 3.00 Prom - 130302 130263 40 -4.26 4.00 Prom + 135339 135378 40 -2.26 4.01 Init + 136493 136568 76 1 1 78 110 22 0.988 4.65 4.02 Intr + 138506 138606 101 0 2 94 76 104 0.991 9.63 4.03 Intr + 143527 143643 117 0 0 110 100 6 0.960 4.66 4.04 Intr + 144970 145129 160 0 1 11 94 106 0.998 3.06 4.05 Intr + 145914 146049 136 1 1 80 106 52 0.960 5.83 4.06 Intr + 154118 154265 148 2 1 82 93 46 0.827 4.74 4.07 Intr + 167805 167880 76 2 1 63 105 52 0.548 3.49 4.08 Intr + 173293 173634 342 2 0 123 82 243 0.967 22.60 4.09 Intr + 175231 175316 86 2 2 44 110 69 0.692 4.24 4.10 Intr + 187607 187772 166 1 1 15 82 163 0.843 7.93 4.11 Term + 193419 193879 461 1 2 123 42 279 0.994 22.05 4.12 PlyA + 194831 194836 6 1.05 5.00 Prom + 195336 195375 40 -4.76 5.01 Init + 199547 199619 73 1 1 85 66 65 0.787 5.23 5.02 Intr + 200629 200748 120 2 0 57 93 136 0.906 11.47 5.03 Intr + 202692 202854 163 1 1 83 80 107 0.974 8.43 5.04 Intr + 205078 205166 89 1 2 100 53 33 0.990 0.61 5.05 Intr + 205843 205973 131 2 2 67 68 89 0.996 5.21 5.06 Intr + 206784 206907 124 2 1 71 86 57 0.995 3.96 5.07 Term + 208999 209153 155 2 2 71 39 137 0.903 5.18 5.08 PlyA + 209380 209385 6 1.05 6.00 Prom + 221373 221412 40 -4.26 6.01 Init + 225929 226034 106 1 1 102 99 200 0.965 20.88 6.02 Intr + 241020 241171 152 1 2 47 66 83 0.038 1.88 6.03 Intr + 248523 248632 110 2 2 50 97 47 0.867 0.88 6.04 Intr + 250014 250845 832 0 1 63 58 424 0.362 28.30 6.05 Term + 253301 253462 162 1 0 121 48 11 0.290 -1.66 6.06 PlyA + 254309 254314 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 81077 81196 120 1 0 103 85 43 0.917 6.19 S.002 Term + 83156 83287 132 1 0 80 43 113 0.962 4.09 S.003 Term + 91459 91549 91 1 1 96 45 96 0.822 3.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_1|137_aa MPNFAGTWKMRSSENFDELLKALGVNAMLRKVAVAAASKPHVEIRQDGDQFYIKTSTTVR TTEINFKVGEGFEEETVDGRKCRSLATWENENKIHCTQTLLEGDGPKTYWTRELANDELI LTFGADDVVCTRIYVRE >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_1|414_bp atgcccaacttcgccggcacctggaagatgcgcagcagcgagaatttcgacgagctgctc aaggcactgggtgtgaacgccatgctgaggaaagtggccgtagcggctgcgtccaagccg cacgtggagatccgccaggacggggatcagttctacatcaagacatccaccacggtgcgc accactgagatcaacttcaaggtcggagaaggctttgaggaggagaccgtggacggacgc aagtgcaggagtttagccacttgggagaatgagaacaagatccactgcacgcaaactctt cttgaaggggacggccccaaaacctactggacccgtgagctggccaacgatgaacttatc ctgacgtttggcgccgatgacgtggtctgcaccagaatttatgtccgagagtga >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_2|722_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIIKSLPTKKSPGPDGFTAEFYQRYNEEL VPFLLKLFQSIEKEGILSNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHNQVSFIPGMQGWFNIHKSMKVIQHINRTKDKNYMIISIDAEKAFDKI QQPFMLKTLKKLGIQLTKDVKDLFKENYKALLNEIKEDTNKWKNIPCSWIGRINIMKMAI LPKVIYRFNAIPIKPPMTFFTELEKTTLKFIWNQKRVCIAKTILSQKNKAGGITLPDFKL YYKATVTKAAWYCYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKQWGKDSLFNKWCW ENWLAIWRKLKLDPFLTPYTKINSRWIKDLNVRPKIIKTLEENLGNTIQDIDMGKDFMTK TPKAMATKAKIDKWDLVKLKSFFTAKETTIGVNRQPTEWEKIFTIYPSDKVLISRIHKEL KQIYKKKPKNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSPLAIREMQIKTTMRYHLTPV RMAIIKKSGNNSKDLEPTQMSINDDWIKKMWHIYIMEYDAAIKKDEFLSFVGTWMKLETI ILSKLSQGEKTKHRMFSLIGQGPVQPQHLVLAWHRVECSFPENPHKVPTPGLLSGSENES IVGPGRQSLPQSTSDDPTVLGLPDPRSQQTQTKESTFCLNAPAAVPERALIGLAWITCLS QN >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_2|2169_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatcc ctaaatagaccaataacaggctctgaaattgaggcaataattaagagcctaccaaccaaa aaaagtccaggaccagacggattcacagctgaattctaccagaggtataacgaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctctctaactca ttttatgaagccagcatcatcctgataccaaagcctggaagagacacaacaaaaaaagag aattttagaccaatatccctaatgaacatcgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccacaatcaagttagcttcatccct gggatgcaaggctggttcaacatacacaaatcaatgaaagtaatccagcatataaacaga accaaagacaaaaactacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaaaaaactaggaatccaacttacaaaggatgtg aaggacctcttcaaggagaactacaaagcactgctcaacgaaataaaagaggacacaaac aaatggaagaacattccatgctcatggataggaagaatcaatatcatgaaaatggccata ctgcccaaggtaatttatagattcaatgccatccccatcaagccaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagtctgcattgcc aagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaacta tactacaaggctacagtaaccaaagcagcatggtactgctaccaaaacagagatatagat caatggaacagaacagagccctcagaaataataccacacatctacaaccatctgatcttt gacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgg gaaaactggctagccatatggagaaagctgaaactggatcctttccttacaccttataca aaaattaattcaagatggattaaagacttaaatgttagacctaaaatcataaaaacccta gaagaaaacctaggcaataccattcaggacatagacatgggcaaggacttcatgactaaa acaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctagttaaactaaag agcttctttacagcaaaagaaactaccatcggagtgaacagacaacctacagaatgggag aaaatttttacaatctacccatctgacaaagtgctaatatccagaatccacaaagaactt aaacaaatttacaagaaaaaaccaaagaaccccatcaaaaagtgggcaaaggatatgaac agacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaatgctcatca ccactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagtt agaatggcgatcattaaaaagtcaggaaacaacagcaaagacttggaaccaacccaaatg tccatcaatgatgactggattaagaaaatgtggcacatatacatcatggaatacgatgca gccataaaaaaggatgagttcttgtcctttgtagggacatggatgaagctggaaaccatt attctgagcaaactatcgcaaggagagaaaaccaaacaccgcatgttctcactcataggg cagggccctgtgcagcctcagcacctggtgctggcctggcaccgagttgagtgctcattc cctgagaacccgcataaggtgcccacaccaggccttctcagtggctctgaaaatgagagc atagtgggacctgggaggcagagtctcccgcaaagtaccagtgatgaccccacagttcta ggcttacctgatccccgtagccagcaaacccagacgaaagagagcaccttctgcctaaat gctccggctgcggtcccagagagggctctgattggcttagcttggatcacatgcctgtct cagaactga >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_3|249_aa MMTSWERWLMLVILALWEVRLEDHLREYNCSYGFNYHYIYTNNSKIFSVVQVCLLSSKTT CPIQEVNMFTLILLIEFGVVAAPEGQQSTAVYHWCNTAVLDPLALAGMMAVAAPDDPPPP SVSSSKPSTVGTRQLSGPWHARLCPREADIMELIKPTWTQVPGIQEAVLSHPPQQSPADA GLPGSGQRSSKGAVPPCRLHIPEVMAAVITDKDVPMLPQSRLLQDAWGLGDPWTAVRATL TDYGKEQSS >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_3|750_bp atgatgaccagctgggaacggtggctcatgcttgtaatcctagcactttgggaggtgagg ctggaggatcatttgagggagtacaactgctcctatggcttcaattatcactacatctac actaataattccaaaatattttctgtagtccaagtctgtcttctaagctccaaaaccaca tgcccgattcaggaggtgaacatgtttactttaatactgctgattgaatttggtgtggtc gccgcacctgagggccagcagtccacagctgtctaccactggtgcaataccgcggtgctg gacccactggcattggctggcatgatggcagtggctgctcctgacgacccgccaccgcca tcggtatcatcttccaagccatcaacagtgggcaccaggcagctgtcagggccctggcat gcaaggctgtgcccaagggaagcggacatcatggagctaattaaaccaacgtggacccaa gtgcctggaattcaggaggctgtcctctcccacccaccccaacagagccccgcagacgct ggattaccaggaagtggacaaagaagcagcaaaggggcagttcctccatgtcgtctgcac atcccagaagtgatggccgcggtcattactgacaaagatgtacccatgttgccacagtct cggctgctgcaggatgcctggggtctgggcgatccatggacagcagtgagggctacattg acagattacggcaaagagcagagcagttag >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_4|622_aa MCPEYGAILSFFPVDNVTLKHLEHTGFSKAKLESMETYLKAVKLFRNDQNSSGEPEYSQV IQINLNSIVPSVSGPKRPQDRVAVTDMKSDFQACLNEKVGFKGFQIAAEKQKDIVSIHYE GSEYKLSHGSVVIAAVISCTNNCNPSVMLAAGLLAKKAVEAGLRVKPYIRTSLSPGSGMV THYLSSSGVLPYLSKLGLTPREFNSYGARRGNDAVMTRGTFANIKLFNKFIGKPAPKTIH FPSGQTRRCAAEAALPVCGKAGSTPGRRVAADIMSSGNYQQSEALSKPTFSEEQASALVE SVFGLKVSKVRPLPSYDDQNFHVYVSKTKDGPTEYVLKISNTKASKNPDLIEVQNHIIMF LKAAGFPTASVCHTKGDNTASLVSVGRPIAELPVSPQLLYEIGKLAAKLDKTLQLSSLHR ENFIWNLKNVPLLEKYLYALGQNRNREIVEHVIHLFKEEVMTKLSHFRECINHGDLNDHN ILIESSKSASGNAEYQVSGILDFGDMSYGYYVFEVAITIMYMMIESKSPIQVGGHVLAGF ESITPLTAVEKGALFLLVCSRFCQSLVMAAYSCQLYPENKDYLMVTAKTGWKHLQQMFDM GQKAVEEIWFETAKSYESGISM >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_4|1869_bp atgtgtccggaatatggtgctatcctcagctttttccctgttgacaatgtgacattaaaa catttagaacatacaggttttagcaaagccaaactcgaatcaatggaaacataccttaaa gctgtgaaattgtttcgaaatgaccagaattcttcaggagaacctgaatactcccaggtg atccagattaatctgaattcaatagttccatctgttagtggtccaaaaagacctcaggat agagttgctgtgacagatatgaaaagcgatttccaggcttgcttaaatgaaaaggttgga tttaaaggcttccaaattgcagctgaaaaacaaaaggatattgtctccattcattatgaa ggaagtgaatataagctgtctcatggatcagtggtcattgctgcagttatcagttgtacc aataattgcaatccatctgtcatgcttgctgcaggtcttttggctaaaaaggctgttgaa gctggtctgcgtgttaaaccttatataagaacaagtttatctccaggcagtgggatggtt acacattacctcagttcaagtggagtattaccatatctaagtaagcttggccttacccct cgtgaattcaactcttacggagctcgaagaggtaatgatgctgtaatgacaagaggcact tttgcaaatatcaagctttttaataagtttattggaaaaccagctcctaaaacaattcat tttccatcaggacagacgcgccggtgcgcggccgaggccgcactacctgtctgcgggaaa gcgggatccaccccaggacgtcgggtcgctgccgacataatgtcaagtggaaactatcag cagtcagaggctcttagcaaacccactttcagtgaggaacaagcctctgcgttagtggag tcagtgtttgggttgaaagtttccaaggtccggccacttcctagctatgatgaccaaaac tttcatgtctacgtttcaaaaaccaaagatggcccaactgaatatgtcctcaaaataagc aacaccaaggctagcaaaaatccagacctgattgaagtgcagaatcacatcatcatgttt ctgaaagccgctggatttccaacagcctctgtgtgtcacactaaaggagacaacacagct tctctcgtgtctgtaggaagacccatcgctgagcttcccgtcagcccccagctattgtat gaaattggaaaactagctgccaaattggataagacactgcagttaagtagtcttcatcgg gagaacttcatctggaatctgaaaaatgttcctcttctggagaaatacctgtatgccctg ggccagaatcgaaaccgagagattgttgagcatgtcattcatctgttcaaggaggaagta atgaccaaattaagtcattttcgagaatgtatcaatcacggagatcttaatgaccataat attttaatagagtccagcaagtcagcctctggaaatgctgaatatcaagtgtctgggatt ttagactttggtgacatgagctatggctactatgtgtttgaagtggcaattaccatcatg tacatgatgattgagagcaagagtcctatacaagtaggaggccatgtccttgcagggttt gaaagcatcaccccactgacagctgtagagaagggtgctttgtttttacttgtatgcagt cgtttttgtcagtcacttgtcatggctgcatactcttgccagctatacccagagaacaaa gactatctcatggttactgcaaaaaccgggtggaaacacttacagcaaatgtttgacatg ggtcagaaagctgtagaagaaatctggtttgaaactgccaaatcctatgaatctgggatc tccatgtga >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_5|284_aa MDNEHRNKYDNFSKEDKMVRVYADAVLRMRGGHISSGYSVSGGGLFFRGVKGSVDISGLQ GLPSGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNIHKLLDEVFFSEKIYKLNEY LLQYQEPIPCEQLVTALCDIKQAYTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYG GWKATCIGNNSAAAVSMLKQDYKEGEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLT RENGKTVIRVLKQKEVEQLIKKHEEEEAKAEREKKEKEQKEKDK >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_5|855_bp atggataatgaacacagaaacaagtatgacaatttcagtaaagaagataaaatggtgcga gtatatgctgacgcggttctgcgcatgcgcgggggccatattagcagcggttattcggtg agcggtggtggtttattcttccgtggagttaagggctccgtggacatctcaggtcttcag ggtcttccatctggtcgcttataccaagttgaatatgccatggaagctattggacatgca ggcacctgtttgggaattttagcaaatgatggtgttttgcttgcagcagagagacgcaac atccacaagcttcttgatgaagtctttttttctgaaaaaatttataaactcaatgagtat ttattacagtatcaggagccaataccttgtgagcagttggttacagcgctgtgtgatatc aaacaagcttatacacaatttggaggaaaacgtccctttggtgtttcattgctgtacatt ggctgggataagcactatggctttcagctctatcagagtgaccctagtggaaattacggg ggatggaaggccacatgcattggaaataatagcgctgcagctgtgtcaatgttgaaacaa gactataaagaaggagaaatgaccttgaagtcagcacttgctttagctatcaaagtacta aataagaccatggatgttagtaaactctctgctgaaaaagtggaaattgcaacactaaca agagagaatggaaagacagtaatcagagttctcaaacaaaaagaagtggagcagttgatc aaaaaacatgaggaagaagaagccaaagctgagcgtgagaagaaagaaaaagaacagaaa gaaaaggataaatag >gi568815583f:78339792_78598140|GENSCAN_predicted_peptide_6|453_aa MAARGSGPRALRLLLLVQLVAGRCGLAGAAGGAQRGLSEPSSIAKHEDSLLKDLFQDYER WVRPVEHLNDKIKIKFGLAISQLVDVEWIDVKLRWNPDDYGGIKVIRVPSDSVWTPDIVL FDNADGRFEGTSTKTVIRYNGTVTWTPPANYKSSCTIDVTFFPFDLQNCSMKFGSWTYDG SQVDIILEDQDVDKRDFFDNGEWEIVSATGSKGNRTDSCCWYPYVTYSFVIKRLPLFYTL FLIIPCIGLSFLTVLVFYLPSNEGEKICLCTSVLVSLTVFLLVIEEIIPSSSKVIPLIGE YLVFTMIFVTLSIMVTVFAINIHHRSSSTHNAMAPLVRKIFLHTLPKLLCMRSHVDRYFT QKEETESGSGPKSSRNTLEAALDSIRYITRHIMKENDVREVVEDWKFIAQVLDRMFLWTF LFVSIVGSLGLFVPVIYKWANILIPVHIGNANK >gi568815583f:78339792_78598140|GENSCAN_predicted_CDS_6|1362_bp atggcggcgcgggggtcagggccccgcgcgctccgcctgctgctcttggtccagctggtc gcggggcgctgcggtctagcgggcgcggcgggcggcgcgcagagaggattatctgaacct tcttctattgcaaaacatgaagatagtttgcttaaggatttatttcaagactacgaaaga tgggttcgtcctgtggaacacctgaatgacaaaataaaaataaaatttggacttgcaata tctcaattggtggatgtggaatggatagatgtaaaattaagatggaaccctgatgactat ggtggaataaaagttatacgtgttccttcagactctgtctggacaccagacatcgttttg tttgataatgcagatggacgttttgaagggaccagtacgaaaacagtcatcaggtacaat ggcactgtcacctggactccaccggcaaactacaaaagttcctgtaccatagatgtcacg tttttcccatttgaccttcagaactgttccatgaaatttggttcttggacttatgatgga tcacaggttgatataattctagaggaccaagatgtagacaagagagatttttttgataat ggagaatgggagattgtgagtgcaacagggagcaaaggaaacagaaccgacagctgttgc tggtatccgtatgtcacttactcatttgtaatcaagcgcctgcctctcttttataccttg ttccttataataccctgtattgggctctcatttttaactgtacttgtcttctatcttcct tcaaatgaaggtgaaaagatttgtctctgcacttcagtacttgtgtctttgactgtcttc cttctggttattgaagagatcataccatcatcttcaaaagtcatacctctaattggagag tatctggtatttaccatgatttttgtgacactgtcaattatggtaaccgtcttcgctatc aacattcatcatcgttcttcctcaacacataatgccatggcgcctttggtccgcaagata tttcttcacacgcttcccaaactgctttgcatgagaagtcatgtagacaggtacttcact cagaaagaggaaactgagagtggtagtggaccaaaatcttctagaaacacattggaagct gcgctcgattctattcgctacattacaagacacatcatgaaggaaaatgatgtccgtgag gttgttgaagattggaaattcatagcccaggttcttgatcggatgtttctgtggactttt cttttcgtttcaattgttggatctcttgggctttttgttcctgttatttataaatgggca aatatattaataccagttcatattggaaatgcaaataagtga