GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:34:42 Sequence gi568815592f:36799445_37023438 : 223994 bp : 48.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 626 523 104 2 2 108 84 252 0.929 25.77 1.08 Intr - 5660 5524 137 0 2 83 89 82 0.934 8.09 1.07 Intr - 14914 14734 181 1 1 36 49 104 0.129 0.74 1.06 Intr - 22716 22670 47 1 2 124 92 34 0.594 5.53 1.05 Intr - 23654 23614 41 1 2 140 113 61 0.599 11.67 1.04 Intr - 28134 27945 190 1 1 72 72 73 0.186 2.74 1.03 Intr - 30888 30840 49 0 1 68 51 40 0.125 -3.45 1.02 Intr - 32932 32797 136 0 1 44 78 109 0.547 6.07 1.01 Init - 39933 39839 95 0 2 80 99 172 0.616 17.37 1.00 Prom - 55225 55186 40 -4.46 2.07 PlyA - 55409 55404 6 1.05 2.06 Term - 56589 56369 221 1 2 108 38 193 0.912 13.50 2.05 Intr - 57210 57142 69 1 0 89 114 78 0.374 9.65 2.04 Intr - 63500 63370 131 1 2 2 69 108 0.217 0.64 2.03 Intr - 64598 64355 244 0 1 -11 87 144 0.343 0.96 2.02 Intr - 72428 72274 155 1 2 98 92 168 0.764 17.92 2.01 Init - 75328 75273 56 1 2 73 94 35 0.394 3.36 2.00 Prom - 79080 79041 40 -5.66 3.00 Prom + 84321 84360 40 -6.26 3.01 Init + 88211 88262 52 1 1 36 115 23 0.440 1.12 3.02 Intr + 95060 95159 100 0 1 105 87 32 0.724 3.97 3.03 Intr + 99982 100189 208 1 1 101 93 171 0.830 17.98 3.04 Intr + 102777 102990 214 2 1 117 98 -17 0.848 0.49 3.05 Intr + 114840 114991 152 1 2 101 105 163 0.994 19.18 3.06 Intr + 120134 120257 124 1 1 113 88 94 0.952 12.06 3.07 Term + 123903 123997 95 1 2 84 49 102 0.901 3.89 3.08 PlyA + 124057 124062 6 1.05 4.00 Prom + 147517 147556 40 -6.16 4.01 Init + 155317 155487 171 0 0 108 115 336 0.972 35.74 4.02 Intr + 159701 159922 222 1 0 108 64 479 0.910 45.82 4.03 Intr + 162007 162116 110 0 2 64 75 158 0.994 11.18 4.04 Intr + 162442 162550 109 1 1 81 45 65 0.469 1.79 4.05 Intr + 162684 162792 109 2 1 38 90 82 0.474 3.26 4.06 Intr + 163491 164168 678 1 0 94 110 228 0.940 17.19 4.07 Term + 164379 164500 122 1 2 107 49 34 0.945 0.04 4.08 PlyA + 165367 165372 6 1.05 5.13 PlyA - 166919 166914 6 1.05 5.12 Term - 169530 169459 72 0 0 118 37 52 0.638 1.01 5.11 Intr - 170670 170595 76 2 1 101 99 31 0.639 5.02 5.10 Intr - 171029 170962 68 2 2 69 77 76 0.820 2.30 5.09 Intr - 171250 171203 48 1 0 107 98 80 0.998 9.78 5.08 Intr - 171612 171499 114 0 0 81 80 23 0.694 1.44 5.07 Intr - 173352 173208 145 2 1 101 77 129 0.597 13.38 5.06 Intr - 177806 177755 52 0 1 92 94 16 0.849 0.57 5.05 Intr - 178247 178190 58 2 1 83 103 41 0.823 3.56 5.04 Intr - 178711 178634 78 1 0 103 90 109 0.908 12.35 5.03 Intr - 179167 179061 107 2 2 112 105 119 0.999 15.93 5.02 Intr - 182228 182144 85 2 1 47 113 18 0.731 -0.41 5.01 Init - 186675 186409 267 0 0 93 75 307 0.845 24.88 5.00 Prom - 189516 189477 40 -7.26 6.00 Prom + 193869 193908 40 -6.96 6.01 Init + 206374 206441 68 0 2 85 80 69 0.747 6.34 6.02 Intr + 209390 209621 232 2 1 92 51 186 0.831 13.08 6.03 Intr + 211529 211610 82 1 1 119 -44 131 0.975 2.21 6.04 Intr + 212236 212410 175 2 1 81 81 261 0.982 23.70 6.05 Intr + 214165 214321 157 1 1 87 94 248 0.999 25.31 6.06 Intr + 214518 214656 139 2 1 71 42 237 0.866 17.44 6.07 Intr + 215202 215260 59 1 2 86 103 82 0.992 8.10 6.08 Intr + 215448 215594 147 2 0 57 91 306 0.999 28.23 6.09 Intr + 216324 216416 93 2 0 100 116 128 0.997 16.96 6.10 Intr + 217763 217858 96 1 0 78 52 53 0.648 0.91 6.11 Intr + 221097 221176 80 2 2 99 60 122 0.977 8.85 6.12 Intr + 221265 221295 31 0 1 82 80 9 0.968 -2.37 6.13 Intr + 222068 222160 93 1 0 112 84 146 0.999 16.76 6.14 Intr + 222795 222926 132 2 0 74 73 145 0.983 12.44 6.15 Intr + 223250 223403 154 1 1 69 89 104 0.989 8.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_1|327_aa MEQPEDMASLSEFDSLAGSIPATKVEITVSCRLGTGAIFPDFRESSSDLNNVDSVSRKRV GNVCWMGKRGIYGAIWKSFVWEQLREAEDGKQDTGETEALRDGVSTCGIWTAAIRDIIGG SLAATWINMASGALLQTYTARLGTEPKELPFYTPGQGNLLDKDMFSKSDPLCVMYTQGME NKQWREFMETMLNYDLAFSILVMPLGPGSAVVLTDTGDKELNGMEYCVPPLEIFSQIDFS FLIDLWVVFLPSLMGYKLGLTGPGGFSDFKIQVLAFFPFVIRPSNEGYKTSSFGRTEVID NTLNPDFVRKFIVDYFFEEKQNLRFDL >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_1|981_bp atggagcagcctgaggacatggcgtcgctgagcgagttcgactccttggcgggcagcatc ccggccaccaaggtggagatcaccgtgtcctgcagactagggacaggggccatcttcccg gatttcagggagtcgtcatcagatctaaacaatgtggattctgtcagcaggaagagagtg ggcaatgtttgctggatgggcaaacgtggcatctatggggccatttggaaatcgtttgtt tgggagcagctccgagaggctgaggatgggaagcaggacacgggggaaactgaggcccta agagacggtgtctcaacatgtggtatttggacggctgccatcagagacataataggggga tctctggcggccacttggataaacatggcctctggagccctgcttcagacctacacagcc agacttggaacagagcccaaggaactgccattttataccccggggcaggggaacctcctg gacaaagacatgttttccaagtccgacccactgtgcgtcatgtatacccaagggatggag aacaagcagtggcgggagtttatggagaccatgttgaattacgatcttgcattcagcatc ttggttatgcctcttggtcctgggtcagctgtagtcctcaccgacacaggtgacaaggaa ttgaatgggatggagtactgcgttcctccactagagatcttttcccagattgacttcagt tttttaattgacctttgggttgtgttcctgccatcattaatggggtataaattgggcctg acaggacctggtggcttctcagattttaaaatccaagttttagcatttttcccctttgtt attcgtccaagtaatgaaggctataaaacatcttcatttgggcgcaccgaagtcatcgac aacacgctcaatcctgacttcgtgcgcaagttcattgtggattactttttcgaggagaag cagaacctccgttttgatctn >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_2|291_aa MAAIPPDSWQPPNVYLETSMGIIVLELYWKHAPKTCKNFAELARRGYYNGTKFHRIIKDF MIQGGDPTGTANTKIEVYWVTPVKDKGSGNQVWKGRSSHSSAGLSNTCESRGAEEEQLPV SSEEHQENVESWKPKEDSNSSRRVTSSVDQVKTGIQTSLTPKASEKEKTASSEPQSPLED CGIVPVWYINKVFVSGRGGASIYGKQFEDELHPDLKFTGAGILAMANAGPDTNGSQFFVT LAPTQWLDGKHTIFGRVCQGIGMVNRVGMVETNSQDRPVDDVKIIKAYPSG >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_2|876_bp atggcggcaattcccccagattcctggcagccacccaacgtttacttggagaccagcatg ggaatcattgtgctggagctgtactggaagcatgctccaaagacctgtaagaactttgct gagttggctcgtcgaggttactacaatggcacaaaattccacagaattatcaaagacttc atgatccaaggaggtgacccaacagggacagcaaacaccaagatagaggtttattgggta acacctgtgaaagataaagggtcagggaaccaggtttggaagggaaggtcttcacacagc agcgcaggcctgagtaacacctgtgaaagcagaggggcggaggaggagcagctgccagtg agcagcgaggaacaccaggagaatgtggagtcgtggaagccaaaggaagacagcaactca agcagaagggtgacatcgagtgtggatcaagtaaaaactggaatccagacgtctctgacc ccaaaggcaagtgagaaagagaagacagcatcttcagaaccacaaagccccttggaggac tgtggcattgttcccgtctggtatattaacaaagtgtttgtgtcaggtcgaggtggtgca tctatctatggcaaacagtttgaagatgaacttcatccagacttgaaattcacgggggct ggaattctcgcaatggccaatgcggggccagataccaatggcagccagttctttgtgacc ctcgcccccacccagtggcttgacggcaaacacaccatttttggccgagtgtgtcagggc ataggaatggtgaatcgcgtgggaatggtagaaacaaactcccaggaccgccctgtggac gacgtgaagatcattaaggcatacccttctgggtag >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_3|314_aa MGNETEQVGSDQIIKGHGIIVVNHFPVLSRSVVIKGHVVSELPAQNRKITWDFILEDMDL AANEISIYDKLSETVDLVRQTGHQCGMSEKAIEKFIRQLLEKNEPQRPPPQYPLLIVVYK VLATLGLILLTAYFVIQPFSPLAPEPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVP LHGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLV IKMHKMPDLFIIGSGEAMLQLIPPFQCRRHCQSVAMPIEPGDIGYVDTTHWKVYVIARGV QPLVICDGTAFSEL >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_3|945_bp atgggaaatgaaactgaacaggtaggcagtgaccagatcataaagggccatggaatcatt gtagtcaatcattttccagttctcagccgctcagttgtgatcaagggacacgtggtttcc gaactgccagctcagaataggaaaataacttgggattttatattggaagacatggatctt gctgccaacgagatcagcatttatgacaaactttcagagactgttgatttggtgagacag accggccatcagtgtggcatgtcagagaaggcaattgaaaaatttatcagacagctgctg gaaaagaatgaacctcagagaccccccccgcagtatcctctccttatagttgtgtataag gttctcgcaaccttgggattaatcttgctcactgcctactttgtgattcaacctttcagc ccattagcacctgagccagtgctttctggagctcacacctggcgctcactcatccatcac attaggctgatgtccttgcccattgccaagaagtacatgtcagaaaataagggagttcct ctgcatgggggtgatgaagacagaccctttccagactttgacccctggtggacaaacgac tgtgagcagaatgagtcagagcccattcctgccaactgcactggctgtgcccagaaacac ctgaaggtgatgctcctggaagacgccccaaggaaatttgagaggctccatccactggtg atcaagatgcataagatgcctgacctatttatcattggcagcggtgaggccatgttgcag ctcatccctcccttccagtgccgaagacattgtcagtctgtggccatgccaatagagcca ggggatatcggctatgtcgacaccacccactggaaggtctacgttatagccagaggggtc cagcctttggtcatctgcgatggaaccgctttctcagaactgtag >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_4|506_aa MHGSCSFLMLLLPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDMLHMRWD EELAAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDVPLAMEEWHHEREHYNLSAATC SPGQMCGHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEG TPCSQCPSGYHCKNSLCGESTGGWPPTRSHFGALSFQVAGFQPFKGRMLESLAASGGPAR EPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMGTPSSLATGIPAFLVTEVSGSLATKAL PAVETQAPTSLATKDPPSMATEAPPCVTTEVPSILAAHSLPSLDEEPVTFPKSTHVPIPK SADKVTDKTKVPSRSPENSLDPKMSLTGARELLPHAQEEAEAEAELPPSSEVLASVFPAQ DKPGELQATLDHTGHTSSKSLPNFPNTSATANATGGRALALQSSLPGAEGPDKPSVVSGL NSGPGHVWGPLLGLLLLPPLVLAGIF >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_4|1521_bp atgcacggctcctgcagtttcctgatgcttctgctgccgctactgctactgctggtggcc accacaggccccgttggagccctcacagatgaggagaaacgtttgatggtggagctgcac aacctctaccgggcccaggtatccccgacggcctcagacatgctgcacatgagatgggac gaggagctggccgccttcgccaaggcctacgcacggcagtgcgtgtggggccacaacaag gagcgcgggcgccgcggcgagaatctgttcgccatcacagacgagggcatggacgtgccg ctggccatggaggagtggcaccacgagcgtgagcactacaacctcagcgccgccacctgc agcccaggccagatgtgcggccactacacgcaggtggtatgggccaagacagagaggatc ggctgtggttcccacttctgtgagaagctccagggtgttgaggagaccaacatcgaatta ctggtgtgcaactatgagcctccggggaacgtgaaggggaaacggccctaccaggagggg actccgtgctcccaatgtccctctggctaccactgcaagaactccctctgtggtgagtcc acgggtggatggccccccacgcgcagccactttggcgccctgtcgttccaagtggccgga tttcaacccttcaaagggaggatgttagaaagtctggcggcttcgggggggcccgcgcga gaacccatcggaagcccggaagatgctcaggatttgccttacctggtaactgaggcccca tccttccgggcgactgaagcatcagactctaggaaaatgggtactccttcttccctagca acggggattccggctttcttggtaacagaggtctcaggctccctggcaaccaaggctctg cctgctgtggaaacccaggccccaacttccttagcaacgaaagacccgccctccatggca acagaggctccaccttgcgtaacaactgaggtcccttccattttggcagctcacagcctg ccctccttggatgaggagccagttaccttccccaaatcgacccatgttcctatcccaaaa tcagcagacaaagtgacagacaaaacaaaagtgccctctaggagcccagagaactctctg gaccccaagatgtccctgacaggggcaagggaactcctaccccatgcccaggaggaggct gaggctgaggctgagttgcctccttccagtgaggtcttggcctcagtttttccagcccag gacaagccaggtgagctgcaggccacactggaccacacggggcacacctcctccaagtcc ctgcccaatttccccaatacctctgccaccgctaatgccacgggtgggcgtgccctggct ctgcagtcgtccttgccaggtgcagagggccctgacaagcctagcgtcgtgtcagggctg aactcgggccctggtcatgtgtggggccctctcctgggactactgctcctgcctcctctg gtgttggctggaatcttctga >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_5|389_aa MAGAGAGAGARGGAAAGVEARARDPPPAHRAHPRHPRPAAQPSARRMDGGSGGLGSGDNA PTTEALFVALGAGVTALSHPLLYVKLLIQVGHEPMPPTLGTNVLGRKVLYLPSFFTYAKY IVQVDGKIGLFRGLSPRLMSNALSTVTRGSMKKVFPPDEIEQVSNKDDMKTSLKKVVKET SYEMMMQCVSRMLAHPLHVISMRCMVQFVGREAKYSGLIPHLLGDVVFLWGCNLLAHFIN AYLVDDSVSDTPGGLGNDQNPGSQARIRLWVSGCMSVLALTACQALDVDQADTGSALEAH SPFSQALAIRSYTKFVMGIAVSMLTYPFLLVGDLMAVNNCGLQAGLPPYSPVFKSWIHCW KYLSVQGQLFRGSSLLFRRVSSGSCFALE >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_5|1170_bp atggcgggagccggagctggagccggagctcgcggcggagcggcggcgggggtcgaggct cgagctcgcgatccaccgcccgcgcaccgcgcacatcctcgccaccctcggcctgcggct cagccctcggcccgcaggatggatggcgggtcagggggcctggggtctggggacaacgcc ccgaccactgaggctcttttcgtggcactgggcgcgggcgtgacggcgctcagccatccc ctgctctacgtgaagctgctcatccaggtgggtcatgagccgatgccccccacccttggg accaatgtgctggggaggaaggtcctctatctgccgagcttcttcacctacgccaagtac atcgtgcaagtggatggtaagatagggctgttccgaggcctgagtccccggctgatgtcc aacgccctctctactgtgactcggggtagcatgaagaaggttttccctccagatgagatt gagcaggtttccaacaaggatgatatgaagacttccctgaagaaagttgtgaaggagacc tcctacgagatgatgatgcagtgtgtgtcccgcatgttggcccaccccctgcatgtcatc tcaatgcgctgcatggtccagtttgtgggacgggaggccaagtacagtggattaatccct cacctcctgggcgatgtggttttcttgtggggctgtaacctgctggcccacttcatcaat gcctacctggtggatgacagcgtgagtgacaccccaggggggctgggaaacgaccagaat ccaggttcccaggccaggattcgtctttgggtttcagggtgcatgtctgtcttagcactg actgcgtgccaagccctggatgttgatcaggcagacacgggctcagcccttgaggctcac agtccgttcagccaggccctggccatccggagctataccaagttcgtgatggggattgca gtgagcatgctgacctaccccttcctgctagttggcgacctcatggctgtgaacaactgc gggctgcaagctgggctccccccttactccccagtgttcaaatcctggattcactgctgg aagtacctgagtgtgcagggccagctcttccgaggctccagcctgcttttccgccgggtg tcatcaggatcatgctttgccctggagtaa >gi568815592f:36799445_37023438|GENSCAN_predicted_peptide_6|580_aa MKGASEEKLASVSNLVTVFENSRTPEAAPRGQRLEDVHHRPECRPPESPGPREKTNVGEA VGSEPRTVSRRYLNSLKNKLSSEAWRKSCQPVTLSGSGTQEPEKKIVQELLETEQAYVAR LHLLDQAMSDLSWRLQVFFQELLKTARSSKAFPEDVVRVIFSNISSIYQFHSQFFLPELQ RRLDDWTANPRIGDVIQKLAPFLKMYSEYVKNFERAAELLATWTDKSPLFQEVLTRIQSS EASGSLTLQHHMLEPVQRIPRYELLLKEYIQKLPAQAPDQADAQKALDMIFSAAQHSNAA ITEMERLQDLWEVYQRLGLEDDIVDPSNTLLREGPVLKISFRRNDPMERYLFLFNNMLLY CVPRVIQVGAQFQVRTRIDVAGMKMHRNFFNQSSAERHLDCFQLSAAANEAAVNSLVREL MDAEFPHSFLVSGKQRTLELQARSQEEMISWMQAFQAAIDQIEKRNETFKAAAQGPEGDI QEQELQSEELGLRAPQWVRDKMVTMCMRCQEPFNALTRRRHHCRACGYCHPGLYLSHLGL YLPYLGLYPPSSGAAVPILAKSAGFNQCSPVVLGTSSEPX >gi568815592f:36799445_37023438|GENSCAN_predicted_CDS_6|1740_bp atgaagggggcaagtgaggagaagctggcatctgtgtccaacctggtcactgtgtttgag aatagcaggaccccagaagcagcacccagaggccagaggctagaggacgtgcatcaccgc cctgagtgcaggcctcccgagtccccaggaccacgggagaagacgaatgtcggggaggcc gtggggtctgagcccaggacagtcagcaggaggtacctgaactccctgaagaacaagctg tccagcgaagcctggaggaaatcttgccagcctgtgaccctctcaggatcggggacgcag gagccagagaagaagatcgtccaggagctgctggagacagagcaggcctatgtggcgcgc ctccacctgctagaccaggccatgagtgacctgtcgtggcggctacaggtgtttttccag gagctgctgaagacagcccgcagcagcaaggccttcccagaggatgtggtcagggtcatc ttctccaacatctcctccatctatcagttccattctcagttcttcctcccagagctgcag cggcgcctggacgactggacagctaacccccgcatcggtgacgtgatccagaagctggcc cccttcctgaagatgtacagtgagtatgtcaagaactttgagcgagcggctgagctgctg gccacctggaccgacaagtctccactcttccaggaggttctcactcgcatccagagcagc gaggcttcgggcagcctgaccctgcagcaccacatgctggaaccagtgcagagaattcca cgttacgagctgctgctcaaggagtacatccagaagctgccagcccaggccccagaccag gccgatgcccagaaagccctggacatgatcttctcagctgcccagcactccaatgcagcc atcactgagatggagcggctgcaggacctgtgggaggtgtaccagcgcctgggcctcgag gacgacatagtagacccctctaacaccctgctccgtgagggcccggtcctcaagatctcc ttccgccgcaacgaccccatggagcgctaccttttcttgttcaacaacatgctgctctac tgtgtgcccagggtgatccaggtgggcgcccagttccaggtgaggacccgcatcgatgtg gccgggatgaagatgcaccggaatttcttcaaccagtcttctgctgagagacacttagat tgttttcagttgtcagcagctgcaaatgaggctgcagtgaatagcctggtgcgggagctg atggatgctgagtttccccactccttcctggtgtccgggaagcagcgcaccctggagctg caagcccggtcccaggaggaaatgatttcctggatgcaggccttccaagcagccattgac caaatcgagaagcggaatgaaaccttcaaggctgcggcccaggggcctgagggagacatc caggagcaggagctgcagtctgaggagctgggcctccgggcaccgcagtgggtccgggac aagatggtgaccatgtgcatgcgctgccaggagcccttcaacgctctgacgcgccgtcgc caccactgccgggcctgcggctattgtcacccaggcctctacctgtcccacttaggcctc tacctgccctacctgggcctctacccacccagctcaggtgctgctgtccccatcttggcc aaatctgctggcttcaaccaatgcagcccagtggtgttgggcactagctctgagccagnn