GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:47:34 Sequence gi568815597f:35473926_35694673 : 220748 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 191 242 52 1 1 59 67 59 0.137 2.22 1.02 Intr + 28485 28574 90 1 0 -52 28 253 0.328 5.27 1.03 Term + 33616 33728 113 2 2 62 48 84 0.616 0.52 1.04 PlyA + 36415 36420 6 1.05 2.00 Prom + 39387 39426 40 -4.16 2.01 Init + 41520 41680 161 2 2 85 72 91 0.742 6.50 2.02 Intr + 69540 69697 158 0 2 42 78 124 0.043 6.45 2.03 Intr + 83260 83406 147 2 0 70 67 126 0.064 8.91 2.04 Intr + 83449 83610 162 2 0 64 65 128 0.675 8.05 2.05 Intr + 83796 83958 163 1 1 49 93 50 0.855 0.63 2.06 Intr + 83998 84085 88 1 1 74 -44 189 0.965 4.27 2.07 Intr + 85152 85322 171 2 0 117 66 220 0.863 22.84 2.08 Intr + 86401 87369 969 0 0 84 94 1369 0.980 128.30 2.09 Intr + 88467 88708 242 2 2 37 117 350 0.724 29.15 2.10 Intr + 89277 89501 225 0 0 93 61 314 0.933 26.30 2.11 Intr + 89842 89984 143 1 2 131 110 61 0.999 12.60 2.12 Term + 91302 91738 437 1 2 104 55 621 0.999 55.85 2.13 PlyA + 92830 92835 6 1.05 3.00 Prom + 97868 97907 40 -6.86 3.01 Init + 99677 99679 3 1 0 98 101 0 0.516 2.30 3.02 Intr + 100002 100484 483 2 0 98 94 545 0.907 49.32 3.03 Intr + 101024 101075 52 1 1 120 116 64 0.867 10.88 3.04 Intr + 102235 102314 80 2 2 87 -29 99 0.797 -2.63 3.05 Intr + 102836 103329 494 1 2 44 80 401 0.817 26.70 3.06 Term + 103519 103660 142 1 1 96 49 83 0.934 2.60 3.07 PlyA + 104504 104509 6 -0.45 4.00 Prom + 107168 107207 40 -5.16 4.01 Init + 109900 109975 76 0 1 87 86 -34 0.480 -2.45 4.02 Intr + 114405 114627 223 1 1 120 91 207 0.900 21.39 4.03 Intr + 116005 116123 119 1 2 37 113 204 0.912 17.91 4.04 Intr + 116709 116850 142 1 1 73 89 284 0.999 26.41 4.05 Term + 120469 120751 283 1 1 98 36 254 0.990 16.10 4.06 PlyA + 121648 121653 6 1.05 5.07 PlyA - 122231 122226 6 1.05 5.06 Term - 129449 129342 108 2 0 125 38 137 0.996 10.91 5.05 Intr - 131357 131308 50 0 2 86 105 -25 0.965 -2.60 5.04 Intr - 135483 135321 163 0 1 114 91 196 0.999 22.05 5.03 Intr - 157419 157349 71 1 2 107 114 60 0.972 9.40 5.02 Intr - 167651 167417 235 2 1 48 93 267 0.711 20.46 5.01 Init - 169589 169545 45 2 0 72 113 -4 0.612 1.19 5.00 Prom - 170905 170866 40 -5.46 6.00 Prom + 176675 176714 40 -0.96 6.01 Init + 179495 179548 54 1 0 52 110 -5 0.401 -0.62 6.02 Intr + 180469 180703 235 0 1 77 18 139 0.138 3.06 6.03 Intr + 193748 193857 110 0 2 90 30 85 0.194 2.90 6.04 Term + 199116 199238 123 2 0 72 39 80 0.319 -0.12 6.05 PlyA + 200121 200126 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_1|84_aa MGNDGPYNGPVRIGYNPVIAAVIIIIIIIIIIIITITIIKDYLSVHMGISQGQCENCGTR RTGSNVRTVGPEEQFPQQTGSNTD >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_1|255_bp atgggaaacgatggtccttacaacggtcctgtgaggataggatataacccggttattgct gctgtcatcatcatcatcatcatcatcatcatcatcatcatcaccatcaccatcattaaa gactacctcagtgtccacatgggtatttctcagggacaatgtgagaactgtgggaccaga agaacaggtagcaatgtgagaactgtgggaccagaagaacagtttccccagcaaactggc agcaacacagactga >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_2|1021_aa MPLNDFWVNNEIKAEIKKLFETNKNKDTTYQNLWDTAKAVLKGKFIALNTHIKRYMKSRD IMPPKKSPENAFPLKIKTNAYSKYTILLKEKDTTGSDWYNRGYSPQTSRPRPPTGAQGHI AGPRLSAASASSWSMGSGTPNLRSPHPEEEEEEEEASLDSDGGRKHRLLPPPRDRPGNTL SMLTKPSRPSPGKGNAGRETKGEEGQCWAEPGVKQPRIRQERCGGGGGAAAAWSQEWATR REQRPEAPGASRWEKTSPLGAAACRGLSEPALIDADTPRGPYRDSIAPYRDTIRDAQSLL CSQLGKASIMASDCEPALNQAEGRNPTLERYLGALREAKNDSEQFAALLLVTKAVKAGDI DAKTRRRIFDAVGFTFPNRLLTTKEAPDGCPDHVLRALGVALLACFCSDPELAAHPQVLN KIPILSTFLTARGDPDDAARRSMIDDTYQCLTAVAGTPRGPRHLIAGGTVSALCQAYLGH GYGFDQALALLVGLLAAAETQCWKEAEPDLLAVLRGLSEDFQKAEDASKFELCQLLPLFL PPTTVPPECYRDLQAGLARILGSKLSSWQRNPALKLAARLAHACGSDWIPAGSSGSKFLA LLVNLACVEVRLALEETGTEVKEDVVTACYALMELGIQECTRCEQSLLKEPQKVQLVSVM KEAIGAVIHYLLQVGSEKQKEPFVFASVRILGAWLAEETSSLRKEVCQLLPFLVRYAKTL YEEAEEANDLSQQVANLAISPTTPGPTWPGDALRLLLPGWCHLTVEDGPREILIKEGAPS LLCKYFLQQWELTSPGHDTSVLPDSVEIGLQTCCHIFLNLVVTAPGLIKRDACFTSLMNT LMTSLPALVQQQGRLLLAANVATLGLLMARLLSTSPALQGTPASRGFFAAAILFLSQSHV ARATPGSDQAVLALSPEYEGIWADLQELWFLGMQAFTGCVPLLPWLAPAALRSRWPQELL QLLGSVSPNSVKPEMVAAYQGVLVELARANRLCREAMRLQAGEETASHYRMAALEQCLSE P >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_2|3066_bp atgcccctgaatgacttttgggtaaataatgaaattaaggcagaaatcaagaagttattt gaaactaataagaacaaagatacaacataccagaatctctgggacacagctaaggcagtg ttaaaagggaaattcatagcactaaatacccacatcaaaagatacatgaagagcagagat ataatgccacctaaaaagtctcctgaaaatgcttttcccctgaagattaagaccaatgca tattcaaaatataccattctcttaaaggaaaaagacaccactggctctgactggtacaac agaggctacagccctcaaaccagccgcccccggccacctaccggggctcagggccacata gcggggccccggctctcggcggcctccgcctcctcctggtccatgggctcggggaccccc aaccttcgctcccctcacccggaggaggaggaggaagaggaagaagcctccctggacagc gacggcggccggaaacaccgcctcctcccacctccccgggaccgacccggaaacacactc tccatgctaaccaagccctcccgcccctcccccgggaagggcaatgccggccgcgagacc aagggggaggaggggcagtgctgggcggagcccggagtgaagcagccgcggattcgtcaa gagcggtgcgggggtgggggtggagctgcagcagcctggagccaggagtgggcaacgcgg cgtgagcagcggcccgaggctcccggagcatcgcgctgggagaagacttcgccgctcggg gccgcagcctgccggggcctctccgagccggcgctgatcgatgccgacacaccccgggga ccctatcgcgactccatcgcgccatatcgcgacaccatcagggatgctcagtccctcttg tgttcacagttgggcaaggcgagcatcatggcctcggattgcgagccagctctgaaccag gcagagggccgaaaccccaccctggagcgctacctgggagccctccgtgaggccaagaat gacagcgagcagtttgcagccctgctgctagtgaccaaggcagtcaaagcaggtgacata gatgccaaaactcggcggcggatcttcgatgctgtcggcttcaccttccccaatcgtctc ctgaccaccaaggaggcgccggatggctgccctgaccatgttctgcgggctttgggtgtg gccctgctggcctgcttctgcagtgaccctgaactggccgcccatccccaagtcctgaac aagattcccattcttagcaccttcctcacagcccggggggacccggacgatgctgcccgc cgctccatgattgatgacacctaccagtgcctgacggctgtagcaggcacacccagaggg cctcggcacctcattgctggtggcaccgtgtctgccctatgccaggcatacctggggcac ggctatggctttgaccaggccctggcactcctggtggggctgctggctgctgccgagaca cagtgctggaaggaggcggagcccgacctgctggccgtgttgcggggcctcagtgaggat ttccagaaagctgaggatgccagcaagtttgagctctgccagctgctgcccctctttttg cccccgacaaccgtgccccctgaatgctaccgggatctgcaggccgggctggcacgcatc ctgggaagcaagctgagctcctggcagcgcaaccctgcactgaagctggcagcccgcctg gcacacgcctgcggctccgactggatcccggcgggcagctccgggagcaagttcctggcc ctgctggtgaatctggcgtgcgtggaagtgcggctggcactggaggagacgggcacggag gtgaaagaggatgtggtgaccgcctgctatgccctcatggagttggggatccaggaatgc actcgctgtgagcagtcactgcttaaggagccacagaaggtgcagctcgtgagcgtcatg aaggaggccataggggctgttatccactacctgctgcaggtggggtcagagaagcagaag gagccctttgtgtttgcctcggtgcggatcctgggtgcctggctggccgaggagacctca tccttgcgtaaggaggtgtgccagctgctgcccttcctcgtccgctatgccaagaccctc tacgaggaggccgaggaggccaatgacctttcccagcaggtggccaacctggccatctcc cccaccaccccagggcccacctggccaggagacgctctccggctcctcctgcctggctgg tgccacctgaccgttgaagatgggccccgggagatcctgatcaaggaaggggccccctcg cttctgtgcaagtatttcctgcagcagtgggaactcacatcccctggccacgacacctcg gtgctgcctgacagcgtggagattggcctgcagacctgctgccacatcttcctcaacctc gtggtcaccgcaccggggctgatcaagcgtgacgcctgcttcacatctctaatgaacacc ctcatgacgtcgctaccagcactagtgcagcaacagggaaggctgcttctggctgctaat gtggccaccctggggctcctcatggcccggctccttagcacctctccagctcttcaggga acaccagcatcccgagggttcttcgcagctgccatcctcttcctatcacagtcccacgtg gcgcgggccaccccgggctcagaccaggcagtgctagccctgtcccctgagtatgagggc atctgggccgacctgcaggagctctggttcctgggcatgcaggccttcaccggctgtgtg cctctgctgccctggctggcccccgctgccctgcgctcccgctggccgcaggagctgctc cagctgctaggcagtgtcagccccaactctgtcaagcccgagatggtggccgcctatcag ggtgtcctggtggagctggcgcgggccaaccggctgtgccgggaggccatgaggctgcag gcgggcgaggagacggccagccactaccgcatggctgccttggagcagtgcctgtcagag ccctga >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_3|417_aa MERPDGLGAAAGGARLSSLPQAAYGPAPPLCHTPAATAAAEFQPPYFPPPYPQPPLPYGQ APDAAAAFPHLAGDPYGGLAPLAQPQPPQAAWAAPRAAARAHEEPPGLLAPPARALGLDP RRDYATAVPRLLHGLADGAHGLADAPLGLPGLAAAPGLEDLQAMDEPGMSLLDQSVIKKG EAKVEVVVAGLGLDFQELGLSYQELGILGVRRNPPKQSKEHNESPNIPGGGALAPWSGAG RGRADSRAQPRGTGPRGLTCTTAGPQRQRDPSASGTPAPAGPQRQRDPSASGTPAPAGPQ RQRVCGPVERVERWRPERRLRPGRPSLDVKLQPAQQQQRGRGRSRAQLDPEVVTQGKAWA VDPGQLRRRGRPSASAECGTGACPARALESEKVALWPPEASDPYSASQPGRERRLRR >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_3|1254_bp atggagcgccccgacgggctgggagcagctgccggcggggcccgcctgtcgtctctgccc caggcggcctacgggccggcgcccccgctctgccacacgccggccgccacagctgccgcc gaattccagccgccctacttcccgccgccctacccgcagccaccgctgccctacggtcag gcgcccgacgccgccgcagcctttccccacctggcaggggacccatatggcggcctggcg cccctggcgcagccgcagcctcctcaggccgcctgggccgcgccccgcgcagccgcccgc gcccacgaggagcctcccggcctgctggcaccgcccgcccgcgccctgggccttgacccg cgccgtgactatgccactgccgtgccccggctcctgcacggcctggccgacggcgcgcac ggcctggcagacgcacctctcggccttccggggctggcggcggcccccggtctggaggac ctgcaggcaatggacgagccgggaatgagcctcctagaccagtccgtgatcaagaaaggg gaggccaaggtggaagtggtggtagcagggctggggctggacttccaggagctggggctg agttaccaggagctggggatcctaggggtacgccgaaatcccccaaagcagtccaaagaa cacaacgagagtcctaacatcccaggtggcggcgcgctggctccctggagcggggcggga cgcggccgcgcggactcacgtgcacaaccgcgcgggacggggccacgcggactcacgtgc acaaccgcgggaccccagcgccagcgggaccccagcgccagcgggaccccagcgccagcg ggaccccagcgccagcgggaccccagcgccagcgggaccccagcgccagcgggaccccag cgccagcgggtctgtggcccagtggagcgagtggagcgctggcgacctgagcggagactg cgccctggacgccccagcctagacgtcaagttacagcccgcgcagcagcagcaaagggga aggggcaggagccgggcacagttggatccggaggtcgtgacccaggggaaagcgtgggcg gtcgacccagggcagctgcggcggcgaggcagaccttcggcctccgccgagtgcggtact ggagcctgccccgccagggccctggaatcagagaaagtcgctctttggccacctgaagcg tcggatccctacagtgcctcccagcctgggcgggagcggcggctgcgtcgctga >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_4|280_aa MDSIPLAIFKGRSSEKKRDRWGKQQVPIPSKASSLSALSLAKDSLVGGITNPGEVFCSVP GRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRCLRERLEKIGLN LPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAAAEYLCRQHADPGELHSRK SMLLAAKQICKEFADLMAQDRSPLGNSRPALILEPGVQSCLTHFSLITHGFGGPAICAAL TAFQNYLLESLKGLDKMFLSSVGSGHGETKASEKDAKHRK >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_4|843_bp atggattccatccctcttgccattttcaagggaagaagctctgagaaaaaaagggacagg tgggggaagcagcaagtgcccatcccctccaaagccagcagcctctcagccctctccttg gccaaagacagcctggtgggcggcatcacaaatcctggtgaggtcttctgctccgtgccc ggccggctttcactgctcagctcaacgtccaagtacaaggtgacggtgggggaggtgcag cggcgactctcgcctcccgagtgcctcaacgcctccctcctggggggtgtcctccgcagg gccaagtccaaaaatgggggccggtgtttgcgggaacggttagagaagattgggctcaac ctgccagctggccgtcgcaaggccgccaatgtgacgctgctgacttcgctagtggaagga gaggccgtgcacctggcccgagacttcggttacgtctgtgagacggagttcccagccaag gcagctgccgagtacctgtgccgacagcacgctgacccgggggagctgcacagccgcaag agcatgctgctggctgccaagcagatctgcaaggagtttgcagacttgatggctcaggac cgctcaccgctgggcaacagccgcccagcactcatcctggagcccggagtacagagctgc ttgacacactttagcctcatcacccatggcttcggtgggcctgccatctgtgctgccctc actgccttccagaactatttgctggagtcactcaaggggctggacaagatgtttctaagc agtgtgggcagtgggcatggtgaaaccaaggcttcggagaaggatgccaagcatcggaaa taa >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_5|223_aa MSHHTRPNNVSTFRHLYVIRERRGSRAAGVAPIFALRCCLTGETWKRASLVLCRTCSPWP SATMEYLIGIQGPDYVLVASDRVAASNIVQMKDGYELSPTAAANFTRRNLADCLRSRTPY HVNLLLAGYDEHEGPALYYMDYLAALAKAPFAAHGYGAFLTLSILDRYYTPTISRERAVE LLRKCLEELQKRFILNLPTFSVRIIDKNGIHDLDNISFPKQGS >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_5|672_bp atgagccaccacacccggcccaacaatgtctccacttttagacacttgtacgtcatccga gagcgccgtggaagtcgtgctgcaggcgtcgcgccaatcttcgctctgaggtgctgtctc accggtgagacctggaagcgggcgagtctcgtgctgtgtcggacctgcagcccctggcct tccgccaccatggagtacctcatcggtatccaaggccccgactatgttcttgtcgcctcc gaccgggtggccgccagcaatattgtccagatgaaggacggatatgaattgtctcccacg gcagcagctaacttcacacgccgaaacctggctgactgtcttcggagtcggaccccatat catgtgaacctcctcctggctggctatgatgagcatgaagggccagcgctgtattacatg gactacctggcagccttggccaaggccccttttgcagcccacggctatggtgccttcctg actctcagtatcctcgaccgatactacacaccgactatctcacgtgagagggcagtggaa ctccttaggaaatgtctggaggagctccagaaacgcttcatcctgaatctgccaaccttc agtgttcgaatcattgacaaaaatggcatccatgacctggataacatttccttccccaaa cagggctcctaa >gi568815597f:35473926_35694673|GENSCAN_predicted_peptide_6|173_aa MFKPQSHPIVIQRAKQCQKCVVGLKRKHQGCLMSSAAGSSIPAAFRQMAPSYSLHYPQGS GQSHLLMTKNTTLVGLTGADADPVELKWSMIRDWLYEPSSLESLESEADLVGNSQPGQRG GRMAKERNTDNEELGLAYDGLYDDGHGQTPLTTIRCLRISNTEECEACAWEAL >gi568815597f:35473926_35694673|GENSCAN_predicted_CDS_6|522_bp atgttcaagccccagagccatccaatagttattcaaagggccaagcaatgccagaagtgt gtggttggtctaaaacgcaagcatcagggctgcctgatgtcttcagcagcaggctccagt atcccagcggctttcagacaaatggcaccttcctactccttgcattatccccaggggtca gggcagtcacatctgctgatgactaaaaacacaacacttgtaggcctcacaggagctgat gcagacccagtagaactgaagtggtccatgatcagagactggttgtacgaaccttcatct ttagagagccttgaaagtgaggccgatttggttggaaacagccagccagggcagcgaggt ggaagaatggccaaggagagaaacacagacaatgaggagctaggtctagcttacgatggc ttgtatgatgacgggcatggtcagacacccctcaccaccatacgctgtctgaggatctca aacacagaggagtgtgaggcctgtgcctgggaagccttataa