GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:17:02 Sequence gi568815597f:35459125_35665660 : 206536 bp : 43.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 1313 1181 133 0 1 45 99 133 0.938 9.80 1.06 Intr - 3589 3497 93 2 0 77 72 68 0.917 4.04 1.05 Intr - 7571 7484 88 2 1 81 91 104 0.988 9.54 1.04 Intr - 11836 11706 131 2 2 75 75 91 0.662 6.91 1.03 Intr - 15782 15681 102 0 0 49 92 41 0.613 0.75 1.02 Intr - 20088 19842 247 0 1 97 110 53 0.389 5.43 1.01 Init - 41575 41528 48 1 0 51 100 -1 0.029 -1.65 1.00 Prom - 43681 43642 40 -0.86 2.00 Prom + 54188 54227 40 -4.16 2.01 Init + 56321 56481 161 1 2 85 72 91 0.742 6.50 2.02 Intr + 84341 84498 158 2 2 42 78 124 0.043 6.45 2.03 Intr + 98061 98207 147 1 0 70 67 126 0.064 8.91 2.04 Intr + 98250 98411 162 1 0 64 65 128 0.675 8.05 2.05 Intr + 98597 98759 163 0 1 49 93 50 0.855 0.63 2.06 Intr + 98799 98886 88 0 1 74 -44 189 0.965 4.27 2.07 Intr + 99953 100123 171 1 0 117 66 220 0.863 22.84 2.08 Intr + 101202 102170 969 2 0 84 94 1369 0.980 128.30 2.09 Intr + 103268 103509 242 1 2 37 117 350 0.724 29.15 2.10 Intr + 104078 104302 225 2 0 93 61 314 0.933 26.30 2.11 Intr + 104643 104785 143 0 2 131 110 61 0.999 12.60 2.12 Term + 106103 106539 437 0 2 104 55 621 0.999 55.85 2.13 PlyA + 107631 107636 6 1.05 3.00 Prom + 112669 112708 40 -6.86 3.01 Init + 114478 114480 3 0 0 98 101 0 0.516 2.30 3.02 Intr + 114803 115285 483 1 0 98 94 545 0.907 49.32 3.03 Intr + 115825 115876 52 0 1 120 116 64 0.867 10.88 3.04 Intr + 117036 117115 80 1 2 87 -29 99 0.797 -2.63 3.05 Intr + 117637 118130 494 0 2 44 80 401 0.817 26.70 3.06 Term + 118320 118461 142 0 1 96 49 83 0.934 2.60 3.07 PlyA + 119305 119310 6 -0.45 4.00 Prom + 121969 122008 40 -5.16 4.01 Init + 124701 124776 76 2 1 87 86 -34 0.480 -2.45 4.02 Intr + 129206 129428 223 0 1 120 91 207 0.900 21.39 4.03 Intr + 130806 130924 119 0 2 37 113 204 0.912 17.91 4.04 Intr + 131510 131651 142 0 1 73 89 284 0.999 26.41 4.05 Term + 135270 135552 283 0 1 98 36 254 0.990 16.10 4.06 PlyA + 136449 136454 6 1.05 5.07 PlyA - 137032 137027 6 1.05 5.06 Term - 144250 144143 108 1 0 125 38 137 0.996 10.91 5.05 Intr - 146158 146109 50 2 2 86 105 -25 0.965 -2.60 5.04 Intr - 150284 150122 163 2 1 114 91 196 0.999 22.05 5.03 Intr - 172220 172150 71 0 2 107 114 60 0.972 9.40 5.02 Intr - 182452 182218 235 1 1 48 93 267 0.708 20.46 5.01 Init - 193561 193505 57 1 0 85 44 65 0.068 3.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:35459125_35665660|GENSCAN_predicted_peptide_1|281_aa MTGEQHQFLVCAVFTKVHKAITISSPLTTDLTAELSGGPKNVSVQPEISEGLATTPSTQQ VKSSEKTQIAVPQPVAPSYSYATPTPQASFQSTSAPYPVIKELVVSAGESVQITLPKNEV QLNAYVLQEPPKGETYTYDWQLITHPRDYSGEMEGKHSQILKLSKVNLPPFTCRAKLTPG LYEFKVIVEGQNAHGEGYVNVTVKPEPRKNRPPIAIVSPQFQEISLPTTSTVIDGSQSTD DDKIVQYHWEELKGPLREEKISEDTAILKLSKLVPGNYTFS >gi568815597f:35459125_35665660|GENSCAN_predicted_CDS_1|843_bp atgactggcgagcagcaccagtttctagtttgtgctgttttcactaaggtccacaaggcg attacaatttccagtcccctaaccacagacctgactgcagagctgtctggtgggccaaag aatgtatcagtgcaacctgaaatatcagagggtcttgctactacgcccagcactcaacaa gtaaaaagttctgagaaaacccagattgctgtcccccagccagtggctccctcctacagt tatgctacccctaccccccaggcctctttccagagcacctcagcaccatacccagttata aaggaactggtggtatctgctggagagagtgtccagataaccctgcctaagaatgaagtt caattaaatgcatatgttctccaagaaccacctaaaggagaaacctacacctacgactgg cagctgattactcatcctagagactacagtggagaaatggaagggaaacattcccagatc ctcaaactatcgaaggtgaatttgcctcctttcacttgcagggcaaagctcactccaggc ctgtatgaattcaaagtgattgtagagggtcaaaatgcccatggggaaggctatgtgaac gtgacagtcaagccagagccccgtaagaatcggccccccattgctattgtgtcacctcag ttccaggagatctctttgccaaccacttctacagtcattgatggcagtcaaagcactgat gatgataaaatcgttcagtaccattgggaagaacttaaggggcctctaagagaagagaag atttctgaagatacagccatattaaaactaagtaaactcgtccctgggaactacactttc agn >gi568815597f:35459125_35665660|GENSCAN_predicted_peptide_2|1021_aa MPLNDFWVNNEIKAEIKKLFETNKNKDTTYQNLWDTAKAVLKGKFIALNTHIKRYMKSRD IMPPKKSPENAFPLKIKTNAYSKYTILLKEKDTTGSDWYNRGYSPQTSRPRPPTGAQGHI AGPRLSAASASSWSMGSGTPNLRSPHPEEEEEEEEASLDSDGGRKHRLLPPPRDRPGNTL SMLTKPSRPSPGKGNAGRETKGEEGQCWAEPGVKQPRIRQERCGGGGGAAAAWSQEWATR REQRPEAPGASRWEKTSPLGAAACRGLSEPALIDADTPRGPYRDSIAPYRDTIRDAQSLL CSQLGKASIMASDCEPALNQAEGRNPTLERYLGALREAKNDSEQFAALLLVTKAVKAGDI DAKTRRRIFDAVGFTFPNRLLTTKEAPDGCPDHVLRALGVALLACFCSDPELAAHPQVLN KIPILSTFLTARGDPDDAARRSMIDDTYQCLTAVAGTPRGPRHLIAGGTVSALCQAYLGH GYGFDQALALLVGLLAAAETQCWKEAEPDLLAVLRGLSEDFQKAEDASKFELCQLLPLFL PPTTVPPECYRDLQAGLARILGSKLSSWQRNPALKLAARLAHACGSDWIPAGSSGSKFLA LLVNLACVEVRLALEETGTEVKEDVVTACYALMELGIQECTRCEQSLLKEPQKVQLVSVM KEAIGAVIHYLLQVGSEKQKEPFVFASVRILGAWLAEETSSLRKEVCQLLPFLVRYAKTL YEEAEEANDLSQQVANLAISPTTPGPTWPGDALRLLLPGWCHLTVEDGPREILIKEGAPS LLCKYFLQQWELTSPGHDTSVLPDSVEIGLQTCCHIFLNLVVTAPGLIKRDACFTSLMNT LMTSLPALVQQQGRLLLAANVATLGLLMARLLSTSPALQGTPASRGFFAAAILFLSQSHV ARATPGSDQAVLALSPEYEGIWADLQELWFLGMQAFTGCVPLLPWLAPAALRSRWPQELL QLLGSVSPNSVKPEMVAAYQGVLVELARANRLCREAMRLQAGEETASHYRMAALEQCLSE P >gi568815597f:35459125_35665660|GENSCAN_predicted_CDS_2|3066_bp atgcccctgaatgacttttgggtaaataatgaaattaaggcagaaatcaagaagttattt gaaactaataagaacaaagatacaacataccagaatctctgggacacagctaaggcagtg ttaaaagggaaattcatagcactaaatacccacatcaaaagatacatgaagagcagagat ataatgccacctaaaaagtctcctgaaaatgcttttcccctgaagattaagaccaatgca tattcaaaatataccattctcttaaaggaaaaagacaccactggctctgactggtacaac agaggctacagccctcaaaccagccgcccccggccacctaccggggctcagggccacata gcggggccccggctctcggcggcctccgcctcctcctggtccatgggctcggggaccccc aaccttcgctcccctcacccggaggaggaggaggaagaggaagaagcctccctggacagc gacggcggccggaaacaccgcctcctcccacctccccgggaccgacccggaaacacactc tccatgctaaccaagccctcccgcccctcccccgggaagggcaatgccggccgcgagacc aagggggaggaggggcagtgctgggcggagcccggagtgaagcagccgcggattcgtcaa gagcggtgcgggggtgggggtggagctgcagcagcctggagccaggagtgggcaacgcgg cgtgagcagcggcccgaggctcccggagcatcgcgctgggagaagacttcgccgctcggg gccgcagcctgccggggcctctccgagccggcgctgatcgatgccgacacaccccgggga ccctatcgcgactccatcgcgccatatcgcgacaccatcagggatgctcagtccctcttg tgttcacagttgggcaaggcgagcatcatggcctcggattgcgagccagctctgaaccag gcagagggccgaaaccccaccctggagcgctacctgggagccctccgtgaggccaagaat gacagcgagcagtttgcagccctgctgctagtgaccaaggcagtcaaagcaggtgacata gatgccaaaactcggcggcggatcttcgatgctgtcggcttcaccttccccaatcgtctc ctgaccaccaaggaggcgccggatggctgccctgaccatgttctgcgggctttgggtgtg gccctgctggcctgcttctgcagtgaccctgaactggccgcccatccccaagtcctgaac aagattcccattcttagcaccttcctcacagcccggggggacccggacgatgctgcccgc cgctccatgattgatgacacctaccagtgcctgacggctgtagcaggcacacccagaggg cctcggcacctcattgctggtggcaccgtgtctgccctatgccaggcatacctggggcac ggctatggctttgaccaggccctggcactcctggtggggctgctggctgctgccgagaca cagtgctggaaggaggcggagcccgacctgctggccgtgttgcggggcctcagtgaggat ttccagaaagctgaggatgccagcaagtttgagctctgccagctgctgcccctctttttg cccccgacaaccgtgccccctgaatgctaccgggatctgcaggccgggctggcacgcatc ctgggaagcaagctgagctcctggcagcgcaaccctgcactgaagctggcagcccgcctg gcacacgcctgcggctccgactggatcccggcgggcagctccgggagcaagttcctggcc ctgctggtgaatctggcgtgcgtggaagtgcggctggcactggaggagacgggcacggag gtgaaagaggatgtggtgaccgcctgctatgccctcatggagttggggatccaggaatgc actcgctgtgagcagtcactgcttaaggagccacagaaggtgcagctcgtgagcgtcatg aaggaggccataggggctgttatccactacctgctgcaggtggggtcagagaagcagaag gagccctttgtgtttgcctcggtgcggatcctgggtgcctggctggccgaggagacctca tccttgcgtaaggaggtgtgccagctgctgcccttcctcgtccgctatgccaagaccctc tacgaggaggccgaggaggccaatgacctttcccagcaggtggccaacctggccatctcc cccaccaccccagggcccacctggccaggagacgctctccggctcctcctgcctggctgg tgccacctgaccgttgaagatgggccccgggagatcctgatcaaggaaggggccccctcg cttctgtgcaagtatttcctgcagcagtgggaactcacatcccctggccacgacacctcg gtgctgcctgacagcgtggagattggcctgcagacctgctgccacatcttcctcaacctc gtggtcaccgcaccggggctgatcaagcgtgacgcctgcttcacatctctaatgaacacc ctcatgacgtcgctaccagcactagtgcagcaacagggaaggctgcttctggctgctaat gtggccaccctggggctcctcatggcccggctccttagcacctctccagctcttcaggga acaccagcatcccgagggttcttcgcagctgccatcctcttcctatcacagtcccacgtg gcgcgggccaccccgggctcagaccaggcagtgctagccctgtcccctgagtatgagggc atctgggccgacctgcaggagctctggttcctgggcatgcaggccttcaccggctgtgtg cctctgctgccctggctggcccccgctgccctgcgctcccgctggccgcaggagctgctc cagctgctaggcagtgtcagccccaactctgtcaagcccgagatggtggccgcctatcag ggtgtcctggtggagctggcgcgggccaaccggctgtgccgggaggccatgaggctgcag gcgggcgaggagacggccagccactaccgcatggctgccttggagcagtgcctgtcagag ccctga >gi568815597f:35459125_35665660|GENSCAN_predicted_peptide_3|417_aa MERPDGLGAAAGGARLSSLPQAAYGPAPPLCHTPAATAAAEFQPPYFPPPYPQPPLPYGQ APDAAAAFPHLAGDPYGGLAPLAQPQPPQAAWAAPRAAARAHEEPPGLLAPPARALGLDP RRDYATAVPRLLHGLADGAHGLADAPLGLPGLAAAPGLEDLQAMDEPGMSLLDQSVIKKG EAKVEVVVAGLGLDFQELGLSYQELGILGVRRNPPKQSKEHNESPNIPGGGALAPWSGAG RGRADSRAQPRGTGPRGLTCTTAGPQRQRDPSASGTPAPAGPQRQRDPSASGTPAPAGPQ RQRVCGPVERVERWRPERRLRPGRPSLDVKLQPAQQQQRGRGRSRAQLDPEVVTQGKAWA VDPGQLRRRGRPSASAECGTGACPARALESEKVALWPPEASDPYSASQPGRERRLRR >gi568815597f:35459125_35665660|GENSCAN_predicted_CDS_3|1254_bp atggagcgccccgacgggctgggagcagctgccggcggggcccgcctgtcgtctctgccc caggcggcctacgggccggcgcccccgctctgccacacgccggccgccacagctgccgcc gaattccagccgccctacttcccgccgccctacccgcagccaccgctgccctacggtcag gcgcccgacgccgccgcagcctttccccacctggcaggggacccatatggcggcctggcg cccctggcgcagccgcagcctcctcaggccgcctgggccgcgccccgcgcagccgcccgc gcccacgaggagcctcccggcctgctggcaccgcccgcccgcgccctgggccttgacccg cgccgtgactatgccactgccgtgccccggctcctgcacggcctggccgacggcgcgcac ggcctggcagacgcacctctcggccttccggggctggcggcggcccccggtctggaggac ctgcaggcaatggacgagccgggaatgagcctcctagaccagtccgtgatcaagaaaggg gaggccaaggtggaagtggtggtagcagggctggggctggacttccaggagctggggctg agttaccaggagctggggatcctaggggtacgccgaaatcccccaaagcagtccaaagaa cacaacgagagtcctaacatcccaggtggcggcgcgctggctccctggagcggggcggga cgcggccgcgcggactcacgtgcacaaccgcgcgggacggggccacgcggactcacgtgc acaaccgcgggaccccagcgccagcgggaccccagcgccagcgggaccccagcgccagcg ggaccccagcgccagcgggaccccagcgccagcgggaccccagcgccagcgggaccccag cgccagcgggtctgtggcccagtggagcgagtggagcgctggcgacctgagcggagactg cgccctggacgccccagcctagacgtcaagttacagcccgcgcagcagcagcaaagggga aggggcaggagccgggcacagttggatccggaggtcgtgacccaggggaaagcgtgggcg gtcgacccagggcagctgcggcggcgaggcagaccttcggcctccgccgagtgcggtact ggagcctgccccgccagggccctggaatcagagaaagtcgctctttggccacctgaagcg tcggatccctacagtgcctcccagcctgggcgggagcggcggctgcgtcgctga >gi568815597f:35459125_35665660|GENSCAN_predicted_peptide_4|280_aa MDSIPLAIFKGRSSEKKRDRWGKQQVPIPSKASSLSALSLAKDSLVGGITNPGEVFCSVP GRLSLLSSTSKYKVTVGEVQRRLSPPECLNASLLGGVLRRAKSKNGGRCLRERLEKIGLN LPAGRRKAANVTLLTSLVEGEAVHLARDFGYVCETEFPAKAAAEYLCRQHADPGELHSRK SMLLAAKQICKEFADLMAQDRSPLGNSRPALILEPGVQSCLTHFSLITHGFGGPAICAAL TAFQNYLLESLKGLDKMFLSSVGSGHGETKASEKDAKHRK >gi568815597f:35459125_35665660|GENSCAN_predicted_CDS_4|843_bp atggattccatccctcttgccattttcaagggaagaagctctgagaaaaaaagggacagg tgggggaagcagcaagtgcccatcccctccaaagccagcagcctctcagccctctccttg gccaaagacagcctggtgggcggcatcacaaatcctggtgaggtcttctgctccgtgccc ggccggctttcactgctcagctcaacgtccaagtacaaggtgacggtgggggaggtgcag cggcgactctcgcctcccgagtgcctcaacgcctccctcctggggggtgtcctccgcagg gccaagtccaaaaatgggggccggtgtttgcgggaacggttagagaagattgggctcaac ctgccagctggccgtcgcaaggccgccaatgtgacgctgctgacttcgctagtggaagga gaggccgtgcacctggcccgagacttcggttacgtctgtgagacggagttcccagccaag gcagctgccgagtacctgtgccgacagcacgctgacccgggggagctgcacagccgcaag agcatgctgctggctgccaagcagatctgcaaggagtttgcagacttgatggctcaggac cgctcaccgctgggcaacagccgcccagcactcatcctggagcccggagtacagagctgc ttgacacactttagcctcatcacccatggcttcggtgggcctgccatctgtgctgccctc actgccttccagaactatttgctggagtcactcaaggggctggacaagatgtttctaagc agtgtgggcagtgggcatggtgaaaccaaggcttcggagaaggatgccaagcatcggaaa taa >gi568815597f:35459125_35665660|GENSCAN_predicted_peptide_5|227_aa MTVQTKVKVDTMKDKDVIKLYVIRERRGSRAAGVAPIFALRCCLTGETWKRASLVLCRTC SPWPSATMEYLIGIQGPDYVLVASDRVAASNIVQMKDGYELSPTAAANFTRRNLADCLRS RTPYHVNLLLAGYDEHEGPALYYMDYLAALAKAPFAAHGYGAFLTLSILDRYYTPTISRE RAVELLRKCLEELQKRFILNLPTFSVRIIDKNGIHDLDNISFPKQGS >gi568815597f:35459125_35665660|GENSCAN_predicted_CDS_5|684_bp atgactgtccagactaaagtaaaagtagataccatgaaggacaaagatgtgataaagttg tacgtcatccgagagcgccgtggaagtcgtgctgcaggcgtcgcgccaatcttcgctctg aggtgctgtctcaccggtgagacctggaagcgggcgagtctcgtgctgtgtcggacctgc agcccctggccttccgccaccatggagtacctcatcggtatccaaggccccgactatgtt cttgtcgcctccgaccgggtggccgccagcaatattgtccagatgaaggacggatatgaa ttgtctcccacggcagcagctaacttcacacgccgaaacctggctgactgtcttcggagt cggaccccatatcatgtgaacctcctcctggctggctatgatgagcatgaagggccagcg ctgtattacatggactacctggcagccttggccaaggccccttttgcagcccacggctat ggtgccttcctgactctcagtatcctcgaccgatactacacaccgactatctcacgtgag agggcagtggaactccttaggaaatgtctggaggagctccagaaacgcttcatcctgaat ctgccaaccttcagtgttcgaatcattgacaaaaatggcatccatgacctggataacatt tccttccccaaacagggctcctaa