GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:07:04 Sequence gi568815593f:54355790_54556239 : 200450 bp : 41.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2696 2775 80 1 2 57 103 22 0.126 1.25 1.02 Intr + 10199 10351 153 2 0 79 35 105 0.009 2.57 1.03 Intr + 30380 30563 184 2 1 38 61 177 0.198 8.87 1.04 Intr + 31791 31844 54 2 0 76 49 81 0.426 1.26 1.05 Term + 45854 45994 141 1 0 82 48 170 0.796 9.35 1.06 PlyA + 46178 46183 6 1.05 2.00 Prom + 59952 59991 40 -4.15 2.01 Init + 64447 64567 121 0 1 68 74 99 0.788 6.90 2.02 Intr + 71406 71433 28 1 1 27 101 43 0.047 -4.24 2.03 Intr + 72410 72515 106 2 1 50 90 80 0.125 3.70 2.04 Intr + 76668 76733 66 2 0 82 93 66 0.043 4.68 2.05 Intr + 82757 82860 104 1 2 57 55 90 0.347 0.65 2.06 Term + 93993 94353 361 0 1 109 43 160 0.791 6.62 2.07 PlyA + 94695 94700 6 1.05 3.00 Prom + 97832 97871 40 -6.65 3.01 Sngl + 100001 100453 453 1 0 61 44 493 0.970 38.15 3.02 PlyA + 100899 100904 6 -1.75 4.00 Prom + 101318 101357 40 -6.25 4.01 Init + 104912 105053 142 1 1 70 76 63 0.909 3.66 4.02 Term + 106605 106780 176 1 2 101 49 122 0.968 6.54 4.03 PlyA + 107795 107800 6 1.05 5.02 PlyA - 108503 108498 6 1.05 5.01 Sngl - 126652 126347 306 1 0 84 47 262 0.963 16.27 5.00 Prom - 128077 128038 40 -7.25 6.02 PlyA - 128539 128534 6 1.05 6.01 Sngl - 130633 129977 657 1 0 29 48 263 0.870 12.32 6.00 Prom - 134975 134936 40 -5.45 7.05 PlyA - 138820 138815 6 1.05 7.04 Term - 145293 144701 593 1 2 3 37 302 0.477 10.30 7.03 Intr - 148181 147967 215 1 2 13 74 184 0.250 6.94 7.02 Intr - 148919 148396 524 2 2 -15 48 427 0.265 19.22 7.01 Init - 149862 149731 132 0 0 85 36 43 0.481 -1.01 7.00 Prom - 151976 151937 40 -4.85 8.00 Prom + 152612 152651 40 -7.95 8.01 Init + 162164 163784 1621 1 1 115 98 1933 0.615 187.49 8.02 Term + 164774 164844 71 0 2 71 39 56 0.640 -3.88 8.03 PlyA + 165753 165758 6 1.05 9.00 Prom + 167893 167932 40 -5.05 9.01 Init + 169203 169286 84 2 0 84 28 119 0.075 6.37 9.02 Intr + 173218 173297 80 0 2 63 34 80 0.036 -2.47 9.03 Intr + 175273 175463 191 1 2 58 81 147 0.055 9.31 9.04 Term + 187390 187643 254 2 2 67 39 196 0.441 7.42 9.05 PlyA + 187698 187703 6 1.05 10.04 PlyA - 188883 188878 6 1.05 10.03 Term - 196478 196245 234 2 0 50 47 228 0.987 10.24 10.02 Intr - 196931 196795 137 0 2 53 3 152 0.634 2.57 10.01 Init - 198367 198274 94 1 1 54 93 113 0.937 8.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_1|203_aa MLSNTLAISHMWLLSTRNVSSLNRDVWVMDEAGNLYSQQTNTGTENQTPHVLTDKWELNN ENTWTQGGEHHTPGPVGGLGQETLAVAKATAEFWLLSWLCEEYLRAPDASAVTHCQTKCK GLGSELTETMRPFLNSEKQSLVTIFVATTTVALAKRGITHSGGSPLACNEQSYGEAHVTV AMRVTLEVKPAAPVKPLEDCSPG >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_1|612_bp atgctgtccaatacactagccatcagccacatgtggctactgagcactagaaatgtgtca agtctcaatcgagatgtgtgggtcatggatgaagctggaaacctttattctcagcaaact aacacaggaacagaaaaccaaacaccacatgttctcactgataagtgggagttgaacaat gagaacacatggactcagggaggggaacatcacacaccggggcctgtcgggggcctggga caggaaactttagctgttgctaaggcaactgcagagttctggctcttaagctggctgtgc gaggagtatctcagagccccagatgcctctgctgtcactcactgtcagaccaagtgcaaa ggacttggtagtgaactcacagaaacaatgagaccatttttgaattctgaaaagcaatct ctggtaaccatttttgttgcaactactactgtggcccttgcaaaacgagggatcactcat tctgggggcagccctctggcatgtaatgagcagtcctatggagaggcccatgtgacagta gccatgcgagtgacactggaagtgaaacctgcagctccagtcaagcctttagaagactgc agccctggctga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_2|261_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHGTCFREHRVGVLSEKQQASWQCDSRLFQLS ASSHSATTAKSTLPFHEQSIWRLSEAARVPSQDGGCLASEEETEENKFADAVFPAGNMTD TFKSPRKQLFPRWYKILSLPQRVYRGHAEILVSSLGMMFSFGSRTLKMPSCYLEHSDVLP LLLCSKLPVAHSVSLLIPAESWLFLENLLGLQAKLLLEFAACVQWSMISKSLQFNVHPTD DSDLGLVEDSIAASILSSLST >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_2|786_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacatggcacgtgtttcagagagcaccgggttggg gttttaagtgaaaagcagcaggcgtcatggcagtgtgacagcagactgtttcagctgtct gcttcttcacactctgccaccactgcaaagagcaccttgcccttccatgagcagtccatc tggcggctgagcgaggctgctcgagtaccctcacaagatggtggctgccttgcctcagag gaagagactgaagaaaataagtttgcagatgccgtattccctgctgggaacatgactgac acattcaagtctcctcgaaagcagttatttccaagatggtataagatcctgtccctgcct cagagagtgtatcgtggccacgctgagattcttgtatcttctcttggcatgatgttttct tttggctccaggaccttgaagatgccatcctgttacctggaacactcagatgtcctccct ctccttctctgctcaaagcttcctgtggcccactctgtcagccttcttatcccagctgag tcatggctctttttagaaaaccttctcggacttcaagccaagctgttattggaatttgcc gcctgtgttcaatggtcaatgatctccaagtcactgcaatttaacgttcaccctactgat gactctgaccttggtttggtcgaggacagcattgctgcatctatactgtcatccttatcc acctaa >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_3|150_aa MAKIILRHLIEIPVRYQEEFEARGLEDCRLDHALYALPGPTIVDLRKTRAAQSPPVDSAA ETPPREGKSHFQILLDVVQFLPEDIIIQTFEGWLLIKAQHGTRMDEHGFISRSFTRQYKL PDGVEIKDLSAVLCHDGILVVEVKDPVGTK >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_3|453_bp atggcaaaaatcattttgaggcacctcatagagattccagtgcgttaccaggaagagttt gaagctcgaggtctagaagactgcaggctggatcatgctttatatgcactgcctgggcca accatcgtggacctgaggaaaaccagggcagcgcagtctcctccagtggactcagcggca gagacgccaccccgagaaggcaaatcccactttcagatcctgctggacgtggtccagttc ctccctgaagacatcatcattcagaccttcgaaggctggctgctgataaaagcacaacac ggaaccagaatggatgagcacggttttatctcaagaagcttcacccgacagtacaaacta ccagatggtgtggaaatcaaagatttgtctgcagtcctctgtcatgatggaattttggtg gtggaagtaaaggatccagttgggactaagtga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_4|105_aa MSRAQGFSQWTLAALEAWQNCWQKWQQHGLGGRRCLQKQGVSQKDMQAYCAPHYEQHAKS AWFLPKILPERCPASPRLLLATSALTAFGNRMHLPGTLALLKEPP >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_4|318_bp atgagtagggctcaaggcttcagtcagtggactcttgctgccttggaagcctggcaaaac tgctggcaaaagtggcagcagcatggtctgggaggcaggcgatgccttcagaaacaaggt gtcagtcaaaaagatatgcaagcatactgtgccccacactatgagcagcatgccaaaagt gcctggttcctacccaaaatacttcctgaaagatgtccagctagtccaaggctactgtta gccaccagcgcactgactgcatttggcaatcgcatgcacttaccaggcacgctggccctt ctcaaagagccaccttga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_5|101_aa MAPRLLKLQFLPWLVTSWAKVVNWVEKLLSGDPQEPPTNHLMNCPDPSSLAIKDLVTRLL RMLPENSLQLTAFWGAALAEENGLVQSHASFRHSLHPTTNQ >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_5|306_bp atggctcccaggctcctcaaactccaatttctcccttggctggtgacatcttgggctaag gtggttaattgggtagagaaacttctttcaggagacccacaggagcctcccactaaccac ttgatgaactgcccagatccctcctctttagcaataaaggacttggtcacccggctgctg cgaatgttgcccgaaaacagccttcagctgactgccttctggggggctgccttggctgaa gaaaacggcctcgtccagagtcacgcttccttccggcacagcctgcatccaacgactaat caatga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_6|218_aa MNIDAKILNKILANRIQQHIKKLIQHDQVGFIPGMQGWFNICKSINIIHHINRTKDKNHM IISIDAEKAFDKIQQPFMLKTLSKLGTDGTYLKTIRAIYDKPTANIILNGQKPEAFPLKT GTRQGCPLSPLLFNIVLEVLTRATRQEKEIKGNQLGKEEVKLSLFADDMIIYSENPITSA QNLLSKQISFSKVSGYKINVQNHKHSYTPITDKQRAKS >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_6|657_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaacacgatcaagttggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacataatccatcatataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa actctcagtaaactaggtactgatgggacgtatctcaaaacaataagagctatttatgac aaacccacagccaatatcatactgaatggacaaaaaccggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg accagggcaaccaggcaggagaaagaaataaagggtaatcaattaggaaaagaggaagtc aaactgtctctgtttgcagatgacatgattatatattcagaaaaccccatcacctcagcc caaaatctccttagtaagcaaatctccttcagcaaagtctcaggatacaaaatcaatgtg caaaatcataagcattcttatacaccaataacagacaaacagagagccaaatcatga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_7|487_aa MNLLGSYTFTPTEVIPQNMALQNISSSDLRPYNSDVIPRNNLEREEQEYSEVTEEVTEQV YLPAKAKVAKEGEVHPYPSAPPHYYFEENDPPDLSFPEDTGRKVVAPVTVRAAPRGTALS SIQAGIQQARQEGDLEAWQFPVRIHPPDQQGNIIATFEPFPFKLLKEFKQAISQYGPGSP FVMGLLKNVTVSSQKIPTDWDTLTRACLTPAQFLQFKTCAMKQGPREPYVDFIARLQESL KKMIADSAAQEIVLQLLAFDNAHPDCQAALRPIRGKAHLVDYVKACDGIGADFVGILDNH FPKTKLFQFLKLTNWILLKITKFKPIEGAENVFADGSSNGKASYFGSKSKVFQTSCTSAQ KAELVAVIEVLTAFDMPINVISDSSYVVHSTQLIENAQLRFHTDEQLMTLFTQLQTAVRS RMHPFYSTHIRAHTPLPGPLTEGNQMADCLLATAISNARHFHNLTHVNASGLKCRYSITW KEAKAII >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_7|1464_bp atgaacctgttgggcagttatactttcacccctacagaggtgatacctcaaaatatggct ctgcaaaatatctcaagcagtgatcttaggccttacaatagtgatgttatccccagaaac aacttggagagggaggaacaagagtatagtgaagtaacagaagaggttacagagcaggtt tatttgccagctaaagccaaagtggcaaaggagggagaggttcatccctacccttctgca ccccctcattattattttgaagaaaatgaccccccagatctttcttttccggaggacact gggcgaaaagtagttgcccccgtgactgttcgagcagcacctcgagggactgctcttagt tctattcaggcaggcattcagcaagctagacaagagggtgatttagaggcttggcagttc cctgttagaatacaccccccagatcaacagggaaatattatagctacatttgagcctttt ccttttaaattactcaaagaattcaaacaagcaataagtcagtatggaccaggttctccc tttgtaatgggactgttaaagaatgttacagtttccagtcagaagattcctactgactgg gacactcttactcgagcttgtctaactcctgctcagttcttacaatttaaaacttgtgct atgaaacagggaccaagggaaccatatgttgattttatagctcggttacaggagtctctt aaaaagatgattgcagattcggctgctcaggagatagtgttgcagttactagctttcgac aatgctcatcccgattgccaggctgctctgcgacctatcagagggaaagcacatttagtt gattatgtcaaggcctgtgatggtattggagctgactttgtgggtattctcgataatcat tttcctaaaacgaagctgtttcagtttctgaaattaactaattggattctccttaaaata actaaatttaaaccaattgaaggtgctgagaatgtttttgcagatgggtctagtaatggt aaagcttcttattttggctcaaaaagtaaagttttccagacgtcctgtacttcagctcaa aaagcggagcttgtagctgtaattgaggtattgactgcttttgatatgcctattaatgtg atttccgattcttcatacgtggttcattccacacagttaattgaaaatgctcagttacga tttcatacagatgaacaactgatgactttatttacccaactgcaaacagcagttaggagt agaatgcaccctttttacagcactcacattagagctcatacacctcttccaggacctttg actgaagggaatcaaatggctgattgcctacttgctactgcaatatctaatgctagacac tttcacaatttaacccatgttaatgcttctggtctcaaatgcagatacagcattacctgg aaagaagctaaagctattatctag >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_8|563_aa MALRARALYDFRSENPGEISLREHEVLSLCSEQDIEGWLEGVNSRGDRGLFPASYVQVIR APEPGPAGDGGPGAPARYANVPPGGFEPLPVAPPASFKPPPDAFQALLQPQQAPPPSTFQ PPGAGFPYGGGALQPSPQQLYGGYQASQGSDDDWDDEWDDSSTVADEPGALGSGAYPDLD GSSSAGVGAAGRYRLSTRSDLSLGSRGGSVPPQHHPSGPKSSATVSRNLNRFSTFVKSGG EAFVLGEASGFVKDGDKLCVVLGPYGPEWQENPYPFQCTIDDPTKQTKFKGMKSYISYKL VPTHTQVPVHRRYKHFDWLYARLAEKFPVISVPHLPEKQATGRFEEDFISKRRKGLIWWM NHMASHPVLAQCDVFQHFLTCPSSTDEKAWKQGKRKAEKDEMVGANFFLTLSTPPAAALD LQEVESKIDGFKCFTKKMDDSALQLNHTANEFARKQVTGFKKEYQKVGQSFRGLSQAFEL DQQAFSVGLNQAIAFTGDAYDAIGELFAEQPRQDLDPVMDLLALYQGHLANFPDIIHVQK AGLAFTGEVLTAFLVGQSVYEQP >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_8|1692_bp atggcgctgcgcgcccgggcgctgtacgacttcaggtcggagaacccaggagagatctcg ctgcgagagcacgaggtgctgagcctgtgcagcgagcaggacatcgagggctggctcgag ggggtcaacagccgcggcgaccgcggcctcttcccggcctcctatgtgcaggtgatccgc gcccccgagcctggcccggcgggagacggcggcccgggcgccccggcccgctacgccaat gtgccccccgggggcttcgagcccctgcctgtcgcgccccccgcctccttcaagccgccg cctgacgccttccaggcgctgctgcagccacagcaggcgccgcctccgagcaccttccag ccgcccggcgcgggcttcccgtacggcgggggcgccctgcagccgtcgcctcagcagctc tacggcggctaccaggccagccaaggcagcgatgatgactgggacgacgagtgggacgac agctccacggtggcggacgagccgggcgctctgggcagcggagcatacccggacctcgac ggctcgtcttcggcgggtgtgggcgcagccggccgctaccgcctgtccacgcgctccgac ctgtccctgggttcccgcggcggctcggtccccccgcagcaccacccgtcggggcccaag agctcggccaccgtgagccgcaacctcaatcgcttctccaccttcgtcaagtccggcggg gaggccttcgtgctgggggaggcgtcaggcttcgtgaaggacggggacaagctgtgcgtg gtgctggggccctatggccccgagtggcaggagaacccctacccgttccagtgcaccatc gacgaccccaccaagcagaccaagttcaagggcatgaagagctacatctcctacaagctg gtgcccacgcacacgcaggtgccggtgcatcggcgctacaagcacttcgactggctgtac gcgcgcctggcggagaagttcccggtcatctccgtgccccacctgcccgagaagcaggcc accggccgcttcgaggaggacttcatctctaagcgcaggaagggcctgatctggtggatg aaccacatggccagccacccagtgctggcgcagtgcgacgtcttccagcacttcctgacg tgccccagcagcaccgacgagaaagcctggaagcagggcaagaggaaggccgagaaggac gagatggtgggcgccaacttcttcctgacccttagcacgccccccgccgctgcccttgac ctgcaggaggtggagagcaagatcgacggcttcaagtgcttcaccaagaagatggacgac agcgcgctgcagctcaaccacacggccaacgagttcgcgcgcaagcaggtgaccggcttc aaaaaggagtatcagaaggtgggccagtccttccgcggcctcagccaggcctttgagctg gaccagcaggccttctcggtgggcctgaaccaggctatcgccttcaccggagatgcctat gacgccattggcgagctcttcgcggagcagcccaggcaggacctggatcccgtcatggac ctattagcgctgtatcaggggcatctggctaacttcccggacatcatccacgttcagaaa gcgggtctggcctttactggtgaagtgctcactgcctttctggtgggccagtccgtttat gagcaaccctga >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_9|202_aa MPSQEAVGAEPHRSGSWPVGTSMGSGGPLLLEVIAETRKIKWGADMARPALKMDRNLNCE FLEASFTGAPWLPEAGLLEVTPVGWAEMRRLLEYSRGRVGFGKEQVNSRHLPGIMAGPGA LTKVKESRRHVEEGKMEVQKADGIQDRCNTISFATLAEIHHFHQIRVRDFKSQMQHFLQQ QIIFFQKVTQKLEEALHKYDSV >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_9|609_bp atgccaagccaagaagccgtgggtgcagagcctcatcgcagtgggagctggcctgtcggg accagcatgggctctggagggccactgctgttggaggtgatagcggaaacacggaagatc aaatggggtgccgacatggccaggcctgcacttaagatggacagaaatttaaactgcgag ttcctggaagcatccttcactggtgcaccctggctccctgaggcagggcttttggaagtt actcctgttggttgggcagagatgaggaggctcttagagtactccaggggaagagttggc tttggaaaggaacaggtgaactcgagacacctgccaggtatcatggcaggacctggagct cttaccaaagtcaaggagagtaggcgacacgtggaggaagggaagatggaggtgcagaag gctgacggcattcaggatcgctgtaacactatttcttttgccactttggctgaaattcac cacttccatcaaattcgagtgagagactttaaatcacagatgcagcatttcttacaacaa caaataatatttttccaaaaagttacccagaagttggaagaagctcttcacaaatatgat agtgtttaa >gi568815593f:54355790_54556239|GENSCAN_predicted_peptide_10|154_aa MVAIDEQMEAEERFNTSLAKGNLVVKEQQSPVPTTVPEAQQAFNKYLQNEGLYRSQRPGC LHRADGIIKLFKQNDSNMCLRRTNGSNLADADVTVKTPASRSRMSQKGVMRGNGSRRGGT DLNSAGWWTDAAPTAEQSQHCGCSLLLEPTPPFS >gi568815593f:54355790_54556239|GENSCAN_predicted_CDS_10|465_bp atggtagcaatagatgaacaaatggaagcagaagaaaggttcaatacaagccttgccaag ggcaatctggtggtaaaagagcagcagagccctgtgcctacaacagtgcccgaagcacaa caagcattcaataaatatctgcagaatgaaggactctatcggtcccagcgtccaggatgt ctccaccgtgccgatggcatcatcaagcttttcaagcaaaatgactccaatatgtgttta aggagaacaaatggcagcaacttggcagacgcagatgtgacagtcaaaacaccagccagc cgctcacgaatgtctcagaagggtgtgatgcgtggtaatgggtcacgtcgcggaggcaca gatttgaattccgcaggctggtggacagacgcggcacccacagcggaacaaagccagcac tgtggctgctctctcctcctggagccaactcctccattctcctga