GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:18:31 Sequence gi568815589r:90513231_90713827 : 200597 bp : 39.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 3224 3219 6 1.05 1.02 Term - 4523 4359 165 2 0 60 40 166 0.658 5.93 1.01 Init - 10145 10053 93 2 0 80 82 86 0.615 7.63 1.00 Prom - 12011 11972 40 -3.45 2.00 Prom + 30381 30420 40 -5.15 2.01 Init + 43858 43978 121 0 1 17 71 140 0.912 5.60 2.02 Intr + 47420 47568 149 0 2 98 49 67 0.476 2.83 2.03 Term + 49178 49402 225 1 0 44 28 190 0.478 4.40 2.04 PlyA + 49605 49610 6 1.05 3.00 Prom + 56850 56889 40 -7.45 3.01 Init + 60656 60850 195 1 0 46 95 156 0.913 11.08 3.02 Intr + 62116 62204 89 0 2 7 71 36 0.106 -8.35 3.03 Intr + 69218 69336 119 2 2 123 49 83 0.360 7.09 3.04 Intr + 70546 70814 269 2 2 84 85 194 0.303 15.03 3.05 Term + 71404 71688 285 0 0 67 48 276 0.982 15.92 3.06 PlyA + 72380 72385 6 1.05 4.00 Prom + 83844 83883 40 -3.75 4.01 Init + 84218 84338 121 1 1 48 58 145 0.476 7.90 4.02 Term + 92586 92740 155 1 2 -33 37 225 0.088 2.40 4.03 PlyA + 93299 93304 6 1.05 5.02 PlyA - 94974 94969 6 1.05 5.01 Sngl - 100597 99998 600 1 0 90 49 844 0.806 76.64 5.00 Prom - 111010 110971 40 -5.65 6.00 Prom + 116024 116063 40 -6.85 6.01 Init + 119889 119998 110 2 2 80 100 80 0.583 7.86 6.02 Intr + 129166 129282 117 1 0 86 65 53 0.274 1.46 6.03 Intr + 129439 129590 152 1 2 59 61 161 0.385 9.39 6.04 Intr + 145159 145286 128 2 2 51 82 83 0.301 3.48 6.05 Intr + 151918 152122 205 0 1 40 41 117 0.174 0.05 6.06 Intr + 152805 152943 139 1 1 9 78 147 0.382 4.40 6.07 Intr + 157139 157283 145 2 1 86 90 22 0.441 1.56 6.08 Intr + 167485 167615 131 0 2 16 82 77 0.031 -1.53 6.09 Intr + 169218 169496 279 0 0 24 102 170 0.070 7.67 6.10 Intr + 174046 174371 326 1 2 20 -14 310 0.020 8.49 6.11 Term + 174417 175084 668 1 2 -37 39 310 0.545 6.60 6.12 PlyA + 175315 175320 6 1.05 7.00 Prom + 175480 175519 40 -6.15 7.01 Sngl + 176438 177412 975 1 0 70 42 422 0.964 32.31 7.02 PlyA + 177444 177449 6 1.05 8.00 Prom + 178992 179031 40 -3.65 8.01 Init + 182260 182314 55 0 1 55 44 62 0.405 -0.00 8.02 Term + 182409 182671 263 1 2 104 33 141 0.475 4.90 8.03 PlyA + 184057 184062 6 1.05 9.04 PlyA - 184081 184076 6 1.05 9.03 Term - 189750 189630 121 2 1 119 49 71 0.135 3.27 9.02 Intr - 195212 195118 95 2 2 67 20 97 0.103 -1.36 9.01 Init - 195986 195849 138 2 0 81 53 89 0.646 3.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_1|85_aa MVPPEGDMMLTREEARMEQRKEAGLGQSALWYLRAEAQEEELDIRPVAPQSILHHTSPSR SPSQELILRVDLISRLPLWFPCMFA >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_1|258_bp atggtccctcctgaaggagacatgatgttaaccagggaagaggctcggatggaacagagg aaagaggctgggcttggtcagtcagcattgtggtatctgagagcggaggcgcaggaggag gagctggacattcgtccagttgcccctcagtccattctccaccacacttctccatcccgc tccccgtcacaggagctgattttacgagtggacctcatcagcaggctcccactctggttc ccatgtatgtttgcctag >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_2|164_aa MHNIVSHAEYTGVQVFRDTSSMLQEGATGSTVKAKCTESIGCSVRPLLPRCLHAWLVVGP PACLSFLFIEDKTTSAFGYGDIFYLEEQERQHMTGQLPEVRDHDNGTLKPHLNDLTFSVG AGGCTISNEWVSLPSVPKRVYKAMSPEPAVTHTALNPIQSSLPL >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_2|495_bp atgcacaacattgtgagtcatgctgagtacacaggggtccaggtattcagggacacttca tcaatgctacaggaaggagctacaggaagcactgttaaggctaaatgcactgaaagcata ggatgttcagtgcgtcctctcttgccacgttgtctccatgcttggcttgtggttgggcct ccagcatgcttatcttttcttttcattgaggataagacgacatctgcatttggatatgga gacattttctacttagaagaacaggaaaggcagcacatgactggtcaactgccagaggtg agggatcatgacaatgggacattgaagccacacctaaacgacctgacattttcagtgggg gcaggtggctgtaccatcagtaatgagtgggtatcactgccttcagtgcccaagagagtg tacaaggcaatgtctccagagcctgctgtcactcacacggccctcaaccccatccagtcc tccctccctctttaa >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_3|318_aa MLNLTPPKDNNQNPEHGKVLNKWADFFNNELEGGENEKKIKDKKYVKGLSSRPMGAFYLD PDSNMKTGILGEMADPRTRTENIQDKPGASCSTNKAREAAASAQWYWGKKEAAKRVLGKS SPAPEGSGDKMALSAQRTIYPVGGGPLFLLGTELSAGFQVVRCGPQATSKWRQCCRALGS HGGPAAHHTMPVQLLLATLWAAVIVMYRILGLAPATLWIPWVVQGSRDHVEDEDLAVEGN RKAEHTGSPASQPQVPLKSMDDNGDLGPSFKCPEGEIGTADVQLWEELTRSKERVTQDQS LTRAGLSAGQGHSHGGSS >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_3|957_bp atgttaaatttgacaccacccaaagacaataatcagaatccagaacatgggaaggtattg aacaaatgggcagattttttcaacaacgaattggaagggggagaaaatgagaagaagatc aaggataaaaaatatgtaaagggcctatcatcaagaccaatgggtgcattttatttggat cctgactcaaacatgaaaacagggatccttggagaaatggctgatcctaggactaggaca gaaaatatacaagacaagcctggagcatcttgtagtacaaacaaggccagagaggccgcg gcctctgcccagtggtactgggggaaaaaggaagccgcgaagcgggtcctggggaagagc tcacctgctcctgagggaagtggcgacaagatggcgctttcagcccaacggacgatctac cctgttggtggtgggcccttgtttctgttaggcacggaactctccgcagggttccaagtg gtccgttgtggcccacaggccactagcaagtggaggcagtgctgcagagcccttgggagc cacggagggcctgctgcccaccacacaatgcctgttcagctgctgctggccaccctgtgg gcagcagtgatagtgatgtacagaatcctgggactagcgccagcgaccctgtggataccc tgggtggtccagggcagtagagaccacgtggaggatgaggacttagcagtggaggggaac aggaaagcagaacatactggttccccagcgtcccagccacaagtccctctgaagagtatg gatgacaatggtgacttaggtccatccttcaaatgtcctgaaggggagattggaacagca gatgtacaattgtgggaagaattaacaaggagtaaagaacgcgtaacacaagaccaaagt ctgaccagagcaggcctctctgcaggacaagggcacagccatggagggagctcctga >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_4|91_aa MHHTHQVPRHGEDRAYHEDLGPQQGDQEPLASPAAEEEEGEPVPWVKGILWMWLQQQPWS AECKELRPRTPHSKYLGQQQVGKAKAELMGA >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_4|276_bp atgcatcacacccatcaggtgccgcgtcatggggaggacagggcctatcatgaagacctt ggcccacagcaaggtgaccaggagcccctggccagtcccgcagcggaggaggaggaagga gagccagtgccgtgggttaaaggaatcctgtggatgtggctgcagcagcagccatggagt gctgaatgcaaggaactgcggccacggaccccacattcaaagtatctggggcaacagcaa gtcggcaaagccaaggcagaattgatgggggcataa >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_5|199_aa MPEQSNDYRVAVFGAGGVGKSSLVLRFVKGTFRESYIPTVEDTYRQVISCDKSICTLQIT DTTGSHQFPAMQRLSISKGHAFILVYSITSRQSLEELKPIYEQICEIKGDVESIPIMLVG NKCDESPSREVQSSEAEALARTWKCAFMETSAKLNHNVKELFQELLNLEKRRTVSLQIDG KKSKQQKRKEKLKGKCVIM >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_5|600_bp atgcctgagcagagtaacgattaccgggtggccgtgtttggggctggcggtgttggcaag agctccctggtgttgaggtttgtgaaaggcacattccgggagagctacatcccgacggtg gaagacacctaccggcaagtgatcagctgtgacaagagcatatgcacattgcagatcacc gacacgacggggagccaccagttcccggccatgcagcggctgtccatctccaaagggcac gccttcatcctggtgtactccattaccagccgacagtccttggaggagctcaagcccatc tacgaacaaatctgcgagatcaaaggggacgtggagagcatccccatcatgctggtgggg aacaagtgtgatgagagccccagccgcgaggtgcagagcagcgaggcggaggccttggcc cgcacatggaagtgtgccttcatggagacctcagccaagctcaaccataacgtgaaggag cttttccaggagctgctcaacctggagaagcgcaggaccgtgagtctccagatcgacggg aaaaagagcaagcagcagaaaaggaaagagaagctcaaaggcaagtgcgtgatcatgtga >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_6|799_aa MLTSQAACRGQRAATEGALASLSSGQFGQKQRQLGLRPHLLTLFLNPLNFHINRNTCTVE FSKTVNPVLEGIPNNTGWQTIMPKPPVPQCGRSEATAEGGNRLPGEEHNSAQSFFRAPLA QDRAAQGNSKFFGYLTEFAAATCFFQRICEFLQPSWYVPMMVLGAKVYNVWTYYTEETKV WQLQPSASRCRSACHHAVIRHSDRGKAVAGRTGAPAGGDCAVALEVVLICSGVLASAGRS ESSVSWRTLVGVARDPDWKAPPSEKTLSALSTPPTSCSQPKERSYNEFKDTLGCTGGYTW ESEERVHFCKVLPLLNLISPATDRSSLGYTKRSNVNIAQVNSESSKHVSGAQNPRENSPM SSSYNQESSEHNGLHSTRVSHDAQSKEMHKRRGVHRGSQADGGQGGMTNKTEEEDTSLWK SLCTKCRGSELMLHTNSKATTNAEAQIPSPGPQEGWSAAARSECRKSIRPKVDKATKMGK KQSRKTRNSKNQSTSPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSKLKEEVRTNG KEVKNFEKKLDEWITRITDAEKSLKDLMELKTTARELQRVSAMEDEMNEMKHEEKFREKR IKRNEQSLQEIWDSVKRPNLCLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQ EIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGQVTHKGKPIRLTADVSAETL QARREWGPIFNILKEKSFQPRISYPAKQSFISEGEIKYFTDKQMLRDFVTTRPALKELLK EALNMERKNRYKPLQKHAK >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_6|2400_bp atgctgacatcacaggcggcttgtagagggcagagggcagccactgaaggggccctggct tccctgtcaagtggacaatttggtcaaaaacaaaggcagcttgggctaagaccacatctt cttactcttttcctaaatccattaaacttccacattaacagaaacacctgcacagtagaa ttttccaaaacggtaaatcctgttttggaaggaatccctaataatacgggctggcaaaca atcatgccaaagccgcccgtgccccaatgcggacgcagcgaagccacggcagaaggggga aatcggttacctggggaagaacacaactccgctcagagcttcttcagagctccactcgcg caggacagggcagcgcaggggaattcgaagttttttggctatctcacagagtttgcagca gcaacctgcttctttcaaaggatctgtgaattcttacagccttcctggtatgttcccatg atggttcttggagcaaaagtttacaatgtttggacctactacactgaggagaccaaggtg tggcagctgcagccgagtgctagcagatgcaggagtgcttgccaccatgcagtcattcgc cacagtgacagaggcaaggcggttgcagggagaacgggggcccctgctggaggagactgt gctgttgcactggaggtggtgttgatttgcagtggggtgctggctagtgcagggaggtct gaatcctctgtcagctggagaacactggtgggagtagccagagaccctgactggaaggct ccacctagtgaaaagaccctgagtgccttatccacacctccaacaagctgcagtcagccc aaggagaggagttacaatgagtttaaggatacattgggctgtactggggggtacacatgg gaaagtgaagagagggttcacttttgcaaggtacttccccttctaaatttaatttctcca gccactgatagatcttccctgggctatactaaaaggtccaatgtaaatattgctcaagta aactcagagagctcaaaacacgttagtggagctcagaatccaagagaaaactcacccatg tcctccagttacaatcaagagagtagtgagcacaatgggctccacagcaccagggtgagc cacgatgcacaaagcaaagagatgcacaaaagaagaggtgtgcacagaggatcccaggca gatggtggtcaaggcggaatgacaaataagacagaggaggaggatacatctctctggaaa tctttgtgcaccaaatgcagaggatctgagttgatgcttcacacaaacagcaaggcaact actaatgcagaggctcaaatcccctctccgggaccacaggagggctggtctgcagcggcc aggtcagaatgtaggaaatctataagaccaaaggtagataaagccacaaagatggggaaa aaacagagcagaaaaaccagaaactctaaaaatcagagcacctctcctcctccaaaggaa cgcagctcctcaccagcaacggaacaaagctggatggagaatgactttgatgagttgaga gaagaaggcttcagaagatcaaattactccaagctaaaggaggaagttcgaaccaatggc aaagaagttaagaactttgaaaaaaaattagatgaatggataactagaataaccgatgca gagaagtccttaaaggacctgatggagctgaaaaccactgcacgagaactacaaagggta tcagcgatggaagatgaaatgaatgaaatgaagcatgaagagaagtttagagaaaaaaga ataaaaagaaatgaacaaagcctccaagaaatatgggactctgtgaaaagaccaaatcta tgtctaattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaatactctg caggatattatccaggagaacttccccaatctagcaaggcaggccaacattcaaattcag gaaatacagagaacgccacaaagatactcctcgagaagagcaactccaagacacataatt gtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggt caagttacccacaaagggaagcccatcagactaacagctgatgtctcagcagaaactcta caagccagaagagagtgggggccaatattcaacattcttaaagaaaagagttttcaaccc agaatttcatatccagccaaacaaagcttcataagtgaaggagaaataaaatactttaca gacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaag gaagcactaaacatggaaaggaaaaatcggtacaagccactgcaaaaacatgccaaatag >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_7|324_aa MDKFLNTYTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVDFIPGMQGWFNMQKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQHFMLKTLNQLVIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLAREIKQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFNKV SGYKIDVQKSQAFLYTNNRQRAKS >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_7|975_bp atggataaattcctcaacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaagccagcatcatcctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtggacttcatccct gggatgcaaggctggttcaacatgcaaaaatcaataaacgtaatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacacttcatgctaaaaactctcaatcaattagttattgatgggacatatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacaaggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggaaatcaagcaggagaaggaaataaagggcatt cagttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcaacaaagtt tcaggatacaaaatcgatgtgcaaaaatcgcaagcattcttatacaccaataacagacag agagccaaatcatga >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_8|105_aa MGAGSGGAHQVTSHGADQGTLPGSSAHCLVSVLRVSSSDSTFSPVVSATPKSVLMFPKCV PRSTLFVAVAYLQRGWHLHVDTPRASQTRQVQTGAPCHPSTASVL >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_8|318_bp atgggtgcaggttctggtggagcccatcaggtcacctcgcacggtgctgaccagggtact ttgccgggcagctctgctcactgcctggtgtcagtcctgcgcgtgtcctcttctgactct acattttctcctgtggtctcagccactcctaaatctgtcctgatgttccccaaatgcgta cctaggtccacactttttgtggctgtcgcatacctacagagagggtggcatctccacgtg gacacccctcgggcatctcagactcgtcaagttcaaaccggagcaccttgtcatccctcc acagcctctgtcctctaa >gi568815589r:90513231_90713827|GENSCAN_predicted_peptide_9|117_aa MPCLEMVRGVAAGVLLGDCALCKQELRMGELNYLKRLFKATKTEIKIKVEEMSTSLPEHR PLLAIEAQRTALRGHALFFDTTTEQHSLLIRDHKPWSGPRQSTEPVHRRPSVSSFDV >gi568815589r:90513231_90713827|GENSCAN_predicted_CDS_9|354_bp atgccttgtttggagatggttaggggggtggctgctggagtcctgcttggagactgtgct ctgtgcaaacaggaattgaggatgggagagctaaattacttgaaaagactatttaaggcc acaaagacagaaataaaaattaaagtggaggagatgagcacctcccttcccgagcacagg cccctgctggccattgaggctcagagaacagcactgcgtggccatgcgctcttttttgac acaacaactgaacagcattctttgttgataagagaccacaaaccatggagtggtcctcgc cagtctacagagcctgtgcacagaagaccttctgtttcatcttttgatgtatag