GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:40:46 Sequence gi568815597f:230542607_230793390 : 250784 bp : 43.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1457 1452 6 1.05 1.08 Term - 7008 6920 89 1 2 78 52 42 0.099 -2.58 1.07 Intr - 8676 8530 147 1 0 69 96 63 0.194 5.41 1.06 Intr - 25586 25531 56 1 2 105 57 46 0.171 1.82 1.05 Intr - 35336 34992 345 1 0 14 37 807 0.203 62.71 1.04 Intr - 35994 35954 41 0 2 41 80 24 0.295 -6.08 1.03 Intr - 36782 36676 107 0 2 37 110 74 0.073 4.43 1.02 Intr - 69819 69590 230 2 2 90 42 123 0.250 5.41 1.01 Init - 82395 82364 32 0 2 105 73 40 0.214 3.44 1.00 Prom - 84636 84597 40 -2.16 2.00 Prom + 95661 95700 40 -0.76 2.01 Init + 100001 100072 72 1 0 91 91 119 0.008 13.57 2.02 Intr + 116858 117019 162 1 0 118 25 208 0.695 17.67 2.03 Intr + 118152 118217 66 2 0 62 119 90 0.713 8.60 2.04 Intr + 120535 120615 81 0 0 62 69 83 0.900 3.63 2.05 Intr + 121878 121981 104 2 2 97 111 -23 0.919 -0.13 2.06 Intr + 126070 126178 109 1 1 71 131 27 0.915 5.59 2.07 Intr + 126750 126929 180 2 0 80 58 153 0.886 11.56 2.08 Intr + 128910 129034 125 2 2 98 91 68 0.967 7.48 2.09 Intr + 132392 132518 127 2 1 77 55 40 0.211 0.28 2.10 Intr + 136307 136446 140 1 2 95 81 5 0.178 -0.24 2.11 Intr + 140968 141029 62 1 2 91 93 48 0.937 3.98 2.12 Intr + 142479 142630 152 1 2 105 94 22 0.962 4.38 2.13 Intr + 144329 144526 198 1 0 33 115 166 0.971 13.35 2.14 Intr + 147408 147547 140 2 2 81 97 72 0.969 6.66 2.15 Intr + 148778 148958 181 2 1 65 91 165 0.982 14.37 2.16 Intr + 152626 152741 116 0 2 78 82 49 0.528 2.55 2.17 Term + 159022 159091 70 1 1 76 54 62 0.047 -0.99 2.18 PlyA + 159140 159145 6 1.05 3.05 PlyA - 159352 159347 6 1.05 3.04 Term - 160723 160535 189 1 0 119 54 138 0.956 10.95 3.03 Intr - 161731 161587 145 0 1 93 87 175 0.781 18.18 3.02 Intr - 163594 163327 268 2 1 97 74 333 0.985 29.39 3.01 Init - 168217 167389 829 1 1 67 116 1011 0.998 94.68 3.00 Prom - 181628 181589 40 -2.06 4.00 Prom + 188503 188542 40 -4.56 4.01 Init + 204891 205704 814 2 1 99 33 190 0.235 9.65 4.02 Intr + 208454 208590 137 0 2 61 101 61 0.611 4.99 4.03 Intr + 212731 212800 70 0 1 99 92 51 0.654 5.35 4.04 Intr + 216906 217024 119 1 2 75 110 71 0.822 8.18 4.05 Intr + 220047 220180 134 2 2 45 89 219 0.975 17.14 4.06 Intr + 223835 223975 141 2 0 69 45 71 0.474 0.37 4.07 Intr + 224935 225103 169 1 1 127 95 213 0.999 25.85 4.08 Intr + 226574 226657 84 1 0 88 96 58 0.988 6.62 4.09 Intr + 229408 229493 86 0 2 118 82 108 0.993 11.82 4.10 Intr + 231414 231508 95 0 2 85 50 30 0.522 -1.49 4.11 Intr + 231948 232083 136 1 1 77 60 76 0.549 3.33 4.12 Intr + 235988 236053 66 2 0 80 61 55 0.393 0.02 4.13 Intr + 236367 236527 161 0 2 96 69 173 0.757 15.83 4.14 Intr + 237573 237730 158 1 2 32 92 109 0.943 5.43 4.15 Intr + 237894 238102 209 2 2 92 117 211 0.992 22.28 4.16 Intr + 243375 243411 37 0 1 99 105 31 0.941 4.16 4.17 Intr + 244916 244996 81 1 0 43 75 77 0.400 1.73 4.18 Intr + 247526 247583 58 1 1 66 89 36 0.310 -0.04 4.19 Intr + 249746 249888 143 0 2 96 34 146 0.981 10.07 4.20 Intr + 250244 250364 121 1 1 77 84 115 0.603 10.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 40404 40500 97 2 1 69 98 106 0.967 10.17 S.002 Init - 100107 99777 331 0 1 73 65 282 0.815 20.26 S.003 Init + 172163 172217 55 1 1 62 115 31 0.871 4.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:230542607_230793390|GENSCAN_predicted_peptide_1|348_aa MGTQGDSHKARSPAIMNDTVTIWTRELMTNQLLQRKQMVIDILNTGKATVPKTEILEKLV KIYKTTLDVIFVFGFRTYFGGGKIADFDGFEGVNIYRGKAGGRGKRENMVTVQNPQVARE KEQEGFQDPVYSFFMFKTRPCLEKRKKKEKKKEKGKGKGKKEGKGKGEEKGEEEEEEKKD EEEEEDEEEEEEEKKEEEGEEEKEEEEEEEEEKKEEEEEKKEEEGEEEEEEEKKKEEGEE EEEEEKKKEEEGQQELQQLPPPHLFSLWWPAILLKDAHPDCSTIHNSKDVEPTQIPINDR LDKENVGHIHHGILCSHKKEQGIPTAAMNAPALSISPPGSVHSIRVLF >gi568815597f:230542607_230793390|GENSCAN_predicted_CDS_1|1047_bp atggggacccagggagactcgcacaaggcgagatcacctgccatcatgaatgacacagta actatctggaccagggaactcatgaccaaccaactacttcaaaggaaacaaatggtcata gatatccttaacactgggaaggcaacagtaccaaagacagaaattctggaaaaattagtc aaaatatacaagaccacactggatgtcatctttgtatttggattcagaacttattttggt ggtggcaagatagctgactttgatggttttgagggcgtcaatatttacaggggaaaagca ggtggaagaggaaagagggagaatatggtcacagtacagaacccacaagttgcaagagaa aaggagcaggaaggtttccaggaccctgtgtacagctttttcatgttcaaaacaagacct tgtctggagaagaggaagaagaaggagaagaagaaggagaaggggaaggggaaggggaag aaggaggggaaggggaagggggaggagaagggggaggaggaggaagaggagaagaaggac gaggaggaggaagaagatgaggaggaggaagaagaggagaagaaggaggaggagggggag gaagagaaggaggaggaagaagaggaggaggaggagaagaaggaggaggaggaggagaag aaggaggaggaaggggaggaagaggaggaggaggagaagaagaaggaggaaggggaggaa gaggaggaggaggagaagaagaaggaggaggaggggcagcaggagctgcagcagctgcca ccaccacacctcttcagcctctggtggccagcaatcctcctgaaagatgctcatcctgat tgcagcactattcacaatagcaaagacgtggaaccaacccaaattcccatcaatgataga ctggataaagaaaatgtgggacatatacaccatggaatactatgcagccataaaaaggag cagggcatcccgacagcagctatgaatgcacctgctctgagcatctccccgccaggctct gtgcactccatcagggtcctcttctga >gi568815597f:230542607_230793390|GENSCAN_predicted_peptide_2|694_aa MEKSRMNLPKGPDTLCFDKDEFMKEDFDVDHFVSDCRKRVQLEELRDDLELYYKLLKTAM VELINKDYADFVNLSTNLVGMDKALNQLSVPLGQLREEVLSLRSSVSEGIRAVDERMSKQ EDIRKKKMCVLRLIQVIRSVEKIEKILNSQSSKETSALEASSPLLTGQILERIATEFNQL QFHAVQSKGMPLLDKVRPRIAGITAMLQQSLEGLLLEGLQTSDVDIIRHCLRTYATIDKT RDAEALVGQVLVKPYIDEVIIEQFVESHPNGLQVMYNKLLEFVPHHCRLLREVTGGAISS EKGNTVPGYDFLVNSVWPQIVQGLEEKLPSLFNPGNPDAFHEKYTISMDFVRRLERQCGS QASVKRLRAHPAYHSFNKKWNLPVYFQIRFREIAGSLEAALTDVLEDAPAESPYCLLASH RTWSSLRRCWSDEMFLPLLVHRLWRLTLQILARYSVFVNELSLRPISNESPKEIKKPLVT GSKEPSITQGNTEDQGSGPSETKPVVSISRTQLVYVVADLDKLQEQEVPTTASSYVDSAL KPLFQLQSGHKDKLKQAIIQQWLEGTLSESTHKYYETVSDVLNSVKKMEESLKRLKQARK TTPANPVGPSGGMSDDDKIRLQLALDVEYLGEQLFPVTKANLKPSQEGSRGEHRSSLAKP IQCKATSVATWSSSEKLGILQPFRRVHVCQLYPL >gi568815597f:230542607_230793390|GENSCAN_predicted_CDS_2|2085_bp atggagaaaagtaggatgaacctgcccaaggggccggacacgctctgcttcgacaaggac gagttcatgaaggaagatttcgatgtcgatcattttgtgtctgactgtaggaagcgggtc cagctggaagaactgagagatgacctggagctctactataaacttcttaaaacagccatg gtcgaactcatcaacaaggattatgcagattttgtcaatctttcaacaaacttggttggc atggacaaagccctcaaccagctttctgtgcctttgggacaattacgagaagaggttctg agccttagatcgtctgtcagtgaaggaattcgggcagttgatgaacgaatgtctaaacaa gaggacattaggaaaaaaaagatgtgtgtattgaggcttatacaagttattcggtcagtt gagaaaattgaaaaaatcttaaactctcaaagttctaaagaaacctctgcactagaagca agcagcccccttttgactggacaaattttggagagaattgccacagaatttaatcagtta cagtttcatgctgttcaaagcaaaggcatgcctcttttggacaaagtaagaccgcgtata gctggcattacagccatgttacagcagtcactggaaggtctcctattagaaggccttcag acgtctgacgtcgatataatacggcactgcttgcggacttacgccacgattgacaagaca cgggacgcggaggccttagttggccaagtactagtgaaaccatacatagacgaggtgatt atagagcagtttgttgaatctcatcccaatggccttcaggtcatgtataataaactcctg gagtttgttcctcaccattgccgccttcttcgagaagtcacaggaggtgccatctccagt gaaaaaggcaatactgttcctggatatgactttttggtgaattctgtttggccacaaata gtacaaggattagaagaaaagttaccctcgctttttaatcctgggaatcccgatgcattt catgagaaatataccataagtatggattttgtcagaagattggaacggcagtgtggatca caggctagtgtaaagagattaagagcccatcctgcctatcacagcttcaataagaagtgg aacttgcctgtttattttcaaataagatttagagaaatagcgggatccttagaagcagca cttacagatgtcctggaagatgccccagctgaaagtccgtattgccttttggcttctcat agaacttggagcagccttaggaggtgttggtcagatgagatgttcttgccattactggtg catcgcctgtggagactcactctgcagattttggcacgatactctgtgtttgtcaatgag ctttcactcaggcccatttctaatgaaagtcccaaggagatcaagaaacctttggtaact ggtagcaaagaaccttccatcacccaaggaaacactgaagaccaaggaagtggtccttcg gaaacaaagcctgtggtttccatttcccgcactcagctcgtgtatgtggttgcagacctg gacaagcttcaggagcaggaggtcccaaccacagcttcctcctatgtggacagtgctctg aagcccttattccagcttcagagcggacacaaggataagctcaaacaagcaataattcag cagtggctagaaggcactctcagtgaaagcactcataagtactatgaaaccgtgtcagat gtattaaactctgtgaagaagatggaagagagcctgaaaaggctgaaacaagccagaaaa accactcccgccaaccccgtcggtcccagtggtggcatgagcgacgacgacaaaatcagg ctgcagttggccctagatgttgagtacttgggagagcagctgtttcctgtgaccaaggcc aacctcaaaccatcccaggaagggagtcgtggggaacacaggtccagcctcgctaagccg atccagtgcaaagctacctcagtagccacatggagttcttccgagaaacttggcatcttg cagccgtttcggagagtccatgtgtgccagctctacccgctttga >gi568815597f:230542607_230793390|GENSCAN_predicted_peptide_3|476_aa MAPAGVSLRATILCLLAWAGLAAGDRVYIHPFHLVIHNESTCEQLAKANAGKPKDPTFIP APIQAKTSPVDEKALQDQLVLVAAKLDTEDKLRAAMVGMLANFLGFRIYGMHSELWGVVH GATVLSPTAVFGTLASLYLGALDHTADRLQAILGVPWKDKNCTSRLDAHKVLSALQAVQG LLVAQGRADSQAQLLLSTVVGVFTAPGLHLKQPFVQGLALYTPVVLPRSLDFTELDVAAE KIDRFMQAVTGWKTGCSLMGASVDSTLAFNTYVHFQGKMKGFSLLAEPQEFWVDNSTSVS VPMLSGMGTFQHWSDIQDNFSVTQVPFTESACLLLIQPHYASDLDKVEGLTFQQNSLNWM KKLSPRTIHLTMPQLVLQGSYDLQDLLAQAELPAILHTELNLQKLSNDRIRVGEVLNSIF FELEADEREPTESTQQLNKPEVLEVTLNRPFLFAVYDQSATALHFLGRVANPLSTA >gi568815597f:230542607_230793390|GENSCAN_predicted_CDS_3|1431_bp atggctcctgccggtgtgagcctgagggccaccatcctctgcctcctggcctgggctggc ctggctgcaggtgaccgggtgtacatacaccccttccacctcgtcatccacaatgagagt acctgtgagcagctggcaaaggccaatgccgggaagcccaaagaccccaccttcatacct gctccaattcaggccaagacatcccctgtggatgaaaaggccctacaggaccagctggtg ctagtcgctgcaaaacttgacaccgaagacaagttgagggccgcaatggtcgggatgctg gccaacttcttgggcttccgtatatatggcatgcacagtgagctatggggcgtggtccat ggggccaccgtcctctccccaacggctgtctttggcaccctggcctctctctatctggga gccttggaccacacagctgacaggctacaggcaatcctgggtgttccttggaaggacaag aactgcacctcccggctggatgcgcacaaggtcctgtctgccctgcaggctgtacagggc ctgctagtggcccagggcagggctgatagccaggcccagctgctgctgtccacggtggtg ggcgtgttcacagccccaggcctgcacctgaagcagccgtttgtgcagggcctggctctc tatacccctgtggtcctcccacgctctctggacttcacagaactggatgttgctgctgag aagattgacaggttcatgcaggctgtgacaggatggaagactggctgctccctgatggga gccagtgtggacagcaccctggctttcaacacctacgtccacttccaagggaagatgaag ggcttctccctgctggccgagccccaggagttctgggtggacaacagcacctcagtgtct gttcccatgctctctggcatgggcaccttccagcactggagtgacatccaggacaacttc tcggtgactcaagtgcccttcactgagagcgcctgcctgctgctgatccagcctcactat gcctctgacctggacaaggtggagggtctcactttccagcaaaactccctcaactggatg aagaaactatctccccggaccatccacctgaccatgccccaactggtgctgcaaggatct tatgacctgcaggacctgctcgcccaggctgagctgcccgccattctgcacaccgagctg aacctgcaaaaattgagcaatgaccgcatcagggtgggggaggtgctgaacagcattttt tttgagcttgaagcggatgagagagagcccacagagtctacccaacagcttaacaagcct gaggtcttggaggtgaccctgaaccgcccattcctgtttgctgtgtatgatcaaagcgcc actgccctgcacttcctgggccgcgtggccaacccgctgagcacagcatga >gi568815597f:230542607_230793390|GENSCAN_predicted_peptide_4|1007_aa MPYLYRAPGPQAHPVPKDARITHSSGQSFEQMRQECLQRGTLFEDADFPASNSSLFYSER PQIPFVWKRPGVSGASRGRSIDEAEVQQPPQEVETGVLGRGRLRSVQLRCIIPVYRGKLC SLEAENNVQAWDALRWGPGASGCTRSPAPSCLTPTEAEDLESEWVLSAGDRHESKPYPQG QAPTLCVVTLRSRYAQCSVASGPELRVLSVDGCLAQQGLSNRVTDGIQGKQESSPENHAP PIPRDQGEQCPTSDSLEVSSQEAPAGHWVPRGNSPPPILSVNPGHLPESAHTPLAHVNNP CGRPVHSVACDHSLSCTEIVKNPEFILGGATRTDICQGELGDCWLLAAIASLTLNQKALA RVIPQDQSFGPGYAGIFHFQFWQHSEWLDVVIDDRLPTFRDRLVFLHSADHNEFWSALLE KAYANPSLTRRKNIRQILILQYACPGLLKTVKVIKNKDSLINCHSHKKPKDMLNGSYEAL KGGSAIEAMEDFTGGVAETFQTKEAPENFYEILEKALKRGSLLGCFIDTRSAAESEARTP FGLIKGHAYSVTGIDQVSFRGQRIELIRIRNPWGQVEWNGSWSDRTGTPSSQNPTWLASE VTYKRCRVQTTSSMGRFLRSGVLLVQLSRSVCVTLLWMMGNSGTVLVPVLTADTSKSHGP GAPPESCPAQDLVVQVSTQNVPHQMAFKDFKAHFDKVEICNLTPDALEEDAIHKWEVTVH QGSWVRGSTAGGCRNFLDTFWTNPQIKLSLTEKDEGQEECSFLVALMQKDRRKLKRFGAN VLTIGYAIYECPDKDEHLNKDFFRYHASRARSKTFINLREVSDRFKLPPGEYILIPSTFE PHQEADFCLRIFSEKKAITRDMDGNVDIDLPEPPKPTPPDQETEEEQRFRALFEQVAGED MEVTAEELEYVLNAVLQKIPELRSHEEPQVETLLMSPLTHMAQTSGNGKLEFDEFKVFWD KLKQWINLFLRFDADKSGTMSTYELRTALKAAGKEKTGVQVADCMPX >gi568815597f:230542607_230793390|GENSCAN_predicted_CDS_4|3021_bp atgccttacctctaccgggccccagggcctcaggcacacccggttcccaaggacgcccgg atcacccactcctcaggccagagctttgagcaaatgaggcaggagtgcctgcagagaggc accctgtttgaggatgcagacttcccagccagcaattcctccctgttctacagtgagagg ccgcagatcccctttgtgtggaaacgaccaggggtgagtggggcgagcaggggaaggagc atagatgaggccgaggttcagcagcccccgcaggaagtggaaacaggggtgctggggagg ggccggctacgctcagtgcaactgaggtgcatcatcccagtctaccgtggaaagctttgc agtttggaggcagaaaataatgtacaggcctgggatgctctgcggtggggacccggggca tctggttgtacccgctccccggctccaagctgtctgactcccaccgaggctgaggacctg gagtcagagtgggtcttgtcagcaggtgataggcacgaatcaaaaccttatccacaagga caggcccccaccctgtgtgtggtaacactgagatcacgttatgcacagtgttctgtggct tctggcccagagctgagggtgctcagtgtggatggatgtttggctcagcaaggtctcagc aaccgggtgacagatggaatacaagggaagcaggaatcttctcccgagaaccatgcaccc cccattcccagagatcagggtgaacagtgccccacctctgacagtctggaggtgtcgtcc caagaggcaccagctggtcactgggtcccaagaggcaattcccccccaccaattctcagt gtaaatccaggacacctccccgaaagtgcccacacccctctagcccatgtgaacaacccc tgtggcagacctgtgcacagcgtggcatgtgaccacagcctctcctgcacggaaatcgtg aaaaacccagaattcattcttggaggggccaccaggactgatatctgccagggagagctg ggagactgctggctattagccgccatcgcctcccttacgcttaatcaaaaagcactggcc agagtcatcccccaggaccaaagctttggccctggttatgccgggatattccatttccag ttctggcagcacagtgagtggctggacgtggtgatcgatgaccgcctgcccaccttcagg gaccgcttggttttcctccactctgccgaccacaacgagttctggagcgccttgctggaa aaagcctacgccaaccctagtcttaccaggagaaaaaacattagacaaattctaatcctg cagtatgcctgcccaggactcctcaaaaccgtcaaggtcatcaaaaacaaggacagtctt ataaactgtcacagccacaaaaagcctaaggatatgctaaatgggagctatgaagctctg aagggaggcagcgccatcgaggccatggaagacttcactgggggtgtggcagagaccttc caaactaaagaggcccccgagaacttctatgagattctagagaaggctttgaagagaggc tccctgctgggctgcttcattgataccagaagtgctgcagaatctgaggcccggacgccg tttggtcttattaagggtcatgcctacagtgtaacgggaattgaccaggtaagcttccga ggccagagaatcgagctcatccgaatccggaacccttggggccaggttgagtggaacggg tcgtggagcgacaggacgggcacacctagctcacagaaccccacctggcttgccagcgag gtcacatacaaaaggtgccgggtacagacaacatcaagcatgggcaggtttctccggagt ggcgttctgttggtccagctgagcagaagcgtctgtgtcacactgctctggatgatgggg aattctggtaccgtgcttgttcctgtgttaactgcagatacgagcaagtcccatggccct ggggcacccccagagagctgccctgcccaggacctggtagttcaggtctccactcagaat gtgccccatcagatggcatttaaggacttcaaggcccactttgataaagtggagatctgc aacctcactcccgatgccctggaggaagacgcgatccacaaatgggaggtgacggtccat cagggaagctgggttcgcggctccacggctgggggctgccgcaatttcctggataccttt tggaccaatccacaaataaaattgtctctgactgagaaagatgaggggcaggaggagtgt agtttccttgtagccctgatgcagaaagatagaaggaaactcaagagatttggtgccaat gtgctgacaatcggctatgccatttatgagtgccctgacaaagacgaacacctgaacaaa gacttcttcagataccacgcttctcgggccagaagcaagacgttcatcaacctgagagaa gtctccgaccggttcaagctgccccctggggagtacatcctgattcccagcacttttgag ccccaccaggaagctgatttctgtctgagaatcttttcagagaaaaaagccattacccgg gatatggatggaaatgtagacattgaccttcctgagcctccaaagccaactccacctgac caggagacagaggaggagcagcggtttcgggctctgtttgaacaagtcgctggtgaggac atggaggtgacagcagaggaacttgagtatgttttaaatgctgtgctgcaaaagatccca gagctgaggtcccacgaggagcctcaggttgaaacactgctcatgtctcccttaacccac atggcacagaccagcggcaatgggaagctggagtttgatgaattcaaagtgttctgggac aagctgaagcagtggattaaccttttccttcggtttgatgctgacaagtccggcaccatg tctacctatgaactacggactgcactgaaagctgcaggtaaagaaaagactggagtacag gtggctgactgcatgccagnn