GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:24:21 Sequence gi568815597r:230603144_230810850 : 207707 bp : 44.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 121 116 6 1.05 1.03 Term - 7540 7530 11 2 2 126 44 1 0.514 -2.14 1.02 Intr - 9282 9053 230 2 2 90 42 123 0.220 5.41 1.01 Init - 21858 21827 32 0 2 105 73 40 0.205 3.44 1.00 Prom - 24099 24060 40 -2.16 2.00 Prom + 35124 35163 40 -0.76 2.01 Init + 39464 39535 72 1 0 91 91 119 0.008 13.57 2.02 Intr + 56321 56482 162 1 0 118 25 208 0.695 17.67 2.03 Intr + 57615 57680 66 2 0 62 119 90 0.713 8.60 2.04 Intr + 59998 60078 81 0 0 62 69 83 0.900 3.63 2.05 Intr + 61341 61444 104 2 2 97 111 -23 0.919 -0.13 2.06 Intr + 65533 65641 109 1 1 71 131 27 0.915 5.59 2.07 Intr + 66213 66392 180 2 0 80 58 153 0.886 11.56 2.08 Intr + 68373 68497 125 2 2 98 91 68 0.967 7.48 2.09 Intr + 71855 71981 127 2 1 77 55 40 0.211 0.28 2.10 Intr + 75770 75909 140 1 2 95 81 5 0.178 -0.24 2.11 Intr + 80431 80492 62 1 2 91 93 48 0.937 3.98 2.12 Intr + 81942 82093 152 1 2 105 94 22 0.962 4.38 2.13 Intr + 83792 83989 198 1 0 33 115 166 0.971 13.35 2.14 Intr + 86871 87010 140 2 2 81 97 72 0.969 6.66 2.15 Intr + 88241 88421 181 2 1 65 91 165 0.982 14.37 2.16 Intr + 92089 92204 116 0 2 78 82 49 0.528 2.55 2.17 Term + 98485 98554 70 1 1 76 54 62 0.047 -0.99 2.18 PlyA + 98603 98608 6 1.05 3.05 PlyA - 98815 98810 6 1.05 3.04 Term - 100186 99998 189 1 0 119 54 138 0.956 10.95 3.03 Intr - 101194 101050 145 0 1 93 87 175 0.781 18.18 3.02 Intr - 103057 102790 268 2 1 97 74 333 0.985 29.39 3.01 Init - 107680 106852 829 1 1 67 116 1011 0.998 94.68 3.00 Prom - 121091 121052 40 -2.06 4.00 Prom + 127966 128005 40 -4.56 4.01 Init + 144354 145167 814 2 1 99 33 190 0.235 9.65 4.02 Intr + 147917 148053 137 0 2 61 101 61 0.611 4.99 4.03 Intr + 152194 152263 70 0 1 99 92 51 0.654 5.35 4.04 Intr + 156369 156487 119 1 2 75 110 71 0.822 8.18 4.05 Intr + 159510 159643 134 2 2 45 89 219 0.975 17.14 4.06 Intr + 163298 163438 141 2 0 69 45 71 0.474 0.37 4.07 Intr + 164398 164566 169 1 1 127 95 213 0.999 25.85 4.08 Intr + 166037 166120 84 1 0 88 96 58 0.988 6.62 4.09 Intr + 168871 168956 86 0 2 118 82 108 0.993 11.82 4.10 Intr + 170877 170971 95 0 2 85 50 30 0.522 -1.49 4.11 Intr + 171411 171546 136 1 1 77 60 76 0.549 3.33 4.12 Intr + 175451 175516 66 2 0 80 61 55 0.393 0.02 4.13 Intr + 175830 175990 161 0 2 96 69 173 0.757 15.83 4.14 Intr + 177036 177193 158 1 2 32 92 109 0.943 5.43 4.15 Intr + 177357 177565 209 2 2 92 117 211 0.992 22.28 4.16 Intr + 182838 182874 37 0 1 99 105 31 0.941 4.16 4.17 Intr + 184379 184459 81 1 0 43 75 77 0.400 1.73 4.18 Intr + 186989 187046 58 1 1 66 89 36 0.310 -0.04 4.19 Intr + 189209 189351 143 0 2 96 34 146 0.981 10.07 4.20 Intr + 189707 189827 121 1 1 77 84 115 0.749 10.07 4.21 Intr + 192020 192136 117 0 0 109 97 251 0.979 28.54 4.22 Intr + 194810 194968 159 0 0 63 31 132 0.758 4.96 4.23 Term + 203958 204154 197 1 2 96 41 77 0.052 1.37 4.24 PlyA + 205356 205361 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 39570 39240 331 0 1 73 65 282 0.815 20.26 S.002 Init + 111626 111680 55 1 1 62 115 31 0.871 4.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:230603144_230810850|GENSCAN_predicted_peptide_1|90_aa MGTQGDSHKARSPAIMNDTVTIWTRELMTNQLLQRKQMVIDILNTGKATVPKTEILEKLV KIYKTTLDVIFVFGFRTYFGGGKIADFGAA >gi568815597r:230603144_230810850|GENSCAN_predicted_CDS_1|273_bp atggggacccagggagactcgcacaaggcgagatcacctgccatcatgaatgacacagta actatctggaccagggaactcatgaccaaccaactacttcaaaggaaacaaatggtcata gatatccttaacactgggaaggcaacagtaccaaagacagaaattctggaaaaattagtc aaaatatacaagaccacactggatgtcatctttgtatttggattcagaacttattttggt ggtggcaagatagctgactttggtgcagcctga >gi568815597r:230603144_230810850|GENSCAN_predicted_peptide_2|694_aa MEKSRMNLPKGPDTLCFDKDEFMKEDFDVDHFVSDCRKRVQLEELRDDLELYYKLLKTAM VELINKDYADFVNLSTNLVGMDKALNQLSVPLGQLREEVLSLRSSVSEGIRAVDERMSKQ EDIRKKKMCVLRLIQVIRSVEKIEKILNSQSSKETSALEASSPLLTGQILERIATEFNQL QFHAVQSKGMPLLDKVRPRIAGITAMLQQSLEGLLLEGLQTSDVDIIRHCLRTYATIDKT RDAEALVGQVLVKPYIDEVIIEQFVESHPNGLQVMYNKLLEFVPHHCRLLREVTGGAISS EKGNTVPGYDFLVNSVWPQIVQGLEEKLPSLFNPGNPDAFHEKYTISMDFVRRLERQCGS QASVKRLRAHPAYHSFNKKWNLPVYFQIRFREIAGSLEAALTDVLEDAPAESPYCLLASH RTWSSLRRCWSDEMFLPLLVHRLWRLTLQILARYSVFVNELSLRPISNESPKEIKKPLVT GSKEPSITQGNTEDQGSGPSETKPVVSISRTQLVYVVADLDKLQEQEVPTTASSYVDSAL KPLFQLQSGHKDKLKQAIIQQWLEGTLSESTHKYYETVSDVLNSVKKMEESLKRLKQARK TTPANPVGPSGGMSDDDKIRLQLALDVEYLGEQLFPVTKANLKPSQEGSRGEHRSSLAKP IQCKATSVATWSSSEKLGILQPFRRVHVCQLYPL >gi568815597r:230603144_230810850|GENSCAN_predicted_CDS_2|2085_bp atggagaaaagtaggatgaacctgcccaaggggccggacacgctctgcttcgacaaggac gagttcatgaaggaagatttcgatgtcgatcattttgtgtctgactgtaggaagcgggtc cagctggaagaactgagagatgacctggagctctactataaacttcttaaaacagccatg gtcgaactcatcaacaaggattatgcagattttgtcaatctttcaacaaacttggttggc atggacaaagccctcaaccagctttctgtgcctttgggacaattacgagaagaggttctg agccttagatcgtctgtcagtgaaggaattcgggcagttgatgaacgaatgtctaaacaa gaggacattaggaaaaaaaagatgtgtgtattgaggcttatacaagttattcggtcagtt gagaaaattgaaaaaatcttaaactctcaaagttctaaagaaacctctgcactagaagca agcagcccccttttgactggacaaattttggagagaattgccacagaatttaatcagtta cagtttcatgctgttcaaagcaaaggcatgcctcttttggacaaagtaagaccgcgtata gctggcattacagccatgttacagcagtcactggaaggtctcctattagaaggccttcag acgtctgacgtcgatataatacggcactgcttgcggacttacgccacgattgacaagaca cgggacgcggaggccttagttggccaagtactagtgaaaccatacatagacgaggtgatt atagagcagtttgttgaatctcatcccaatggccttcaggtcatgtataataaactcctg gagtttgttcctcaccattgccgccttcttcgagaagtcacaggaggtgccatctccagt gaaaaaggcaatactgttcctggatatgactttttggtgaattctgtttggccacaaata gtacaaggattagaagaaaagttaccctcgctttttaatcctgggaatcccgatgcattt catgagaaatataccataagtatggattttgtcagaagattggaacggcagtgtggatca caggctagtgtaaagagattaagagcccatcctgcctatcacagcttcaataagaagtgg aacttgcctgtttattttcaaataagatttagagaaatagcgggatccttagaagcagca cttacagatgtcctggaagatgccccagctgaaagtccgtattgccttttggcttctcat agaacttggagcagccttaggaggtgttggtcagatgagatgttcttgccattactggtg catcgcctgtggagactcactctgcagattttggcacgatactctgtgtttgtcaatgag ctttcactcaggcccatttctaatgaaagtcccaaggagatcaagaaacctttggtaact ggtagcaaagaaccttccatcacccaaggaaacactgaagaccaaggaagtggtccttcg gaaacaaagcctgtggtttccatttcccgcactcagctcgtgtatgtggttgcagacctg gacaagcttcaggagcaggaggtcccaaccacagcttcctcctatgtggacagtgctctg aagcccttattccagcttcagagcggacacaaggataagctcaaacaagcaataattcag cagtggctagaaggcactctcagtgaaagcactcataagtactatgaaaccgtgtcagat gtattaaactctgtgaagaagatggaagagagcctgaaaaggctgaaacaagccagaaaa accactcccgccaaccccgtcggtcccagtggtggcatgagcgacgacgacaaaatcagg ctgcagttggccctagatgttgagtacttgggagagcagctgtttcctgtgaccaaggcc aacctcaaaccatcccaggaagggagtcgtggggaacacaggtccagcctcgctaagccg atccagtgcaaagctacctcagtagccacatggagttcttccgagaaacttggcatcttg cagccgtttcggagagtccatgtgtgccagctctacccgctttga >gi568815597r:230603144_230810850|GENSCAN_predicted_peptide_3|476_aa MAPAGVSLRATILCLLAWAGLAAGDRVYIHPFHLVIHNESTCEQLAKANAGKPKDPTFIP APIQAKTSPVDEKALQDQLVLVAAKLDTEDKLRAAMVGMLANFLGFRIYGMHSELWGVVH GATVLSPTAVFGTLASLYLGALDHTADRLQAILGVPWKDKNCTSRLDAHKVLSALQAVQG LLVAQGRADSQAQLLLSTVVGVFTAPGLHLKQPFVQGLALYTPVVLPRSLDFTELDVAAE KIDRFMQAVTGWKTGCSLMGASVDSTLAFNTYVHFQGKMKGFSLLAEPQEFWVDNSTSVS VPMLSGMGTFQHWSDIQDNFSVTQVPFTESACLLLIQPHYASDLDKVEGLTFQQNSLNWM KKLSPRTIHLTMPQLVLQGSYDLQDLLAQAELPAILHTELNLQKLSNDRIRVGEVLNSIF FELEADEREPTESTQQLNKPEVLEVTLNRPFLFAVYDQSATALHFLGRVANPLSTA >gi568815597r:230603144_230810850|GENSCAN_predicted_CDS_3|1431_bp atggctcctgccggtgtgagcctgagggccaccatcctctgcctcctggcctgggctggc ctggctgcaggtgaccgggtgtacatacaccccttccacctcgtcatccacaatgagagt acctgtgagcagctggcaaaggccaatgccgggaagcccaaagaccccaccttcatacct gctccaattcaggccaagacatcccctgtggatgaaaaggccctacaggaccagctggtg ctagtcgctgcaaaacttgacaccgaagacaagttgagggccgcaatggtcgggatgctg gccaacttcttgggcttccgtatatatggcatgcacagtgagctatggggcgtggtccat ggggccaccgtcctctccccaacggctgtctttggcaccctggcctctctctatctggga gccttggaccacacagctgacaggctacaggcaatcctgggtgttccttggaaggacaag aactgcacctcccggctggatgcgcacaaggtcctgtctgccctgcaggctgtacagggc ctgctagtggcccagggcagggctgatagccaggcccagctgctgctgtccacggtggtg ggcgtgttcacagccccaggcctgcacctgaagcagccgtttgtgcagggcctggctctc tatacccctgtggtcctcccacgctctctggacttcacagaactggatgttgctgctgag aagattgacaggttcatgcaggctgtgacaggatggaagactggctgctccctgatggga gccagtgtggacagcaccctggctttcaacacctacgtccacttccaagggaagatgaag ggcttctccctgctggccgagccccaggagttctgggtggacaacagcacctcagtgtct gttcccatgctctctggcatgggcaccttccagcactggagtgacatccaggacaacttc tcggtgactcaagtgcccttcactgagagcgcctgcctgctgctgatccagcctcactat gcctctgacctggacaaggtggagggtctcactttccagcaaaactccctcaactggatg aagaaactatctccccggaccatccacctgaccatgccccaactggtgctgcaaggatct tatgacctgcaggacctgctcgcccaggctgagctgcccgccattctgcacaccgagctg aacctgcaaaaattgagcaatgaccgcatcagggtgggggaggtgctgaacagcattttt tttgagcttgaagcggatgagagagagcccacagagtctacccaacagcttaacaagcct gaggtcttggaggtgaccctgaaccgcccattcctgtttgctgtgtatgatcaaagcgcc actgccctgcacttcctgggccgcgtggccaacccgctgagcacagcatga >gi568815597r:230603144_230810850|GENSCAN_predicted_peptide_4|1163_aa MPYLYRAPGPQAHPVPKDARITHSSGQSFEQMRQECLQRGTLFEDADFPASNSSLFYSER PQIPFVWKRPGVSGASRGRSIDEAEVQQPPQEVETGVLGRGRLRSVQLRCIIPVYRGKLC SLEAENNVQAWDALRWGPGASGCTRSPAPSCLTPTEAEDLESEWVLSAGDRHESKPYPQG QAPTLCVVTLRSRYAQCSVASGPELRVLSVDGCLAQQGLSNRVTDGIQGKQESSPENHAP PIPRDQGEQCPTSDSLEVSSQEAPAGHWVPRGNSPPPILSVNPGHLPESAHTPLAHVNNP CGRPVHSVACDHSLSCTEIVKNPEFILGGATRTDICQGELGDCWLLAAIASLTLNQKALA RVIPQDQSFGPGYAGIFHFQFWQHSEWLDVVIDDRLPTFRDRLVFLHSADHNEFWSALLE KAYANPSLTRRKNIRQILILQYACPGLLKTVKVIKNKDSLINCHSHKKPKDMLNGSYEAL KGGSAIEAMEDFTGGVAETFQTKEAPENFYEILEKALKRGSLLGCFIDTRSAAESEARTP FGLIKGHAYSVTGIDQVSFRGQRIELIRIRNPWGQVEWNGSWSDRTGTPSSQNPTWLASE VTYKRCRVQTTSSMGRFLRSGVLLVQLSRSVCVTLLWMMGNSGTVLVPVLTADTSKSHGP GAPPESCPAQDLVVQVSTQNVPHQMAFKDFKAHFDKVEICNLTPDALEEDAIHKWEVTVH QGSWVRGSTAGGCRNFLDTFWTNPQIKLSLTEKDEGQEECSFLVALMQKDRRKLKRFGAN VLTIGYAIYECPDKDEHLNKDFFRYHASRARSKTFINLREVSDRFKLPPGEYILIPSTFE PHQEADFCLRIFSEKKAITRDMDGNVDIDLPEPPKPTPPDQETEEEQRFRALFEQVAGED MEVTAEELEYVLNAVLQKIPELRSHEEPQVETLLMSPLTHMAQTSGNGKLEFDEFKVFWD KLKQWINLFLRFDADKSGTMSTYELRTALKAAGKEKTGVQVADCMPGFQLSSHLLQLIVL RYADEELQLDFDDFLNCLVRLENASRLDASIRSEKFTFSPPLMRVPSPPLMKVLVYHQME QLPHRLHWKALWSLGNKLELRSALRGQGFRKPRACDLVGRGTRVIGASPIQCTGSLVETK HHSTESQDRLPSTSAKAASSPQL >gi568815597r:230603144_230810850|GENSCAN_predicted_CDS_4|3492_bp atgccttacctctaccgggccccagggcctcaggcacacccggttcccaaggacgcccgg atcacccactcctcaggccagagctttgagcaaatgaggcaggagtgcctgcagagaggc accctgtttgaggatgcagacttcccagccagcaattcctccctgttctacagtgagagg ccgcagatcccctttgtgtggaaacgaccaggggtgagtggggcgagcaggggaaggagc atagatgaggccgaggttcagcagcccccgcaggaagtggaaacaggggtgctggggagg ggccggctacgctcagtgcaactgaggtgcatcatcccagtctaccgtggaaagctttgc agtttggaggcagaaaataatgtacaggcctgggatgctctgcggtggggacccggggca tctggttgtacccgctccccggctccaagctgtctgactcccaccgaggctgaggacctg gagtcagagtgggtcttgtcagcaggtgataggcacgaatcaaaaccttatccacaagga caggcccccaccctgtgtgtggtaacactgagatcacgttatgcacagtgttctgtggct tctggcccagagctgagggtgctcagtgtggatggatgtttggctcagcaaggtctcagc aaccgggtgacagatggaatacaagggaagcaggaatcttctcccgagaaccatgcaccc cccattcccagagatcagggtgaacagtgccccacctctgacagtctggaggtgtcgtcc caagaggcaccagctggtcactgggtcccaagaggcaattcccccccaccaattctcagt gtaaatccaggacacctccccgaaagtgcccacacccctctagcccatgtgaacaacccc tgtggcagacctgtgcacagcgtggcatgtgaccacagcctctcctgcacggaaatcgtg aaaaacccagaattcattcttggaggggccaccaggactgatatctgccagggagagctg ggagactgctggctattagccgccatcgcctcccttacgcttaatcaaaaagcactggcc agagtcatcccccaggaccaaagctttggccctggttatgccgggatattccatttccag ttctggcagcacagtgagtggctggacgtggtgatcgatgaccgcctgcccaccttcagg gaccgcttggttttcctccactctgccgaccacaacgagttctggagcgccttgctggaa aaagcctacgccaaccctagtcttaccaggagaaaaaacattagacaaattctaatcctg cagtatgcctgcccaggactcctcaaaaccgtcaaggtcatcaaaaacaaggacagtctt ataaactgtcacagccacaaaaagcctaaggatatgctaaatgggagctatgaagctctg aagggaggcagcgccatcgaggccatggaagacttcactgggggtgtggcagagaccttc caaactaaagaggcccccgagaacttctatgagattctagagaaggctttgaagagaggc tccctgctgggctgcttcattgataccagaagtgctgcagaatctgaggcccggacgccg tttggtcttattaagggtcatgcctacagtgtaacgggaattgaccaggtaagcttccga ggccagagaatcgagctcatccgaatccggaacccttggggccaggttgagtggaacggg tcgtggagcgacaggacgggcacacctagctcacagaaccccacctggcttgccagcgag gtcacatacaaaaggtgccgggtacagacaacatcaagcatgggcaggtttctccggagt ggcgttctgttggtccagctgagcagaagcgtctgtgtcacactgctctggatgatgggg aattctggtaccgtgcttgttcctgtgttaactgcagatacgagcaagtcccatggccct ggggcacccccagagagctgccctgcccaggacctggtagttcaggtctccactcagaat gtgccccatcagatggcatttaaggacttcaaggcccactttgataaagtggagatctgc aacctcactcccgatgccctggaggaagacgcgatccacaaatgggaggtgacggtccat cagggaagctgggttcgcggctccacggctgggggctgccgcaatttcctggataccttt tggaccaatccacaaataaaattgtctctgactgagaaagatgaggggcaggaggagtgt agtttccttgtagccctgatgcagaaagatagaaggaaactcaagagatttggtgccaat gtgctgacaatcggctatgccatttatgagtgccctgacaaagacgaacacctgaacaaa gacttcttcagataccacgcttctcgggccagaagcaagacgttcatcaacctgagagaa gtctccgaccggttcaagctgccccctggggagtacatcctgattcccagcacttttgag ccccaccaggaagctgatttctgtctgagaatcttttcagagaaaaaagccattacccgg gatatggatggaaatgtagacattgaccttcctgagcctccaaagccaactccacctgac caggagacagaggaggagcagcggtttcgggctctgtttgaacaagtcgctggtgaggac atggaggtgacagcagaggaacttgagtatgttttaaatgctgtgctgcaaaagatccca gagctgaggtcccacgaggagcctcaggttgaaacactgctcatgtctcccttaacccac atggcacagaccagcggcaatgggaagctggagtttgatgaattcaaagtgttctgggac aagctgaagcagtggattaaccttttccttcggtttgatgctgacaagtccggcaccatg tctacctatgaactacggactgcactgaaagctgcaggtaaagaaaagactggagtacag gtggctgactgcatgccaggctttcagctgagcagccacctcctgcagctgattgtgctc aggtatgcggatgaggagctccagctggacttcgatgacttcctcaactgcctggtccgg ctggagaatgcgagccgactggatgcctccatccgttcagagaagttcaccttcagcccg cccttgatgagagtcccatcgccgcccttgatgaaagtccttgtttaccaccagatggag cagcttccccacaggctccactggaaggccctttggtcgctgggaaacaaacttgagcta cgcagtgcgctgaggggccagggcttcagaaagcccagagcatgtgacctggttggcaga ggtaccagggtcattggtgccagcccaattcagtgtactggatcattagtagaaacaaag caccattctactgagagccaagaccgcctgccaagtaccagtgcgaaggctgcctcgtca ccacagctgtga