GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:32:49 Sequence gi568815586f:7939833_8152847 : 213015 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10579 10628 50 0 2 100 45 90 0.735 4.32 1.02 Intr + 17158 17240 83 1 2 91 90 17 0.750 1.48 1.03 Term + 41620 41705 86 2 2 80 41 89 0.190 1.22 1.04 PlyA + 42338 42343 6 1.05 2.00 Prom + 61281 61320 40 -4.56 2.01 Init + 61704 61796 93 2 0 62 84 82 0.496 5.58 2.02 Intr + 70264 70379 116 0 2 43 60 161 0.568 8.05 2.03 Intr + 75176 75407 232 2 1 62 31 74 0.186 -3.12 2.04 Intr + 75458 75846 389 1 2 -34 36 455 0.351 21.69 2.05 Intr + 78947 79046 100 2 1 55 69 97 0.546 4.61 2.06 Intr + 79538 79599 62 1 2 71 86 20 0.282 -2.37 2.07 Intr + 86528 86622 95 2 2 32 38 148 0.103 3.81 2.08 Intr + 99987 100333 347 1 2 124 81 420 0.791 40.01 2.09 Intr + 102826 102900 75 0 0 127 116 71 0.999 13.51 2.10 Intr + 103869 103937 69 2 0 56 95 51 0.794 1.98 2.11 Intr + 104928 105126 199 2 1 85 113 117 0.999 12.82 2.12 Intr + 108050 108457 408 0 0 103 97 284 0.996 25.04 2.13 Intr + 108865 108966 102 2 0 98 92 39 0.978 5.45 2.14 Intr + 109530 109739 210 1 0 85 79 193 0.997 16.98 2.15 Intr + 110690 110788 99 0 0 99 110 16 0.969 4.98 2.16 Term + 112930 113018 89 2 2 98 39 127 0.950 6.62 2.17 PlyA + 113673 113678 6 1.05 3.02 PlyA - 114615 114610 6 1.05 3.01 Sngl - 120275 118905 1371 2 0 58 48 461 0.754 33.23 3.00 Prom - 122679 122640 40 -5.36 4.00 Prom + 124558 124597 40 -7.06 4.01 Init + 140117 140274 158 1 2 67 94 80 0.659 5.88 4.02 Intr + 141802 141931 130 1 1 44 56 80 0.496 1.10 4.03 Intr + 142292 142551 260 1 2 74 79 232 0.586 17.16 4.04 Intr + 150104 150204 101 2 2 110 89 66 0.999 8.65 4.05 Intr + 150363 150467 105 1 0 74 99 138 0.987 13.69 4.06 Intr + 151937 152018 82 0 1 70 89 85 0.991 5.50 4.07 Intr + 152844 152952 109 0 1 62 84 108 0.998 8.09 4.08 Term + 156210 156326 117 2 0 117 55 56 0.670 3.74 4.09 PlyA + 158015 158020 6 1.05 5.03 PlyA - 158043 158038 6 1.05 5.02 Term - 167878 167559 320 2 2 52 37 150 0.772 1.34 5.01 Init - 173495 173363 133 2 1 78 75 59 0.449 3.90 5.00 Prom - 185443 185404 40 -4.56 6.03 PlyA - 185835 185830 6 1.05 6.02 Term - 194490 193939 552 0 0 20 51 539 0.998 37.71 6.01 Init - 195118 194522 597 1 0 72 -95 420 0.555 17.38 6.00 Prom - 195185 195146 40 -8.46 7.00 Prom + 195490 195529 40 -8.76 7.01 Init + 195863 195904 42 1 0 72 80 30 0.921 1.02 7.02 Intr + 196956 197071 116 2 2 149 94 90 0.859 14.95 7.03 Term + 203835 203937 103 0 1 76 54 108 0.771 3.95 7.04 PlyA + 205133 205138 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_1|72_aa MTRCILELLAQARSGLQCWEYTDKKDVQAGTVAHTYNPSSLGGREQRQGWKLENLGISKD YGTDCATDFAKT >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_1|219_bp atgactcgctgcatcctagagctcctggctcaagcgaggtcggggctgcaatgctgggaa tatacagataaaaaagatgttcaggccggcacagtggctcacacgtataatcctagcagt ttgggaggccgagaacaaagacaaggttggaagttagaaaacctgggtatcagcaaagac tatggtacagactgtgctacagacttcgcaaaaacttaa >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_2|894_aa MIGKLVTKKFEEDMQMDLSEWSKTVKIFVYHKLLPPPPAFGDHDPDQSAAINVEARPSTS KKIVTRSRLSRPAEMLLPLPTVFPQMRLLSRVLAPHLTRAYAKDVKFGADARALMLQGVD LLADAVAVTMGPKGRTVIIEQSWGSPQIQIRAKLVQDIANNTNEEAGNGTTSATVLARSI AKEGFKKISKGANPVEIRKGVMLAVDAVIAELRKQSKFVTTPEEIAQVATTSANGDKEIG NIISNAMKKVGRKGVITVKDGKTLNDELEIIESGWASRWDPLTASKSLGFDFEWSVNVVA KRDSVHRALPGKAPADSTKVFWRDCSPPTQGFQKASVHADFGGSKQETIDPVQEMTLFEG NEEPRSTMASDLESSLTSIDWLPQLTLRATIEKLGSASQAGPPGSSRKCSPGSPTDPNAT LSKDEAAVHQDGKPRYSYATLITYAINSSPAKKMTLSEIYRWICDNFPYYKNAGIGWKNS IRHNLSLNKCFRKVPRPRDDPGKGSYWTIDTCPDISRKRRHPPDDDGTGSVDGGAVAAGA SGRESAEGPPPLYNTNHDFKFSYSEINFQDLSWSFRNLYKSMLEKSSSSSQHGFSSLLGD IPPSNNYYMYQQQQPPPPQQQQQQQQPPQPPPQQSQPQQQQAPAQGPSAVGGAPPLHTPS TDGCTPPGGKQAGAEGYGPPPVMAMHPPPLQHGGYHPHQHHPHSHPAQQPPPPQPQAQGQ APINNTGFAFPSDWCSNIDSLKESFKMVNRLNWSSIEQSQFSELMESLRQAEQKNWTLDQ HHIANLCDSLNHFLTQTGHVPPQGGTHRPPAPARIADSCALTSGKQESAMSQVNSYGHPQ APHLYPGPSPMYPIPTQDSAGYNRPAHHMVPRPSVPPPGANEEIPDDFDWDLIT >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_2|2685_bp atgattggaaaattggtgacaaagaaatttgaggaagacatgcagatggacctctctgag tggtcaaaaactgtgaagatatttgtataccataaattgctaccaccacccccagccttc ggcgaccacgaccctgatcagtcagcagccatcaacgttgaggcaagaccctccaccagc aaaaagattgtgacacgctcaaggctcagccgccccgcagaaatgcttcttccgttaccc acagtctttccccagatgagactgctgtccagggtactggcccctcatctcactcgggct tatgccaaagatgtaaaatttggcgcagatgcccgagccttaatgcttcaaggtgtagac cttttagccgatgctgtagctgttacgatggggccaaagggaagaacagtgattattgag cagagctggggaagtccccaaatacaaattagagctaaacttgttcaagatattgctaat aacacaaatgaagaggctggaaatggcaccacctctgctaccgtactggcacgctctatt gccaaggagggcttcaagaagattagcaaaggtgctaatccagtggaaatcaggaaaggt gtgatgttggctgttgatgctgtaattgctgaacttagaaagcagtctaaatttgtgacc acccctgaagaaattgcacaggttgctacaacttctgcaaacggagacaaagaaattggc aatatcatctccaatgcaatgaaaaaggttggaagaaagggtgtcatcacagtaaaggat ggaaaaacactgaatgatgaattagaaattattgaaagtggctgggcgagccgctgggac ccactaactgcctcgaaaagcctaggattcgactttgaatggtccgttaatgtggtcgca aaacgtgactcggttcatcgggcgctccctggtaaggcccctgcagacagcacgaaggtg ttttggagagattgttctcccccgacccaaggattccagaaagccagtgtacacgcagac ttcggaggcagtaaacaggaaaccatcgatcccgtgcaggaaatgaccttgtttgaaggc aacgaagaacccagaagtaccatggcttctgacctagagagtagcctcacctccatagac tggctcccccagctgaccctccgagctaccattgagaagcttggaagtgcctcccaggct gggcctcccgggagcagccgcaagtgttcaccagggtcacccacagatcctaatgccacc ctgagcaaagacgaggcagcagtgcaccaggacggcaagccacgatacagctatgccact ctcatcacctatgccatcaactcctctccagccaagaagatgaccctcagcgagatttac cgctggatctgtgataacttcccctattacaagaatgctggcattggttggaagaattca atacggcacaacctttctctcaacaagtgtttccggaaggtgcccagacctcgggatgac cctgggaagggttcctattggacaattgacacctgccctgacatttcccgaaagagaaga caccctccagatgatgatggcacaggatctgtggatggtggagcagtggcagcaggggct tcaggccgagaaagtgctgagggtccccctcccctctataacaccaaccatgactttaaa ttctcctactcagagatcaactttcaggatctaagctggtccttccgcaacctctataag tccatgctggagaagtcctcttcctcctctcagcacggcttttcttctctcctgggggac atcccaccctcgaacaactactacatgtatcagcagcagcagccaccgccacctcaacag cagcagcagcagcagcagccgccacagccacctccccagcagtcccagccacagcagcag caggcacctgcccagggcccctcagctgtagggggtgctcctccactgcacaccccaagc acagatggttgtaccccaccagggggaaagcaagctggggcggaaggctatgggcctccc cctgtaatggccatgcatccacccccgctgcagcatggaggctaccaccctcatcagcac catccccactcccaccctgcccagcagccaccacctccacagccacaggcacaaggccag gctcccatcaacaacactggctttgcctttccttctgactggtgctctaatattgactct ttaaaggaaagcttcaagatggtgaatcggctcaattggtccagcattgagcagtcacaa ttctcagaactgatggagagtctacgacaggcagagcagaagaactggaccctcgaccag catcacattgccaatctgtgtgactccctcaaccacttccttactcagactggtcacgtg ccccctcaagggggtacccaccgcccaccagcccctgcccgtattgctgactcctgtgcc ctcaccagtggcaaacaggagtcagccatgagccaagtgaactcttatgggcacccacaa gctccccacctctaccctggcccatcaccaatgtacccaatccccacccaggactcagca ggatacaatcgcccagcacaccatatggtccctcggccatcagtgccacctcctggtgcc aatgaggagatccctgatgacttcgactgggacttgatcacttag >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_3|456_aa MVILSLTFLLGLPGNGLVLWVAGLKMQRTVNTIWFLHLTLADLLCCLSLPFSLAHLALQG QWPYGRFLCKLIPSIIVLNMFASVFLLTAISLDRCLVVFKPIWCQNHRNVGMACSICGCI WVVAFVMCIPVFVYREIFTTDNHNRCGYKFGLSSSLDYPDFYGDPLENRSLENIVQPPGE MNDRLDPSSFQTNDHPWTVPTVFQPQTFQRPSADSLPRGSARLTSQNLYSNVFKPADVVS PKIPSGFPIEDHETSPLDNSDAFLSTHLKLFPSASSNSFYESELPQGFQDYYNLGQFTDD DQVPTPLVAITITRLVVGFLLPSVIMIACYSFIVFRMQRGRFAKSQSKTFRVAVVVVAVF LVCWTPYHIFGVLSLLTDPETPLGKTLMSWDHVCIALASANSCFNPFLYALLGKDFRKKA RQSIQGILEAAFSEELTRSTHCPSNNVISERNSTTV >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_3|1371_bp atggtcattctcagccttacttttttactgggattgccaggcaatgggctggtgctgtgg gtggctggcctgaagatgcagcggacagtgaacacaatttggttcctccacctcaccttg gcggacctcctctgctgcctctccttgcccttctcgctggctcacttggctctccaggga cagtggccctacggcaggttcctatgcaagctcatcccctccatcattgtcctcaacatg tttgccagtgtcttcctgcttactgccattagcctggatcgctgtcttgtggtattcaag ccaatctggtgtcagaatcatcgcaatgtagggatggcctgctctatctgtggatgtatc tgggtggtggcttttgtgatgtgcattcctgtgttcgtgtaccgggaaatcttcactaca gacaaccataatagatgtggctacaaatttggtctctccagctcattagattatccagac ttttatggagatccactagaaaacaggtctcttgaaaacattgttcagccgcctggagaa atgaatgataggttagatccttcctctttccaaacaaatgatcatccttggacagtcccc actgtcttccaacctcaaacatttcaaagaccttctgcagattcactccctaggggttct gctaggttaacaagtcaaaatctgtattctaatgtatttaaacctgctgatgtggtctca cctaaaatccccagtgggtttcctattgaagatcacgaaaccagcccactggataactct gatgcttttctctctactcatttaaagctgttccctagcgcttctagcaattccttctac gagtctgagctaccacaaggtttccaggattattacaatttaggccaattcacagatgac gatcaagtgccaacacccctcgtggcaataacgatcactaggctagtggtgggtttcctg ctgccctctgttatcatgatagcctgttacagcttcattgtcttccgaatgcaaaggggc cgcttcgccaagtctcagagcaaaacctttcgagtggccgtggtggtggtggctgtcttt cttgtctgctggactccataccacatttttggagtcctgtcattgcttactgacccagaa actcccttggggaaaactctgatgtcctgggatcatgtatgcattgctctagcatctgcc aatagttgctttaatcccttcctttatgccctcttggggaaagattttaggaagaaagca aggcagtccattcagggaattctggaggcagccttcagtgaggagctcacacgttccacc cactgtccctcaaacaatgtcatttcagaaagaaatagtacaactgtgtga >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_4|353_aa MVGIYKTGVFWVGCDGKTKRKGNLRMLARELLKQGDGRNKEKANNVQNMKKNRNWVDPTY SLFNRIKFSKKIIGIHAESVAKHFNRCRPEKSEIKSVSIWLPASPKRLISETNLNSRGSD ATEVVALCISASRAPPPVLRFAPGSADSGPKMATELEYESVLCVKPDVSVYRIPPRASNR GYRASDWKLDQPDWTGRLRITSKGKTAYIKLEDKVSGELFAQAPVEQYPGIAVETVTDSS RYFVIRIQDGTGRSAFIGIGFTDRGDAFDFNVSLQDHFKWVKQESEISKESQEMDARPKL DLGFKEGQTIKLCIGLCSKPGTTAIQLGPVLNGIGRTLRTDLRNKNDLEGTNL >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_4|1062_bp atggttggaatatataagactggagtcttttgggttggctgtgatggaaagacaaagagg aaaggaaacttaaggatgctggcaagagaactgttgaaacaaggagatgggaggaataaa gagaaggctaacaatgtgcagaacatgaagaagaacaggaattgggttgatcccacgtat agtctatttaacaggatcaaattttccaaaaagataatcggaatacacgccgaaagtgtg gcaaaacattttaacaggtgtcgtcctgaaaaatcggagataaagagtgtttcaatatgg ctccccgcgtcaccgaaaaggttaatctcagaaactaacttgaactcgcgcggaagtgac gcaacagaagttgtcgcgctttgcatctccgcctcccgtgctccgcctccggtcttacgt ttcgcccccggcagcgccgacagcggacccaagatggcgaccgagttggagtacgagtct gtgctgtgtgtgaagccagacgtcagcgtctaccggattccgccccgggcctccaaccgc ggttacagggcctctgactggaaattagaccagcctgattggactggtcgcctccgaatc acttcaaaagggaagactgcctatatcaaactcgaggataaagtttcaggggagctcttt gctcaggcaccagtagaacaatatcctggtattgctgtggagacggtgacagattctagc cgctactttgtaatccggatccaggatggtactgggcgcagtgctttcattggcattggc ttcacagatcggggagatgccttcgactttaatgtctccttgcaggatcacttcaagtgg gtaaagcaggaatctgagatttccaaggaatctcaagaaatggatgctcgtcctaagttg gatctgggcttcaaggaaggacaaaccatcaagttgtgtatcgggctctgttccaaacca ggcaccacagccatccaactgggtccagttctgaatggcattggcaggacattaaggaca gacttgaggaataaaaatgaccttgagggcaccaatctgtga >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_5|150_aa MEYYAAIKNDEFMSFVGTWMKLVTIILSKLSQGQKTKHRMFSLTEIQKTLRDYYEHLYAH KLENLEKLEMDKFLETCNLPRLTQEEIESLNRSITSSETESVIKSLPTRKGPGPGGFTAK FYWMYKEEVPAELFLLKLFQKFEEEGLIAN >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_5|453_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaactggtaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcacagaaatacaaaaaaccctcagagactattatgaacacctctatgcacac aaactagaaaacctagaaaaactagaaatggacaaattcttggaaacatgcaacctccca agattgacccaggaagaaattgaatccctgaacagatcaataacaagttctgaaactgaa tcagtaataaaaagcctaccaaccagaaaaggcccaggaccaggtggattcacagccaaa ttctactggatgtataaagaagaggttcctgctgaactattcctactgaaactgttccaa aaatttgaggaggaaggactcatcgctaactga >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_6|382_aa MLETNEKLESFSKELEDIKKNQDFTMLGAPSFPMAGHLASDFAFSPPPGGGGDGPGGPEP GWVDPRTWLSFQGPPGGPGIGPGFGPGSEEWGIPPCPPPYEFCGGMAYCGPQTGVGLVPQ DGLETSQPEGEAGVGVESNSDGASPEPCTVPSGAVKLEKEKLEQNPEESQDIKALQKELE QFAKLLKQKRITLGYTQADVFSQTTICRFEALQLSFKNMCELRPLLQKWVEEADNNENLQ EICKAETLVQARKRKRTSIENQVRGNLENLFLRCPKPTLQQISHIAQQLGLEKDVVRVWF CNRRQKGKRSSSGYAQREDFEAVGSPFSGGPVSFPLAPGPHFGTPGYGSPHFTALYSSVP FPEGEAFPPVSVTTLGSPMHSN >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_6|1149_bp atgcttgaaacaaatgaaaaactagaaagtttcagcaaagaactagaagatataaagaaa aaccaagatttcaccatgcttggggcgccttccttccccatggcgggacacctggcttcg gatttcgccttctcaccccctccaggcggtggaggtgatgggccaggggggccggagccg ggctgggttgatcctcggacctggctaagcttccaaggccctcctggagggccaggaatc gggccggggtttgggccaggctctgaggagtgggggattcccccatgtcccccgccgtat gagttctgcggggggatggcgtactgtgggcctcagactggagtggggctagtgccccaa gacggcttggagacctctcagcctgagggcgaagcaggagtcggggtggagagcaactcc gatggggcctccccggagccctgcaccgtcccctctggtgccgtgaagctggagaaggag aagctggagcaaaacccggaggagtcccaggacatcaaagctctgcagaaagaactcgag caatttgccaagctcctgaagcagaagaggatcaccctgggatatacacaggccgatgtg ttcagccaaacgaccatctgccgctttgaggctctgcagcttagcttcaagaacatgtgt gagctgcggcccttgctgcagaagtgggtggaggaagctgacaacaatgaaaatcttcag gagatatgcaaagcagaaaccctcgtgcaggcccgaaagagaaagcgaaccagtatcgag aaccaagtgagaggcaacctggagaatttgttcctgcggtgcccgaaacccacactgcag cagatcagccacatcgcccagcagcttgggctggagaaggatgtggtccgagtgtggttc tgtaaccggcgccagaagggcaagcgatcaagcagtggctatgcacaacgagaggatttt gaggctgttgggtctcctttctcagggggaccagtgtcctttcctctggccccagggccc cattttggtaccccaggctatgggagccctcacttcactgcactgtactcctcggtccct ttccctgagggggaagcctttccccctgtctccgtcaccactctgggctctcccatgcat tcaaactga >gi568815586f:7939833_8152847|GENSCAN_predicted_peptide_7|86_aa MEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTGVDTVPH SSRLMGLLYCSILDLEPLLECVLLYM >gi568815586f:7939833_8152847|GENSCAN_predicted_CDS_7|261_bp atggaggctcacctgctggtgataaacactcaagaagagcaggatttcatcttccagaat ctgcaagaagaatctgcttattttgtggggctctcagatccagaaggtcagcgacattgg caatgggttgatcagacaccatacaatgaaagttccacaggagttgacactgtgccccat tcatcacggctgatggggctgctgtactgcagcatcttggacctggaaccactgctggag tgtgtcctgctctacatgtga