GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:56:03 Sequence gi568815586r:7958740_8160185 : 201446 bp : 43.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 42797 42889 93 1 0 62 84 82 0.496 5.58 1.02 Intr + 51357 51472 116 2 2 43 60 161 0.568 8.05 1.03 Intr + 56269 56500 232 1 1 62 31 74 0.186 -3.12 1.04 Intr + 56551 56939 389 0 2 -34 36 455 0.351 21.69 1.05 Intr + 60040 60139 100 1 1 55 69 97 0.546 4.61 1.06 Intr + 60631 60692 62 0 2 71 86 20 0.282 -2.37 1.07 Intr + 67621 67715 95 1 2 32 38 148 0.103 3.81 1.08 Intr + 81080 81426 347 0 2 124 81 420 0.791 40.01 1.09 Intr + 83919 83993 75 2 0 127 116 71 0.999 13.51 1.10 Intr + 84962 85030 69 1 0 56 95 51 0.794 1.98 1.11 Intr + 86021 86219 199 1 1 85 113 117 0.999 12.82 1.12 Intr + 89143 89550 408 2 0 103 97 284 0.996 25.04 1.13 Intr + 89958 90059 102 1 0 98 92 39 0.978 5.45 1.14 Intr + 90623 90832 210 0 0 85 79 193 0.997 16.98 1.15 Intr + 91783 91881 99 2 0 99 110 16 0.969 4.98 1.16 Term + 94023 94111 89 1 2 98 39 127 0.950 6.62 1.17 PlyA + 94766 94771 6 1.05 2.02 PlyA - 95708 95703 6 1.05 2.01 Sngl - 101368 99998 1371 1 0 58 48 461 0.754 33.23 2.00 Prom - 103772 103733 40 -5.36 3.00 Prom + 105651 105690 40 -7.06 3.01 Init + 121210 121367 158 0 2 67 94 80 0.659 5.88 3.02 Intr + 122895 123024 130 0 1 44 56 80 0.496 1.10 3.03 Intr + 123385 123644 260 0 2 74 79 232 0.586 17.16 3.04 Intr + 131197 131297 101 1 2 110 89 66 0.999 8.65 3.05 Intr + 131456 131560 105 0 0 74 99 138 0.987 13.69 3.06 Intr + 133030 133111 82 2 1 70 89 85 0.991 5.50 3.07 Intr + 133937 134045 109 2 1 62 84 108 0.998 8.09 3.08 Term + 137303 137419 117 1 0 117 55 56 0.670 3.74 3.09 PlyA + 139108 139113 6 1.05 4.03 PlyA - 139136 139131 6 1.05 4.02 Term - 148971 148652 320 1 2 52 37 150 0.772 1.34 4.01 Init - 154588 154456 133 1 1 78 75 59 0.449 3.90 4.00 Prom - 166536 166497 40 -4.56 5.03 PlyA - 166928 166923 6 1.05 5.02 Term - 175583 175032 552 2 0 20 51 539 0.998 37.71 5.01 Init - 176211 175615 597 0 0 72 -95 420 0.555 17.38 5.00 Prom - 176278 176239 40 -8.46 6.00 Prom + 176583 176622 40 -8.76 6.01 Init + 176956 176997 42 0 0 72 80 30 0.921 1.02 6.02 Intr + 178049 178164 116 1 2 149 94 90 0.862 14.95 6.03 Term + 184928 185030 103 2 1 76 54 108 0.773 3.95 6.04 PlyA + 186226 186231 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_1|894_aa MIGKLVTKKFEEDMQMDLSEWSKTVKIFVYHKLLPPPPAFGDHDPDQSAAINVEARPSTS KKIVTRSRLSRPAEMLLPLPTVFPQMRLLSRVLAPHLTRAYAKDVKFGADARALMLQGVD LLADAVAVTMGPKGRTVIIEQSWGSPQIQIRAKLVQDIANNTNEEAGNGTTSATVLARSI AKEGFKKISKGANPVEIRKGVMLAVDAVIAELRKQSKFVTTPEEIAQVATTSANGDKEIG NIISNAMKKVGRKGVITVKDGKTLNDELEIIESGWASRWDPLTASKSLGFDFEWSVNVVA KRDSVHRALPGKAPADSTKVFWRDCSPPTQGFQKASVHADFGGSKQETIDPVQEMTLFEG NEEPRSTMASDLESSLTSIDWLPQLTLRATIEKLGSASQAGPPGSSRKCSPGSPTDPNAT LSKDEAAVHQDGKPRYSYATLITYAINSSPAKKMTLSEIYRWICDNFPYYKNAGIGWKNS IRHNLSLNKCFRKVPRPRDDPGKGSYWTIDTCPDISRKRRHPPDDDGTGSVDGGAVAAGA SGRESAEGPPPLYNTNHDFKFSYSEINFQDLSWSFRNLYKSMLEKSSSSSQHGFSSLLGD IPPSNNYYMYQQQQPPPPQQQQQQQQPPQPPPQQSQPQQQQAPAQGPSAVGGAPPLHTPS TDGCTPPGGKQAGAEGYGPPPVMAMHPPPLQHGGYHPHQHHPHSHPAQQPPPPQPQAQGQ APINNTGFAFPSDWCSNIDSLKESFKMVNRLNWSSIEQSQFSELMESLRQAEQKNWTLDQ HHIANLCDSLNHFLTQTGHVPPQGGTHRPPAPARIADSCALTSGKQESAMSQVNSYGHPQ APHLYPGPSPMYPIPTQDSAGYNRPAHHMVPRPSVPPPGANEEIPDDFDWDLIT >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_1|2685_bp atgattggaaaattggtgacaaagaaatttgaggaagacatgcagatggacctctctgag tggtcaaaaactgtgaagatatttgtataccataaattgctaccaccacccccagccttc ggcgaccacgaccctgatcagtcagcagccatcaacgttgaggcaagaccctccaccagc aaaaagattgtgacacgctcaaggctcagccgccccgcagaaatgcttcttccgttaccc acagtctttccccagatgagactgctgtccagggtactggcccctcatctcactcgggct tatgccaaagatgtaaaatttggcgcagatgcccgagccttaatgcttcaaggtgtagac cttttagccgatgctgtagctgttacgatggggccaaagggaagaacagtgattattgag cagagctggggaagtccccaaatacaaattagagctaaacttgttcaagatattgctaat aacacaaatgaagaggctggaaatggcaccacctctgctaccgtactggcacgctctatt gccaaggagggcttcaagaagattagcaaaggtgctaatccagtggaaatcaggaaaggt gtgatgttggctgttgatgctgtaattgctgaacttagaaagcagtctaaatttgtgacc acccctgaagaaattgcacaggttgctacaacttctgcaaacggagacaaagaaattggc aatatcatctccaatgcaatgaaaaaggttggaagaaagggtgtcatcacagtaaaggat ggaaaaacactgaatgatgaattagaaattattgaaagtggctgggcgagccgctgggac ccactaactgcctcgaaaagcctaggattcgactttgaatggtccgttaatgtggtcgca aaacgtgactcggttcatcgggcgctccctggtaaggcccctgcagacagcacgaaggtg ttttggagagattgttctcccccgacccaaggattccagaaagccagtgtacacgcagac ttcggaggcagtaaacaggaaaccatcgatcccgtgcaggaaatgaccttgtttgaaggc aacgaagaacccagaagtaccatggcttctgacctagagagtagcctcacctccatagac tggctcccccagctgaccctccgagctaccattgagaagcttggaagtgcctcccaggct gggcctcccgggagcagccgcaagtgttcaccagggtcacccacagatcctaatgccacc ctgagcaaagacgaggcagcagtgcaccaggacggcaagccacgatacagctatgccact ctcatcacctatgccatcaactcctctccagccaagaagatgaccctcagcgagatttac cgctggatctgtgataacttcccctattacaagaatgctggcattggttggaagaattca atacggcacaacctttctctcaacaagtgtttccggaaggtgcccagacctcgggatgac cctgggaagggttcctattggacaattgacacctgccctgacatttcccgaaagagaaga caccctccagatgatgatggcacaggatctgtggatggtggagcagtggcagcaggggct tcaggccgagaaagtgctgagggtccccctcccctctataacaccaaccatgactttaaa ttctcctactcagagatcaactttcaggatctaagctggtccttccgcaacctctataag tccatgctggagaagtcctcttcctcctctcagcacggcttttcttctctcctgggggac atcccaccctcgaacaactactacatgtatcagcagcagcagccaccgccacctcaacag cagcagcagcagcagcagccgccacagccacctccccagcagtcccagccacagcagcag caggcacctgcccagggcccctcagctgtagggggtgctcctccactgcacaccccaagc acagatggttgtaccccaccagggggaaagcaagctggggcggaaggctatgggcctccc cctgtaatggccatgcatccacccccgctgcagcatggaggctaccaccctcatcagcac catccccactcccaccctgcccagcagccaccacctccacagccacaggcacaaggccag gctcccatcaacaacactggctttgcctttccttctgactggtgctctaatattgactct ttaaaggaaagcttcaagatggtgaatcggctcaattggtccagcattgagcagtcacaa ttctcagaactgatggagagtctacgacaggcagagcagaagaactggaccctcgaccag catcacattgccaatctgtgtgactccctcaaccacttccttactcagactggtcacgtg ccccctcaagggggtacccaccgcccaccagcccctgcccgtattgctgactcctgtgcc ctcaccagtggcaaacaggagtcagccatgagccaagtgaactcttatgggcacccacaa gctccccacctctaccctggcccatcaccaatgtacccaatccccacccaggactcagca ggatacaatcgcccagcacaccatatggtccctcggccatcagtgccacctcctggtgcc aatgaggagatccctgatgacttcgactgggacttgatcacttag >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_2|456_aa MVILSLTFLLGLPGNGLVLWVAGLKMQRTVNTIWFLHLTLADLLCCLSLPFSLAHLALQG QWPYGRFLCKLIPSIIVLNMFASVFLLTAISLDRCLVVFKPIWCQNHRNVGMACSICGCI WVVAFVMCIPVFVYREIFTTDNHNRCGYKFGLSSSLDYPDFYGDPLENRSLENIVQPPGE MNDRLDPSSFQTNDHPWTVPTVFQPQTFQRPSADSLPRGSARLTSQNLYSNVFKPADVVS PKIPSGFPIEDHETSPLDNSDAFLSTHLKLFPSASSNSFYESELPQGFQDYYNLGQFTDD DQVPTPLVAITITRLVVGFLLPSVIMIACYSFIVFRMQRGRFAKSQSKTFRVAVVVVAVF LVCWTPYHIFGVLSLLTDPETPLGKTLMSWDHVCIALASANSCFNPFLYALLGKDFRKKA RQSIQGILEAAFSEELTRSTHCPSNNVISERNSTTV >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_2|1371_bp atggtcattctcagccttacttttttactgggattgccaggcaatgggctggtgctgtgg gtggctggcctgaagatgcagcggacagtgaacacaatttggttcctccacctcaccttg gcggacctcctctgctgcctctccttgcccttctcgctggctcacttggctctccaggga cagtggccctacggcaggttcctatgcaagctcatcccctccatcattgtcctcaacatg tttgccagtgtcttcctgcttactgccattagcctggatcgctgtcttgtggtattcaag ccaatctggtgtcagaatcatcgcaatgtagggatggcctgctctatctgtggatgtatc tgggtggtggcttttgtgatgtgcattcctgtgttcgtgtaccgggaaatcttcactaca gacaaccataatagatgtggctacaaatttggtctctccagctcattagattatccagac ttttatggagatccactagaaaacaggtctcttgaaaacattgttcagccgcctggagaa atgaatgataggttagatccttcctctttccaaacaaatgatcatccttggacagtcccc actgtcttccaacctcaaacatttcaaagaccttctgcagattcactccctaggggttct gctaggttaacaagtcaaaatctgtattctaatgtatttaaacctgctgatgtggtctca cctaaaatccccagtgggtttcctattgaagatcacgaaaccagcccactggataactct gatgcttttctctctactcatttaaagctgttccctagcgcttctagcaattccttctac gagtctgagctaccacaaggtttccaggattattacaatttaggccaattcacagatgac gatcaagtgccaacacccctcgtggcaataacgatcactaggctagtggtgggtttcctg ctgccctctgttatcatgatagcctgttacagcttcattgtcttccgaatgcaaaggggc cgcttcgccaagtctcagagcaaaacctttcgagtggccgtggtggtggtggctgtcttt cttgtctgctggactccataccacatttttggagtcctgtcattgcttactgacccagaa actcccttggggaaaactctgatgtcctgggatcatgtatgcattgctctagcatctgcc aatagttgctttaatcccttcctttatgccctcttggggaaagattttaggaagaaagca aggcagtccattcagggaattctggaggcagccttcagtgaggagctcacacgttccacc cactgtccctcaaacaatgtcatttcagaaagaaatagtacaactgtgtga >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_3|353_aa MVGIYKTGVFWVGCDGKTKRKGNLRMLARELLKQGDGRNKEKANNVQNMKKNRNWVDPTY SLFNRIKFSKKIIGIHAESVAKHFNRCRPEKSEIKSVSIWLPASPKRLISETNLNSRGSD ATEVVALCISASRAPPPVLRFAPGSADSGPKMATELEYESVLCVKPDVSVYRIPPRASNR GYRASDWKLDQPDWTGRLRITSKGKTAYIKLEDKVSGELFAQAPVEQYPGIAVETVTDSS RYFVIRIQDGTGRSAFIGIGFTDRGDAFDFNVSLQDHFKWVKQESEISKESQEMDARPKL DLGFKEGQTIKLCIGLCSKPGTTAIQLGPVLNGIGRTLRTDLRNKNDLEGTNL >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_3|1062_bp atggttggaatatataagactggagtcttttgggttggctgtgatggaaagacaaagagg aaaggaaacttaaggatgctggcaagagaactgttgaaacaaggagatgggaggaataaa gagaaggctaacaatgtgcagaacatgaagaagaacaggaattgggttgatcccacgtat agtctatttaacaggatcaaattttccaaaaagataatcggaatacacgccgaaagtgtg gcaaaacattttaacaggtgtcgtcctgaaaaatcggagataaagagtgtttcaatatgg ctccccgcgtcaccgaaaaggttaatctcagaaactaacttgaactcgcgcggaagtgac gcaacagaagttgtcgcgctttgcatctccgcctcccgtgctccgcctccggtcttacgt ttcgcccccggcagcgccgacagcggacccaagatggcgaccgagttggagtacgagtct gtgctgtgtgtgaagccagacgtcagcgtctaccggattccgccccgggcctccaaccgc ggttacagggcctctgactggaaattagaccagcctgattggactggtcgcctccgaatc acttcaaaagggaagactgcctatatcaaactcgaggataaagtttcaggggagctcttt gctcaggcaccagtagaacaatatcctggtattgctgtggagacggtgacagattctagc cgctactttgtaatccggatccaggatggtactgggcgcagtgctttcattggcattggc ttcacagatcggggagatgccttcgactttaatgtctccttgcaggatcacttcaagtgg gtaaagcaggaatctgagatttccaaggaatctcaagaaatggatgctcgtcctaagttg gatctgggcttcaaggaaggacaaaccatcaagttgtgtatcgggctctgttccaaacca ggcaccacagccatccaactgggtccagttctgaatggcattggcaggacattaaggaca gacttgaggaataaaaatgaccttgagggcaccaatctgtga >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_4|150_aa MEYYAAIKNDEFMSFVGTWMKLVTIILSKLSQGQKTKHRMFSLTEIQKTLRDYYEHLYAH KLENLEKLEMDKFLETCNLPRLTQEEIESLNRSITSSETESVIKSLPTRKGPGPGGFTAK FYWMYKEEVPAELFLLKLFQKFEEEGLIAN >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_4|453_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaactggtaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcacagaaatacaaaaaaccctcagagactattatgaacacctctatgcacac aaactagaaaacctagaaaaactagaaatggacaaattcttggaaacatgcaacctccca agattgacccaggaagaaattgaatccctgaacagatcaataacaagttctgaaactgaa tcagtaataaaaagcctaccaaccagaaaaggcccaggaccaggtggattcacagccaaa ttctactggatgtataaagaagaggttcctgctgaactattcctactgaaactgttccaa aaatttgaggaggaaggactcatcgctaactga >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_5|382_aa MLETNEKLESFSKELEDIKKNQDFTMLGAPSFPMAGHLASDFAFSPPPGGGGDGPGGPEP GWVDPRTWLSFQGPPGGPGIGPGFGPGSEEWGIPPCPPPYEFCGGMAYCGPQTGVGLVPQ DGLETSQPEGEAGVGVESNSDGASPEPCTVPSGAVKLEKEKLEQNPEESQDIKALQKELE QFAKLLKQKRITLGYTQADVFSQTTICRFEALQLSFKNMCELRPLLQKWVEEADNNENLQ EICKAETLVQARKRKRTSIENQVRGNLENLFLRCPKPTLQQISHIAQQLGLEKDVVRVWF CNRRQKGKRSSSGYAQREDFEAVGSPFSGGPVSFPLAPGPHFGTPGYGSPHFTALYSSVP FPEGEAFPPVSVTTLGSPMHSN >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_5|1149_bp atgcttgaaacaaatgaaaaactagaaagtttcagcaaagaactagaagatataaagaaa aaccaagatttcaccatgcttggggcgccttccttccccatggcgggacacctggcttcg gatttcgccttctcaccccctccaggcggtggaggtgatgggccaggggggccggagccg ggctgggttgatcctcggacctggctaagcttccaaggccctcctggagggccaggaatc gggccggggtttgggccaggctctgaggagtgggggattcccccatgtcccccgccgtat gagttctgcggggggatggcgtactgtgggcctcagactggagtggggctagtgccccaa gacggcttggagacctctcagcctgagggcgaagcaggagtcggggtggagagcaactcc gatggggcctccccggagccctgcaccgtcccctctggtgccgtgaagctggagaaggag aagctggagcaaaacccggaggagtcccaggacatcaaagctctgcagaaagaactcgag caatttgccaagctcctgaagcagaagaggatcaccctgggatatacacaggccgatgtg ttcagccaaacgaccatctgccgctttgaggctctgcagcttagcttcaagaacatgtgt gagctgcggcccttgctgcagaagtgggtggaggaagctgacaacaatgaaaatcttcag gagatatgcaaagcagaaaccctcgtgcaggcccgaaagagaaagcgaaccagtatcgag aaccaagtgagaggcaacctggagaatttgttcctgcggtgcccgaaacccacactgcag cagatcagccacatcgcccagcagcttgggctggagaaggatgtggtccgagtgtggttc tgtaaccggcgccagaagggcaagcgatcaagcagtggctatgcacaacgagaggatttt gaggctgttgggtctcctttctcagggggaccagtgtcctttcctctggccccagggccc cattttggtaccccaggctatgggagccctcacttcactgcactgtactcctcggtccct ttccctgagggggaagcctttccccctgtctccgtcaccactctgggctctcccatgcat tcaaactga >gi568815586r:7958740_8160185|GENSCAN_predicted_peptide_6|86_aa MEAHLLVINTQEEQDFIFQNLQEESAYFVGLSDPEGQRHWQWVDQTPYNESSTGVDTVPH SSRLMGLLYCSILDLEPLLECVLLYM >gi568815586r:7958740_8160185|GENSCAN_predicted_CDS_6|261_bp atggaggctcacctgctggtgataaacactcaagaagagcaggatttcatcttccagaat ctgcaagaagaatctgcttattttgtggggctctcagatccagaaggtcagcgacattgg caatgggttgatcagacaccatacaatgaaagttccacaggagttgacactgtgccccat tcatcacggctgatggggctgctgtactgcagcatcttggacctggaaccactgctggag tgtgtcctgctctacatgtga