GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:59:21 Sequence gi568815596f:230660002_230918701 : 258700 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 3144 3139 6 1.05 1.07 Term - 8614 8444 171 1 0 114 39 129 0.848 8.43 1.06 Intr - 30460 30396 65 2 2 -1 83 100 0.008 -0.96 1.05 Intr - 34687 34550 138 2 0 75 80 80 0.633 6.34 1.04 Intr - 41053 40907 147 2 0 52 47 75 0.350 0.01 1.03 Intr - 42948 42620 329 2 2 53 38 222 0.926 9.04 1.02 Intr - 44475 44024 452 0 2 48 113 91 0.332 -0.30 1.01 Init - 47271 47140 132 0 0 47 41 116 0.538 2.94 1.00 Prom - 50964 50925 40 -5.76 2.02 PlyA - 52090 52085 6 1.05 2.01 Sngl - 53325 52567 759 0 0 78 31 225 0.743 10.03 2.00 Prom - 63060 63021 40 -5.06 3.02 PlyA - 63861 63856 6 1.05 3.01 Sngl - 65402 65142 261 2 0 70 44 339 0.888 22.86 3.00 Prom - 70943 70904 40 -3.46 4.05 PlyA - 70966 70961 6 1.05 4.04 Term - 75102 74643 460 2 1 22 41 196 0.466 2.86 4.03 Intr - 80826 80754 73 1 1 58 72 66 0.509 0.46 4.02 Intr - 87432 87307 126 1 0 73 43 114 0.658 6.05 4.01 Init - 95130 95076 55 0 1 63 12 58 0.126 -2.85 4.00 Prom - 95436 95397 40 -1.36 5.00 Prom + 98538 98577 40 -6.46 5.01 Init + 100001 100114 114 1 0 103 86 89 0.982 10.41 5.02 Intr + 102406 102501 96 0 0 59 65 70 0.308 2.01 5.03 Intr + 130871 131035 165 1 0 68 94 132 0.793 11.96 5.04 Intr + 136538 136602 65 1 2 73 65 34 0.109 -2.88 5.05 Intr + 138728 138896 169 2 1 104 87 45 0.206 5.95 5.06 Intr + 150262 150321 60 0 0 102 89 22 0.226 2.63 5.07 Intr + 154048 154113 66 0 0 106 86 30 0.271 3.70 5.08 Intr + 157753 157896 144 0 0 45 75 105 0.298 5.38 5.09 Term + 158515 158703 189 0 0 132 33 240 0.998 20.35 5.10 PlyA + 158709 158714 6 1.05 6.00 Prom + 159401 159440 40 -8.06 6.01 Init + 162098 162148 51 1 0 89 90 74 0.953 8.86 6.02 Intr + 167898 168021 124 2 1 79 66 52 0.818 2.26 6.03 Intr + 168928 169064 137 2 2 70 64 66 0.227 2.69 6.04 Term + 184638 184799 162 2 0 58 33 133 0.307 2.74 6.05 PlyA + 188695 188700 6 1.05 7.00 Prom + 198502 198541 40 -5.16 7.01 Init + 205025 205144 120 1 0 106 76 213 0.994 22.09 7.02 Intr + 213416 213556 141 1 0 96 94 226 0.974 24.55 7.03 Intr + 215517 215540 24 2 0 129 65 5 0.581 0.52 7.04 Intr + 215619 215807 189 2 0 84 76 440 0.996 42.18 7.05 Intr + 216856 216966 111 0 0 94 91 193 0.999 20.78 7.06 Intr + 217399 217549 151 0 1 92 69 305 0.999 28.74 7.07 Term + 218007 218098 92 1 2 104 54 225 0.918 18.58 7.08 PlyA + 219226 219231 6 1.05 8.04 PlyA - 219971 219966 6 1.05 8.03 Term - 230183 230043 141 2 0 49 43 89 0.313 -1.57 8.02 Intr - 234535 234108 428 2 2 106 77 111 0.513 5.31 8.01 Init - 236145 235998 148 0 1 84 82 86 0.834 7.85 8.00 Prom - 237738 237699 40 -2.66 9.02 PlyA - 237811 237806 6 1.05 9.01 Sngl - 250961 250002 960 2 0 76 43 1314 0.988 122.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_1|477_aa MSARLKSIPTKPSRYAEKEAYRGCGAGLREKSAEETSCSPGSEEKHQTEQLTPQKSTFLR TKNQVRDYSTWFKLHITERGTEEGRKDSLELLMFPIPPSPGSNHGARRESVVLGEGEHSD CRILHFQLWWLWGENHLLEERSGKSKGDFGNLGISSASGEQSTKQAFRVPDSRPWLLDGI SGPALDQRGVYCPERNQKISTSIKTVQENSSSNKLNKTSGTSPGETEIWELSDRQFKIAI LKKLNKIQDNTEKEFRILSDKFNKEIEIIKKNQAEILELKNTTDMLKNTSESPTSRIDQA EERINVHILIPGTCEYVTLGGKGNFVEVIKILEVGKLSRILWWAQCNQNGVYKLPATNSH KGSQHFAPDCPAGVTTICAEPLCSPPVAHPSDSRPFNISAILGPMQKIATVSEEQYITVH KICKAWQCGTAQASKPETQHGKPHQGKEAWMGNFLDTGEEQPESLILTHFTLFHLPF >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_1|1434_bp atgtcagcacgtttaaagtccatcccaaccaagcccagccgttacgccgagaaggaggcc tacaggggctgtggggctggactcagagagaaatcagctgaggaaacatcctgcagccct ggaagtgaggagaaacaccaaactgagcaactaaccccacaaaaaagcaccttcctgaga accaaaaatcaagtgagagattatagcacctggtttaaacttcatatcactgaaagaggc actgaagagggtaggaaagatagtcttgaattgctgatgttccctattcccccatcccct ggcagcaaccatggggcacgaagagaatctgtggtcttgggtgagggagagcacagtgat tgcaggattttgcatttccagctgtggtggctatggggagagaaccatctgcttgaggaa aggagtgggaagagtaaaggggactttggcaacttaggtatcagctcagcctcaggggag cagagcaccaaacaggcttttagagtccctgattctaggccgtggctcttggatggcatt tctggacctgccctggaccagaggggagtctactgccctgaaagaaaccagaaaatatcc acaagcatcaagaccgtccaggaaaacagctcgtcaaacaaactaaataagacatcaggg accagtcctggagagacagagatatgggaactttcagacagacaattcaaaatagctatt ttgaagaaactcaataaaattcaagataacacagagaaggaattcagaatactatcagat aaatttaacaaagagattgaaatcattaaaaagaatcaagcagaaattctggagttgaaa aacacaactgacatgctgaagaatacatcagagtctcctaccagcagaattgatcaagca gaagaaagaattaatgtccacatcctaatccctggaacctgtgaatacgtgaccttaggt ggtaaagggaactttgtagaagtgattaagatccttgaagtggggaaattatccaggatc ctctggtgggctcagtgtaatcagaacggtgtttataagctgccagcgacgaactctcat aagggcagccagcacttcgcccctgactgtcctgctggagtgacaaccatctgtgctgag cctctttgcagtcctcctgtggctcatccctcggacagccgccctttcaacatctcagct attttaggaccaatgcagaagattgccacggtgtctgaggaacagtacatcacagtacat aagatctgcaaagcctggcagtgtggcactgcccaggccagcaaaccagagacacagcac ggcaagccccaccaggggaaagaggcttggatgggcaacttcctggacactggggaggag caaccagaaagcctcatcctcacccactttaccctgttccaccttcctttctga >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_2|252_aa MEAAPGPVRAQRGRRKPGRRPHHEQGRPGVTHGAGPGVPFSASASSLSPSSAAAASGCSF GPRGVARTLGPSVVFRPLLLPATGSRGGGGCAGGRLPRGSRGCRRAAALAWAAAAAVGPG AALVFACAPAPAPEGSVSPRPRRSPNPSPRLRPEPGFDVRTAQPPRTRRRAVPQRPLPPR APPSGRRTQGAASRREARAGGAAEKSRSQAPQFLFLRRSFQGGAQPFLLRLSALTAGIAR PLSTTTSSTLQA >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_2|759_bp atggaggcagccccggggccggtgagggcccagcggggcaggcggaagccgggcaggcgt ccgcaccacgagcagggccggccgggggtcactcacggggctgggccgggcgttcccttc tctgcctccgcgtcgtcgctgtccccgtcctcggccgccgccgcttcgggctgctcgttc ggtccccgcggagtggctcggacgctcggcccctccgttgtcttccgcccgctcctgctc ccggcgacgggctcccgcggcggcggcggctgcgcgggcggacggctgcccaggggctcc cgcggctgccgccgcgctgctgcgctcgcttgggctgcggctgctgctgtggggcccggc gccgccttagtcttcgcctgtgcccccgccccggcgcctgaaggctctgtgtctcccagg ccgcgccgctccccgaacccctcgccccgcttgcggcccgagcccggctttgacgtgcgc actgcgcagccgccgaggacgcgacgtcgggcggtgccgcaacgcccccttccgcctcgg gcgcccccgagcggccggcggactcagggcgccgcaagtaggcgcgaggcccgagcggga ggggccgcagagaaaagcagaagccaagccccgcagttcctcttcctccgccgcagcttc caaggcggggctcagccctttctcctccgcctgagcgctcttactgcgggaatcgcacgc ccactttcaacgaccacttcctcaactcttcaagcttaa >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_3|86_aa MTTSQKHRDFVAEPMGEKPVGSLAGIGEVLGKKLEERGFDKAYVVLGQFLVLKKDEDLYR EWLKDTCGASAKQSRDCLREWCDAFL >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_3|261_bp atgacaacctcccaaaagcaccgagacttcgtggcagagcccatgggggagaagccagtg gggagcctggctgggattggtgaagtcctgggcaagaagctggaggaaaggggctttgac aaggcctacgttgtccttggccagtttctggtgctaaagaaagatgaagacctctaccgg gaatggctgaaagacacttgtggcgccagcgccaagcagtcccgggactgccttcgagag tggtgcgacgccttcttgtga >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_4|237_aa MEPIQMPINQRVDKETVECKRKNIYRGNSSMGNAINKWFIVLLSPSLREIIFGNYSNANG GVPSPRAVEWYGSSDLLVTEEVSSGKLRKTTRIQGVSGQATSKEHNVNEKQADEISQNAT QSTSIGHLMLSYDSTENTTPPPPPNIGGLDQQTNNAEQCSTRVGHIFQFLPPQLRECKCS DSFVGVECLWDPVGQLFRLPQAFMIPVAKSPKAGPTFKTNMRTKGQNNRNEDVVWVK >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_4|714_bp atggaaccaatccaaatgcccatcaatcaacgagtggataaagaaactgttgagtgtaag aggaagaatatctacagaggaaacagttccatgggcaatgccatcaacaagtggttcata gtgctcctctccccttctctccgtgaaataatttttggaaactacagcaatgccaatggg ggggtccccagcccccgggctgtggagtggtacgggtccagtgacctgctagtaacagag gaggtgagcagtgggaaattaaggaaaacaacaagaattcaaggggtatcagggcaggcg accagcaaagagcacaatgtaaatgaaaaacaagccgatgagataagccaaaatgctacc caatcaacaagtattggtcacttgatgctttcctatgattcaacagaaaacacaacccca ccaccgccaccaaatattggtggtttagaccagcagactaataatgcagaacagtgttcc acccgggtggggcacatattccagttcctccctccacagttgagagaatgtaagtgcagt gactccttcgtcggggtggaatgcctctgggatcctgtgggccagctctttagactccct caggccttcatgatacctgtggccaaaagccctaaagcagggcctacatttaagactaac atgaggactaaaggacaaaataacaggaatgaagatgtggtctgggtcaaataa >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_5|355_aa MPFPFGKSHKSPADIVKNLKESMAVLEKQDISDKKAEKLDIFNFRKHSAVLIALQALHQV EAANSQQRAEATEEVSKNLVAMKEILYGTNEKEPQTEAVAQLAQELYNSGLLSTLVADLQ LIDFELSKDELYDQPFCEQIFPSSKSRYESPEIALNCGIMLRECIRHEPLAKIILWSEQF YDFFRYVEMSTFDIASDAFATFKDLLTRHKLLSAEFLEQHYDRFFSEYEKLLHSENYVTK RQSLKLLGELLLDRHNFTIMTKYISKPENLKLMMNLLRDKSRNIQFEAFHVFKVFVANPN KTQPILDILLKNQAKLIEFLSKFQNDRTEDEQFNDEKTYLVKQIRDLKRPAQQEA >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_5|1068_bp atgccgttcccgtttgggaagtctcacaaatctccagcagacattgtgaagaatctgaag gagagcatggctgttctggaaaagcaagacatttctgataaaaaagcagaaaagttggac atcttcaactttaggaagcactcagcagtcctgatagccctacaggctttgcaccaggtg gaggcagctaacagccagcaaagagcagaggctacagaagaagtttccaaaaatctggtt gccatgaaagaaattctgtatggcacaaatgaaaaagagcctcagacagaagcagtagct caacttgctcaagaactctataatagtgggctccttagcaccctggtagctgatttacag ctcattgactttgagctcagcaaagatgaattgtatgatcagccattttgtgagcagata tttccatcaagtaagagccggtatgaatctccagaaatagctctaaattgtggaataatg ttaagagaatgcatcagacatgaaccacttgcaaaaatcattttgtggtcggaacagttt tatgatttcttcagatatgtcgaaatgtcaacatttgacatagcttcagatgcatttgcc acattcaaggatttacttacaagacataaattgctcagtgcagaatttttggaacagcat tatgatagatttttcagtgaatatgagaagttacttcattcagaaaattatgtgacaaaa agacagtcactgaagcttctcggtgaactactactagatagacacaacttcacaattatg acaaaatacatcagtaaacctgagaacctcaaattaatgatgaacctgctgcgagacaaa agtcgcaacatccagtttgaggcctttcacgtttttaaggtgtttgtagccaatcctaac aagacgcagcccatcctagacatcctcctcaagaaccaggccaaactcatagagttcctc agcaagtttcagaacgacaggacggaggatgagcagtttaacgacgagaagacctattta gttaaacagatcagggatttgaagagaccagctcagcaagaagcttaa >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_6|157_aa MAVSSLTIWFPEDMKDRNLQSTLLGPEEQCQGSRNPICSDQQSQDRRRGKRICHRPLTEP VPAVSKACAGLEAVSHSEGLSLLPQGTVPDAGPRPGKWTLKSPQPASQPAIHSAIHVAIQ TSINQGQFCKELPAMPRDRYSLVITIGGEEQDASGIQ >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_6|474_bp atggctgtgtcctcactgaccatctggtttccagaagacatgaaggacaggaacctccag agcacattgttggggcctgaggagcagtgccaagggtcccggaatcccatctgctctgat caacagtcacaggaccgccgacgaggaaagaggatttgccaccgtcccctgacagagcct gtccctgctgtgtcgaaggcctgtgcaggactcgaggccgtctcccactcagagggcctg tccttgctgccccaagggactgtccctgacgcagggcctaggcctggaaagtggacactc aagagtccccagccagccagccagccagccattcattcagccatccatgtggccatccaa acatctatcaaccaggggcaattttgcaaggaacttccagcaatgcctagagatagatat tctttggttatcacaattggtggtgaggagcaggatgcttctggcatccagtag >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_7|275_aa MVKISFQPAVAGIKGDKADKASASAPAPASATEILLTPAREEQPPQHRSKRGGSVGGVCY LSMGMVVLLMGLVFASVYIYRYFFLAQAFGACSGRLARDNFFRCGVLYEDSLSSQVRTQM ELEEDVKIYLDENYERINVPVPQFGGGDPADIIHDFQRGLTAYHDISLDKCYVIELNTTI VLPPRNFWELLMNVKRGTYLPQTYIIQEEMVVTEHVSDKEALGSFIYHLCNGKDTYRLRR RATRRRINKRGAKNCNAIRHFENTFVVETLICGVV >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_7|828_bp atggtgaagattagcttccagcccgccgtggctggcatcaagggcgacaaggctgacaag gcgtcggcgtcggcccctgcgccggcctcggccaccgagatcctgctgacgccggctagg gaggagcagcccccacaacatcgatccaagagggggggctcagtgggcggcgtgtgctac ctgtcgatgggcatggtcgtgctgctcatgggcctcgtgttcgcctctgtctacatctac agatacttcttccttgcgcaggcttttggggcatgtagcggacggctggcccgagataac ttcttccgctgtggtgtgctgtatgaggactccctgtcctcccaggtccggactcagatg gagctggaagaggatgtgaaaatctacctcgacgagaactacgagcgcatcaacgtgcct gtgccccagtttggcggcggtgaccctgcagacatcatccatgacttccagcggggtctg actgcgtaccatgatatctccctggacaagtgctatgtcatcgaactcaacaccaccatt gtgctgccccctcgcaacttctgggagctcctcatgaacgtgaagagggggacctacctg ccgcagacgtacatcatccaggaggagatggtggtcacggagcatgtcagtgacaaggag gccctggggtccttcatctaccacctgtgcaacgggaaagacacctaccggctccggcgc cgggcaacgcggaggcggatcaacaagcgtggggccaagaactgcaatgccatccgccac ttcgagaacaccttcgtggtggagacgctcatctgcggggtggtgtga >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_8|238_aa MGSDKTESPLSSSRTPGVHTSNQGFLVFAQRDYQVARNDQTGAVIGWREGCSLGELNNVM HKRNSLNSLENCDTTFCAETEVEVVGGVFTAPSLGARGVGCLHLPCTQDPKERGTVCETG ERSPPEPGHLISQALGCPSGATKTLRLGTSHRNAVLSILFAPILVNVSFITSSNNKKTKS EAIQLEAFLGLEPQVFLYSNTHRLRQLVTSPAPTPDAKPSRYCPKLGGAEEPSEGEEK >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_8|717_bp atggggtcagataaaactgagtcaccactgtcctcctcaagaacacctggtgtccacacc tctaaccagggcttcctggtgtttgcacaaagagattatcaggtggccaggaacgatcag actggagcagtcattgggtggagggaaggctgcagcctgggtgagctgaacaatgtcatg cacaaaagaaattctcttaacagtttggaaaactgtgacaccaccttctgtgcagaaacg gaagtggaagtggtggggggtgtcttcacggctcccagcctcggggcacgtggtgtgggg tgtttgcacctgccgtgcacccaagaccccaaggagcgggggaccgtgtgtgaaactggt gaacgcagccctcctgagccagggcatctaatttcccaggctcttggctgcccaagtggg gctactaaaaccctgagacttggcaccagtcaccgaaatgctgttctttccattctgttt gccccaattctggtaaatgtgagttttattaccagcagcaacaacaagaagacaaagtca gaagcgatccaactggaagccttcctgggtttggagcctcaggtgttcctttacagcaac acacaccgactaagacaactagtgacctctcctgctccgacgcccgacgccaagccctca cgctactgtcccaaacttggaggcgccgaggagcctagtgaaggggaggaaaagtag >gi568815596f:230660002_230918701|GENSCAN_predicted_peptide_9|319_aa MSQQNTSGDCLFDGVNELMKTLQFAVHIPTFVLGLLLNLLAIHGFSTFLKNRWPDYAATS IYMINLAVFDLLLVLSLPFKMVLSQVQSPFPSLCTLVECLYFVSMYGSVFTICFISMDRF LAIRYPLLVSHLRSPRKIFGICCTIWVLVWTGSIPIYSFHGKVEKYMCFHNMSDDTWSAK VFFPLEVFGFLLPMGIMGFCCSRSIHILLGRRDHTQDWVQQKACIYSIAASLAVFVVSFL PVHLGFFLQFLVRNSFIVECRAKQSISFFLQLSMCFSNVNCCLDVFCYYFVIKEFRMNIR AHRPSRVQLVLQDTTISRG >gi568815596f:230660002_230918701|GENSCAN_predicted_CDS_9|960_bp atgagtcagcaaaacaccagtggggactgcctgtttgacggtgtcaacgagctgatgaaa accctacagtttgcagtccacatccccaccttcgtcctgggcctgctcctcaacctgctg gccatccatggcttcagcaccttccttaagaacaggtggcccgattatgctgccacctcc atctacatgatcaacctggcagtctttgacctgctgctggtgctctccctcccattcaag atggtcctgtcccaggtacagtcccccttcccgtccctgtgcaccctggtggagtgcctt tacttcgtcagcatgtacggaagcgtcttcaccatctgcttcatcagcatggaccggttc ttggccatccgttacccgctactggtgagccacctccggtcccccaggaagatctttggg atctgctgcaccatctgggtcctggtgtggaccggaagcatccctatctacagtttccat gggaaagtggaaaaatacatgtgcttccacaacatgtctgatgatacctggagcgccaag gtcttcttcccgctggaggtgtttggcttcctccttcccatgggcatcatgggcttctgc tgctccaggagcatccacatcctgctgggccgccgagaccacacccaggactgggtgcag cagaaagcctgcatctacagcatcgcagccagcctggctgtcttcgtggtctccttcctc ccagtccacctggggttcttcctgcagttcctggtgagaaacagctttatcgtagagtgc agagccaagcagagcatcagcttcttcttgcaattgtccatgtgtttctccaacgtcaac tgctgcctggatgttttctgctactactttgtcatcaaagaattccgcatgaacatcagg gcccaccggccttccagggtccagctggtcctgcaggacaccacgatctcccggggctaa