GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:36:01 Sequence gi568815585f:48606818_48807855 : 201038 bp : 40.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3945 4109 165 2 0 -18 95 114 0.309 0.74 1.02 Term + 4296 4406 111 2 0 118 45 67 0.813 2.98 1.03 PlyA + 6028 6033 6 1.05 2.00 Prom + 14808 14847 40 -4.35 2.01 Init + 16573 16850 278 0 2 39 77 133 0.100 3.80 2.02 Intr + 17376 17480 105 0 0 103 59 48 0.060 1.81 2.03 Intr + 26468 26597 130 2 1 70 79 57 0.136 2.78 2.04 Intr + 34151 34291 141 1 0 44 70 104 0.318 3.83 2.05 Intr + 39536 39581 46 1 1 110 87 56 0.945 4.76 2.06 Intr + 40062 40220 159 1 0 106 82 53 0.890 5.54 2.07 Term + 46114 46277 164 2 2 65 43 141 0.882 4.42 2.08 PlyA + 49382 49387 6 1.05 3.04 PlyA - 50142 50137 6 1.05 3.03 Term - 51787 51623 165 1 0 100 49 95 0.655 3.73 3.02 Intr - 67741 67020 722 2 2 -5 77 395 0.019 19.32 3.01 Init - 69281 69227 55 2 1 102 37 71 0.236 4.90 3.00 Prom - 81925 81886 40 -4.85 4.00 Prom + 82220 82259 40 -7.65 4.01 Sngl + 91125 91763 639 2 0 54 43 283 0.770 16.33 4.02 PlyA + 91991 91996 6 1.05 5.00 Prom + 92471 92510 40 -3.35 5.01 Sngl + 93107 93934 828 1 0 70 43 187 0.547 7.88 5.02 PlyA + 94103 94108 6 1.05 6.00 Prom + 96046 96085 40 -4.25 6.01 Sngl + 100001 101041 1041 1 0 99 49 567 0.940 50.77 6.02 PlyA + 101885 101890 6 1.05 7.00 Prom + 110011 110050 40 -5.45 7.01 Init + 116096 116123 28 1 1 83 95 12 0.254 0.66 7.02 Intr + 121236 121363 128 1 2 103 80 61 0.373 6.28 7.03 Term + 122247 122345 99 2 0 97 36 52 0.284 -1.95 7.04 PlyA + 122568 122573 6 1.05 8.09 PlyA - 122909 122904 6 1.05 8.08 Term - 124408 124228 181 0 1 94 49 120 0.214 4.80 8.07 Intr - 125041 124942 100 2 1 40 100 26 0.268 -2.75 8.06 Intr - 125851 125771 81 2 0 75 94 72 0.603 5.19 8.05 Intr - 126822 126787 36 1 0 122 61 59 0.164 3.92 8.04 Intr - 127326 127162 165 1 0 62 42 137 0.660 5.51 8.03 Intr - 131808 131453 356 2 2 44 35 177 0.200 2.01 8.02 Intr - 132395 132330 66 1 0 91 80 53 0.203 1.90 8.01 Init - 136871 136810 62 2 2 78 61 46 0.258 1.67 8.00 Prom - 138326 138287 40 -3.05 9.00 Prom + 139979 140018 40 -5.35 9.01 Sngl + 144374 144820 447 1 0 71 54 158 0.276 6.68 9.02 PlyA + 145120 145125 6 1.05 10.00 Prom + 147407 147446 40 -6.35 10.01 Init + 152947 153007 61 0 1 83 70 59 0.178 5.06 10.02 Term + 164801 165012 212 0 2 5 46 256 0.369 9.57 10.03 PlyA + 165031 165036 6 1.05 11.00 Prom + 168862 168901 40 -4.55 11.01 Init + 171340 171392 53 0 2 74 52 87 0.833 4.38 11.02 Intr + 176292 176471 180 0 0 113 95 166 0.977 17.86 11.03 Term + 179710 179824 115 1 1 78 55 58 0.501 -1.44 11.04 PlyA + 180703 180708 6 1.05 12.03 PlyA - 181936 181931 6 1.05 12.02 Term - 195304 195144 161 2 2 20 48 163 0.652 2.62 12.01 Init - 197276 197258 19 2 1 95 78 45 0.835 4.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_1|91_aa WGITGGGGHWNCAGSDLKPAQHWISSKAHCNYNLATSMFTQGPGTLRSAGGKASKRQRSL TLWPPTPQANYEYFQATANNPLMPKGSSVSL >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_1|276_bp tggggtatcactgggggtggtggccactggaactgtgctgggtcagacctgaagccagca cagcactggatctcatccaaggcccactgtaattacaacctggctacatctatgttcact caaggcccagggactctacgatcagcaggtggcaaggccagcaagaggcagagaagcctc accttgtggccaccaactccacaggcaaactatgagtacttccaagctactgctaataat cccttaatgcccaagggctcttcagtcagtttgtga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_2|340_aa MYGYPWHIDSDSGIHFAGRDPQDWAREHDIAGHFHLPHKPQAVGSIERKQWPIKSTTSNF TGESCLAQVDECVICSLYLFKFSQREDMCPIDSCYVLSPLLPAVTYEANLFLQWAKTMLV AHRKTPVGDCEFRVRCANKPGQLRYVPVAVWFPLFRGPWRCSSEGNQCPVVWWREGGRQL HLIGGRVLPGESSGGREEADCRKGAKGPRALSKGREKSPCEEGVCLPFAFHHDCGKGGKT PAPLQLPPALLDVRACACTEPRLEGSQVMMLLLPCMCSAQCSADLPAFYAGNEDLNSDGE MSNGSIWNIFYGQNKWNLLAQKWVVEERGKLKVTSRFMDG >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_2|1023_bp atgtatggatatccatggcacattgacagtgactcagggatccattttgctggacgtgac cctcaggattgggccagggagcatgatatagcagggcatttccacctcccacataaaccc caagcagtagggtcaattgagagaaaacaatggcctattaaaagcacgacttcaaacttt actggggaatcctgtcttgcacaggtagatgaatgtgttatctgcagcctttatctcttt aaattcagccagagggaggacatgtgccccatagacagttgttatgttctgtcaccatta ttaccagctgtcacctatgaagcaaaccttttcctacaatgggccaagactatgctggtg gctcacagaaaaacacccgttggggactgtgaatttagagtgaggtgcgcaaacaagcca gggcagctcagatatgttccagtggcagtatggttccctttgttcaggggcccatggcga tgtagcagtgagggaaaccagtgtccagtggtgtggtggcgtgaaggaggcagacagttg catttaatcgggggtagagttttaccaggtgaatctagtggaggaagagaagaggcagat tgtaggaaaggggctaaaggaccacgggctctaagcaagggaagagagaagtcaccttgt gaagaaggtgtctgcttacccttcgccttccaccatgattgtgggaaaggaggaaagaca ccagccccactgcagctaccacccgcactgctagatgtccgagcatgtgcctgcactgag ccacgcctagagggcagccaggtgatgatgctgctcctgccttgcatgtgctctgcccag tgctcagcagacctccctgctttctatgcaggaaacgaggatttgaacagtgatggagag atgagtaatggttcgatttggaacatattttatgggcagaacaaatggaacttgttggct cagaagtgggtggtggaagaaagagggaagttaaaagtgacttctaggtttatggatgga tga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_3|313_aa MAEQEQLRFAAPSKTNAETQCKKSKNIDNRLKKLLTRITSLEQNIKDLMELKNTARELYE AYTSINSRIDQMEERISEIEDLLNEIKHEDKIKEKIMKRNKQSLKICDYMKRPNLHLIGV PESDGENGSKLGNTLPDIIQENFPTLARQANIQIQEIQRTPLRYSSIRATPRHIIIRFTR AEVKEKMLRTVRQKGWVTHKGKPIRLTADLSAETLQARKEWGPIFNILKEKNFQPGISRP AKISFTGEGEIKSFTDKQMTLGFPIQHLLQGQQLSPVYPYPPDCLPMEGKHVSTAVPQAL AQREAHECAQINK >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_3|942_bp atggctgaacaggaacagctccggtttgcagctcccagtaagaccaatgcagaaacccaa tgcaagaaatctaagaacattgataacaggttaaagaaactgctaactagaataaccagt ttagagcagaacataaaagacctgatggagctgaagaacacagcacgagaactttatgaa gcatatacaagtatcaatagcagaatcgatcaaatggaagaaaggatatcagagattgaa gatctacttaatgaaataaagcatgaagacaagattaaagaaaaaataatgaaaaggaac aaacaaagcctcaaaatatgtgactatatgaaaagaccaaacctacatctgattggtgta cctgaaagtgacggggagaatggatctaagttgggaaacacacttccagatattatccaa gagaacttccctaccctagcaagacaggccaacattcaaattcaggaaatacagagaaca ccactaagatactcctcaataagagcaaccccaagacacataatcatcagattcaccagg gctgaagtgaaggaaaaaatgttaaggacagtcagacagaaaggttgggttacccacaaa gggaagcccatcagactaaccgcagatctctccgcagaaactctacaagccagaaaagag tgggggccaatattcaacattcttaaagaaaagaattttcaacccggaatttcacgtcct gctaaaataagcttcacaggtgaaggagaaataaaatcgtttacagacaagcaaatgacc cttgggtttcccatccagcatttactccaggggcagcagttgtctcctgtctatccctac ccaccagactgtttgcccatggaaggcaagcacgtgtctactgctgtaccccaggcccta gcacagcgtgaagcacatgaatgtgctcagatcaataaatga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_4|212_aa MKQEEKHREKKVKRNEKSLQEIWEDYVKRPKLRLIGVPASDGENGTKLENTLQDIIQENF PNLARQANIEIQEIQRTPLRYSSRRATPRHIIVRFTKVEMKEKMLKAPREKGRVTHKGKP IRLTADPLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDF VTTKPALQELLKEALNMERNNQFQPLQKHAKL >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_4|639_bp atgaagcaagaagagaagcatagagagaaaaaagtaaaaagaaatgaaaaaagcctccaa gaaatatgggaggactatgtgaaaagaccaaaactacgtctgattggtgtacctgcaagt gacggggagaatggaaccaagttggaaaacactcttcaggatattatccaggagaacttc cccaacctagcgaggcaggccaacattgaaattcaggaaatacagagaacaccactaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaaaggcacccagagagaaaggtagggttacccacaaagggaagccc atcagactaacagcagatcccttggcagaaactctacaagccagaagagagtgggggcca attttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaacta agcttcataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagatttt gtcaccaccaagcctgccttacaagagctcctgaaggaagcactaaacatggaaaggaac aaccagttccagccactgcaaaaacatgccaaattgtaa >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_5|275_aa MDKFLDTYTLPRLNQKEVESLNRPITGSEIEGIINILPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEANIILIPKPGRDTTTKENFRLMSFMNIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIHHVNRTKDKNHMIISIDAEKAFNKI QQPFMLKSVNKLGIWTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLDVLARAIRQEKEIKGIKLGKEEVKLSLFADDI >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_5|828_bp atggataaattcctggacacatataccctcccaagactaaatcagaaagaagttgaatcc ctgaatagaccaataacaggttctgaaattgagggaataattaatatcctaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaattca ttttatgaggccaacatcatcctgataccaaagcctggcagagacacaacaacaaaagag aattttagactgatgtccttcatgaacattgatgcaaaaatcctaaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccacgatcaagttggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaacgtaatccatcatgtaaacaga accaaagacaaaaaccacatgattatctccatagatgcagaaaaggccttcaacaaaatt caacaacccttcatgctaaaaagtgtcaataaactaggtatttggacgtatctcaaaata ataagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaaattggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggacgttctggccagggcaatcaggcaagagaaagaaataaagggtattaaa ttaggaaaagaggaagtcaaattgtcactgtttgcagatgacatttag >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_6|346_aa MERKFMSLQPSISVSEMEPNGTFSNNNSRNCTIENFKREFFPIVYLIIFFWGVLGNGLSI YVFLQPYKKSTSVNVFMLNLAISDLLFISTLPFRADYYLRGSNWIFGDLACRIMSYSLYV NMYSSIYFLTVLSVVRFLAMVHPFRLLHVTSIRSAWILCGIIWILIMASSIMLLDSGSEQ NGSVTSCLELNLYKIAKLQTMNYIALVVGCLLPFFTLSICYLLIIRVLLKVEVPESGLRV SHRKALTTIIITLIIFFLCFLPYHTLRTVHLTTWKVGLCKDRLHKALVITLALAAANACF NPLLYYFAGENFKDRLKSALRKGHPQKAKTKCVFPVSVWLRKETRV >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_6|1041_bp atggagagaaaatttatgtccttgcaaccatccatctccgtatcagaaatggaaccaaat ggcaccttcagcaataacaacagcaggaactgcacaattgaaaacttcaagagagaattt ttcccaattgtatatctgataatatttttctggggagtcttgggaaatgggttgtccata tatgttttcctgcagccttataagaagtccacatctgtgaacgttttcatgctaaatctg gccatttcagatctcctgttcataagcacgcttcccttcagggctgactattatcttaga ggctccaattggatatttggagacctggcctgcaggattatgtcttattccttgtatgtc aacatgtacagcagtatttatttcctgaccgtgctgagtgttgtgcgtttcctggcaatg gttcacccctttcggcttctgcatgtcaccagcatcaggagtgcctggatcctctgtggg atcatatggatccttatcatggcttcctcaataatgctcctggacagtggctctgagcag aacggcagtgtcacatcatgcttagagctgaatctctataaaattgctaagctgcagacc atgaactatattgccttggtggtgggctgcctgctgccatttttcacactcagcatctgt tatctgctgatcattcgggttctgttaaaagtggaggtcccagaatcggggctgcgggtt tctcacaggaaggcactgaccaccatcatcatcaccttgatcatcttcttcttgtgtttc ctgccctatcacacactgaggaccgtccacttgacgacatggaaagtgggtttatgcaaa gacagactgcataaagctttggttatcacactggccttggcagcagccaatgcctgcttc aatcctctgctctattactttgctggggagaattttaaggacagactaaagtctgcactc agaaaaggccatccacagaaggcaaagacaaagtgtgttttccctgttagtgtgtggttg agaaaggaaacaagagtataa >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_7|84_aa MATIDSPSTYEKLRHKQFKASWIPNQPLPSTAADQTTPLRSQRESPLQFDAEQCWLPAST RNQQACSHPRDLAKAITFARNAFL >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_7|255_bp atggctaccatcgattccccctctacctatgaaaagctcaggcataaacaatttaaggca tcttggattccaaaccaacccctacccagtactgctgcagaccagactacccctcttaga agtcagagggaatcccctctccaatttgatgctgagcaatgctggcttcctgcctctact cgaaaccagcaagcatgctcccatcccagggacttggcaaaggctattacctttgcccgg aatgctttcctttaa >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_8|348_aa MTSATGQLLGRFQEAYNHGRRFSHFYQVALSSSSVPELKNMCGQLLHEEPWKRLQEMTGA SLLQQKAFRTAGQHGKAACCVRAPAVITVFLGAMGLRKGKQRSLDSLEENTKVYLLFPGL AVHTHITSFNPCFSRKIAESLRGSGSCQGHTAPKEQGSSDSANQTGSSHESPHSPSDHAK APNNPHRAQPTSAAAALKAPASLFWVCSHIPANQECYPSQTRLYPSVTSHSRHGKLETNT VHSCAKCFRRPITLQEVRLYSEPPFCFPYAGNLGSYRSALKAKENEQKSRCPSLMNSRAK KLLPPVETCVSPTGLNCPVSAQHSSGHPGGSSLGLSSSTRWVVALHLC >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_8|1047_bp atgacatcggcaactggtcagcttctggggaggtttcaggaagcttacaatcatggcaga aggttcagccatttttatcaagtcgctttgtcgtcttcatcagttcctgagctgaagaac atgtgcgggcagcttctccatgaagagccttggaagcgcctgcaggaaatgacaggtgca tctcttttgcagcaaaaggccttcagaacagcagggcagcacggcaaagctgcttgctgt gtgcgtgccccggcagtcattactgtgttccttggagctatggggctgagaaaggggaag cagaggagccttgactcgctggaagagaacacaaaagtgtaccttctttttcctggcctt gcagtccacacacacattacctcatttaatccctgtttttcaaggaagatagcagagtcc ctaagagggtcaggaagttgtcaaggtcacacagctcctaaagagcagggctcctcagac tcagctaatcaaactggcagtagccatgagtcacctcacagcccaagtgaccatgctaag gctcccaacaaccctcacagagctcagccaacgtctgctgctgctgccctgaaggcccca gcttctctcttctgggtttgctctcacattccagccaaccaggaatgctacccttctcag actcgcctctatcctagtgtcacttcccacagcagacatgggaaactggaaaccaacacc gttcattcctgtgccaagtgctttaggcgcccaataaccctacaagaggtgcggctttac tctgagcctcctttttgctttccatacgctggaaatctaggctcttacaggtctgctctg aaggctaaagagaatgaacagaaaagcaggtgcccaagcctgatgaattcccgtgctaaa aaactgctgcctcctgtggagacctgtgtgtctcccacagggctgaattgccctgtgtct gcccagcacagctctgggcacccaggtggctcctccctgggcctcagcagcagcacaaga tgggttgtagccctgcatctctgttaa >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_9|148_aa MADMPFKLSSDIASFAEPHRPHSEFHVLSSPSEDSVKASTEHFSYCCSPICQTSALDYEL PEGRTHSCSPLAAQSSPVPGMKERSISCLPNEWILKYPVCSLVTQCNTLGHKISSCLKAG AGSMHSQLESESEQLCSKARQESGTHYY >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_9|447_bp atggcagatatgccttttaaacttagctcagacattgcttcctttgcagaaccacaccgg ccacactcagagttccatgtgctctcttcaccctcagaggactctgtaaaagccagcacg gagcacttctcttactgttgttcacctatttgtcagacttcagcactagactatgagctc cctgagggccggactcactcttgttcacctttagctgctcagtccagcccagtgcctgga atgaaagagaggtccatttcatgtttaccaaatgaatggattctgaaataccctgtctgc tctctggttacgcagtgcaacaccttaggccataaaattagttcctgcttaaaagcagga gcaggctctatgcactcccagttagagtcagaatctgagcagctatgttccaaggctaga caagagtctgggacacactactactga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_10|90_aa MAEGKEEAGTSYKARAGGKKKRGDAAAKASEETHVMDYRALVHERDEAAYGELRAMVLDL RAFYAELYHIINSNLEKIVNPKGEKKPSMY >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_10|273_bp atggcagaaggcaaagaggaggcaggcacttcttacaaggctagagcaggaggaaaaaag aaacgtggggatgctgcggccaaggcctccgaggagactcatgtaatggattaccgggcc ttggtgcatgagcgagatgaggcagcctatggggagctcagggccatggtgctggacctg agggccttctatgctgagctttatcatatcatcaacagcaacctggagaaaattgtcaac ccaaagggtgaaaagaagccatctatgtactga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_11|115_aa MPQDWCGEDAVLQPLDVRNGSPSGPDYCESCCFSGSSYPLELPDSMLVLENVCKGSSDVT CPLVFQRWVPAAADESDRFQGSSLSQTMPEFSYKKPGKTVTSTHVIIQKPFGVKF >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_11|348_bp atgcctcaggactggtgtggtgaagacgctgtgcttcagccgctggacgttaggaatggc agtccctcagggccagactactgtgaatcctgctgcttttctggatctagttacccactg gagctgccagactccatgctggtgctggagaatgtttgcaagggatctagtgatgtgacc tgtcctctagtcttccagcggtgggtaccagcagcagctgatgagagtgatagatttcag ggttccagtctttcccaaacaatgcctgaattctcatacaagaaacctggcaaaactgtg acttcaactcatgttataattcaaaaaccctttggtgtcaaattttga >gi568815585f:48606818_48807855|GENSCAN_predicted_peptide_12|59_aa MEEEQVYKIIIHYSEDFGPKFDAILAVLDFFKSSNSNDLVAKGDIENQGKFEEEKQTQL >gi568815585f:48606818_48807855|GENSCAN_predicted_CDS_12|180_bp atggaggaggaacaagtttacaaaattattattcattactcagaggactttggaccaaag tttgatgcaatattggctgtgttagatttcttcaaatccagtaacagcaatgacttagta gccaaaggagacattgaaaaccaaggcaagtttgaagaggaaaagcaaacacaattgtaa