GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:42:18 Sequence gi568815592r:96698475_96899731 : 201257 bp : 38.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6490 6564 75 0 0 84 108 100 0.798 12.74 1.02 Intr + 29664 29766 103 2 1 92 95 38 0.221 3.73 1.03 Term + 33946 34196 251 2 2 79 42 128 0.664 2.18 1.04 PlyA + 34499 34504 6 1.05 2.00 Prom + 40146 40185 40 -4.65 2.01 Init + 47156 47228 73 1 1 62 52 107 0.362 5.78 2.02 Intr + 52632 52732 101 1 2 48 97 93 0.331 5.11 2.03 Term + 53645 53911 267 1 0 131 43 81 0.956 2.41 2.04 PlyA + 54177 54182 6 1.05 3.05 PlyA - 54978 54973 6 1.05 3.04 Term - 63842 63401 442 1 1 91 39 282 0.729 17.34 3.03 Intr - 66461 66397 65 2 2 101 6 73 0.179 -3.00 3.02 Intr - 70136 69957 180 2 0 25 35 166 0.211 4.14 3.01 Init - 72279 72274 6 0 0 71 84 17 0.370 -0.47 3.00 Prom - 72327 72288 40 -6.05 4.00 Prom + 74112 74151 40 -3.65 4.01 Init + 76271 76357 87 1 0 60 89 66 0.172 4.60 4.02 Intr + 86649 86857 209 2 2 91 72 91 0.525 4.75 4.03 Intr + 87004 87095 92 1 2 71 37 41 0.290 -3.98 4.04 Term + 89747 89928 182 0 2 14 41 204 0.402 5.09 4.05 PlyA + 90090 90095 6 1.05 5.02 PlyA - 90754 90749 6 1.05 5.01 Sngl - 101257 99998 1260 1 0 91 42 580 0.985 49.31 5.00 Prom - 120848 120809 40 -4.55 6.00 Prom + 133130 133169 40 -3.25 6.01 Init + 134443 134536 94 0 1 64 -7 112 0.620 -0.11 6.02 Intr + 138903 139114 212 1 2 92 105 198 0.770 19.61 6.03 Intr + 144591 144796 206 2 2 85 52 59 0.003 -0.92 6.04 Intr + 161196 161418 223 0 1 49 85 144 0.027 7.41 6.05 Intr + 161765 162112 348 1 0 42 45 192 0.037 4.93 6.06 Intr + 165084 165171 88 2 1 43 87 26 0.026 -3.38 6.07 Term + 178824 179041 218 1 2 68 39 152 0.768 4.62 6.08 PlyA + 179934 179939 6 1.05 7.04 PlyA - 180213 180208 6 1.05 7.03 Term - 192917 192630 288 2 0 96 49 197 0.999 10.99 7.02 Intr - 198373 198270 104 2 2 93 99 165 0.984 17.07 7.01 Init - 199327 198886 442 1 1 66 4 331 0.923 18.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_1|142_aa MHSNTPSSNGSGIHVIESEQALKAQAFPECLICGKPCPDARDTGMNRLGSYLYRAFNSNA HPLPLRDNSLGILSCFLCIILKCTRSSEVVIKPNDLALLPLSLLLSHSHIVMDMIVLNAD ISEIQEGFLELHLCNPESSKFI >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_1|429_bp atgcacagcaacaccccatcatcaaatggaagtggtatacatgtcatcgagtctgagcag gccctaaaagcacaagcatttcctgagtgcttaatatgtggcaagccctgccctgatgct agggatacaggaatgaacagactcggttcctatctttatagagcttttaatagcaatgca catcccctgcctcttagagataattcccttggaattctcagttgcttcctctgcatcatt ctgaagtgtactagatccagtgaagttgtcatcaagcccaacgacttagctctcctccct ctgtctttgttactcagccattcccacatcgtgatggacatgatagtgttaaatgcagat atctctgagattcaggaaggctttcttgaactgcacctttgcaaccctgaatcctccaaa tttatctga >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_2|146_aa MSVLGSPLELKKYKSCGKYKAMDFESLPKKEIPCKKKNHQSSGEGRQKAEAQKKEEKKVT TPPISHKRLNKPCSLGVTISTYFLIAAVPYRPPYYSLSFDKDFGIWLMVFLFSPSPAITM RDSSAHLNPWSLRFQSPYVWLFSRNF >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_2|441_bp atgagcgtcctgggctccccactggaattgaagaaatacaagagctgtgggaaatataaa gccatggacttcgagagcttgcctaagaaggagatcccatgcaaaaagaagaatcatcaa tcatcaggagagggaaggcagaaggctgaagctcagaagaaggaagagaagaaggtgaca actcctccaatttctcataaaagactaaacaaaccctgctctctgggggtcactatttct acttacttcttaattgctgctgtcccctacagacctccttattactctctttcttttgac aaagactttggcatctggctcatggtcttcctcttcagcccaagccctgccatcaccatg agggactccagtgcccatcttaacccctggtctctgagatttcagtctccttatgtctgg ttattttctaggaacttttga >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_3|230_aa MLSKLPSLPDTSREKWLIGATVMAATPLPGNSVALDSVQPAAAGCSPSRLHSSVLETQGP GGGPVDLQRGGSNVNLVERIVERVTRTWELPNGEAKRAITQTVLQHAPCSPRCGRKEGEK SCSPLGSPDLGAPGARAVTPSLGPCSSWHLQAFRCHRIPQCQLGKLLVVHLVQPQPFREP APMPGPGADCPAAAAGISDGAAARPHTPSSTPCRSMPDSKTSLEAWDPGW >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_3|693_bp atgctgtccaaactgcccagtcttcctgataccagcagagaaaaatggctgattggagcc acagtgatggcggccacccctctgcctgggaactcagtagccttagacagtgtccagcct gctgctgctggctgcagcccaagcaggttgcacagctctgtgcttgagacccaaggccct ggtgggggtcctgtggatttacagagaggaggatcaaatgtaaatctagtggagagaatt gtggaaagggtgacaagaacttgggaactgccaaatggtgaggctaaaagagctatcaca caaacagtgctgcaacatgccccttgctcaccacgttgtgggcgaaaagaaggagagaag agctgcagccctttggggagcccagacctgggagctcctggagccagggctgtgactccc tctttggggccctgcagttcctggcatctccaagctttcaggtgccaccgcattccccag tgccagctggggaagctgcttgtggtgcacctggtccagccacagcctttcagagagccg gcacccatgccgggacctggagctgactgccctgcggcagcagccggcatatctgacggc gcagcggccagacctcacactccctcatccaccccttgccgctccatgcctgactcaaag acttccttggaggcatgggatccaggctggtag >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_4|189_aa MSFAGTWMELEAIILSKLMEEQKTKHFSLPGYQDCQSFCLIHKFHSLGYRAHQCHKEVED LDCVRGNLFGMGLKMEAGAVHGLTLMGYYQTVRETHSLRKVQPHNQLLSTEIFVKVMEAI FKFGHKNLPVTCGDRFEIDLPFLAATLEAFFPGDTRLSDGFFRSEQQGTPGVLVTVPPLY ESRSLLKSN >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_4|570_bp atgtcctttgcagggacatggatggagctggaagccatcattctcagcaaactaatggag gaacagaaaaccaaacacttctcacttccaggatatcaagactgtcagtcattctgcctc atccataaatttcacagcctgggatacagagctcatcaatgccataaggaagttgaggac ttagactgtgtcagaggaaatctttttggcatgggtctgaagatggaagctggagctgta cacgggctaactctcatgggctactatcaaacagtgagggaaacccacagtctcaggaag gtgcaaccacataaccaacttctatccacagaaatatttgtgaaagtgatggaggccata ttcaagtttggccataaaaatcttccagtcacttgcggagacagatttgagattgatctc ccattcttggctgcaacactagaagccttcttccctggtgatactcgtctcagtgatggc tttttccgtagtgagcagcagggaacccctggtgttttggtaacagtacctccactttat gagtcccgtagcctcttaaaatccaattaa >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_5|419_aa MVFSAVLTAFHTGTSNTTFVVYENTYMNITLPPPFQHPDLSPLLRYSFETMAPTGLSSLT VNSTAVPTTPAAFKSLNLPLQITLSAIMIFILFVSFLGNLVVCLMVYQKAAMRSAINILL ASLAFADMLLAVLNMPFALVTILTTRWIFGKFFCRVSAMFFWLFVIEGVAILLIISIDRF LIIVQRQDKLNPYRAKVLIAVSWATSFCVAFPLAVGNPDLQIPSRAPQCVFGYTTNPGYQ AYVILISLISFFIPFLVILYSFMGILNTLRHNALRIHSYPEGICLSQASKLGLMSLQRPF QMSIDMGFKTRAFTTILILFAVFIVCWAPFTTYSLVATFSKHFYYQHNFFEISTWLLWLC YLKSALNPLIYYWRIKKFHDACLDMMPKSFKFLPQLPGHTKRRIRPSAVYVCGEHRTVV >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_5|1260_bp atggtcttctcggcagtgttgactgcgttccataccgggacatccaacacaacatttgtc gtgtatgaaaacacctacatgaatattacactccctccaccattccagcatcctgacctc agtccattgcttagatatagttttgaaaccatggctcccactggtttgagttccttgacc gtgaatagtacagctgtgcccacaacaccagcagcatttaagagcctaaacttgcctctt cagatcaccctttctgctataatgatattcattctgtttgtgtcttttcttgggaacttg gttgtttgcctcatggtttaccaaaaagctgccatgaggtctgcaattaacatcctcctt gccagcctagcttttgcagacatgttgcttgcagtgctgaacatgccctttgccctggta actattcttactacccgatggatttttgggaaattcttctgtagggtatctgctatgttt ttctggttatttgtgatagaaggagtagccatcctgctcatcattagcatagataggttc cttattatagtccagaggcaggataagctaaacccatatagagctaaggttctgattgca gtttcttgggcaacttccttttgtgtagcttttcctttagccgtaggaaaccccgacctg cagataccttcccgagctccccagtgtgtgtttgggtacacaaccaatccaggctaccag gcttatgtgattttgatttctctcatttctttcttcatacccttcctggtaatactgtac tcatttatgggcatactcaacacccttcggcacaatgccttgaggatccatagctaccct gaaggtatatgcctcagccaggccagcaaactgggtctcatgagtctgcagagacctttc cagatgagcattgacatgggctttaaaacacgtgccttcaccactattttgattctcttt gctgtcttcattgtctgctgggccccattcaccacttacagccttgtggcaacattcagt aagcacttttactatcagcacaacttttttgagattagcacctggctactgtggctctgc tacctcaagtctgcattgaatccgctgatctactactggaggattaagaaattccatgat gcttgcctggacatgatgcctaagtccttcaagtttttgccgcagctccctggtcacaca aagcgacggatacgtcctagtgctgtctatgtgtgtggggaacatcggacggtggtgtga >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_6|462_aa MEKPDIQLILETPIIQEMNNAKEGKTSEEVLKNNAARRRRKEDNEEEVVAAAAIQAAVAG AGAESEEHRGRAPPPREPPRGSRPQGNQRSLLKAPAPVLDREGGWCPVTSSGCQLPLCAA VAYSPSISGLPSFSGGWCPVTSSGCLHLGMSMVKSVVKSIGTKWLMLLFFLDLWNFELER DDLGYLVEEISKQQSIQEVTWVLLKAFSFKRETDHKSLENLQPDNAIEKKIPFSEEKFKP AAEICMMYGNAWMPWQKFAAGVVPSWRTSARSVKKGNVGLEPPHRITTGALPSGAVRRGP LSSIPQSGRSTVSFHHVPGKSADTQCQSMKAARREAVPCKATGVELPKTMGTHLLPQCDL NDLSKMPQNSSVLLMGSTPSIQGSNQWANRESTEAPEKLATQKMCGTVPAPRSNAECFCI QVLQKLHALQNPCLLTEFSSFSFPLSCGSPTSCAIIEHTEAP >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_6|1389_bp atggagaaacctgatattcaacttatactcgaaactcctatcattcaagaaatgaataat gcaaaagaaggaaaaacatctgaagaagtgctgaagaacaacgctgccaggcggaggagg aaggaggacaacgaagaggaggtggtggcggcggcggcgatacaggcggcggtggcggga gctggagctgaaagtgaggagcatcgcggacgagcgccgccgcctcgcgagccgccgagg ggttcccgcccccagggtaaccaacgctccctcttaaaggcgccggccccggttttagac cgggagggtggctggtgtccagtgactagctctggctgccagctgcctctctgcgctgca gttgcttactctccaagtatctctgggctcccatctttctcgggtggctggtgtccagtg actagctctggctgcctccatcttgggatgtccatggtgaagtctgtggttaagtccata gggactaaatggttgatgcttctgttctttttagatttgtggaactttgaacttgagaga gatgatttagggtatctggtggaagaaatttctaagcagcaaagcattcaagaggtgact tgggtgctattaaaggcattcagtttcaaaagggaaacagatcataaaagtttggaaaat ttgcagcctgacaatgcaatagaaaagaaaatcccattttctgaggagaaattcaagcca gctgcagaaatttgcatgatgtatggaaatgcctggatgccctggcagaaatttgctgca ggggtggtgccctcatggagaacctctgctaggtcagtgaagaagggaaatgtggggttg gagcccccacacagaatcactactggggcactgcctagtggggctgtgagaagagggcca ctgtcctccataccccagagtggtagatccactgtcagctttcaccatgtgcctggaaaa tctgcagatactcaatgccagtccatgaaagcagccagaagggaggctgtaccctgcaaa gccacaggggtggagctgcccaagaccatgggaacccaccttttgcctcagtgtgacctg aatgatctttccaaaatgccccagaattcttctgttttactaatgggatcaactccctca atccaaggtagcaatcaatgggccaacagagagagcacggaagctccagagaaactggca actcagaagatgtgtggcaccgtgcccgctccaaggagtaatgctgaatgcttttgcatt caggttttgcaaaaacttcatgcacttcagaacccatgcttactaactgaattttcaagc ttctcttttcccctttcttgtggaagcccaacttcctgtgccatcattgagcatactgag gctccctga >gi568815592r:96698475_96899731|GENSCAN_predicted_peptide_7|277_aa MGALVIRGIRNFNLENRAEREISKMKPSVAPRHPSTNSLLREQISREWCEGVSGPGGAES LCCPAVPSVGPEAEGSASACSAAAARVGGALEGGAARPVTLRLSSAPAVRSAAPPTSETQ RQQPVLPLTRPSPAAVTDLIQGIKIREVYPEVKGEIARKDEKLLSFLKDVYVDSKDPVSS LQVKAAETCQEPKEFRLPKDHHFDMINIKSIPKGKISIVEALTLLNNHKLFPETWTAEKI MQEYQLEQKDVNSLLKYFVTFEVEIFPPEDKKAIRSK >gi568815592r:96698475_96899731|GENSCAN_predicted_CDS_7|834_bp atgggagcactagtgattcgcggtatcaggaatttcaacctagagaaccgagcggaacgg gaaatcagcaagatgaagccctctgtcgctcccagacacccctctaccaacagcctcctg cgagagcagattagtcgtgagtggtgcgagggcgtttcggggcccgggggcgcggagtcc ctgtgctgcccggctgtccctagcgtgggtccggaggccgagggatcggcgtcagcctgc tcggccgcagccgctcgggttggcggagcgttggaagggggagccgccaggccggtgaca ttgagactgtcctccgcgcccgcggtgaggtcggcggccccgcctacttccgagacccag aggcagcagccagtgctcccgctaaccaggccctcgccggctgctgtcacggacttgata cagggaatcaaaatccgagaggtctatccagaagttaaaggagagattgctcgtaaagat gaaaagctgctgtcgtttctaaaagatgtgtatgttgattccaaagatcctgtgtcttcc ttgcaggtaaaagctgctgaaacatgtcaagagccgaaggaattcagattgccgaaagac catcattttgatatgataaatattaagagcattcccaaaggcaaaatttccattgtagaa gcattgacacttctcaataatcataagcttttcccagaaacctggactgctgagaaaata atgcaggaataccagttagaacagaaagatgtgaattctcttcttaaatattttgttact tttgaagtcgaaatcttccctcctgaagacaagaaagcaatacgatcaaaatga