GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:04:13 Sequence gi568815592r:96791107_96997801 : 206695 bp : 38.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 2431 2426 6 1.05 1.01 Sngl - 8625 7366 1260 0 0 91 42 580 0.987 49.31 1.00 Prom - 28216 28177 40 -4.55 2.00 Prom + 40498 40537 40 -3.25 2.01 Init + 41811 41904 94 2 1 64 -7 112 0.620 -0.11 2.02 Intr + 46271 46482 212 0 2 92 105 198 0.770 19.61 2.03 Intr + 51959 52164 206 1 2 85 52 59 0.003 -0.92 2.04 Intr + 68564 68786 223 2 1 49 85 144 0.027 7.41 2.05 Intr + 69133 69480 348 0 0 42 45 192 0.037 4.93 2.06 Intr + 72452 72539 88 1 1 43 87 26 0.026 -3.38 2.07 Term + 86192 86409 218 0 2 68 39 152 0.768 4.62 2.08 PlyA + 87302 87307 6 1.05 3.04 PlyA - 87581 87576 6 1.05 3.03 Term - 100285 99998 288 1 0 96 49 197 0.999 10.99 3.02 Intr - 105741 105638 104 1 2 93 99 165 0.984 17.07 3.01 Init - 106695 106254 442 0 1 66 4 331 0.924 18.77 3.00 Prom - 108092 108053 40 -4.65 4.07 PlyA - 109362 109357 6 1.05 4.06 Term - 121809 121696 114 0 0 118 49 44 0.547 1.09 4.05 Intr - 133958 133731 228 2 0 94 34 200 0.469 12.34 4.04 Intr - 137379 137261 119 1 2 61 72 57 0.442 0.66 4.03 Intr - 147440 147198 243 0 0 27 93 213 0.365 12.25 4.02 Intr - 172285 172250 36 2 0 80 79 49 0.257 0.52 4.01 Init - 179156 179000 157 2 1 62 63 112 0.608 6.22 4.00 Prom - 180016 179977 40 -6.65 5.00 Prom + 182102 182141 40 -4.45 5.01 Init + 184736 185071 336 1 0 53 114 250 0.358 21.42 5.02 Term + 185910 185987 78 2 0 72 34 66 0.707 -3.62 5.03 PlyA + 186745 186750 6 1.05 6.03 PlyA - 187050 187045 6 1.05 6.02 Term - 188921 188542 380 0 2 10 42 222 0.341 3.77 6.01 Init - 193020 192795 226 0 1 66 60 88 0.465 2.48 6.00 Prom - 193478 193439 40 -6.15 7.06 PlyA - 193643 193638 6 1.05 7.05 Term - 194820 194582 239 1 2 61 48 167 0.950 5.35 7.04 Intr - 195173 194905 269 1 2 28 71 223 0.479 10.85 7.03 Intr - 195599 195445 155 2 2 44 65 29 0.358 -5.85 7.02 Intr - 196326 195817 510 0 0 17 69 235 0.460 6.74 7.01 Init - 197931 197926 6 0 0 61 58 17 0.679 -4.07 7.00 Prom - 198109 198070 40 -5.35 8.03 PlyA - 198234 198229 6 1.05 8.02 Term - 200855 200557 299 0 2 72 48 213 0.333 10.14 8.01 Init - 201118 201067 52 1 1 99 55 100 0.803 9.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 115417 115461 45 0 0 74 110 49 0.850 6.43 S.002 Sngl + 129062 129304 243 1 0 95 48 157 0.864 7.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_1|419_aa MVFSAVLTAFHTGTSNTTFVVYENTYMNITLPPPFQHPDLSPLLRYSFETMAPTGLSSLT VNSTAVPTTPAAFKSLNLPLQITLSAIMIFILFVSFLGNLVVCLMVYQKAAMRSAINILL ASLAFADMLLAVLNMPFALVTILTTRWIFGKFFCRVSAMFFWLFVIEGVAILLIISIDRF LIIVQRQDKLNPYRAKVLIAVSWATSFCVAFPLAVGNPDLQIPSRAPQCVFGYTTNPGYQ AYVILISLISFFIPFLVILYSFMGILNTLRHNALRIHSYPEGICLSQASKLGLMSLQRPF QMSIDMGFKTRAFTTILILFAVFIVCWAPFTTYSLVATFSKHFYYQHNFFEISTWLLWLC YLKSALNPLIYYWRIKKFHDACLDMMPKSFKFLPQLPGHTKRRIRPSAVYVCGEHRTVV >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_1|1260_bp atggtcttctcggcagtgttgactgcgttccataccgggacatccaacacaacatttgtc gtgtatgaaaacacctacatgaatattacactccctccaccattccagcatcctgacctc agtccattgcttagatatagttttgaaaccatggctcccactggtttgagttccttgacc gtgaatagtacagctgtgcccacaacaccagcagcatttaagagcctaaacttgcctctt cagatcaccctttctgctataatgatattcattctgtttgtgtcttttcttgggaacttg gttgtttgcctcatggtttaccaaaaagctgccatgaggtctgcaattaacatcctcctt gccagcctagcttttgcagacatgttgcttgcagtgctgaacatgccctttgccctggta actattcttactacccgatggatttttgggaaattcttctgtagggtatctgctatgttt ttctggttatttgtgatagaaggagtagccatcctgctcatcattagcatagataggttc cttattatagtccagaggcaggataagctaaacccatatagagctaaggttctgattgca gtttcttgggcaacttccttttgtgtagcttttcctttagccgtaggaaaccccgacctg cagataccttcccgagctccccagtgtgtgtttgggtacacaaccaatccaggctaccag gcttatgtgattttgatttctctcatttctttcttcatacccttcctggtaatactgtac tcatttatgggcatactcaacacccttcggcacaatgccttgaggatccatagctaccct gaaggtatatgcctcagccaggccagcaaactgggtctcatgagtctgcagagacctttc cagatgagcattgacatgggctttaaaacacgtgccttcaccactattttgattctcttt gctgtcttcattgtctgctgggccccattcaccacttacagccttgtggcaacattcagt aagcacttttactatcagcacaacttttttgagattagcacctggctactgtggctctgc tacctcaagtctgcattgaatccgctgatctactactggaggattaagaaattccatgat gcttgcctggacatgatgcctaagtccttcaagtttttgccgcagctccctggtcacaca aagcgacggatacgtcctagtgctgtctatgtgtgtggggaacatcggacggtggtgtga >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_2|462_aa MEKPDIQLILETPIIQEMNNAKEGKTSEEVLKNNAARRRRKEDNEEEVVAAAAIQAAVAG AGAESEEHRGRAPPPREPPRGSRPQGNQRSLLKAPAPVLDREGGWCPVTSSGCQLPLCAA VAYSPSISGLPSFSGGWCPVTSSGCLHLGMSMVKSVVKSIGTKWLMLLFFLDLWNFELER DDLGYLVEEISKQQSIQEVTWVLLKAFSFKRETDHKSLENLQPDNAIEKKIPFSEEKFKP AAEICMMYGNAWMPWQKFAAGVVPSWRTSARSVKKGNVGLEPPHRITTGALPSGAVRRGP LSSIPQSGRSTVSFHHVPGKSADTQCQSMKAARREAVPCKATGVELPKTMGTHLLPQCDL NDLSKMPQNSSVLLMGSTPSIQGSNQWANRESTEAPEKLATQKMCGTVPAPRSNAECFCI QVLQKLHALQNPCLLTEFSSFSFPLSCGSPTSCAIIEHTEAP >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_2|1389_bp atggagaaacctgatattcaacttatactcgaaactcctatcattcaagaaatgaataat gcaaaagaaggaaaaacatctgaagaagtgctgaagaacaacgctgccaggcggaggagg aaggaggacaacgaagaggaggtggtggcggcggcggcgatacaggcggcggtggcggga gctggagctgaaagtgaggagcatcgcggacgagcgccgccgcctcgcgagccgccgagg ggttcccgcccccagggtaaccaacgctccctcttaaaggcgccggccccggttttagac cgggagggtggctggtgtccagtgactagctctggctgccagctgcctctctgcgctgca gttgcttactctccaagtatctctgggctcccatctttctcgggtggctggtgtccagtg actagctctggctgcctccatcttgggatgtccatggtgaagtctgtggttaagtccata gggactaaatggttgatgcttctgttctttttagatttgtggaactttgaacttgagaga gatgatttagggtatctggtggaagaaatttctaagcagcaaagcattcaagaggtgact tgggtgctattaaaggcattcagtttcaaaagggaaacagatcataaaagtttggaaaat ttgcagcctgacaatgcaatagaaaagaaaatcccattttctgaggagaaattcaagcca gctgcagaaatttgcatgatgtatggaaatgcctggatgccctggcagaaatttgctgca ggggtggtgccctcatggagaacctctgctaggtcagtgaagaagggaaatgtggggttg gagcccccacacagaatcactactggggcactgcctagtggggctgtgagaagagggcca ctgtcctccataccccagagtggtagatccactgtcagctttcaccatgtgcctggaaaa tctgcagatactcaatgccagtccatgaaagcagccagaagggaggctgtaccctgcaaa gccacaggggtggagctgcccaagaccatgggaacccaccttttgcctcagtgtgacctg aatgatctttccaaaatgccccagaattcttctgttttactaatgggatcaactccctca atccaaggtagcaatcaatgggccaacagagagagcacggaagctccagagaaactggca actcagaagatgtgtggcaccgtgcccgctccaaggagtaatgctgaatgcttttgcatt caggttttgcaaaaacttcatgcacttcagaacccatgcttactaactgaattttcaagc ttctcttttcccctttcttgtggaagcccaacttcctgtgccatcattgagcatactgag gctccctga >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_3|277_aa MGALVIRGIRNFNLENRAEREISKMKPSVAPRHPSTNSLLREQISREWCEGVSGPGGAES LCCPAVPSVGPEAEGSASACSAAAARVGGALEGGAARPVTLRLSSAPAVRSAAPPTSETQ RQQPVLPLTRPSPAAVTDLIQGIKIREVYPEVKGEIARKDEKLLSFLKDVYVDSKDPVSS LQVKAAETCQEPKEFRLPKDHHFDMINIKSIPKGKISIVEALTLLNNHKLFPETWTAEKI MQEYQLEQKDVNSLLKYFVTFEVEIFPPEDKKAIRSK >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_3|834_bp atgggagcactagtgattcgcggtatcaggaatttcaacctagagaaccgagcggaacgg gaaatcagcaagatgaagccctctgtcgctcccagacacccctctaccaacagcctcctg cgagagcagattagtcgtgagtggtgcgagggcgtttcggggcccgggggcgcggagtcc ctgtgctgcccggctgtccctagcgtgggtccggaggccgagggatcggcgtcagcctgc tcggccgcagccgctcgggttggcggagcgttggaagggggagccgccaggccggtgaca ttgagactgtcctccgcgcccgcggtgaggtcggcggccccgcctacttccgagacccag aggcagcagccagtgctcccgctaaccaggccctcgccggctgctgtcacggacttgata cagggaatcaaaatccgagaggtctatccagaagttaaaggagagattgctcgtaaagat gaaaagctgctgtcgtttctaaaagatgtgtatgttgattccaaagatcctgtgtcttcc ttgcaggtaaaagctgctgaaacatgtcaagagccgaaggaattcagattgccgaaagac catcattttgatatgataaatattaagagcattcccaaaggcaaaatttccattgtagaa gcattgacacttctcaataatcataagcttttcccagaaacctggactgctgagaaaata atgcaggaataccagttagaacagaaagatgtgaattctcttcttaaatattttgttact tttgaagtcgaaatcttccctcctgaagacaagaaagcaatacgatcaaaatga >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_4|298_aa MGSHRKGHSSCSTKSSRGDWARLEQTWTVAVVQLNDGVGPWSPRSTQMKFQKGRGGLLTL INGEGSLNTTGHQGGPEQSEAATAAQRMLMRYLQDTRSMPEGGAQKRRCPLWGPALPQYH LARDPRTLWQGTIGNLVELKSDPDAVSFPNLSSSIPSHPSAVASSAALHLPDHSSSTAFI TRSATVCRTGYKERDWLTRWGERSAGLGARDGSAGTGLSLMHNGSASPHDAEVNRTRSSF ALLVRGCSSEDERPSVFVCAKGLQGTLVRREYVQVVGRGADRQISSNYNAEELDHGSS >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_4|897_bp atgggaagtcacagaaaaggtcattcaagctgtagtaccaagagcagtaggggagactgg gccagactagagcagacctggactgtggctgtggttcagctgaatgatggagttgggcct tggagcccaaggagcactcagatgaagtttcagaaaggaagaggaggactcctaactctt atcaatggagaaggcagccttaataccactggtcatcagggaggaccagagcagtcagaa gcagcaacagcagcacagagaatgctgatgaggtatctgcaggacacgagaagcatgccg gaaggtggagcccagaagagaaggtgtccactgtggggcccagccctgccccagtaccac ctggccagggaccccaggaccctctggcagggaacaatagggaacctagtggaactgaag tctgatcctgatgctgtctccttcccaaatttgtcttcctccatcccctcccacccatcc gcagttgcatcctctgctgcactccatcttccagaccatagttcttccacagctttcatc actcgctcagctacagtctgcagaacaggttacaaggagagagactggcttacccgatgg ggtgagcggtctgctgggttaggtgccagagacggcagcgcaggcaccggcctctccttg atgcacaatggctctgcttcgccccacgatgctgaggttaacagaacccgcagtagcttc gctctgctggttcggggatgcagtagcgaagacgagcgtcctagcgtgtttgtgtgtgcg aaaggcttacagggtaccttggtgagaagggaatatgtgcaggtggtagggagaggagca gacagacaaataagcagcaattacaatgcagaggaactggatcatggaagtagttag >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_5|137_aa MVERKKMREQMVTLIECTGTRVASVDVQGMREQLAHRAGKRKNVDLIALLCSIQEMLTGQ RLCHSESHNDSVLAALNQQRSDGILCDITLIAEEQKFHAHKAVLAACSDYFRSFGVDFQQ QITCGNKIAVFLEDEST >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_5|414_bp atggtggagagaaagaaaatgagggaacagatggtcacactcatagagtgcactggaacc cgtgttgcctcagtcgatgttcaaggaatgagagagcagctggcccacagggctgggaaa aggaaaaatgttgacttgattgctctcctttgtagtattcaagaaatgctgacaggccag aggctctgccactccgaatctcacaatgacagtgtcctggcagcgctgaatcagcagagg agtgatggcatcctctgcgacatcaccctgattgctgaggaacagaaattccatgctcac aaggcagtcctagcagcatgcagtgactatttccggagctttggggttgatttccaacag caaattacatgtggtaataagattgctgtttttcttgaagatgaaagtacataa >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_6|201_aa MKAEIKMCFETNENKDTTYQDLWDAFKAVCRGKFIPLNAHKRKQERSKIDTLTSPLKELE KQEQTHSKASRRQEITKFQDTKSIVRISVAFLYTNNIQAESQIKIAISFTIVTKRIKYLG RQPTGEVKELYKENYKTLLIEISDDTNKWKNISCSWIGRINVVKMAILPKAMYTFNVIPI KLPMMFFTELEKTYSKIHMEP >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_6|606_bp atgaaggcagaaataaagatgtgctttgaaaccaatgagaacaaagacacaacatatcag gatctctgggacgcattcaaagcagtgtgtagagggaaatttataccactaaatgcccac aagagaaagcaggagagatctaaaattgacaccctaacatcaccattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataacaaagtttcaggat acaaaatcaattgtacgaatatcagtagcatttctatacaccaacaacatccaagctgag agccaaatcaagattgcaatttcattcacaatagtcacaaaaagaataaaatacctagga agacagccaaccggagaggtaaaagagctctacaaagagaattacaaaacacttctcatt gaaattagcgatgataccaacaagtggaaaaatatttcatgctcatggataggaagaatc aatgtcgtgaaaatggccatactgcctaaagcaatgtacacattcaatgttattcctatc aaactaccaatgatgtttttcacagaattagaaaaaacctattctaaaattcatatggaa ccataa >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_7|392_aa MLWFANENKDTTYQNLWDTFKAVCRGKFIPLNAHKRKQERSKIDTLTSQLKELEKQEQTH SKASRRQEISKIRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIKKKREKNEID AIKNDKGDITTDPIEIQTTGGWSQDGRIGTAPVYSSQRERCRRQVISAFPTEQMAHQEII SRALLGGSYTHEASLIASTAVGDPTARQQPGWGRGARHCQGLSRGRLTPHTAGYSSETKI PEERSGSNICCSPISAVLQPPLVIPRQTGSGVDLQQTPKDLQLRVLTVRRKTNKQKRHPH QNPICTSPSSKTKGHSSSPAMEQSWTENDFDELRQEGFRQSNYSELKEEVRTHGKEVKNL EKKLDEWLTRITNSREVLKGPDGTENQGTRTT >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_7|1179_bp atgctgtggtttgccaacgagaacaaagacacaacataccagaatctctgggacacattc aaagcagtgtgtagagggaaatttataccactaaatgcccacaagagaaagcaggagaga tctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataagtaagatcagagcagaactgaaggaaatagag actcaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaac aaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatgaaatagat gcaataaaaaatgataaaggggatatcaccactgatcccatagaaatacaaactactggg gggtggagccaagatggccgcataggaacagctccagtctacagctcccagcgtgagcga tgcagaagacaggtgatttctgcatttccaactgagcaaatggcacaccaggagattata tcccgtgcattgcttggagggtcctacacccacgaagcctcgctcattgctagcacagca gtgggagatccaactgcaaggcagcagccaggctgggggaggggcgcccgccattgccaa ggcttgagtaggggcagactgacacctcacacggccggatactcctctgagacaaaaatt ccagaggaacgatccggcagcaacatctgctgttcaccaatatctgctgttctgcagcct ccactggtgatacccaggcaaacagggtctggagtggacctccaacaaactccaaaagac ctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaagacatccacac caaaaccccatctgtacgtcaccatcatcaaagaccaaaggacacagctcctcaccagca atggaacaaagctggacggagaatgactttgacgagttgagacaagaaggcttcagacaa tcaaactactctgagctaaaggaggaagttcgtacccatggcaaagaagttaaaaacctt gaaaaaaaattagatgaatggctaactagaataaccaattctagagaagtccttaaagga cctgatggaactgaaaaccaaggcacgagaactacgtga >gi568815592r:96791107_96997801|GENSCAN_predicted_peptide_8|116_aa MDPLVPDTGSCISGNKRGWGGSTVIRDYHRPPAQRSHLTRKSGLFSMPSPAPATPHWAGP LDLRPWPLPARALRGSSSALSWKGSPGGNRQATTFAALKPSLLWLSGLGRNAVIRH >gi568815592r:96791107_96997801|GENSCAN_predicted_CDS_8|351_bp atggatcccctggtaccggacaccggcagctgcatctctggcaacaagagaggctgggga gggagcacagtgattagggactatcatagacccccagcacagcgcagccaccttacccgg aaaagcggcctgttttcaatgccgtcccccgcccctgctactcctcactgggcagggcct ctagatctgcgaccttggccactccctgctagggctctccgcggtagcagctctgcactt tcttggaagggatctcccggaggtaacaggcaagctactacttttgctgctctgaagccc tcgctcctgtggctctccggcttgggaaggaacgcagtgatcaggcactaa