GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:21:15 Sequence gi568815580f:31992777_32229184 : 236408 bp : 41.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8255 8312 58 1 1 42 114 23 0.912 1.82 1.02 Intr + 9147 9322 176 1 2 135 50 27 0.884 2.34 1.03 Intr + 9516 9623 108 2 0 12 80 120 0.869 3.16 1.04 Intr + 10297 10378 82 0 1 61 110 127 0.960 10.49 1.05 Intr + 11895 12061 167 1 2 51 65 82 0.380 0.96 1.06 Term + 13492 13551 60 0 0 102 45 37 0.337 -2.37 1.07 PlyA + 15240 15245 6 1.05 2.00 Prom + 18435 18474 40 -6.65 2.01 Init + 26088 26251 164 2 2 85 86 260 0.996 22.75 2.02 Term + 26490 27084 595 0 1 57 37 190 0.765 3.81 2.03 PlyA + 28273 28278 6 1.05 3.05 PlyA - 29375 29370 6 1.05 3.04 Term - 34378 34305 74 2 2 96 29 30 0.282 -5.01 3.03 Intr - 34896 34847 50 2 2 66 92 94 0.435 5.01 3.02 Intr - 41595 41522 74 0 2 83 67 115 0.482 6.19 3.01 Init - 45124 44930 195 1 0 42 52 127 0.093 2.64 3.00 Prom - 48596 48557 40 -2.65 4.04 PlyA - 48655 48650 6 1.05 4.03 Term - 56078 55789 290 0 2 77 38 102 0.927 -1.35 4.02 Intr - 56440 56324 117 2 0 71 86 167 0.512 14.32 4.01 Init - 62683 62674 10 1 1 73 74 3 0.288 -1.85 4.00 Prom - 66642 66603 40 -3.05 5.00 Prom + 69845 69884 40 -4.15 5.01 Init + 70159 70212 54 0 0 56 90 48 0.757 1.13 5.02 Intr + 73126 73233 108 0 0 93 92 107 0.997 11.16 5.03 Term + 75522 75608 87 2 0 85 28 101 0.972 0.48 5.04 PlyA + 76720 76725 6 1.05 6.02 PlyA - 76976 76971 6 1.05 6.01 Sngl - 83496 83284 213 0 0 90 40 231 0.996 13.43 6.00 Prom - 86257 86218 40 -6.95 7.03 PlyA - 89646 89641 6 1.05 7.02 Term - 99402 99129 274 2 1 73 45 163 0.172 4.46 7.01 Init - 100002 99953 50 0 2 101 51 64 0.312 4.50 7.00 Prom - 115061 115022 40 -6.05 8.00 Prom + 115391 115430 40 -5.75 8.01 Init + 119006 119143 138 1 0 61 80 58 0.774 2.49 8.02 Intr + 120969 121084 116 2 2 56 115 58 0.769 3.63 8.03 Intr + 130742 130798 57 2 0 71 121 44 0.606 3.08 8.04 Intr + 147470 147672 203 2 2 20 9 262 0.362 9.51 8.05 Term + 148007 148182 176 0 2 -17 44 202 0.587 2.24 8.06 PlyA + 149438 149443 6 1.05 9.03 PlyA - 150049 150044 6 1.05 9.02 Term - 160445 160152 294 2 0 85 37 125 0.197 1.42 9.01 Init - 185068 184907 162 1 0 107 99 38 0.293 6.58 9.00 Prom - 185690 185651 40 -6.15 10.05 PlyA - 186214 186209 6 1.05 10.04 Term - 191224 190840 385 0 1 68 55 185 0.937 6.58 10.03 Intr - 191530 191403 128 1 2 82 60 66 0.904 1.76 10.02 Intr - 203642 203403 240 2 0 -9 28 309 0.367 12.02 10.01 Init - 204085 203654 432 1 0 75 47 459 0.877 36.16 10.00 Prom - 207241 207202 40 -6.75 11.00 Prom + 207425 207464 40 -4.25 11.01 Init + 208880 208886 7 1 1 67 91 0 0.301 -0.60 11.02 Intr + 210117 210234 118 1 1 113 106 78 0.312 10.70 11.03 Intr + 211406 211584 179 2 2 34 88 113 0.919 4.54 11.04 Intr + 214476 214694 219 1 0 72 34 136 0.841 3.95 11.05 Intr + 215343 215495 153 1 0 80 87 75 0.939 5.72 11.06 Intr + 217725 217940 216 1 0 111 82 187 0.998 17.95 11.07 Intr + 220340 220783 444 0 0 52 38 244 0.803 8.25 11.08 Intr + 222306 222485 180 1 0 73 95 73 0.975 5.42 11.09 Intr + 224215 224341 127 2 1 78 110 97 0.999 9.72 11.10 Intr + 224985 225189 205 0 1 76 101 124 0.832 10.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_1|216_aa MTSGPQTNQPKEHLTNFKSGCCQVPVLPQPLLPNPIILLSPPLPTPRPGLQFRFATSPPP PAQQFPLGEVAGAEGIVRPNLTLTTAGFMSQTSRKALEQFPEKIPNGTIRQIPQWLKTDA ARSPWKPPRPSRMLWVTLTVEERNFLTMQGSSSIINTSLIKTLLKAALLPKEAGVIHCKG HQKASDPITQGNTYADKGHLRQFIVINQNATRWILA >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_1|651_bp atgacctcgggtcctcagaccaaccaacccaaggaacatctcaccaatttcaaatcgggc tgctgccaggtcccggttcttcctcagcctctgctccccaatcctataatccttctatca cctcccctcccaacacccaggcctggcttacagtttcgttttgcgactagccctccccca cctgcccaacaatttcctcttggagaggtggctggagctgaaggcatagtcaggcccaat ctcacactgacaaccgccggcttcatgagccagacctccaggaaggcattagagcagttc cctgagaagatccccaatggaactatcaggcagattccccagtggctgaagactgacgct gcccgatcgccttggaagccccctagaccatcacggatgctttgggtaactctcacagtg gaggaaagaaatttcctcactatgcaagggtcctcctccatcattaacacttctttaata aaaactcttctcaaagccgctttacttccaaaggaagctggagtcattcactgcaagggc catcagaaggcatcagatcccatcactcagggcaacacttatgctgataagggacacctg agacagttcatagtaatcaatcaaaatgctacccgttggatattggcttga >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_2|252_aa MGSVLSTDSGKSAPASATARALERRRDPELPVTSFDCAVCLEVLHQPVRTRCGHVSRSPG AELQRRGSRHGSRPEPGLANWGGCSWLENQRSPRHGKETSSSYKAGAPAKRSYPAEHGRT GCASLCPRNPGPFCFAGAVHVFPECPQRLVLSKQGSVQSVFPPPPLLPFPNTDTQPLKGQ VKLELPSDFPQLLEFNNLLGWGLDVSRQAPECTPSKPSHNVTGGGRLPALRTVSPNRRKL QSKITFSKPHST >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_2|759_bp atgggctccgtgctgagcaccgacagcggcaaatcggcgcccgcctctgccaccgcgcgg gccctggagcgcaggagggacccggagttgcccgtcacgtccttcgactgcgccgtgtgc cttgaggtgttacaccagcctgtccggacccgctgtggccacgtctcccgaagcccaggt gcggaactccagagaaggggatccaggcacgggtcccgcccagagccagggctcgccaac tggggagggtgctcttggctggagaaccagcgctcgcccaggcacggaaaggaaacgagc tccagctacaaggcaggggctcccgcaaagcgcagttaccccgctgaacacggtagaacg ggctgcgcttccctctgccccagaaacccgggtccgttttgttttgctggagctgtgcac gtgttcccagaatgcccacagaggctcgtcttaagcaaacaaggatctgtgcaaagtgtc tttccacccccacccctgctaccctttcccaacacagacacccagcccctcaaagggcaa gtcaaactggagcttccgagtgactttccacagttgttagagtttaacaacctgctaggg tggggcttggacgttagccgccaagcccctgaatgtactccttcaaaacccagccacaac gtaacaggaggtggcaggctacctgctttacgcacggtctctccaaaccggcggaaatta cagagtaaaataactttctcaaaaccgcatagtacttag >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_3|130_aa MFRFCPIRVQFFQSSLRLATFRILLIGAFYRVLIGALYRALIDAFYRALIGAFYRALIGV FYNPLGLCPAAAAALLKRKTERDAKEITPRANEWDLGTTDEKPAKKGNKTPGDVNPLTQE GKIKEILGKN >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_3|393_bp atgttccgtttctgtcctatcagagtgcagtttttccaatcctccctgcgattggctact ttcagaatcctgctgattggtgcgttttacagagtgctgattggtgcactttacagagca ctgattgatgcgttttacagagcactgattggtgcattttacagagcgctaattggtgtg ttttacaatcctcttggcttatgtcccgctgcagctgccgccctgctcaagagaaagact gagcgggatgctaaggagataacgccaagagccaatgaatgggatcttggaacaacggat gaaaagccagcaaagaaagggaacaagaccccaggagatgtaaatcctttaactcaggaa ggaaaaataaaagaaatacttggaaaaaattag >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_4|138_aa MCNCQLSSMTLKTCKIMEDLASWATFNLGQTEQHVDWISAGAAPQGPIVKQLPSPVPPTE QLEQEKPVTIGSAPSTPNKLLLSHSNCQLPRPLRSITNISACFSLELAIDHRGFPHSPGA RSLYSNSVFVTYQLCDLV >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_4|417_bp atgtgcaactgtcagctgtcttcaatgaccctgaagacctgtaaaatcatggaggaccta gcttcatgggccaccttcaatttgggccaaactgagcagcatgtggactggatttctgct ggagcagctccccagggacctattgtgaaacaactccctagcccagttcctcctacagag caacttgagcaggaaaagcccgtgaccatcggctctgctccctcaacacctaacaagctc ctacttagccattccaattgccaactccctaggccattaagatcaataacaaatatctct gcatgcttttctcttgaacttgctatagaccacaggggtttcccacacagccctggagct agatcactgtattcaaattctgtctttgttacttaccagctgtgtgaccttgtatga >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_5|82_aa MGRVGWLMLVIPALWEAEFCPLCRLIPDENPSSFSGSLIRHLQVSHTLFYDDFIDFNIIE EALIRRVLDRSLLEYVNHSNTT >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_5|249_bp atgggccgggtggggtggctcatgcttgtaatcccagcactttgggaggcagagttctgt ccactttgccgtttaatacccgatgagaatccaagcagcttcagtggcagtttaataaga catctgcaagttagtcacactttgttttatgatgatttcatagattttaatataattgag gaagctcttatccgaagagtcttagaccggtcacttcttgaatatgtgaatcactcgaac accacataa >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_6|70_aa MVRINVLADALKSINNAKKRGKHQVLIGPCSKVIIRFLTVIMKHGYIGESEIIDDQSWED CEPHRQAKQV >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_6|213_bp atggtgcgcataaatgtcctggctgatgctctcaagagcatcaacaatgccaaaaagaga ggcaaacaccaggttcttattgggccatgctccaaagtcatcatccggtttctaactgtg ataatgaagcatggttacattggcgagtctgaaatcattgatgatcagagctgggaagat tgtgaacctcacaggcaggctaaacaagtgtag >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_7|107_aa MAGDGETRRWRQRQRPRDPAQRFPRASCPAAAIFAAEAPGQMEGGVRCAYRPLTGPEPRL MVQGDSPAAASSSVLALRCLQPTRPSPNAGAPGIWAEEPGEAGRHGV >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_7|324_bp atggcgggggatggggaaacaaggcgatggcgacagcggcagcggccgagagacccggcc cagcgctttcctcgagcttcctgtccggccgccgctatcttcgcagccgaggccccgggg caaatggagggcggcgtgcgctgtgcttatcggcccctaacggggccggagccgaggctc atggtgcagggagattcgcccgccgccgccagcagctctgtcttggcattgcgctgccta cagcctacccggccgtcaccgaacgcgggagcgccggggatctgggcggaggagccagga gaggcggggaggcacggggtgtga >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_8|229_aa MRESGAHCPLCRGNVTRRERACPERALDLENIMRKFSGSCRCCAKQIKFYRMRHHYKSCK KYQDEYGVSSIIPNFQISQDSVGNSNRSETSTSDNTETYQENTRPLRDPLGEASWAPESG GDLENLYVYLRDCKHTNQHPVSSSGIVNTPIGTLYLAQGLQTPISILRLAQEASKTTNPP EGRNSEHIRTSERTNSRRATLRAVTLTARVRSFILEVSETKNPPIPDTA >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_8|690_bp atgagggaaagcggagcacattgtcccctatgtcgtggaaatgtgactagaagagagaga gcatgtcctgaacgggccttagaccttgaaaatataatgaggaagttttctggtagctgc agatgctgtgcaaaacagattaaattctatcgcatgagacatcattacaaatcttgtaag aagtatcaggatgaatatggtgtttcttctatcattccaaactttcagatctctcaagat tcagtagggaacagcaataggagtgaaacatccacatctgataacacagaaacttaccaa gagaatacaaggcccctgcgggatccactgggtgaagccagctgggctcctgagtctggt ggggacctggagaacctttatgtctatctcagggattgtaaacacaccaatcagcaccct gtgtctagctcagggattgtaaatacaccaatcggcactctgtatctagctcaaggtttg caaacaccaatcagcatcctgcgtctagctcaggaagccagcaagaccacgaacccacca gaaggaagaaactccgaacacatccgaacatcagaaagaacaaactccagacgcgccacc ttaagagctgtaacactcactgcgagggtccgcagcttcattcttgaagtcagtgagacc aagaacccaccaattcctgacacagcatga >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_9|151_aa MVVRRSCFPYICTVGTLGSMIILSCSMTEDFCFDSCCMWWALIVDFAPCGQKGQNTKKYI YSTEIFTSNNPELRSEDETVFRALEKWKTSEQTIGEMDFYICNDPHPDSALYQVCEKSLP NSVSTIEKVKLRWSTSFPTFLGSLAGTLSLL >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_9|456_bp atggtggtgagaagaagctgcttcccctacatatgcactgtgggaaccctaggatctatg ataatactcagctgttcaatgactgaggacttctgctttgacagctgctgcatgtggtgg gccttgattgttgactttgctccatgtggccagaagggccagaataccaaaaagtatata tacagcactgagattttcaccagcaacaatccagagctcagatctgaggatgagacagtt ttcagggccttagagaaatggaaaacctctgagcagacaataggagaaatggatttctac atctgtaatgaccctcaccccgattctgccctgtaccaagtatgtgaaaaatctctccca aactcagtttctacaatagaaaaagtgaaactgaggtggtcaaccagcttccccactttc ttgggttccctggcaggaaccctgtccttgctttaa >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_10|394_aa MGKYAEALRSQQKAVLMSVRVMGIEHPNTIQENMHLALHCFTSRQLSLALSLLQGAHYLM RWCLGKTPRGGVLDNIRRVLHRVMEYDLSLCFLDNALAVSTKYQGPKALKVALGHHLITS VYESKAEFPVGPAAPEGRAGCTPSLGEDQEKTKESSEYLKCLTQLAVALRRAMHEIYRNG SSNNIPPLNFTAPSMASVLEQFKGINGILFIPLSQKDLESLKAEVLIKPAGTHLTEGKHP ILSDSALPCGGTTQLQSTLAILPHLWGYQEKNARHTKKHKQFEETEQAMEQDSDMADMLE LSDQKFKTTMINMLRALTNKVDSMQEDMGIISREMEILRKTQKEMLEIKNTVTEMKNPFD RFISRLDTAERNPQLEDISIETSKTEKQREKKKD >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_10|1185_bp atgggcaagtacgcagaggccctgagaagccagcagaaggcagtgctgatgagcgtgcgg gtgatgggcatcgagcaccccaacaccatccaggaaaacatgcacctggccctgcactgc ttcaccagcaggcagctgtccctggccctcagcctgctgcagggcgcccactacctcatg cgctggtgcttggggaagacaccccgaggtggcgtgctggacaacatcaggcgggtgctg cacagggtgatggagtacgacctgtcgctgtgcttcctggacaacgcgctggccgtcagc accaagtaccaagggcccaaagctctcaaggtggcccttggccaccacctcatcacctcg gtctatgagagcaaagctgagtttccggtcggccctgcagcaccagaaggaagggctggc tgcacaccctctctgggcgaggaccaggagaagaccaaggagagctctgagtacctcaag tgcctgacccagctggccgtggccctgcggcgcgccatgcatgagatctaccgcaacggc tccagcaacaacatcccgcccctcaacttcacagcccccagcatggccagcgtcttggag cagttcaaaggcatcaatggcatcctcttcattcctctcagccaaaaagacttggagagt ctgaaagccgaggttctgataaaacctgctggaactcacctgacggaaggaaaacatcca attctatctgactctgcccttccgtgtggaggaactacccaactccagtccactctagcc atcctgccccacctatggggctatcaagaaaaaaatgcaaggcatactaaaaagcacaaa cagtttgaagagaccgagcaagcaatggaacaagactcagatatggcagatatgttggaa ttatcagaccagaaatttaaaacaactatgattaatatgctaagggctctaacaaataaa gtagacagcatgcaagaggacatggggattataagcagagagatggaaattctaagaaag acccaaaaagaaatgctagaaataaaaaatactgtaacagaaatgaagaatccctttgac agatttattagtagactggacacagctgaaaggaatccacaactggaggatatctcaata gaaacctccaaaactgaaaagcagagagaaaaaaaaaaagactga >gi568815580f:31992777_32229184|GENSCAN_predicted_peptide_11|616_aa MYEMNAKGVILNAFERYRLKTCIDFKPWAGETNYISVFKGSGCWSSVGNRRVGKQELSIG ANCDRIATVQHEFLHALGFWHEQSRSDRDDYVRIMWDRILSGREHNFNTYSDDISDSLNV PYDYTSVMHYSKTAFQNGTEPTIVTRISDFEDVIGQRMDFSDSDLLKLNQLYNCSSSLSF MDSCSFELENVCGMIQSSGDNADWQRVSQVPRGPESDHSNMGQCQGSGFFMHFDSSSVNV GATAVLESRTLYPKRGFQCLQFYLYNSGSESDQLNIYIREYSADNVDGNLTLVEEIKEIP TGSWQLYHVTLKVTKKFRVVFEGRKGSGASLGGLSIDDINLSETRCPHHIWHIRNFTQFI GSPNGTLYSPPFYSSKGYAFQIYLNLAHVTNAGIYFHLISGANDDQLQWPCPWQQATMTL LDQNPDIRQRMSNQRSITTDPFMTTDNGNYFWDRPSKVGTVALFSNGTQFRRGGGYGTSA FITHERLKSRDFIKGDDVYILLTVEDISHLNSTQIQLTPAPSVQDLCSKTTCKNDGVCTV RDGKAECRCQSGEDWWYMGERCEKRGSTRDTIVIAVSSTVAVFALMLIITLVSVYCTRKK YRERMSSNRPNLTPQN >gi568815580f:31992777_32229184|GENSCAN_predicted_CDS_11|1848_bp atgtatgaaatgaatgctaagggagttatcctcaatgcatttgaacgttatcgccttaaa acatgtattgactttaagccttgggctggagaaacaaactatatatcagtgttcaagggc agtggctgctggtcttcagtaggaaataggcgggttgggaagcaagaactttccatcggg gcaaactgtgaccgaatagcaacagttcaacacgagttcctccacgctctgggattctgg catgagcagtcgcgttctgaccgggatgactatgtcaggataatgtgggacagaattctg tcaggcagagagcacaattttaacacctatagtgacgatatatcagattccctgaatgtt ccctatgattacacttcagtaatgcactacagtaaaactgcattccaaaatggaacagag ccgacaattgtcacaagaatctcagactttgaggatgtgatcggccaacgaatggatttc agtgactctgatctcctaaagttgaatcaactgtataactgctcctcttccttgagtttt atggactcgtgcagttttgaactggaaaatgtgtgtggcatgatccaaagttcaggagat aatgctgactggcaacgggtttcacaggttcccagggggccagagagtgatcactccaac atgggccagtgccaaggttctggtttcttcatgcatttcgatagcagctctgtaaatgtg ggggccacagcagtgctggaaagtagaacgctgtaccctaaaagaggatttcagtgcctg caattttacttatataacagtggcagtgaaagtgatcaactgaacatctatatcagggag tattctgcagacaatgtggatggcaatttaacccttgtggaagaaataaaagaaataccc actgggagctggcaactttatcatgtaacattgaaagtgaccaagaagtttagagtggtg tttgaaggacgcaaaggctctggtgcatcactgggtggtctgtctattgatgacatcaat ctttcggaaacacggtgccctcatcatatctggcatataaggaatttcacacagttcatt ggcagcccaaatggaactctgtatagccctccattttactcttctaaaggttatgccttt cagatttacttaaatctagcccatgtgactaatgcagggatatatttccacttgatctct ggagccaatgatgatcaattacagtggccatgtccttggcaacaagccacaatgacactt ttggatcaaaatcctgacattcgacagcgtatgtccaatcagcggagtataactacagac ccatttatgaccaccgataatggaaactatttctgggacaggccttctaaagtgggaaca gtggctttgttctctaatggaactcagtttagaagaggtgggggctatggaaccagtgcc tttataacccacgaaaggctgaaaagcagagattttataaaaggagatgatgtttatatc ctactgacagtggaagacatatctcacctcaactctacacaaatccagctaacaccagcc cctagtgttcaagacctctgctcaaaaaccacctgtaaaaatgacggtgtctgcactgtt cgagatggcaaagctgagtgcaggtgccagtcaggggaagactggtggtacatgggagaa aggtgtgaaaagagaggctccacccgagacaccatagtcattgctgtttcatctactgtt gctgtgtttgccttgatgctgatcatcacccttgtcagtgtctattgcaccaggaagaaa tatcgtgaaaggatgagctcaaatcgaccaaatttgactccgcaaaat