GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:19:07 Sequence gi568815588r:3679542_3885014 : 205473 bp : 42.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12134 12173 40 1 1 71 49 63 0.327 1.10 1.02 Intr + 12210 12452 243 1 0 15 25 202 0.424 3.15 1.03 Term + 16254 16396 143 1 2 96 35 121 0.478 4.71 1.04 PlyA + 16726 16731 6 1.05 2.00 Prom + 17276 17315 40 -7.65 2.01 Init + 17340 17548 209 2 2 65 113 156 0.819 14.14 2.02 Intr + 17792 17924 133 2 1 11 30 165 0.499 2.73 2.03 Intr + 21993 22132 140 2 2 39 29 128 0.097 0.34 2.04 Intr + 22827 22935 109 0 1 103 15 79 0.107 1.47 2.05 Intr + 23753 23848 96 1 0 69 42 111 0.013 3.89 2.06 Term + 38032 38202 171 0 0 4 54 227 0.056 7.74 2.07 PlyA + 38508 38513 6 -0.45 3.07 PlyA - 39371 39366 6 -3.24 3.06 Term - 40064 39837 228 2 0 75 47 235 0.884 13.75 3.05 Intr - 41341 41186 156 1 0 71 102 32 0.720 2.19 3.04 Intr - 43817 43658 160 1 1 53 87 57 0.107 1.17 3.03 Intr - 50548 50387 162 0 0 105 56 143 0.470 10.97 3.02 Intr - 56854 56746 109 2 1 91 66 81 0.005 4.62 3.01 Init - 69060 69015 46 0 1 55 75 139 0.766 10.20 3.00 Prom - 70856 70817 40 -4.85 4.19 PlyA - 70864 70859 6 1.05 4.18 Term - 77124 76850 275 1 2 34 48 318 0.469 17.05 4.17 Intr - 78047 77946 102 0 0 100 109 42 0.637 6.73 4.16 Intr - 79892 79803 90 0 0 104 41 83 0.556 4.25 4.15 Intr - 85521 85336 186 1 0 56 50 107 0.478 2.44 4.14 Intr - 86851 86634 218 0 2 69 73 145 0.761 8.32 4.13 Intr - 87390 87256 135 2 0 86 76 109 0.641 8.26 4.12 Intr - 88545 88412 134 0 2 47 46 127 0.656 2.92 4.11 Intr - 88940 88764 177 2 0 38 98 109 0.051 6.09 4.10 Intr - 96862 96752 111 1 0 112 20 91 0.128 4.36 4.09 Intr - 100688 100658 31 1 1 110 41 41 0.022 -1.19 4.08 Intr - 101812 101635 178 2 1 46 58 149 0.177 5.76 4.07 Intr - 102854 102100 755 1 2 88 82 502 0.504 39.77 4.06 Intr - 104819 104641 179 2 2 45 37 162 0.592 4.60 4.05 Intr - 105450 105372 79 2 1 47 97 129 0.146 8.33 4.04 Intr - 106583 106437 147 1 0 35 72 127 0.122 4.23 4.03 Intr - 111271 111152 120 0 0 92 64 61 0.012 2.79 4.02 Intr - 115708 115543 166 2 1 68 59 159 0.437 9.10 4.01 Init - 118477 118216 262 1 1 41 48 119 0.421 0.47 4.00 Prom - 123859 123820 40 -5.05 5.09 PlyA - 125583 125578 6 1.05 5.08 Term - 126612 126461 152 1 2 116 41 173 0.982 12.49 5.07 Intr - 128125 127966 160 1 1 0 97 160 0.733 6.74 5.06 Intr - 131980 131861 120 1 0 94 41 74 0.378 3.07 5.05 Intr - 141071 140976 96 2 0 38 70 140 0.290 6.49 5.04 Intr - 146712 146456 257 1 2 -10 52 188 0.125 1.64 5.03 Intr - 146946 146761 186 1 0 43 41 200 0.486 9.54 5.02 Intr - 162488 162451 38 1 2 107 101 42 0.109 4.39 5.01 Init - 171119 171058 62 2 2 85 81 19 0.271 1.67 5.00 Prom - 175582 175543 40 -5.35 6.00 Prom + 175710 175749 40 -8.05 6.01 Init + 178913 179066 154 1 1 57 99 152 0.669 12.17 6.02 Intr + 189382 189542 161 2 2 62 65 43 0.063 -1.81 6.03 Intr + 190262 190436 175 1 1 115 48 104 0.079 7.69 6.04 Term + 198159 198298 140 1 2 46 48 192 0.439 8.14 6.05 PlyA + 198565 198570 6 1.05 7.03 PlyA - 200745 200740 6 1.05 7.02 Term - 202715 202557 159 2 0 70 48 110 0.522 2.16 7.01 Intr - 203871 203551 321 0 0 56 17 194 0.497 4.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 1012 1122 111 0 0 80 44 117 0.891 4.08 S.002 Term + 190262 190374 113 0 2 115 47 98 0.836 6.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_1|141_aa MASVDSSLQSKATAPRGQDEPRGGMNCKRGMNRRGADCGKSVLELKTPHAGVMMHLLQAT HASTLDLVLTATETRAYALRTMGEAALALQGDRYRCVHQGGQEDTAHGGKRLRAQNRAES SGKWIWRNMKKITGKIVLITN >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_1|426_bp atggcctctgtggacagctccctgcagtccaaagctacagctccacgaggacaggatgaa cctcgaggaggcatgaactgcaagagaggcatgaatcgccgtggagcagattgtggcaag tctgtgttggagctcaaaactcctcatgctggagttatgatgcatctcttgcaggccaca catgcctccactctggacttggtgctgacagccacagagaccagggcttatgctcttagg accatgggggaggctgcccttgccctccagggagacagatatcgctgtgttcatcaagga ggccaagaagacacagcccatggaggcaaacgtctcagagcacagaacagagcagaaagc agtgggaaatggatttggaggaacatgaagaagatcaccggcaaaattgtccttattact aattag >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_2|285_aa MQPSVFSEKWCHDKGSSWASSVGSLGPQRWLEPEHAGEGARVAKIRVIPVLPSWTLQGCP VPSWSSYVERQPDMMLEILPTDTWSIPQGTGMQRPSTQAGVDASSARPVLSPPLYYSDHL SCGMFTELAVAHGPACLQRNCIVLSDEGEDAEGLVHSRQLMCWLTTTGLQRKGKRCPRNH NLHDDHSPIKFHARVNKRSCIVLPDEGEDAEDLVHPRQLTGVWAGTLTQRMGAWASLGYH KSSIFLRSAESALLKRHGPQRTGQGTLQEEAPQGAERRTREGTGT >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_2|858_bp atgcagccaagtgttttctctgaaaaatggtgccatgacaaaggttcatcatgggcatct tccgtgggcagcttgggccctcagaggtggctagagccagagcatgctggtgaaggagcc cgtgttgctaagatcagggtgatcccagtgctgccatcatggacacttcaagggtgccct gtgccttcatggagctcctatgtggaaagacaaccagatatgatgctggaaattctgccc actgacacctggagcattcctcaggggactgggatgcagcggccatccactcaggctggt gtagatgcatcgtcggccaggcccgtgctttctcctcctctgtactacagcgaccactta tcttgtgggatgttcacagagctggctgtggcacatggtcctgcttgtcttcagagaaat tgtattgttctttctgatgagggtgaggatgctgagggcctggttcattcaagacagcta atgtgctggctgaccacgactggcctgcagaggaaaggaaagagatgcccccgtaaccac aacctgcatgacgatcacagccccatcaaattccatgcaagagtgaacaagaggagttgt attgttcttcctgatgagggtgaggatgctgaggacctggttcatccaagacagctaact ggtgtttgggcaggaacgctgacccagcgtatgggggcctgggctagtctgggctaccac aagagcagcattttcctgaggtctgcagagtccgcccttcttaagcgacatgggccccag agaacagggcaagggaccctgcaggaggaggccccccaaggagccgagagaaggacccga gaaggaaccgggacctga >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_3|286_aa MDVIVKIRDCGQECAANEAGFFSPWFTPKSVKSMSPIVLLGRLEKGADSSKRVLTTFSHT IARIHYIAYLHLSSLAVIQAPEGKGSSSIVSTDVFQAPQTCREHSRISQNCEAQKTRPQS MAPEHVKCFELEEMGRPPTPGLSDPLLPSCLSLPSFCPKLTCELFRNHWACKLEDHVFGL FKLASDCLLKLPLLPVIELNFSSSALCRVTYGLGRLQAQVLLRSYWFPGMATSQQNNFST SSSPRSSRRSEDMKQTHEKVTLQVELEAAQLLRSQPEKAVQEQADI >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_3|861_bp atggatgtcatcgtcaagattcgcgactgtgggcaggaatgtgcagctaatgaagcaggc ttcttctctccttggttcacccccaaatcagtcaagagcatgagtcccattgttctcttg ggacgacttgaaaaaggtgcagactctagcaagcgagtgcttaccaccttctcccatacc attgcacgcattcattacattgcatatttacatttgtcttcactggctgtaatacaagcc cctgaaggcaaaggatcttcatctattgtatccactgatgtatttcaagctcctcaaacc tgccgagaacatagtagaatctctcagaattgtgaggctcagaaaacaagaccccaaagt atggcgccagagcatgtcaagtgctttgagctggaggaaatgggaaggccgcctacgcca ggtctctccgaccctctcctgccctcatgtctctctctgccctctttctgccccaaactg acatgtgaattatttagaaaccactgggcatgtaaactggaagaccatgtgtttggactg ttcaaattggcaagtgactgtctgctgaaacttcctctgcttcctgtaatagaactcaac ttctcctccagcgccctgtgcagagtgacttacgggcttggcaggctgcaggctcaggtg ctgttacggagctactggttcccgggcatggccacgtcccagcagaataacttctcaact tccagtagccctcgcagctcccgccgttcagaagacatgaaacaaacacacgagaaagtg actctgcaggtagaactagaagctgcccagctgcttcgttcccagcctgagaaagccgtc caggagcaagcggacatctga >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_4|1114_aa MHLFKLFEVFAEEEQCIDWWKGATRWQIEISDSSGCANGNEKVAWKKIRSSELSKWDIPF IQEGISVFNRSTTKIGPMYKFVRICTSVATGCGDGSQSLPLEAHRPKRKAQQSTKTAGSI APRETSVESSGSTVRAPDGCCRKGPSPHRHTSITTGVQQWSKWVCNRSTEGHPGTPVSML IARGACVLPSPTLVARRTEGTLGLEGDGDGLRDGGGQRGTRRWRRTERGTPSIFQELQIV HETGYFSALPSLEEYWQQKNKNKTVYTPVTCQSCDNEPVQTASSLSLERRTSVSDIVVEL EFGGIGNLRNLPGPRAYRNRFPLTSVRTRMLNYKRFCPCSDRAAMLSVLHSCMFLEIAFA FTFGRHGNHVPSLVHFAQTCLELERYLQSEPCYVSASEIKFDSQEDLWTKIILAREKKEE SELKISSSPPEDTLISPSFCYNLETNSLNSDVSSESSDSSEELSPTAKFTSDPIGEVLVS SGKLSSSVTSTPPSSPELSREPSQLWGCVPGELPSPGKVRSGTSGKPGDKGNGDASPDGR RRVHRCHFNGCRKVYTKSSHLKAHQRTHTVVLGIRGHCPFTSALARGYLERPWGFLNETQ GSGQRVAGQRGVCGFDADGPRPEWDRPALRKALQMLMGRPVLGAGEDVSEPDIALPECIK YSSPRTDKNNPPAVGTMLQKSQVHLDEICGRTTSMPYCLPVEPILTLSPLSNGTPCQPAM GHSGARYAFSSFTHTLNITVFRQKWAQYLLSCSRDHDGQRAHSTSMCGSFAAAFTAPSDC RWKAYGKDDVVRQQKTMTRDVHTPVNSVLNSTTPGRTPTWERKDLSTRQLSCHCSSCIHL NAEHCKVHVVFFQLKVTCDAVSVSDGKMDFESLRPIRGQTGSAPPPPVQSPGRAAVGRVQ SALPTLCHCEGQCLLWLRDEENFGYQVASGQQNRFSRDYHLSLANNNSGSYKANTEMIFV ACAWLTDCLDLNLNTVTYYMHALGQFAEERQIRFDDAGISGTLLLGLPGNLDPSLSKMVF INTGEPLKGTKSIYSLNLGLQLYGRSKRRRGRRGRRSGGGGGGRGGGGREEEEEEGGGVA GGEEEEEEEECICIPSGKELSLGKAKTHSCSLDR >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_4|3345_bp atgcatttatttaaactctttgaagtctttgcagaagaagagcagtgtattgactggtgg aaaggtgctaccagatggcagatagagatttctgactcatctggatgtgctaatggaaat gagaaagttgcatggaagaagataagatcttctgaattgtctaagtgggatatccctttt attcaggaggggatttcggttttcaacagaagcaccacaaaaattgggcccatgtacaaa tttgtcagaatttgtacttctgtggcaacgggatgtggagatggaagtcagtcccttccc ttagaagctcacagacctaagaggaaggcacaacaatccaccaagactgcaggatccata gcgcccagggagaccagtgtggaatcctcaggatccacagtgagagcaccggatggttgc tgcagaaagggcccatctcctcaccggcacacttccatcacaacaggtgtgcagcaatgg tcaaagtgggtgtgcaacaggagcactgaggggcacccaggcaccccagtctccatgtta attgcaaggggagcctgcgtgctcccctcccccactctggtggcccggaggacagagggg accctaggactggagggggacggagacgggctcagggatggagggggacagagagggacc cggaggtggaggaggacagagaggggtaccccaagcatcttccaggagctccagatcgtg cacgagaccggctacttctcggcgctgccgtctctggaggagtactggcaacagaaaaat aaaaataaaactgtgtacaccccagtgacttgtcagtcttgtgacaatgagcccgtccaa actgcttcaagtcttagtttggaaagaaggacctcggtttccgacattgtcgtagaactg gaatttgggggcattggaaatttaagaaacttaccaggtccccgtgcttatagaaacaga tttcctctaacttccgtaaggacacggatgcttaattacaaaaggttttgcccctgtagt gaccgggcagcaatgttatctgtccttcattcttgcatgtttttggaaattgcttttgct tttacttttggtcgtcatggcaatcacgtgccttctctggttcattttgcacagacctgc ctagagctggaacgttacctccagagcgagccctgctatgtttcagcctcagaaatcaaa tttgacagccaggaagatctgtggaccaaaatcattctggctcgggagaaaaaggaggaa tccgaactgaagatatcttccagtcctccagaggacactctcatcagcccgagcttttgt tacaacttagagaccaacagcctgaactcagatgtcagcagcgaatcctctgacagctcc gaggaactttctcccacggccaagtttacctccgaccccattggcgaagttttggtcagc tcgggaaaattgagctcctctgtcacctccacgcctccatcttctccggaactgagcagg gaaccttctcaactgtggggttgcgtgcccggggagctgccctcgccagggaaggtgcgc agcgggacttcggggaagccaggtgacaagggaaatggcgatgcctcccccgacggcagg aggagggtgcaccggtgccactttaacggctgcaggaaagtttacaccaaaagctcccac ttgaaagcacaccagcggacgcacacagtagtgctggggatccgaggccactgccccttc accagtgcactcgcacgaggctacctcgagcggccctggggtttcctaaatgaaactcaa gggtcaggacagagggttgctgggcagcgtggagtgtgtgggtttgatgctgacggcccg aggcccgagtgggaccggcctgctctgagaaaagccttacagatgctcatgggaaggcct gtcctgggagctggagaggatgtcagcgagcctgacattgccctccctgaatgcatcaaa tactcttctccaaggactgacaaaaacaaccccccggctgtgggcacaatgctgcagaag tcacaggtgcaccttgatgaaatatgtggccgaaccacatctatgccctattgtcttcca gtggagcccatactcaccctctcacctctaagcaacggaaccccatgccaaccagcaatg ggccattctggagccaggtacgcattttcttcattcacccacacactcaatatcacagtc ttcagacagaagtgggctcagtacctgctttcttgcagtcgtgaccatgatggtcagcgt gcccattccacgtccatgtgtggctccttcgctgctgccttcacagccccttctgactgt agatggaaagcctatgggaaagacgacgtggttagacagcagaagacaatgaccagggat gtccacacacctgtcaattcagttctcaatagtacaacaccaggaagaacccccacctgg gaaagaaaggacctgtcaaccagacagttaagctgtcactgttcatcatgtattcatctt aacgcagaacattgcaaagtgcatgttgtctttttccagttaaaggtaacgtgtgatgca gtcagtgtttcagatggaaaaatggactttgaatccctgcgtcccatccgagggcagact gggtctgcccctcctccaccagtgcagtcccctgggagggctgctgtgggcagagtgcag agtgcattgcccacactctgtcactgtgagggtcagtgcttgctgtggttgagggatgaa gaaaactttggatatcaagtggcctcagggcaacagaacaggttttcaagggactatcac ctttccctagcaaacaacaacagtggcagctacaaggcaaatacggaaatgatctttgta gcatgtgcgtggttgacagattgcctggacttgaatctcaataccgtcacttactacatg catgcgcttgggcagtttgctgaagagcggcaaataagattcgacgatgcaggaatctct gggactttgctcctaggactgcccgggaatcttgacccatcgctaagtaagatggtgttt ataaacacaggggaacctctcaaaggaacaaaaagcatttattcactcaacctgggactt caactttatggaagaagtaagagaagaagaggaagaagaggaagaagaagcggtggcggc ggaggaggaagaggaggaggaggaagagaagaggaagaagaagaaggaggaggagtagca ggaggagaagaagaggaggaggaggaggaatgcatttgtattccgtctggaaaagagctt tccctaggcaaagctaagacacacagctgcagccttgaccgatga >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_5|356_aa MGIMLKDSFQHNPCSPEGNTRFLINSHSDGDDDESTHYPPKDSSITRNPTASGLEISKPE KTEKYPHKYFLPPNCRDASKRLHIVPIGVHSLRVSPINAHMETEEHTGKWKNFTASLFVT RAKALSQPRRSPVSCSRPHCPAGAVATTSWRSGRARDEEGEDENQSQILVAPPGCEDASL WGRPCGSTVVLITVTLSPRQKGADAVGLELQLKWNVLFEIDDFQGDHYVPETPVSSFRTP NLDSPQLPSWLSRATLSSAKAASRTVSPHKQPARRREVTLQLLTALRAGMQTDTAGAMGC YRDAELSVSAESQLMSSGRCHQVQGLESAAAELELFVKNGSSKTIDRHLVLKAAGQ >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_5|1071_bp atgggcatcatgttaaaagacagctttcaacacaatccatgtagccctgagggtaataca aggttcctgataaatagtcacagtgatggtgatgatgatgaaagcactcactaccctcca aaagattctagtatcacaagaaatcccactgcctcaggtttggaaatttccaaacctgag aaaactgagaaatacccacataagtacttcctaccaccaaactgccgggatgccagcaaa aggctccatatcgtccccatcggggtccacagcctgcgcgtcagccctataaacgcacac atggagacagaggagcacacaggaaaatggaagaacttcacggccagtttatttgtaacc agagccaaagccttgagccagccacgccggtcaccggtcagctgcagtcggcctcactgc ccggcaggagctgtggccaccacgtcatggaggagtgggagggcaagggatgaagagggg gaagatgagaatcagagccaaatcctcgtggctcccccaggctgtgaggacgcatccctg tggggaagaccgtgtggaagcactgttgttctcatcacggtcacactctctccccgacag aaaggtgctgacgctgtggggttggagctccagttgaagtggaatgttctttttgaaatc gatgactttcaaggtgatcattacgtgccagagacccctgtgagctccttcagaacacca aatttggactctccccagctcccgagctggctgtcaagggcaacactctcatcagcaaag gcagccagcaggacagtcagtccacataaacaaccagccagaaggagagaagtgacttta cagctcttgaccgcactgcgggctgggatgcagacggatacggctggagctatgggatgt tatagggacgcagagctaagtgtgtctgcagagagccagctgatgtcttccggccgatgt caccaggtgcagggtctcgagtctgcagctgctgagctggagttatttgtaaaaaatggt agcagcaagacaatagatagacatcttgtcctgaaggcagcagggcaatga >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_6|209_aa MQALFKGGSTRPQSLAAWDTAVSLHSAKKLGEKGLAKKRNSIRYARCTQCAEASYKVQLT RRGAELNLTFSKKDCQRICRYIFKTPHNAFDEILIIDPLAYFMWTGGQLFVLFRPSADRM GLTHIREISPKNILTEISRIKIDHISGHHGPTKLTHKINRHTSGSGQLSNAGLVAMEDAL NGQRKTHMKAPKMWMKAMPSSRAKECVNA >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_6|630_bp atgcaagcattgttcaagggagggagcacaagaccccagagcttggccgcttgggacaca gctgtgagcctccattctgcaaagaagcttggagagaaaggccttgcaaagaagaggaac agcatccgctatgcccgatgcactcagtgtgcagaagcaagttacaaagtccagctgaca cgcagaggagcagaattgaacctcaccttttcaaagaaggattgtcaaaggatttgtaga tatattttcaagacaccacacaatgcatttgatgagattctgattatagaccctttggca tattttatgtggactggaggtcagctttttgtgctcttcaggccttcggcagatcggatg gggctcacccatatcagggaaatctcacccaaaaatatcctcacagaaatatccagaata aagattgaccacatatctgggcaccatggcccaaccaagctgacacataaaattaaccgt cacactagtggaagcggccagctttctaatgctggcttggtggccatggaagatgccctc aacggtcaaagaaaaactcatatgaaagcaccaaaaatgtggatgaaagcaatgccatcc tcacgtgctaaggagtgtgtgaatgcatga >gi568815588r:3679542_3885014|GENSCAN_predicted_peptide_7|159_aa DAKAGTCKLPAVQGTRWAHKLEEEEGLWVLHVCFLGLLGLCFCGHHTGHTASSGQELAPV WSLSFQNHLYYTPVPMTPAAVTGTSSQSPCSELRDTDRTREHLHFRPQYENGLIQPVSPV WEVLSGVHLHLGPYSICLQLTRAGGLKATPCSLGAANIQ >gi568815588r:3679542_3885014|GENSCAN_predicted_CDS_7|480_bp gatgccaaagctgggacctgcaagctgcctgcagtacagggcaccagatgggcccacaag ctggaggaggaagaagggctctgggtccttcatgtttgcttcctggggcttcttggcctc tgcttctgtggccatcacactggtcacactgcttcctctgggcaggagctggctccagta tggagcttatccttccaaaaccacctttattacacacccgttcccatgaccccagctgca gtcacagggacctcctcccagagcccctgctctgagctcagagatactgaccgcaccagg gagcacctccacttcagacctcagtatgaaaatggactgatacagcctgtttcccccgtg tgggaagtgttgtctggtgttcacttacatctgggtccttacagcatttgccttcaattg acgagagctggtgggctcaaggctacaccctgctccttgggagcagccaacatccagtga