GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:13:56 Sequence gi568815586f:64510633_64794909 : 284277 bp : 43.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 7724 7870 147 1 0 90 49 124 0.604 2.74 1.02 PlyA + 9132 9137 6 1.05 2.04 PlyA - 9730 9725 6 1.05 2.03 Term - 17820 17732 89 1 2 64 48 94 0.671 0.82 2.02 Intr - 24402 24254 149 2 2 16 44 99 0.027 -1.82 2.01 Init - 32768 32380 389 2 2 80 23 221 0.463 9.21 2.00 Prom - 34931 34892 40 -3.26 3.00 Prom + 35904 35943 40 -4.76 3.01 Init + 39041 39092 52 1 1 91 75 10 0.459 1.54 3.02 Intr + 45929 46143 215 0 2 88 -34 157 0.058 1.93 3.03 Term + 48344 48523 180 1 0 81 40 136 0.202 5.71 3.04 PlyA + 50711 50716 6 1.05 4.00 Prom + 62169 62208 40 -3.86 4.01 Init + 72806 72808 3 1 0 98 53 0 0.320 -2.50 4.02 Intr + 73957 74075 119 0 2 144 78 44 0.311 8.26 4.03 Term + 85762 85921 160 1 1 52 36 105 0.346 -0.89 4.04 PlyA + 88789 88794 6 1.05 5.00 Prom + 97446 97485 40 -5.26 5.01 Init + 100001 100111 111 1 0 89 89 175 0.960 17.91 5.02 Intr + 122993 123136 144 1 0 50 24 102 0.108 0.48 5.03 Term + 139586 139678 93 1 0 84 48 75 0.341 0.93 5.04 PlyA + 140643 140648 6 1.05 6.00 Prom + 159351 159390 40 -2.76 6.01 Init + 163443 163475 33 2 0 60 89 101 0.761 5.28 6.02 Intr + 174155 174262 108 1 0 126 99 128 0.999 18.18 6.03 Intr + 177584 177821 238 1 1 90 107 161 0.981 15.39 6.04 Intr + 180838 180947 110 2 2 86 107 82 0.999 9.90 6.05 Term + 184131 184280 150 2 0 69 42 259 0.919 17.31 6.06 PlyA + 185568 185573 6 1.05 7.13 PlyA - 185979 185974 6 1.05 7.12 Term - 196141 195937 205 0 1 65 48 100 0.499 0.54 7.11 Intr - 209550 209390 161 0 2 117 114 42 0.973 8.49 7.10 Intr - 211073 210963 111 2 0 52 94 95 0.852 7.08 7.09 Intr - 212481 212374 108 0 0 63 65 46 0.544 0.28 7.08 Intr - 218425 218324 102 1 0 59 111 122 0.449 11.97 7.07 Intr - 226475 226372 104 0 2 105 100 94 0.966 12.19 7.06 Intr - 228867 228749 119 2 2 80 71 162 0.976 13.81 7.05 Intr - 230056 229974 83 1 2 79 116 -17 0.895 -1.36 7.04 Intr - 232676 232509 168 2 0 23 22 216 0.962 8.64 7.03 Intr - 234275 234177 99 2 0 110 97 50 0.997 8.41 7.02 Intr - 235092 235027 66 0 0 92 97 11 0.670 1.50 7.01 Init - 248644 248453 192 1 0 64 48 325 0.914 23.07 7.00 Prom - 249487 249448 40 -6.36 8.00 Prom + 255812 255851 40 -5.66 8.01 Init + 256725 256830 106 2 1 69 99 33 0.494 2.89 8.02 Intr + 257018 257112 95 0 2 63 47 84 0.322 1.48 8.03 Intr + 260964 261048 85 2 1 65 24 82 0.250 -1.11 8.04 Term + 263031 263428 398 1 2 77 49 172 0.276 7.44 8.05 PlyA + 263885 263890 6 -0.45 9.00 Prom + 264497 264536 40 -6.46 9.01 Sngl + 270177 270824 648 2 0 105 32 867 0.994 76.98 9.02 PlyA + 271688 271693 6 1.05 10.00 Prom + 273560 273599 40 -5.76 10.01 Init + 274703 274751 49 1 1 86 89 38 0.992 2.81 10.02 Intr + 275249 275413 165 0 0 63 100 198 0.635 18.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_1|48_aa MVLTSAPVKARKLPVMVEGKEEDSVSHGKREQERQKEVPNSFKQPDLT >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_1|147_bp atggtgctgacatctgctcctgtgaaggccaggaagcttccagtcatggtggaaggcaaa gaagaagacagtgtatcacacggcaagagggagcaagaaagacagaaggaagttccaaac tcttttaaacaaccagatctcacatga >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_2|208_aa MAGCRSRALPRGEAAKARREIERSAGGPALLWDPKHPPQLLARVLSPSLPGRAAPAGHSK CGARHSKCGARHSKCGARHSKCGARHSKCGARHSKCGARHSKCGARHSKCGARHSKCGAR HSKCGARHSNMLKLTAFSLVVHKMTVYVTIDGALEMRKYGNAMPYYKASYSRNTTTYWCS TPCMTFENPLNHSKRQLPELKLDEINVS >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_2|627_bp atggctggctgcaggtcccgagccctgccccgtggggaggcagctaaggcccggcgagaa atcgagcgcagcgctggtgggccggcactgctgtgggacccaaagcaccctccgcagctg ctagcccgggtgctaagcccctcactgcctgggcgggcggcgccggccggccactccaag tgcggggcccgccactccaagtgcggggcccgccactccaagtgcggggcccgccactcc aagtgcggggcccgccactccaagtgcggggcccgccactccaagtgcggggcccgccac tccaagtgcggggcccgccactccaagtgcggggcccgccactccaagtgcggggcccgc cactccaagtgcggggcccgccactccaacatgctgaaattgacagcattttccttagtg gtacataaaatgacggtgtacgtcacaatagatggtgccttagagatgaggaaatatggt aatgctatgccttactacaaggcaagctattctagaaatacaactacttactggtgttca actccctgcatgacctttgaaaatccacttaaccactctaagcgtcagttgccggaactg aagttggatgagatcaatgtttcctga >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_3|148_aa MAASSQHSLSLLSPISGGWGQPVGSVVFKKMGPGVCSRGHQSVMLATAEELSSGDSQLPQ FTPVSQIYFSVQTGEQLLFSSRGESPIAALPSGPALWLQPDLASHTGDCGCLAPDTLEPH IPMDINESSSATVTSAIGADLRRRATTA >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_3|447_bp atggcggccagcagccagcacagcctctctctcctgagccccataagcggaggctggggg cagcctgtaggaagtgtggtcttcaagaaaatgggtccaggggtctgcagccggggccat cagtcagttatgctcgccacagcagaagagctgagcagtggggactcacagctgccacag ttcacccctgtatcccagatctacttttccgtgcagactggggagcagctcctcttctct tcccgaggggaaagccctatagctgcactcccttcaggtccagccctctggctgcagcct gaccttgctagtcatactggtgattgtggctgcctggctcctgacactctggagcctcat attcccatggatattaatgagtccagctctgctactgtcacttctgccatcggcgctgac ctgcggaggagggccaccactgcttag >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_4|93_aa MISYIFMTVSFPMNSKGNARKQIHSAKVSKKDITVTLGRWRKPEIAKFVKKRGLFGSQFC RLYKHGTSDVQGFGVKFPSAQQVLILCTACVPR >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_4|282_bp atgatatcctacatcttcatgactgtgtcctttcctatgaattccaaaggaaatgcacgc aaacaaatacattcagccaaagtgtctaagaaagacataacggtcacccttgggaggtgg aggaaacccgaaattgcaaaatttgtgaagaaaagaggtttatttggctcacagttctgc aggctgtacaagcatggcaccagtgatgttcaaggatttggagttaaatttcccagtgcc caacaagtgctcatcctgtgcactgcatgtgtccctcgctaa >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_5|115_aa MSSGYSSLEEDAEDFFFTARTSFFRRAPQGKPRSGQQPQILFYSNEEKTNTENWYQERGI AIKIPENVELALALGNRQRLEEFGGMSLANLINKFDYLHAKEEAVPQGKPEAALP >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_5|348_bp atgagcagcggctacagcagcctggaggaggacgccgaggacttcttcttcaccgccagg acctccttcttcaggagagcgccccagggcaagccccgctccggccaacaacctcagata ctcttttatagcaatgaagaaaagacaaacacagaaaattggtaccaggagaggggcatc gcaataaagatacctgaaaatgtggaactggctttggcactgggtaataggcagagatta gaagagtttggagggatgtcattggccaatttgataaacaaattcgattatctccatgcc aaagaggaagcagtgcctcaaggcaagcctgaagcagcgctgccatga >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_6|212_aa MLGLFLLSLPQDVEKEKETHSYLSKEEIKEKVHKYNLAVTDKLKMTLNSNGIYTGFIKVQ MELCKPPQTSPNSGKLSPSSNGCMNTLHISSTNTVGEVIEALLKKFLVTESPAKFALYKR CHREDQVYACKLSDREHPLYLRLVAGPRTDTLSFVLREHEIGEWEAFSLPELQNFLRILD KEEDEQLQNLKRRYTAYRQKLEEALREVWKPD >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_6|639_bp atgctgggcctgttcctgctttcgctgccccaggatgttgagaaagagaaggaaacccac agttacctcagcaaagaggagatcaaagagaaagttcataaatacaacttagcagtcaca gacaagttgaagatgaccttgaattcaaatgggatttacactggcttcattaaagtacag atggaactctgcaaacctccacagacttctccaaattctggaaaactctctcccagtagc aatggctgtatgaatacacttcatatcagcagcacaaacactgtcggggaagtgatcgag gccctgctcaaaaagtttctcgtgactgagagccctgccaagtttgcactttataagcgt tgtcacagggaagaccaagtctacgcctgcaagctctcagaccgggaacatccactctac ctgcgtttggtagcagggcccagaacagacacacttagttttgttcttcgtgaacatgaa attggagagtgggaagccttcagccttccagaactacagaatttcttgcgcatcttggac aaggaagaagatgaacagctgcagaacctgaagaggcgctacacagcctacaggcagaag ctggaagaagccctccgtgaggtgtggaagcctgattaa >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_7|505_aa MRLLPLAPGRLRRGSPRHLPSCSPALLLLVLGGCLGVFGVAAGTRRPNVVLLLTDDQDEV LGGMYGAPDAGGLEHVPLGWSYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLA NVSLDFLDYKSNFEPFFMMIATPAPHSPWTAAPQYQKAFQNVFAPRNKNFNIHGTNKHWL IRQAKTPMTNSSIQFLDNAFRKRWQTLLSVDDLVEKLVKRLEFTGELNNTYIFYTSDNGY HTGQFSLPIDKRQLYEFDIKVPLLVRGPGIKPNQTSKMLVANIDLGPTILDIAGYDLNKT QMDGMSLLPILRGASNLTWRSDVLVEYQGEGRNVTDPTCPSLSPGVSQCFPDCVCEDAYN NTYACVRTMSALWNLQYCEFDDQEVFVEVYNLTADPDQITNIAKTIDPELLGKMNYRLMM LQSCSGPTCRTPGVFDPGSLVQVTIHFVIYQGRLQPKAQCPSMLPQRMKPMMPRLVLCTA GDNNTLACNHPGPSPSSCNANEGPV >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_7|1518_bp atgcggctcctgcctctagccccaggtcggctccggcggggcagcccccgccacctgccc tcctgcagcccagcgctgctactgctggtgctgggcggctgcctgggggtcttcggggtg gctgcgggaacccggaggcccaacgtggtgctgctcctcacggacgaccaggacgaagtg ctcggcggcatgtacggagccccagatgcaggtggactagaacacgttcctctgggttgg agttactggtatgccttggaaaagaattctaagtattataattacaccctgtctatcaat gggaaggcacggaagcatggtgaaaactatagtgtggactacctgacagatgttttggct aatgtctccttggactttctggactacaagtccaactttgagcccttcttcatgatgatc gccactccagcgcctcattcgccttggacagctgcacctcagtaccagaaggctttccag aatgtctttgcaccaagaaacaagaacttcaacatccatggaacgaacaagcactggtta attaggcaagccaagactccaatgactaattcttcaatacagtttttagataatgcattt aggaaaaggtggcaaactctcctctcagttgatgaccttgtggagaaactggtcaagagg ctggagttcactggggagctcaacaacacttacatcttctatacctcagacaatggctat cacacaggacagttttccttgccaatagacaagagacagctgtatgagtttgatatcaaa gttccactgttggttcgaggacctgggatcaaaccaaatcagacaagcaagatgctggtt gccaacattgacttgggtcctactattttggacattgctggctacgacctaaataagaca cagatggatgggatgtccttattgcccattttgagaggtgccagtaacttgacctggcga tcagatgtcctggtggaataccaaggagaaggccgtaacgtcactgacccaacatgccct tccctgagtcctggcgtatctcaatgcttcccagactgtgtatgtgaagatgcttataac aatacctatgcctgtgtgaggacaatgtcagcattgtggaatttgcagtattgcgagttt gatgaccaggaggtgtttgtagaagtctataatctgactgcagacccagaccagatcact aacattgctaaaaccatagacccagagcttttaggaaagatgaactatcggttaatgatg ttacagtcctgttctgggccaacctgtcgcactccaggggtttttgaccccggcagtctg gtgcaagtgactattcactttgtcatctatcagggcaggcttcagcctaaagctcaatgc ccatcaatgctcccacagaggatgaagcccatgatgccacgtctggtcctctgcacagca ggggataacaacactctggcctgcaatcatcctggcccctctccttcttcctgtaatgcc aacgaaggtcctgtttaa >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_8|227_aa MGNDHSLPQAQPSISPDAVAMLDLIPLLDKINMTTALTNPDCPGILQNILPIQYIDDIML IRQDEKELTSKGLYDQLKEEEKAPTWFIDKTAWFIAFTGDTSRYWKTQGDWDQAASKPQQ PYRKVIKLLKQKQKPSKGQQPQRLKIDNPIKMRKNQHKNAENSKRQNALFPPNDCITSPA RVWNQAKAVMAEMTEVEFRIWIGTNFTELKEDIVTQWKEAKNHDKTS >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_8|684_bp atgggaaatgatcatagcctaccacaagctcaaccaagtattagtcctgacgctgtggcc atgctggatttgatacctttgctagacaaaatcaatatgactacagccctgacaaatcca gactgcccaggtatcctgcagaacatcttacccatccaatacattgatgacatcatgctg atcaggcaggatgaaaaagagctcacgtccaagggcctgtatgaccagctgaaggaggag gaaaaagccccaacttggtttatagacaaaacagcatggtttatagccttcactggtgat acttccaggtactggaaaacccaaggtgactgggaccaggctgccagcaaaccacagcag ccctacagaaaagtgatcaagctgttaaaacaaaaacaaaaaccatccaaaggtcagcaa cctcaaagattgaaaatagataatcccataaagatgagaaagaatcagcacaagaatgct gaaaactcaaaaaggcaaaatgccctctttcctccaaatgactgcatcacctctccagca agggtttggaaccaggccaaggctgtgatggctgaaatgacagaagtagaattcagaata tggataggaacaaacttcactgaattaaaggaggatattgtaacccaatggaaggaagct aaaaatcatgataaaacatcatag >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_9|215_aa MDVLPTGGGRPGLRTELEFRGGGGEARLESQEEETIPAAPPAPRLRGAAERPRRSRDTWD GDEDTEPGEACGGRTSRTASLVSGLLNELYSCTEEEEAAGGGRGAEGRRRRRDSLDSSTE ASGSDVVLGGRSGAGDSRVLQELQERPSQRHQMLYLRQKGEGRAQQGPGGFLEPGCPRRL CPRSSAHLGCSAERSDTVSLDLTVVPTVAQIYVHT >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_9|648_bp atggacgtccttcccaccggcgggggccgcccggggctccggacggagctggaattccgc ggcggcggtggcgaggcgaggctggagagtcaggaggaagaaacgattcctgcagctccc ccagccccgcgcctccggggagcggcggagcggccgcggcgctcccgggacacgtgggac ggcgatgaggacacggagcccggcgaggcgtgcggcggccgcacaagccgcacggcgtcc ctggtgagcgggctgctcaacgagctgtacagctgcacagaggaggaggaggcggcgggc gggggccgcggggccgagggccgccggcggcgccgcgacagcctcgacagctccaccgag gcctcgggctccgacgtggtcctgggcggccgcagcggtgccggcgactcccgcgtgctg caggagctgcaggagcgaccgagccagcggcatcagatgctgtacctgcggcagaaaggt gagggccgggcgcagcagggtcccgggggcttcctggagccgggttgtcctcgccgtctc tgcccgcgctccagcgcgcacctgggctgctcggcggagcgctcagatactgtctctttg gatctgacggtggtacccaccgtggcacagatttatgtccacacttag >gi568815586f:64510633_64794909|GENSCAN_predicted_peptide_10|72_aa MGFLHVGQAGLQLLTSDANELKTILRELKYRIGIQSAKLLRHLKQKDRLLHKVQRNCDIV TACLQAVSQKRX >gi568815586f:64510633_64794909|GENSCAN_predicted_CDS_10|216_bp atggggtttctccatgttggtcaggctggtctccaactcctgacctcagacgctaatgaa ctgaagacgatccttcgagagctaaagtacagaattggcatccagtcggccaagttactt cggcatctgaagcagaaagataggcttctgcataaagtgcagaggaactgtgatattgtg actgcctgcttgcaggctgtgtcacagaagagaann