GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:01:56 Sequence gi568815575r:24688395_24889069 : 200675 bp : 38.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1673 1712 40 -3.25 1.01 Init + 5568 5610 43 2 1 100 65 61 0.647 5.63 1.02 Intr + 11031 11155 125 1 2 101 100 110 0.932 12.88 1.03 Intr + 14857 14953 97 0 1 23 78 233 0.916 14.46 1.04 Intr + 15995 16075 81 0 0 85 77 202 0.949 17.49 1.05 Intr + 26160 26275 116 1 2 43 76 121 0.986 5.65 1.06 Intr + 26747 26809 63 1 0 94 84 34 0.680 1.60 1.07 Intr + 27968 28060 93 1 0 79 55 66 0.663 1.64 1.08 Intr + 28490 28577 88 1 1 53 119 64 0.877 4.62 1.09 Intr + 28896 29097 202 1 1 36 52 299 0.899 18.52 1.10 Intr + 29186 29364 179 2 2 15 89 165 0.990 7.94 1.11 Intr + 34761 34873 113 1 2 100 52 0 0.805 -3.22 1.12 Intr + 35941 36057 117 0 0 43 76 115 0.974 5.54 1.13 Intr + 37587 37661 75 2 0 14 123 94 0.689 4.29 1.14 Intr + 38539 38677 139 0 1 98 85 35 0.996 3.32 1.15 Intr + 39388 39542 155 2 2 102 -1 158 0.790 7.17 1.16 Intr + 40884 40973 90 2 0 99 20 76 0.550 1.17 1.17 Intr + 42248 42346 99 1 0 39 90 77 0.649 2.39 1.18 Intr + 43976 44060 85 1 1 103 68 49 0.886 2.87 1.19 Intr + 45361 45422 62 2 2 75 98 31 0.893 0.33 1.20 Intr + 47005 47094 90 0 0 90 89 50 0.959 4.57 1.21 Intr + 49231 49347 117 0 0 87 42 65 0.743 1.54 1.22 Intr + 50981 51156 176 1 2 89 80 127 0.988 9.82 1.23 Intr + 52981 53110 130 1 1 75 54 57 0.952 0.78 1.24 Intr + 53608 53727 120 0 0 22 68 122 0.879 3.37 1.25 Intr + 54836 54935 100 1 1 33 87 178 0.999 10.96 1.26 Intr + 57024 57148 125 1 2 89 68 57 0.998 3.18 1.27 Intr + 59917 60066 150 0 0 62 76 176 0.998 13.24 1.28 Intr + 60476 60598 123 1 0 70 84 83 0.975 5.96 1.29 Term + 73379 73540 162 1 0 82 39 79 0.129 -0.65 1.30 PlyA + 75830 75835 6 1.05 2.03 PlyA - 78386 78381 6 1.05 2.02 Term - 100696 99998 699 1 0 74 38 924 0.861 78.55 2.01 Init - 108402 108352 51 0 0 61 67 39 0.226 0.31 2.00 Prom - 108730 108691 40 -5.35 3.00 Prom + 115068 115107 40 -3.95 3.01 Init + 118187 118231 45 1 0 60 77 32 0.233 0.04 3.02 Intr + 122314 122406 93 0 0 77 100 60 0.910 5.34 3.03 Intr + 124264 124469 206 0 2 41 87 229 0.990 15.18 3.04 Intr + 126585 126717 133 0 1 106 94 97 0.935 11.83 3.05 Intr + 133058 133189 132 1 0 85 97 48 0.969 5.32 3.06 Intr + 138033 138207 175 2 1 86 106 249 0.998 25.09 3.07 Intr + 153258 153436 179 1 2 69 57 103 0.831 4.02 3.08 Intr + 155152 155283 132 0 0 57 95 120 0.809 9.52 3.09 Intr + 160484 160506 23 1 2 87 76 13 0.192 -4.58 3.10 Intr + 166933 167082 150 1 0 55 50 155 0.425 6.76 3.11 Term + 170941 171049 109 1 1 82 38 74 0.268 -1.20 3.12 PlyA + 173760 173765 6 1.05 4.04 PlyA - 174074 174069 6 1.05 4.03 Term - 186475 186394 82 0 1 70 42 87 0.202 -1.51 4.02 Intr - 188651 188555 97 0 1 79 67 93 0.305 4.45 4.01 Init - 198533 198491 43 2 1 78 111 35 0.595 5.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 180467 180306 162 2 0 98 36 94 0.874 2.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:24688395_24889069|GENSCAN_predicted_peptide_1|1104_aa MAPVHGDDCEIGASALSDSGSFVSSRARREKKSKKGRQEALERLKKAKAGEKYKYEVEDF TGVYEEVDEEQYSKLVQARQDDDWIVDDDGIGYVEDGREIFDDDLEDDALDADEKGKDGK ARNKDKRNVKKLAVTKPNNIKSMFIACAGKKTADKAVDLSKDGLLGDILQDLNTETPQIT PPPVMILKKKRSIGASPNPFSVHTATAVPSGKIASPVSRKEPPLTPVPLKRAEFAGDDVQ VESTEEEQESGAMEFEDGDFDEPMEVEEVDLEPMAAKAWDKESEPAEEVKQEADSGKGTV SYLGSFLPDVSCWDIDQEGDSSFSVQEVQVDSSHLPLVKGADEEQVFHFYWLDAYEDQYN QPGVVFLFGKVWIESAETHVSCCVMVKNIERTLYFLPREMKIDLNTGKETGTPISMKDVY EEFDEKIATKYKIMKFKSKPVEKNYAFEIPDVPEKSEYLEVKYSAEMPQLPQDLKGETFS HVFGTNTSSLELFLMNRKIKGPCWLEVKSPQLLNQPVSWCKVEAMALKPDLVNVIKDVSP PPLVVMAFSMKTMQNAKNHQNEMPVASVPSSCDNQKMPTNTSSDVPGSGDVYKATTVLKS TEDMSKGLRNQLEESSTEQRWDNVRIIAMAALVHHSFALDKAAPKPPFQSHFCVVSKPKD CIFPYAFKEVIEKKNVKVEVAATERTLLGFFLAKVHKIDPDIIVGHNIYGFELEVLLQRI NVCKAPHWSKIGRLKRSNMPKLGGRSGFGERNATCGRMICDVEISAKELIRCKSYHLSEL VQQILKTERVVIPMENIQNMYSESSQLLYLLEHTWKDAKFILQIMCELNVLPLALQITNI AGNIMSRTLMGGRSERNEFLLLHAFYENNYIVPDKQIFRKPQQKLGDEDEEIDGDTNKYK KGRKKAAYAGGLVLDPKVGFYDKFILLLDFNSLYPSIIQEFNICFTTVQRVASEAQKVTE DGEQEQIPELPDPSLEMGILPREIRKLVERRKQVKQLMKQQDLNPDLILQYDIRQKALKL TANSMYGCLGFSYSRFYAKPLAALVTYKGREPGILQVIFAECVHGGKWRQMTLLVTHCLE RKAFGNDSSKTRARAWEAKLVITY >gi568815575r:24688395_24889069|GENSCAN_predicted_CDS_1|3315_bp atggcacctgtgcacggcgacgactgtgagataggggcgagtgctctgtcagattcaggg agttttgtatcttctcgagcccggcgagaaaaaaaatcaaagaaggggcgccaagaagcc ctagaaagactgaaaaaggctaaagctggtgagaagtataaatatgaagtcgaggacttc acaggtgtttatgaagaagttgatgaagaacagtattcgaagctggttcaggcacgccag gatgatgactggattgtggatgatgatggtattggctatgtggaagatggccgagagatt tttgatgatgaccttgaagatgatgcccttgatgctgatgagaaaggaaaagatggtaaa gcacgcaataaagacaagaggaatgtaaagaagctcgcagtgacaaaaccgaacaacatt aagtcaatgttcattgcttgtgctggaaagaaaactgcagataaagctgtagacttgtcc aaggatggtctgctaggtgacattctacaggatcttaacactgagacacctcaaataact ccaccacctgtaatgatactgaagaagaaaagatccattggagcttcaccgaatcctttc tctgtgcacaccgccacggcagttccttcaggaaaaattgcttcccctgtctccagaaag gagcctccattaactcctgttcctcttaaacgtgctgaatttgctggcgatgatgtacag gtcgagagtacagaagaagagcaggagtcaggggcaatggagtttgaagatggtgacttt gatgagcccatggaagttgaagaggtggacctggagcctatggctgccaaggcttgggac aaagagagtgagccagcagaggaagtgaaacaagaggcggattctgggaaagggaccgtg tcctacttaggaagttttctcccggatgtctcttgttgggacattgatcaagaaggtgat agcagtttctcagtgcaagaagttcaagtggattccagtcacctcccattggtaaaaggg gcagatgaggaacaagtattccacttttattggttggatgcttatgaggatcagtacaac caaccaggtgtggtatttctgtttgggaaagtttggattgaatcagccgagacccatgtg agctgttgtgtcatggtgaaaaatatcgagcgaacgctttacttccttccccgtgaaatg aaaattgatctaaatacggggaaagaaacaggaactccaatttcaatgaaggatgtttat gaggaatttgatgagaaaatagcaacaaaatataaaattatgaagttcaagtctaagcca gtggaaaagaactatgcttttgagatacctgatgttccagaaaaatctgagtacttggaa gttaaatactcggctgaaatgccacagcttcctcaagatttgaaaggagaaactttttct catgtatttgggaccaacacatctagcctggaactgttcttgatgaacagaaagatcaaa ggaccttgttggcttgaagtaaaaagtccacagctcttgaatcagccagtcagttggtgt aaagttgaggcaatggctttgaaaccagacctggtgaatgtaattaaggatgtcagtcca ccaccgcttgtcgtgatggctttcagcatgaagacaatgcagaatgcaaagaaccatcaa aatgagatgccagtagcatccgtcccctctagttgtgacaaccaaaaaatgcctacaaac acatcatcagatgtccccgggtcaggggatgtgtataaagcaacgacagtattaaaatct actgaagatatgtcaaaaggactcaggaatcaacttgaagagtcttctactgaacaaaga tgggacaatgtgaggattattgctatggcagctttggtccatcacagttttgcattggat aaagcagccccaaagcctccctttcagtcacacttctgtgttgtgtctaaaccaaaggac tgtatttttccatatgctttcaaagaagtcattgagaaaaagaatgtgaaggttgaggtt gctgcaacagaaagaacactgctaggttttttccttgcaaaagttcacaaaattgatcct gatatcattgtgggtcataatatttatgggtttgaactggaagtactactgcagagaatt aatgtgtgcaaagctcctcactggtccaagataggtcgactgaagcgatccaacatgcca aagcttgggggccggagtggatttggtgaaagaaatgctacctgtggtcgaatgatctgt gatgtggaaatttcagcaaaggaattgattcgttgtaaaagctaccatctgtctgaactt gttcagcagattctaaaaactgaaagggttgtaatcccaatggaaaatatacaaaatatg tacagtgaatcttctcaactgttatacctgttggaacacacctggaaagatgccaagttc attttgcagatcatgtgtgagctaaatgttcttccattagcattgcagatcactaacatc gctgggaacattatgtccaggacgctgatgggtggacgatccgagcgtaacgagttcttg ttgcttcatgcattttacgaaaacaactatattgtgcctgacaagcagattttcagaaag cctcagcaaaaactgggagatgaagatgaagaaattgatggagataccaataaatacaag aaaggacgtaagaaagcagcttatgctggaggcttggttttggaccccaaagttggtttt tatgataagttcattttgcttctggacttcaacagtctatatccttccatcattcaggaa tttaacatttgttttacaacagtacaaagagttgcttcagaggcacagaaagttacagag gatggagaacaagaacagatccctgagttgccagatccaagcttagaaatgggcattttg cccagagagatccggaaactggtagaacggagaaaacaagtcaaacagctaatgaaacag caagacttaaatccagaccttattcttcagtatgacattcgacagaaggctttgaagctc acagcgaacagtatgtatggttgcctgggattttcctatagcagattttacgccaaacca ctggctgccttggtgacatacaaaggaagggagccaggaattcttcaagtgatatttgct gagtgtgtacatggagggaaatggagacaaatgacattgcttgtgacccactgccttgag aggaaagccttcggtaatgattccagtaaaactagagctagagcttgggaagctaaactt gtgattacttactag >gi568815575r:24688395_24889069|GENSCAN_predicted_peptide_2|249_aa MEITEKQTEAKTFSRGDLSDTADTMGFRDLKSPAGLQVLNDYLADKSYIEGYVPSQADVA VFEAVSSPLPADLCHALRWYNHIKSYEKEKASLPGVKKALGKYGPADVEDTTGSGATDSK DDDDIDLFGSDDEEESEEAKRLREERLAQYESKKAKKPALVAKSSILLDVKPWDDETDMA KLEERVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVEDDKVGTDMLEEQITAFEDYVQS MDVAAFNKI >gi568815575r:24688395_24889069|GENSCAN_predicted_CDS_2|750_bp atggagatcacagagaagcagacagaagctaagacattcagcagaggggatctctcggat acagccgacaccatgggtttcagagacctgaaaagccccgccggcctccaggtgctcaac gattacctggcggacaagagctacatcgaggggtatgtgccatcacaagcagatgtggca gtatttgaagccgtgtccagcccactgcctgccgacttgtgtcatgccctacgttggtat aatcacatcaagtcttacgaaaaggaaaaggccagcctgccaggagtgaagaaagctttg ggcaagtatggtcctgccgatgtggaagacactacaggaagtggagctacagatagtaaa gatgatgatgacattgacctctttggatctgatgatgaggaggaaagtgaagaagcaaag aggctaagggaagaacgtcttgcacaatatgaatcaaagaaagccaaaaaacctgcactt gttgccaagtcttccatcttactagatgtgaaaccttgggatgatgagacagatatggcg aaattagaggagcgcgtcagaagcattcaagcagacggcttagtctggggctcatctaaa ctagttccagtgggatacggaattaagaaacttcaaatacagtgtgtagttgaagatgat aaagtcggaacagatatgctggaggagcagatcactgcttttgaggactacgtgcagtcc atggatgtggctgctttcaacaagatctaa >gi568815575r:24688395_24889069|GENSCAN_predicted_peptide_3|458_aa MEHYHQLLVRPTEKMMNLEVIYGDTDSIMINTNSTNLEEVFKLGNKVKSEVNKLYKLLEI DIDGVFKSLLLLKKKKYAALVVEPTSDGNYVTKQELKGLDIVRRDWCDLAKDTGNFVIGQ ILSDQSRDTIVENIQKRLIEIGENVLNGSVPVSQFEINKALTKDPQDYPDKKSLPHVHVA LWINSQGGRKVKAGDTVSYVICQDGSNLTASQRAYAPEQLQKQDNLTIDTQYYLAQQIHP VVARICEPIDGIDAVLIATWLGLDPTQFRVHHYHKDEENDALLGGPAQLTDEEKYRDCER FKCPCPTCGTENIYDNVFDGSGTDMEPSLYRCSNIDCKASPLTFTVQLSNKLIMDIRRFI KKYYDSYLVQVDLSVSVDKCIESYIYYHSQDTEQGHTFWHDLPSATHHTEYAANPKDAQN ETFIPCRPSACRTGELGIPQGSLMLDYLHTFTPAISSA >gi568815575r:24688395_24889069|GENSCAN_predicted_CDS_3|1377_bp atggaacattatcatcagctgctagtgaggcccactgaaaagatgatgaatcttgaagtt atttatggagatacagattcaattatgataaacaccaatagcaccaatctggaagaagta tttaagttgggaaacaaggtaaaaagtgaagtgaataagttgtacaaactgcttgaaata gacattgatggggttttcaagtctctgctactgctgaaaaaaaagaagtacgctgctctg gttgttgagccaacgtcggatgggaattatgtcaccaaacaggagctcaaaggattagat atagttagaagagattggtgtgatcttgctaaagacactggaaactttgtgattggccag attctttctgatcaaagccgggacactatagtggaaaacattcagaagaggctgatagaa attggagaaaatgtgctaaatggcagtgtcccagtgagccagtttgaaattaacaaggca ttgacaaaggatccccaggattaccctgataaaaaaagcctacctcatgtacatgttgcc ctctggataaattctcaaggaggcagaaaggtgaaagctggagatactgtgtcatatgtc atctgtcaggatggatcaaacctcactgcaagtcagagggcctatgcgcctgagcagctg cagaaacaggataatctaaccattgacacccagtactacctggcccagcagatccaccca gtcgtggctcggatctgtgaaccaatagacggaattgatgctgtcctcattgcaacgtgg ttgggacttgaccccacccaatttagagttcatcattatcataaagatgaagagaatgat gctctacttggtggcccagcacagctcactgatgaagagaaatacagggactgtgaaaga ttcaaatgtccatgccctacatgtggaactgagaatatttatgataatgtctttgatggt tcgggaacagatatggagcccagcttgtatcgttgcagtaacatcgattgtaaggcttca cctctgacctttacagtacaactgagcaacaaattgatcatggacattagacgtttcatt aaaaagtactatgatagctatcttgtgcaggtggacctttctgtgagtgttgacaaatgc atagaatcatatatctactaccacagtcaagatacagaacagggtcatactttctggcat gatcttccatctgcgacccaccatactgaatatgctgccaatcccaaagatgcacagaat gaaacttttatcccttgcagaccatctgcctgccgcactggagagcttggcattcctcag ggatcccttatgcttgactatctccacactttcactcctgccatttcctctgcgtag >gi568815575r:24688395_24889069|GENSCAN_predicted_peptide_4|73_aa MATSEPVLKNEWKFGLSVFSPVTKGDSHGEHNCNNLQNTMATHITTRILNSGGTLESPGQ RYNTAPPPECLIQ >gi568815575r:24688395_24889069|GENSCAN_predicted_CDS_4|222_bp atggcaacttctgaaccggtcttgaaaaatgagtggaagtttggattatcagttttttcg ccagttaccaagggcgactcgcatggtgaacataattgcaataatttgcaaaacaccatg gcaacccacattacaactagaattcttaactctggtggcacactggaatcaccagggcag cgttacaacacagcgccacccccagagtgtctgattcagtag