GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:13:13 Sequence gi568815584f:88724708_88977407 : 252700 bp : 39.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 1971 1834 138 2 0 66 86 114 0.145 7.66 1.07 Intr - 11858 11657 202 0 1 71 115 189 0.822 17.32 1.06 Intr - 14307 14172 136 0 1 100 85 108 0.996 10.92 1.05 Intr - 15865 15680 186 1 0 56 87 104 0.960 6.06 1.04 Intr - 21634 21478 157 0 1 50 89 62 0.445 1.59 1.03 Intr - 25349 25267 83 2 2 61 60 86 0.515 0.62 1.02 Intr - 29964 29805 160 2 1 98 91 104 0.903 10.77 1.01 Init - 42063 42041 23 0 2 73 119 2 0.549 1.17 1.00 Prom - 51801 51762 40 -5.95 2.00 Prom + 54554 54593 40 -7.15 2.01 Init + 57424 57945 522 0 0 48 87 276 0.630 18.40 2.02 Term + 67512 67916 405 2 0 54 46 442 0.216 30.80 2.03 PlyA + 69141 69146 6 1.05 3.00 Prom + 75770 75809 40 -5.35 3.01 Init + 78659 78810 152 1 2 82 45 101 0.117 4.76 3.02 Intr + 94476 94544 69 0 0 132 75 43 0.314 4.78 3.03 Intr + 95879 95907 29 2 2 92 82 25 0.212 -0.86 3.04 Intr + 99867 100114 248 1 2 94 86 163 0.173 12.96 3.05 Intr + 108986 109015 30 1 0 95 103 46 0.059 4.21 3.06 Intr + 114745 114865 121 0 1 45 113 136 0.104 10.95 3.07 Intr + 116330 116489 160 0 1 62 51 112 0.137 3.02 3.08 Intr + 132483 132570 88 0 1 62 106 91 0.668 7.35 3.09 Intr + 136515 136625 111 2 0 83 76 55 0.915 3.46 3.10 Intr + 140500 140590 91 0 1 44 81 91 0.728 2.65 3.11 Term + 141810 141952 143 1 2 86 41 117 0.949 3.91 3.12 PlyA + 142596 142601 6 1.05 4.00 Prom + 144290 144329 40 -5.75 4.01 Init + 145355 145491 137 1 2 48 55 152 0.770 7.56 4.02 Intr + 146842 147016 175 1 1 49 68 181 0.998 11.22 4.03 Intr + 147623 147745 123 1 0 56 82 160 0.973 12.06 4.04 Term + 152587 152703 117 0 0 81 38 72 0.161 -0.94 4.05 PlyA + 153258 153263 6 1.05 5.03 PlyA - 153312 153307 6 1.05 5.02 Term - 159433 159324 110 2 2 88 47 76 0.778 1.19 5.01 Init - 160101 159666 436 0 1 54 -8 303 0.312 13.67 5.00 Prom - 166815 166776 40 -1.15 6.00 Prom + 172943 172982 40 -5.25 6.01 Init + 185562 185967 406 2 1 60 79 236 0.725 16.70 6.02 Intr + 189805 189938 134 2 2 48 116 68 0.795 5.04 6.03 Intr + 190006 190075 70 0 1 -1 61 108 0.487 -2.96 6.04 Intr + 191090 191260 171 0 0 72 109 58 0.419 5.29 6.05 Intr + 196068 196108 41 1 2 135 31 44 0.173 0.42 6.06 Intr + 197420 197611 192 1 0 97 52 124 0.366 8.47 6.07 Intr + 202183 202372 190 0 1 55 78 98 0.044 3.74 6.08 Intr + 221524 221672 149 2 2 -6 94 150 0.036 5.23 6.09 Intr + 221928 222011 84 2 0 9 35 156 0.034 1.50 6.10 Intr + 227721 227799 79 2 1 80 82 69 0.036 3.71 6.11 Intr + 232261 232444 184 2 1 61 101 159 0.416 12.42 6.12 Intr + 237713 237784 72 2 0 76 80 54 0.180 1.00 6.13 Intr + 243519 243692 174 0 0 24 87 112 0.037 2.83 6.14 Intr + 249472 249646 175 1 1 38 68 145 0.300 6.52 6.15 Term + 250607 250666 60 1 0 93 45 38 0.199 -3.17 6.16 PlyA + 251172 251177 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 114781 114865 85 0 1 60 113 127 0.878 13.43 S.002 Term + 116330 116493 164 0 2 62 32 126 0.834 1.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_1|362_aa MPLKLQNCLALHPERVLVATGQVGKEPYICIWDSYTVQTISVLKDVHTHGIACLAFDLDG QCSVKAVIDNARRNECGYIPVKLYLQKHSFSLICLPFYSFLDFLDINQRLVSVGLDSKNA VCVWDWKRGKMLSMAPGHTDRFWSLCGNALTPKRGVFGKTGDLQTILCLACARDELTYSG ALNGDIYVWKGINLIRTIQGAHAAGIFSMNACEEGFATGGRDGCIRLWDLTFKPITVIDL RETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIVVQERNKPFLIMQGHCEGELWALAVH PTKPLAVTGSDDRSVRIWSLVDHALIARCNMEEPIRCAAVNADGIHLALGMKDGSFTVLR VS >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_1|1086_bp atgcccctcaagctgcaaaactgccttgcattgcatcctgaacgagtgttggtagcaaca ggacaagttgggaaagagccttatatttgtatttgggattcatacactgtgcagaccata tcagttttaaaggatgttcatacacatggtatagcttgcttggcgtttgacttagatgga cagtgcagtgtgaaagcagtcatagacaatgcacgaaggaatgagtgtggctatattcca gtaaaactttatttacaaaaacatagtttctctttgatttgtttgcctttttattccttc ctggacttcttggacataaatcagcgcttggtttcagttggacttgattcaaagaatgca gtttgtgtttgggactggaaaaggggaaaaatgttgtctatggctcctggtcatacagat agattctggagtttatgtggaaatgctctgaccccaaaacgaggtgtctttggtaagacg ggtgaccttcagacaatactgtgcctagcctgtgcaagggatgaattaacatattctggt gcactcaatggggatatatatgtttggaaaggaatcaatcttatacgaacaatacaagga gcccatgctgcaggaatttttagcatgaatgcttgtgaagaaggctttgctactggtggc agagatggttgtattcgtctttgggatttaacttttaaaccaattactgtgattgatctc agggaaacagaccagggatacaaaggtttgtctgtaaggagtgtgtgttggcgaggtgac cacattctagttggaacacaggacagtgaaatttttgaaattgtggtgcaagaaagaaat aaaccttttctaattatgcaagggcattgtgaaggtgaactttgggcacttgctgtccat cctactaaacctttggctgtgactggaagtgatgatcgttcggtcaggatatggagccta gtagatcatgcattaatagcaaggtgtaatatggaagaaccaattcgttgtgcagctgtc aatgcagatggaatccatcttgcccttggaatgaaggatggctcattcactgtacttaga gtaagn >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_2|308_aa MDNKVHPEVVSDGEEKLVGNWSKDQSYYAKRLAAFCPCPRDLWNFELKRDDLGYLVEEIS KRQRIQEEAEHESLEIFQPDDAVEKKNPFSGEKFKPATEIYIRTRSQRLITKTKRKTSPG HVRDFCGSPSHHRLGGLRGKTGFLCQVQDTSATYSLETWCPVSPPLQLWLKEAKERSRRP EEPAGANSGSGLQGDGGGPRSPVPDDVVAVAPVELLPVLPRAVHHADPRHEVYDLLGRSV VEVVAALVAPVAVHPLEPQVAARGAPGRHVGAPTRRSRSGPRRRREAAAARQRKPSRWLP GLPASRVL >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_2|927_bp atggacaataaggtccaccctgaggtggtctcagatggagaagagaaacttgttgggaac tggagtaaagatcaatcttactatgcaaagagactggcagcattttgcccctgccctaga gatctgtggaactttgaacttaagagagatgatttagggtatctggtggaagaaatttct aagcgacaaagaattcaagaggaagcagagcatgaaagtttggaaatttttcagcctgat gatgcagtagaaaagaaaaacccattttctggggagaaattcaagcctgctacagaaatc tacataagaacgaggagccaaaggctaatcacaaagacaaagaggaaaacgtctccaggc catgtcagagacttttgcggcagcccctcccatcacaggctcggaggcctcagaggaaaa actggtttcctgtgccaggtccaagacacctctgctacatacagcctagagacttggtgc cctgtgtccccaccactccagctgtggctaaaagaggccaaggagcggtcacggcgtcca gaggaacccgcgggtgcaaactccgggagcggcttgcagggtgacggcggcggcccccgc tccccggtacctgatgatgtcgtcgctgtggccccggtagaacttctgcctgtgctcccg cgggctgtacaccacgccgacccccgccacgaagtatacgatctccttggccgcagtgta gtagaggttgttgcggcactggtggccccggtagccgtacacccactcgagccgcaggtg gcagctcggggcgctccgggccgccatgtcggggcgcccacccgccgctcccgctcgggc ccgcggcggcgacgggaggcggcggcggcccggcaacgaaagccctcccgctggctgccg ggacttcccgccagccgcgtcctctaa >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_3|413_aa MKAEICKSRTVKDWHNHQKLGEGNKTDSPPQPSEGTNTANTLILDFWPLEMIACCWETFP QRLTAELFMNPTHRWHGYKNQSVGALRAPLGQGPSRRGLPGRVGRRQLFTPRPPLSWSAG PSLAAPAAMSSEMEPLLLAWSYFRRRKFQLCADLCTQMLEKSPYDQEPDPELPVHQAAWI LKARALTEMVYIDEIDVDQEGIAEMMLDENAIAQVPRQSHKLEDPLQVSSGPARRVEGQA LWNRLSEHPEPPTQPALSPAPPEDLSGWERLGMYREAEKQFKSALKQQEMVDTFLYLAKV YVSLDQPVTALNLFKQGLDKFPGEVTLLCGIARIYERSSNFTVYEITLGVDGKYDPSSSS ESEVLEAYTVAIDPLCESKITTEMRQQPSSSCIDYTAPLLYLGLGDTYLEELN >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_3|1242_bp atgaaggcagagatttgcaagtcaaggactgtcaaggactggcacaaccaccagaagcta ggagaagggaataaaactgattctcccccacaaccctcggaaggaactaacactgccaac actttgatcttggacttctggcctctagaaatgattgcatgctgttgggaaacttttcct cagcggctgactgcagagctgtttatgaaccccacacacagatggcatggttacaagaac cagagtgtaggcgcgctgcgggcacctctcggacaaggccccagccgtcgcgggttgccg ggcagagtcggacgccgccagctcttcactccacgcccacctctctcctggagcgctggg ccttcgctggccgcaccggcagccatgagctcggagatggagccgctgctcctggcctgg agctattttaggcgcaggaagttccagctctgcgccgatctatgcacgcagatgctggag aagtccccttatgaccaggaaccagatcctgaattgccagtgcatcaggcagcttggatc ttaaaagcaagagcgctaacagaaatggtatacatagatgaaattgatgtagatcaggaa ggaattgcagaaatgatgctggatgaaaatgctatagctcaagttccacgccaatcacac aagctggaagacccattacaggtttcctcaggcccagcacgcagagtggaaggccaggca ctatggaacaggctatcagaacacccagaaccgcctacacagcccgccctatcaccagct cctccggaagatttgtcaggctgggaacggttgggaatgtatcgtgaagcagaaaaacag tttaaatcagccctgaagcagcaggaaatggtagatacatttctgtacttggcaaaagtt tatgtctcattggatcaacctgtgactgctttaaatcttttcaaacaaggcttagataag tttccaggagaagtaaccctgctctgtggaattgcaagaatctatgagaggtcttcaaac tttactgtgtatgaaattaccttgggagtggatgggaagtatgacccatcctcttcctct gaatcagaagtcctggaagcctacacagtggccattgaccctctctgtgaaagtaaaatc accacagagatgagacaacaaccaagttccagctgcattgactataccgccccccttttg tacttagggcttggtgacacctatttggaagaacttaattaa >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_4|183_aa MNNMSSAAEYYKEVLKQDNTHVEAIACIGSNHFYSDQPEIALRFYRRLLQMGIYNGQLFN NLGLCCFYAQQYDMTLTSFERALSLAENEEEAADVWYNLGHVAVGIGDTNLAHQCFRLAL VNNNNHAEAYNNLAVLEMRKGHVEQIGDLQRSYVAAQKSEAAFPDHVDTQHLIKQLRQHF AML >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_4|552_bp atgaacaatatgtcatcagcagcagaatattacaaagaagttttgaaacaagacaatact catgtggaagccatcgcatgcattggaagcaaccacttctattctgatcagccagaaata gctctccggttttacaggcggctgctgcagatgggcatttataacggccagctttttaac aatctggggctgtgttgcttctatgcccagcagtatgatatgactctgacctcatttgaa cgtgccctttctttggctgaaaatgaagaagaggcagctgatgtctggtacaacttggga catgtagctgtgggaataggagatacaaatttggcccatcagtgcttcaggctggctctg gtcaacaacaacaaccacgccgaggcctacaacaacctggctgtgctggagatgcggaag ggccacgttgaacagattggagatctgcagagaagctatgttgctgcgcagaagtctgaa gcagcatttccagaccatgtggacacacaacatttaattaaacaattaaggcagcatttt gctatgctctga >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_5|181_aa MWECLELPRDLLKGFDLKADNDMDNEVQAKVVLDGNEELVGNWNKGHSCYALAKRLAAFY PCPRDLWNFELERGDLGYLVEEISKQQSIQEEAEHKSLENLQPDNVIENKNPYSEEKFKP AAEIGISNKEPNINHQDNGENISRACVQKTKIEVSEPPPRFQRMYGKVWQTCNSYYLRAS L >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_5|546_bp atgtgggaatgtttggaacttcctagagacttgttgaaaggctttgacctaaaagctgat aatgatatggacaatgaagtccaggctaaggtggtcttggatggaaatgaagaacttgtt gggaactggaacaaaggtcactcttgctatgctttagcaaagaggctggcagcattctac ccctgccctagagacctgtggaactttgaacttgagagaggtgatttagggtatctggtg gaagaaatttctaagcagcaaagcattcaagaggaggcagagcataaaagtttagaaaat ttgcagcctgacaatgtgatagaaaacaaaaacccatattctgaggagaaattcaagcct gctgcagaaattggcataagtaacaaggaaccaaatattaatcaccaagacaatggggaa aatatctccagggcatgtgtgcagaagacaaaaattgaggtttcggaacctccacctaga tttcagaggatgtatggaaaagtgtggcagacttgtaacagttactacttgagagcatca ctatga >gi568815584f:88724708_88977407|GENSCAN_predicted_peptide_6|726_aa MGASVLSKIASYDFYGSSRLSSSAGKNTTLGFHVSAVGVFISGAPLDGVCCLRSFGAGAC PECQEHALPRLSHQPCCERVWHPLESSRCRHDVVVPAIQGTATTIPGSCRFLHGMPVEDD SINHQGCLGHSLSPQAHSTLILETTCKKGGGKIGLVQGHSENCLAGEKLGDASKRGLSQK QQDERQEYKDIGTATRKDVSLGAECYHGGCSPLRAVVRYRKSNPVFPYHLFMSLIFNLTV TEKNEFGWFHDKPKKAHLKAAPPLAVTTMSLECMDLQLKVVTNSTPGTWGNKPFLKEDLN GVSQWPPQFTFVHLNLVFIYILEAASSASWGEPRSSVSKHENRLMQAYGTAMRVNKLQLY VIKWMNLINMMLGENRRKTIGSVKFHSREVQKQSRLFNALKKGGREGRRRKRRCWRGKRE KKEREDEEEKEEEEEEKGEKEEREREKNKEESALDLRAQGDGKQAYRHPLTEHLAGAQRC QPDREIIDEEEQRSERSHERDKTFSELLEATRGWGVGTNAHEQCQPVIVWARYERLFGEG SGFEKEFDLILIKFPLLNKRTLKHREIDSQNHHSVALAVGSKWPFRLMLIGPLQPTAGML RGWWKKAPGGARLSMADTRRDVLCMEGMQILTRKVLWNLEQNGKEILLLKRERPGQEHST LGNGKSPVWLQNKSLGRGRVSTQGLRPRGEGPTGIWAFMNLGFQESWPHGFLAILHTPDS LPLQGL >gi568815584f:88724708_88977407|GENSCAN_predicted_CDS_6|2181_bp atgggggcttccgttctctctaagatagcatcctatgatttctatgggtcctctagactg tcctcgagtgccggtaagaacaccactcttgggttccatgtgtcagctgttggggtcttc atttctggagccccgctggatggtgtctgctgcctgaggagttttggggctggtgcctgc cctgagtgccaagagcatgctctcccacggctttcccaccagccctgctgtgaaagggtc tggcacccactagaaagctccagatgccgtcatgacgtggttgtgcctgccatccagggc actgccaccaccatccccgggagctgtcggttcctccatggtatgccagttgaagatgac agcatcaaccaccaaggctgcctgggccacagcctgtccccacaagcacattctactctt attcttgagactacatgcaaaaaagggggagggaaaattgggttggttcaaggtcactca gagaactgtcttgcaggggagaaacttggcgatgcaagcaagagagggttatcgcagaag caacaggatgaaagacaagagtataaagatataggcactgctacccggaaagatgtctcc ttgggggcagagtgttaccacggtggttgcagtccactaagggcagttgtgagataccgg aagtcaaacccagttttcccttatcacctctttatgtcactgattttcaatttaacagtt acagagaaaaatgaatttggctggtttcacgataaaccaaagaaagcacacctgaaggca gctcctcccctggcggtcaccaccatgagtttggagtgtatggacttacagctgaaggta gtcaccaacagcactccaggcacctggggaaacaagccctttctgaaagaggatctcaat ggagtatcacagtggcccccacagttcacctttgtgcacttgaatctcgtcttcatatac attctggaagcagcttcttctgcctcttggggggaacctagaagcagtgtatcaaaacat gagaacagactaatgcaggcatacggcacagcaatgagagtgaacaaactacaactatat gtgataaaatggatgaatctcataaatatgatgttgggtgaaaacagacgcaaaaccata ggttctgtaaaattccattcacgtgaagtacagaaacagtcaagactattcaatgccctt aaaaaaggaggaagggagggaagaagaagaaagagaagatgttggagagggaagagagag aagaaggaaagagaggatgaagaggagaaggaggaagaggaggaagagaaaggggaaaag gaggaaagagaaagagagaagaacaaggaggagtcagccctagatctgcgagcccaaggt gatggaaaacaggcctatcggcacccccttactgaacaccttgcaggtgcccagaggtgt caacctgacagagaaataattgatgaagaagaacaaagaagtgaaagatcacatgaaaga gacaaaaccttctcagaacttttagaagccaccaggggctggggagtgggaacgaatgca catgaacaatgccaacctgtgattgtgtgggccagatacgaacgactctttggagaaggc tctggatttgagaaagaatttgacctaattctcattaaattcccattgttgaataagaga actctgaaacatagagaaatagactcccagaatcaccactcagtggccctggctgtgggc tctaagtggccctttaggcttatgttgattgggcccctgcaacccacagcaggcatgctg agaggttggtggaagaaggcgccaggtggagcaagactaagcatggcagacacaagaaga gacgtcctctgcatggaagggatgcagattttgacacggaaagtgctttggaatctggaa caaaatgggaaagaaatactcttgctgaagagggaacggcctggacaagagcacagtaca ttaggaaatggcaagagtccagtgtggctgcagaacaagagtctggggagaggtcgggta tcaactcaaggcctgaggccaagaggtgaaggacctacaggaatctgggctttcatgaac ctgggctttcaagaatcctggccacatgggttccttgctattcttcacaccccagattca ctccctcttcagggcctgtga