GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:23:09 Sequence gi568815591f:129334473_129583066 : 248594 bp : 43.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11576 11622 47 1 2 69 116 9 0.111 1.95 1.02 Intr + 33998 34079 82 2 1 80 101 20 0.003 2.14 1.03 Intr + 37297 37379 83 0 2 114 71 27 0.200 2.04 1.04 Intr + 41370 41454 85 0 1 22 94 55 0.127 -0.68 1.05 Intr + 45169 45277 109 0 1 78 109 84 0.864 9.36 1.06 Intr + 54584 54727 144 0 0 64 98 155 0.723 14.35 1.07 Intr + 55162 55262 101 2 2 75 68 34 0.890 -0.07 1.08 Intr + 62750 62852 103 1 1 97 83 119 0.881 12.05 1.09 Intr + 65818 65912 95 2 2 115 64 44 0.883 4.38 1.10 Intr + 68907 69013 107 2 2 66 110 76 0.973 6.61 1.11 Intr + 70625 70741 117 2 0 103 30 109 0.977 6.18 1.12 Intr + 71364 71427 64 0 1 104 57 52 0.983 2.42 1.13 Intr + 71906 71994 89 1 2 138 107 81 0.745 13.77 1.14 Intr + 75004 75074 71 1 2 44 80 93 0.191 2.93 1.15 Intr + 79122 79216 95 1 2 111 83 77 0.059 9.18 1.16 Intr + 88323 88466 144 2 0 108 94 70 0.670 10.08 1.17 Intr + 90402 90470 69 2 0 62 110 24 0.718 1.38 1.18 Intr + 90591 90669 79 2 1 102 106 103 0.997 12.62 1.19 Term + 91971 92203 233 1 2 68 46 152 0.788 5.64 1.20 PlyA + 92328 92333 6 1.05 2.00 Prom + 93103 93142 40 -7.56 2.01 Init + 100001 100194 194 1 2 93 45 259 0.663 20.54 2.02 Intr + 102128 102173 46 2 1 47 50 63 0.075 -3.19 2.03 Intr + 105550 105619 70 0 1 67 80 66 0.114 2.45 2.04 Intr + 109552 109626 75 2 0 108 86 68 0.076 8.09 2.05 Intr + 117141 117275 135 1 0 59 113 157 0.695 15.84 2.06 Intr + 118755 118875 121 1 1 101 24 134 0.965 7.85 2.07 Intr + 119670 119738 69 0 0 94 109 61 0.991 7.10 2.08 Intr + 119949 120055 107 0 2 79 60 71 0.982 3.26 2.09 Intr + 120772 120899 128 2 2 130 101 73 0.997 13.20 2.10 Intr + 121967 122170 204 1 0 116 51 174 0.996 15.90 2.11 Intr + 123743 123978 236 1 2 116 96 229 0.886 22.99 2.12 Intr + 124240 124305 66 1 0 59 92 94 0.865 4.92 2.13 Intr + 125045 125108 64 2 1 79 62 14 0.398 -3.38 2.14 Intr + 125829 125900 72 2 0 80 85 123 0.987 10.80 2.15 Intr + 128494 128568 75 0 0 46 91 65 0.829 2.31 2.16 Intr + 129843 129895 53 2 2 95 68 101 0.746 6.51 2.17 Intr + 130140 130266 127 0 1 91 99 82 0.868 10.28 2.18 Intr + 131395 131523 129 0 0 53 71 61 0.734 1.79 2.19 Intr + 136115 136243 129 1 0 42 107 91 0.672 7.29 2.20 Intr + 146313 146417 105 2 0 58 101 58 0.920 4.51 2.21 Intr + 148370 148574 205 1 1 48 101 184 0.952 14.47 2.22 Intr + 151107 151309 203 1 2 97 22 147 0.147 8.00 2.23 Term + 177775 177969 195 0 0 39 48 221 0.483 10.61 2.24 PlyA + 178425 178430 6 1.05 3.00 Prom + 183035 183074 40 -3.16 3.01 Init + 187649 187708 60 1 0 86 113 10 0.346 4.56 3.02 Intr + 195149 195271 123 1 0 97 26 61 0.088 1.58 3.03 Intr + 198486 198552 67 2 1 49 131 -6 0.005 -1.72 3.04 Term + 216137 216318 182 0 2 115 45 122 0.864 8.37 3.05 PlyA + 217428 217433 6 1.05 4.00 Prom + 219517 219556 40 -4.46 4.01 Init + 224032 224145 114 0 0 53 81 117 0.906 7.71 4.02 Intr + 232032 232098 67 2 1 67 33 80 0.043 -1.12 4.03 Intr + 248117 248293 177 0 0 6 116 179 0.039 12.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 248030 248293 264 0 0 55 116 199 0.877 16.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:129334473_129583066|GENSCAN_predicted_peptide_1|638_aa MVTEKIIREDLWDKRRTYLTQACTMEKWDGNEGTSAFHMPEWMEIWLIDFHEYPASLMPD ILLARSNPFHRGGSGAGNVTMLGSKKKYIVNGNSGIKAQIQFADQKQEFNKRPTKIGRRS LSRSISQSSTDSYSSAASYTDSSDDETSPRDKQQKNSKGSSDFCVKNIKQAEFGRREIEI AEQEMPALMALRKRAQGEKPLAGAKIVGCTHITAQTAVLMETLGALGAQCRWAACNIYST LNEVAAALAESGFPVFAWKGESEDDFWWCIDRCVNVEGWQPNMILDDGGDLTHWIYKKYP NMFKKIKGIVEESVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDGLK RTTDMMFGGKQVVVCGYGEVGKGCCAALKAMGSIVYVTEIDPICALQACMDGFRLVKLNE VIRQVDIVITCTGNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVWGLLWLMLYLLYVFQA SLRTPELTWERVRSQVDHVIWPDGKRIVLLAEGRLLNLSCSTVPTFVLSITATTQALALI ELYNAPEGRYKQDVYLLPKKMDEYVASLHLPTFDAHLTELTDEQAKYLGLNKNGPFKPNY YRCLGLPRNQGHLGSDELPALELVFASPTTYAYMNSEK >gi568815591f:129334473_129583066|GENSCAN_predicted_CDS_1|1917_bp atggtaacagaaaagattatcagagaggatttgtgggataagagaaggacatatcttacc caagcctgcactatggagaagtgggacggtaatgagggcacctcagcttttcacatgcct gagtggatggaaatctggttgattgactttcatgagtatccagccagcttgatgcccgat attcttttggcaagaagtaatccttttcatagaggtggcagtggggctggtaatgtcact atgctgggcagcaagaagaaatacattgttaatggcaactctgggattaaggcccagatc cagtttgctgaccagaagcaagaattcaacaaacgtcccaccaaaattggacgtcgctct ttgtctcgttccatttctcagtcatctactgacagctacagctcagcggcttcatataca gatagctctgatgatgagacatcgcccagggacaagcagcaaaagaactctaagggaagc agtgacttctgtgttaagaacatcaaacaggcagagtttggacgaagagaaattgaaatt gctgaacaagaaatgcctgcattgatggctttgaggaagagagctcaaggagaaaagcct ttggctggagccaaaatcgtgggttgcacacacatcactgctcagactgctgtgcttatg gaaactctgggtgctctgggggcccagtgccgatgggctgcctgcaacatctattccact ctcaatgaagtggctgctgctctagcagaaagtggatttcctgtttttgcctggaaggga gagtcagaagatgacttttggtggtgtatcgatagatgtgtgaatgtggagggctggcag ccaaacatgatcttggatgatggaggggatcttacccactggatttataaaaagtatccc aacatgtttaagaaaatcaagggcatagtagaggagagtgttactggagttcacaggctg taccaactgtccaaagctgggaagctgtgtgttccagccatgaatgtcaatgactcagtc accaaacagaaatttgacaacctctactgttgccgtgaatcaattcttgatggacttaaa aggacaacagacatgatgtttggtggaaagcaagtggtagtctgtggctatggagaggtg gggaaagggtgctgtgctgccctgaaagccatgggctccattgtgtatgtaactgaaatt gaccccatctgtgccctgcaagcctgtatggatggatttcgactggtgaaattaaatgag gtcatccgacaagtggacattgttattacctgtacaggtaacaagaatgtggtaaccaga gagcacttggaccgtatgaagaatagctgcatcgtttgtaacatgggacattccaacaca gagattgacgtgtggggcttgctatggctaatgctgtacttgctgtatgtgttccaggcg agtctgcggacaccagaactgacctgggagcgagtgagatctcaagttgaccatgtgata tggcctgatggcaagaggatagtactgctggcagagggccgcctgctgaaccttagctgc tccacagtgcctacatttgtgctctcaatcactgctactactcaggctcttgccttgata gagctttacaatgctcctgagggtcgctataagcaggatgtctacctgttgcccaagaag atggatgagtatgtggccagcctacacctgcctacctttgatgcccacttgacagagctg acagatgaacaggccaagtatctgggactcaacaagaatgggcccttcaagcctaattac tacaggtgccttgggctccccagaaatcagggacacctgggcagtgatgagcttcctgca ctggaattggtctttgcatccccaaccacatatgcttacatgaactcagaaaaatga >gi568815591f:129334473_129583066|GENSCAN_predicted_peptide_2|935_aa MEDPAAPGTGGPPANGNGNGGGKGKQAAPKGREAFRSQRRESEVRSPESFGWGPETPAGG KRRLRTPFTCQYIPYYHSTQGSVDCPTLEFEYGDADGHAAELSELYSYTENLEFTNNRRC FEEDFKTQVQGKEWLELEEDAQKAYIMGLLDRLEVVSRERRLKVARAVLYLAQGTFGECD SEVDVLHWSRYNCFLLYQMGTFSTFLELLHMEIDNSQACSSALRKPAVSIADSTELRVLL SVMYLMVENIRLERETDPCGWRTARETFRTELSFSMHNEEPFALLLFSMVTKFCSGLAPH FPIKKVLLLLWKVVMFTLGGFEHLQTLKVQKRAELGLPPLAEDSIQVVKSMRAASPPSYT LDLGESQLAPPPSKLRGRRGSRRQLLTKQDSLDIYNERDLFKTEEPATEEEEESAGDGER TLDGELDLLEQDPLVPPPPSQAPLSAERVAFPKGLPWAPKVRQKDIEHFLEMSRNKFIGF TLGQDTDTLVGLPRPIHESVKTLKQHKYISIADVQIKNEEELEKCPMSLGEEVVPETPCE ILYQGMLYSLPQYMLRTLDLEGDTSASDWLPSITVLQSMKLGIDVNRHKEIIVKSISTLL LLLLKHFKLNHIYQKSESSLLVEEQVLRVQRGFCRVYVHGPINQGIFPLSYIRRHLWIQP IPAVAIYLKPVLISASYSISVLDYPCCTIQDLPELTTESLEAGDNSQFCWRNLFSCINLL RLLNKLTKWKHSRTMMLVVFKSAPILKRALKVKQAMLQLYVLKLLKLQTKYLGRQWRKSN MKTMSAIYQKVRHRMNDDWAYGNDIDARPWDFQAEECTLRANIEAFNSRRYDRPQDSEFS PVDNCLQSVLGQRLDLPEDFHYSYELWLEREPAKGKKGKGQGKSHGKKQKKPEVDILSPA AMLNLYYIAHNVADCLHLRGFHWPGAPKGKKGRSK >gi568815591f:129334473_129583066|GENSCAN_predicted_CDS_2|2808_bp atggaggaccccgccgcgcctgggaccgggggcccgcccgcaaatggcaatggcaacggc ggcggcaaagggaagcaggcggcgcccaagggccgcgaagcgttccgaagccagcggcgg gagtcagaggtgaggagcccggaaagcttcggctggggcccggagacgcccgccggcggg aagcggcggctgaggactccttttacctgccagtacatcccctattaccacagtacccag ggctctgtggactgtcccactctggagtttgagtatggagatgcagatgggcatgcagcc gagttgtcagaattgtatagttacactgagaacctggaattcaccaataacaggaggtgc tttgaagaagatttcaagactcaagtgcagggcaaggaatggctggagttggaagaagat gcccaaaaggcctatataatgggactcttggaccggctagaggtggtcagtagggaacgg cggctgaaggtggcccgggctgttctctacctggcccaaggtacttttggggaatgtgat tcagaggtcgatgtgctacactggtccaggtacaactgcttcctgctgtatcagatgggg accttctccaccttcctggagctactccacatggaaattgacaacagccaggcctgtagc agtgcccttcggaaaccagctgtctccatagctgatagcacagagctcagggtgctgctg agtgttatgtacctaatggtggaaaatattcgcctggagcgagagacagacccctgtggg tggagaacagcccgggagaccttccgcactgaattaagcttctccatgcataatgaggag ccttttgcccttttactcttctccatggttaccaagttctgcagtggcctggctcctcac ttccccataaagaaggtcctgctcctgctctggaaggtggtcatgtttaccctcggtgga tttgagcatctgcagactctcaaagtacagaagcgggcagaattgggcctgcctccactg gctgaagacagtatccaggtggtgaagagcatgcgtgctgcctccccgccctcttacact cttgacctgggagagtctcagctggcacccccaccctccaagctgcgaggccgccgtggc tctcgaaggcaactcctcactaagcaggacagcctggacatctacaatgaaagggatctc ttcaagactgaggagcccgccacagaggaggaagaggagtctgctggtgatggagaacga accttggatggagagctagacctgctagagcaggaccctctggtgccacctccaccctca caggcacccctctctgctgagcgggtggcttttcccaagggcctgccctgggccccaaag gtcagacagaaggacattgagcacttcttggagatgagcaggaacaagttcatcggattc accctggggcaggacacagatacattggttggattacccaggcccatccatgagagtgtg aagaccctaaagcagcacaagtatatctccatcgcagatgtgcagatcaagaatgaagag gagctggagaagtgccctatgtctttgggggaagaggtggtaccagagacgccatgtgaa atcctctaccagggaatgctgtacagccttccgcagtatatgctcagaactctggatctg gagggtgacaccagtgccagcgactggctgcccagcatcactgttctccagagcatgaag ctgggcatcgatgtgaacaggcacaaggagattattgtaaagagtatctctaccctgctt ctgctactcctcaaacacttcaaactcaaccatatctaccagaagtcagaatcaagcttg ctggtagaagagcaggtcctcagagtacagagaggcttctgtagggtctatgtccatggc cctatcaaccagggtatttttccactttcatacattcgaagacatctttggatacagcca attcctgctgttgccatttacctaaagcctgtccttatctctgcatcttacagcatctca gtcctggattatccttgctgtaccatccaggatttgccggagcttactactgaaagtctg gaagctggagacaacagccagttctgctggaggaacctcttttcctgcatcaacctcctg aggctgctcaataaactgaccaaatggaaacattcccggaccatgatgctggtagtgttt aaatcggcaccaatcttaaagcgggccctcaaggtcaaacaggccatgctgcaactttat gtcctaaagctactaaagttacagaccaagtacctggggcgccaatggaggaaaagcaac atgaaaaccatgtcagccatttaccagaaagtgcgtcaccgcatgaacgatgactgggct tacgggaatgacatcgatgccagaccatgggacttccaagcagaagaatgtaccttgagg gccaacattgaggcttttaacagccgtcgctatgacagaccccaggactctgagttttca cctgtggataactgcttgcagagcgtactggggcagaggttggatctgcctgaagatttc cactattcatatgagctctggctcgagagagagccagctaaagggaaaaaaggaaaaggc cagggcaagtctcatgggaagaaacagaagaaaccagaagtggacattctcagccccgcg gccatgctgaacctctactacatcgcccacaacgtcgctgactgcctgcatctgcgaggc ttccattggccgggtgctcccaaaggaaagaaagggagaagcaagtga >gi568815591f:129334473_129583066|GENSCAN_predicted_peptide_3|143_aa MRPRDIWSCSPEAAVYAELKSHGALEITLQPSEQCAAKVGPWKLAHRPNPDPGSCLYGCR QIFAFYPEPSIHGLYCSLCLPSAELPDDFPNGQSPGLHKTEFLIPEELKKCQQLLNYVKE GHTQVASHVQRLFMECAGFSPEV >gi568815591f:129334473_129583066|GENSCAN_predicted_CDS_3|432_bp atgaggccaagagacatatggtcctgttctcctgaggctgcagtctacgcggagctgaag agtcatggagccttggagattaccctccagccctcagagcagtgtgcagccaaagttggt ccctggaagctcgcacacaggcccaacccagaccccgggtcctgcctctatggctgcagg cagatctttgcattttacccagagccctccattcatgggctctactgcagtctctgtctc cccagtgccgagctccctgatgattttccaaatggacagtctcctgggctccacaagact gagttcctgattccagaagagctgaaaaaatgccagcaactcttgaattatgtgaaggag gggcacacgcaggtggcttcccatgtgcagaggctgtttatggagtgtgcaggattcagc ccagaagtgtga >gi568815591f:129334473_129583066|GENSCAN_predicted_peptide_4|120_aa MALIDVGFVGEKRADGSPLSGQLFTIPGRRAKLAVGNMGSKKEKNENRAKGEAPYKTIRS LDWELYVDGSSFINPQGERCAGYAVVILDAVIEAKSLPQGTSAQKAELIALIWALELSEX >gi568815591f:129334473_129583066|GENSCAN_predicted_CDS_4|360_bp atggccctcattgacgtgggcttcgtgggtgagaaacgggctgatgggtcgcctctttca gggcagttgttcactattccagggaggagggccaagctggcagtgggcaacatgggcagc aagaaggagaagaatgagaaccgagcaaagggggaagccccttataaaaccatcagatct ttagactgggagctgtacgtggacgggagcagcttcatcaacccacaaggagagaggtgt gcgggatatgcggtggtaatcctggatgctgtcattgaagccaaatcattgccccagggc acttcagcccagaaggccgaactcattgctttaatttgggccttagagctgagtgaagnn