GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:40:15 Sequence gi568815575f:48583854_48791159 : 207306 bp : 47.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6303 6434 132 2 0 64 113 35 0.501 3.74 1.02 Intr + 14105 14184 80 1 2 94 78 33 0.117 1.25 1.03 Intr + 14864 15104 241 2 1 107 83 444 0.954 43.35 1.04 Intr + 15500 15609 110 1 2 115 85 191 0.998 20.58 1.05 Intr + 15734 15864 131 2 2 53 99 199 0.991 17.84 1.06 Intr + 16466 16773 308 0 2 48 25 531 0.977 38.97 1.07 Intr + 17931 18111 181 2 1 99 96 224 0.997 23.74 1.08 Intr + 18212 18353 142 0 1 104 94 131 0.998 14.71 1.09 Intr + 20419 20537 119 1 2 112 92 158 0.996 18.71 1.10 Intr + 20995 21175 181 2 1 76 22 347 0.063 25.83 1.11 Intr + 29823 29946 124 0 1 52 105 71 0.173 5.79 1.12 Intr + 47808 47896 89 2 2 -1 9 170 0.013 -1.03 1.13 Intr + 52479 52714 236 0 2 83 73 121 0.087 7.43 1.14 Intr + 92437 92776 340 2 1 42 52 234 0.320 9.73 1.15 Intr + 99967 100132 166 1 1 45 85 135 0.424 8.86 1.16 Intr + 100430 100570 141 1 0 122 60 224 0.990 23.55 1.17 Intr + 101694 101780 87 2 0 93 95 77 0.732 9.07 1.18 Intr + 101881 102022 142 0 1 1 105 112 0.965 4.13 1.19 Intr + 102093 102134 42 1 0 82 113 56 0.961 5.71 1.20 Intr + 102928 103102 175 2 1 114 52 198 0.998 17.80 1.21 Intr + 104201 104243 43 2 1 85 86 12 0.984 -1.06 1.22 Intr + 104447 104600 154 1 1 120 76 190 0.837 20.65 1.23 Intr + 104807 105213 407 0 2 72 100 160 0.487 9.67 1.24 Intr + 105467 105581 115 1 1 100 105 68 0.999 9.72 1.25 Term + 107254 107309 56 2 2 108 48 106 0.990 6.32 1.26 PlyA + 107542 107547 6 1.05 2.00 Prom + 109576 109615 40 -6.46 2.01 Init + 112932 112950 19 2 1 102 67 -3 0.657 -0.83 2.02 Intr + 115049 115194 146 0 2 62 77 162 0.953 12.50 2.03 Intr + 116238 116900 663 2 0 75 75 1175 0.895 106.86 2.04 Intr + 122412 122558 147 2 0 75 15 337 0.994 25.53 2.05 Intr + 122645 122774 130 1 1 105 81 279 0.996 29.17 2.06 Term + 123584 123717 134 0 2 85 41 188 0.999 12.05 2.07 PlyA + 125137 125142 6 1.05 3.00 Prom + 142848 142887 40 -3.36 3.01 Init + 167960 168053 94 1 1 87 90 106 0.946 9.24 3.02 Intr + 168854 168988 135 0 0 6 43 187 0.975 6.54 3.03 Intr + 172761 172866 106 1 1 111 53 47 0.550 2.77 3.04 Intr + 174423 174486 64 0 1 49 108 74 0.670 4.22 3.05 Intr + 177884 178000 117 1 0 95 66 22 0.187 1.36 3.06 Intr + 178123 178191 69 0 0 131 69 -10 0.210 0.78 3.07 Intr + 181982 182119 138 1 0 105 121 115 0.956 17.16 3.08 Intr + 187074 187229 156 2 0 39 113 84 0.983 6.21 3.09 Term + 189457 189582 126 0 0 122 45 152 0.999 12.58 3.10 PlyA + 189771 189776 6 1.05 4.03 PlyA - 191333 191328 6 1.05 4.02 Term - 192616 192140 477 1 0 -11 49 343 0.575 15.34 4.01 Init - 192835 192662 174 1 0 65 76 171 0.432 12.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6775 6918 144 0 0 28 49 148 0.932 2.81 S.002 Term + 20995 21179 185 2 2 76 46 359 0.934 28.21 S.003 Sngl - 71194 70973 222 1 0 75 48 207 0.991 10.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:48583854_48791159|GENSCAN_predicted_peptide_1|1313_aa MKLAPQQQGVLPPGVWSSRPFHFGVARHKGHEPESVDVLAFVVHAAAGGHARGGGRGMAA VWQQVLAVDARYNAYRTPTFPQFRTQYIRRRSQLLRENAKAGHPPALRRQYLRLRGQLLG QRYGPLSEPGSARAYSNSIVRSSRTTLDRMEDFEDDPRALGARGHRRSVSRGSYQLQAQM NRAVYEDRPPGSVVPTSAAEASRAMAGDTSLSENYAFAGMYHVFDQHVDEAVPRVRFAND DRHRLACCSLDGSISLCQLVPAPPTVLRVLRGHTRGVSDFAWSLSNDILVSTSLDATMRI WASEDGRCIREIPDPDSAELLCCTFQPVNNNLTVVGNAKHNVHVMNISTGKKVKGGSSKL TGRVLALSFDAPGRLLWAGDDRGSVFSFLFDMATGKLTKAKRLVVHEGSPVTSISARSWV SREARDPSLLINACLNKLLLYRVVDNEGTLQLKRSFPIEQSSHPVRSIFCPLMSFRQGAC VVTGSEDMCVHFFDVERAAKAAVNKLQGHSAPVLDVSFNCDESLLASSDASGMVIVWRRE QKSFCTLLCVTPEVVMSQYVCGQECPHSSLCGLRIVGILVPEPPTCSLLLLLLLLLLLLQ SFGFLFESEMILRPEPTDLTIYHLVFIHIVILLTLVSLLSPGLPESLYFQNDFKCKGFFL PEQDDEKFLHLHHLLPECASDHNHQPQKLLVARTLSCPFSGQQPHTTQRCENQQPEPQKH GFPGARALDWHPLRMLQSGDQWEPGAQGRRRARAATEKHRDPTSVIDQSELTAPHTPSPP ALSQLPARVREGRVKTKSKEERATRASPEKTRAESTMSGGPMGGRPGGRGAPAVQQNIPS TLLQDHENQRLFEMLGRKCLTLATAVVQLYLALPPGAEHWTKEHCGAVCFVKDNPQKSYF IRLYGLQAGRLLWEQELYSQLVYSTPTPFFHTFAGDDCQAGLNFADEDEAQAFRALVQEK IQKRNQRQSGGEEATGEERKLGRDRRQLPPPPTPANEGPPVGPLSLGLATVDIQNPDITS SRYRGLPAPGPSPADKKRSGKKKISKADIGAPSGFKHVSHVGWDPQNGFDVNNLDPDLRS LFSRAGISEAQLTDAETSKLIYDFIEDQGGLEAVRQEMRRQEPLPPPPPPSRGGNQLPRP PIVGGNKGRSGPLPPVPLGIAPPPPTPRGPPPPGRGGPPPPPPPATGRSGPLPPPPPGAG GPPMPPPPPPPPPPPSSGNGPAPPPLPPALVPAGGLAPGGGRGALLDQIRQGIQLNKTPG APESSALQPPPQSSEGLVGALMHVMQKRSRAIHSSDEGEDQAGDEDEDDEWDD >gi568815575f:48583854_48791159|GENSCAN_predicted_CDS_1|3942_bp atgaaattagctcctcagcaacaaggtgtcctgccacctggtgtgtggtcatctcgcccc tttcatttcggtgttgcccggcacaagggacatgaaccagaatctgtggatgttcttgct ttcgtggtccacgctgccgcgggcggacacgccagaggaggaggccggggaatggccgcg gtgtggcagcaagtcttagcagtggacgcgaggtacaacgcgtaccgcacaccaacgttt ccacagtttcggacgcagtatatccgccggcgcagccagctgctgcgggagaatgccaag gctgggcaccccccagcgctgcgtcggcagtacctgaggcttcgggggcagctgctgggc cagcgctacgggcccctctccgagccaggcagtgctcgtgcctatagcaacagcatcgtc cgcagtagccgcactactcttgaccgcatggaggactttgaggatgatcctcgggccctg ggggcccgtgggcaccgtcgttctgtcagcagaggctcctaccagctgcaggcgcagatg aaccgtgccgtctatgaggacaggccccctggcagcgtggtgcccacgtcagcagcagag gcaagtcgggccatggccggggacacgtcactgagcgagaactatgcctttgcgggcatg tatcatgtttttgaccagcacgtggatgaggcagtcccaagggtgcgcttcgccaatgat gaccgacaccgcctggcctgctgctcactcgacggcagcatctccctgtgccagctggtg cctgccccacccacagtgcttcgcgtgctacggggccacacccgtggtgtctccgacttc gcctggtccctctccaatgacatcctcgtgtccacctcactggatgccaccatgcgcatc tgggcctctgaggatggtcgctgcatccgagagatccctgaccccgatagcgctgaactg ctctgctgcaccttccagcctgtcaacaacaacctcactgtggtggggaacgccaagcac aacgtgcatgtcatgaacatctccacaggcaagaaagtgaaggggggctccagcaagctg acaggccgtgtccttgctctgtcctttgatgcccctggccggctgctctgggcgggtgat gaccgtggcagtgtcttctctttcctctttgatatggccacagggaagctgaccaaagcc aagcgtttggtggtgcatgaggggagccctgtgaccagcatctcagcccggtcctgggtc agccgcgaggcccgggatccctcactgctcatcaatgcttgcctcaacaagttgctgctc tacagggtggtagacaacgaggggaccctgcagctgaagagaagcttccccatcgagcag agctcacatcctgtgcgcagcatcttctgtcccctcatgtccttccgccagggggcctgc gtggtgacgggcagtgaggacatgtgcgtgcacttctttgatgtggagcgggcggccaag gctgctgtcaacaagctgcagggccacagtgcacctgtgcttgatgtcagcttcaactgc gacgagagcctactggcctccagtgacgccagcggcatggtcatcgtctggaggcgggag cagaagagtttctgcaccctgctctgtgtgacaccagaagtagtcatgtcccagtatgta tgtggccaggagtgtccacattctagtctctgtggcctgaggattgttggcatcctggtc cctgaacccccaacctgttccctgctgctgctgctgctgctgctgctgctgctgctgcag tcgtttggtttcctctttgaatctgagatgattctgaggcctgagcccacagacctgacc atttatcacctggtcttcatccacatagtgatcctccttactctggtgtcattgttgtct ccaggtctgcctgagtcactgtattttcagaatgacttcaagtgtaaaggatttttttta cctgaacaagatgatgagaagtttctccatctgcaccacctgcttcctgagtgtgcttca gaccataatcatcagccccagaagctcttggttgcgcgcaccctgtcctgccccttctcg ggccaacaaccccacacgacgcagcggtgtgagaaccagcaaccggaaccgcaaaagcac ggctttcccggagcccgggctctggactggcacccactgcgcatgctccagagcggcgac caatgggagcccggcgcgcagggacggaggcgggcccgggccgctaccgagaagcacagg gacccgacttcagtcatcgaccaatcagagctcacagccccccacaccccctcccccccg gccctgagccaattgccagctcgtgtgcgggagggccgggtcaagactaagtcaaaggag gagagggcaacgcgagcctcgccagagaagacaagggcagaaagcaccatgagtgggggc ccaatgggaggaaggcccgggggccgaggagcaccagcggttcagcagaacataccctcc accctcctccaggaccacgagaaccagcgactctttgagatgcttggacgaaaatgcttg acgctggccactgcagttgttcagctgtacctggcgctgccccctggagctgagcactgg accaaggagcattgtggggctgtgtgcttcgtgaaggataacccccagaagtcctacttc atccgcctttacggccttcaggctggtcggctgctctgggaacaggagctgtactcacag cttgtctactccacccccacccccttcttccacaccttcgctggagatgactgccaagcg gggctgaactttgcagacgaggacgaggcccaggccttccgggccctcgtgcaggagaag atacaaaaaaggaatcagaggcaaagtggaggtgaggaggccacaggggaggaaaggaag ttgggcagagacagacgccagctacccccaccaccaacaccagccaatgaaggccctcca gtgggtccgctctccctggggctggcgacagtggacatccagaaccctgacatcacgagt tcacgataccgtgggctcccagcacctggacctagcccagctgataagaaacgctcaggg aagaagaagatcagcaaagctgatattggtgcacccagtggattcaagcatgtcagccac gtggggtgggacccccagaatggatttgacgtgaacaacctcgacccagatctgcggagt ctgttctccagggcaggaatcagcgaggcccagctcaccgacgccgagacctctaaactt atctacgacttcattgaggaccagggtgggctggaggctgtgcggcaggagatgaggcgc caggagccacttccgccgcccccaccgccatctcgaggagggaaccagctcccccggccc cctattgtggggggtaacaagggtcgttctggtccactgccccctgtacctttggggatt gccccacccccaccaacaccccggggacccccacccccaggccgagggggccctccacca ccaccccctccagctactggacgttctggaccactgccccctccaccccctggagctggt gggccacccatgccaccaccaccgccaccaccgccaccgccgcccagctccgggaatgga ccagcccctcccccactccctcctgctctggtgcctgccgggggcctggcccctggtggg ggtcggggagcgcttttggatcaaatccggcagggaattcagctgaacaagacccctggg gccccagagagctcagcgctgcagccaccacctcagagctcagagggactggtgggggcc ctgatgcacgtgatgcagaagagaagcagagccatccactcctccgacgaaggggaggac caggctggcgatgaagatgaagatgatgaatgggatgactga >gi568815575f:48583854_48791159|GENSCAN_predicted_peptide_2|412_aa MAENLKGCSVCCKSSWNQLQDLCRLAKLSCPALGISKRNLYDFEVEYLCDYKKIREQEYY LVKWRGYPDSESTWEPRQNLKCVRILKQFHKDLERELLRRHHRSKTPRHLDPSLANYLVQ KAKQRRALRRWEQELNAKRSHLGRITVENEVDLDGPPRAFVYINEYRVGEGITLNQVAVG CECQDCLWAPTGGCCPGASLHKFAYNDQGQVRLRAGLPIYECNSRCRCGYDCPNRVVQKG IRYDLCIFRTDDGRGWGVRTLEKIRKNSFVMEYVGEIITSEEAERRGQIYDRQGATYLFD LDYVEDVYTVDAAYYGNISHFVNHSCDPNLQVYNVFIDNLDERLPRIAFFATRTIRAGEE LTFDYNMQVDPVDMESTRMDSNFGLAGLPGSPKKRVRIECKCGTESCRKYLF >gi568815575f:48583854_48791159|GENSCAN_predicted_CDS_2|1239_bp atggcggaaaatttaaaaggctgcagcgtgtgttgcaagtcttcttggaatcagctgcag gacctgtgccgcctggccaagctctcctgccctgccctcggtatctctaagaggaacctc tatgactttgaagtcgagtacctgtgcgattacaagaagatccgcgaacaggaatattac ctggtgaaatggcgtggatatccagactcagagagcacctgggagccacggcagaatctc aagtgtgtgcgtatcctcaagcagttccacaaggacttagaaagggagctgctccggcgg caccaccggtcaaagaccccccggcacctggacccaagcttggccaactacctggtgcag aaggccaagcagaggcgggcgctccgtcgctgggagcaggagctcaatgccaagcgcagc catctgggacgcatcactgtagagaatgaggtggacctggacggccctccgcgggccttc gtgtacatcaatgagtaccgtgttggtgagggcatcaccctcaaccaggtggctgtgggc tgcgagtgccaggactgtctgtgggcacccactggaggctgctgcccgggggcgtcactg cacaagtttgcctacaatgaccagggccaggtgcggcttcgagccgggctgcccatctac gagtgcaactcccgctgccgctgcggctatgactgcccaaatcgtgtggtacagaagggt atccgatatgacctctgcatcttccgcacggatgatgggcgtggctggggcgtccgcacc ctggagaagattcgcaagaacagcttcgtcatggagtacgtgggagagatcattacctca gaggaggcagagcggcggggccagatctacgaccgtcagggcgccacctacctctttgac ctggactacgtggaggacgtgtacaccgtggatgccgcctactatggcaacatctcccac tttgtcaaccacagttgtgaccccaacctgcaggtgtacaacgtcttcatagacaacctt gacgagcggctgccccgcatcgctttctttgccacaagaaccatccgggcaggcgaggag ctcacctttgattacaacatgcaagtggaccccgtggacatggagagcacccgcatggac tccaactttggcctggctgggctccctggctcccctaagaagcgggtccgtattgaatgc aagtgtgggactgagtcctgccgcaaatacctcttctag >gi568815575f:48583854_48791159|GENSCAN_predicted_peptide_3|334_aa MVWEALAAASLVLWALSWEMCWGQQWMGKKHAILQQLNVRDFWEQLNITVTPCQMGIIPV LLIRNLTVNAGNGLTPGITVAHDIDDIMLSRHSEQEVATTLDLLVRNFCVRGAPSSNGNG KYVMGPKQALKTQALSQRIVGRTLGKDAYPAMLRHLPSRLPVKMWGRTLEKQRQAEGSRG PWKGPALSHGGPCAQSWRDSSQTPPPCLIRRLDHIVMTVKSIKDTTMFYSKILGMEVMTF KEDRKALCFGDQKFNLHEVGKEFEPKAAHPVPGSLDICLITEVPLEEMIQHLKACDVPIE EGPVPRTGAKGPIMSIYFRDPDRNLIEVSNYISS >gi568815575f:48583854_48791159|GENSCAN_predicted_CDS_3|1005_bp atggtttgggaagccctggctgctgcctctctggttctctgggctttgtcctgggagatg tgctggggccagcagtggatggggaagaagcacgctattttgcaacagctcaacgtgcgt gacttttgggagcagctaaacatcactgtgacaccctgtcagatgggtatcatccccgta ttactgatcaggaatctgacggtcaatgcaggcaacggactcactccaggtatcacagtg gcccatgacattgatgatattatgctgagtagacatagtgagcaagaagtagcaactact ctagacttattggtgagaaatttctgtgtcagaggtgctccatcgtcgaatggaaatggg aaatatgtgatggggcctaagcaggccctgaagacacaagcgctgagccagaggattgtc gggaggaccctgggcaaagacgcctaccctgccatgctgcgccatctgccctccaggctg ccagtcaagatgtggggcaggactttggagaaacagagacaagcagagggcagcagaggc ccatggaaaggcccggctctaagccatggagggccctgtgctcagtcatggagggacagc agtcagacccctcccccatgtcttatccgtagacttgaccacatcgtgatgacggtgaag agcatcaaagacaccaccatgttttattccaagatcctgggcatggaggtcatgactttt aaggaagaccggaaagcactgtgttttggagaccagaaatttaacctccacgaggtggga aaggaatttgaacccaaagccgctcacccagttcctggctccctggacatatgtctgatc acagaggtgcctttggaggaaatgatccagcacctcaaggcttgtgatgtccctattgag gaggggccagtccccagaacaggggcaaaagggcctatcatgtccatctacttccgagac cccgacagaaatctgattgaggtgtccaactacatctcctcgtga >gi568815575f:48583854_48791159|GENSCAN_predicted_peptide_4|216_aa MAMTAENLAVKHKISREECDKYGLQSQQRWKAANDAGYFNDEMAPIEVKTKKGKQTMQKL PPVFKKDGTVTAGNASGIADGAGAVIIASEDAVKNHNFTPLARIVGYFVSVCDPSIMGIG PVLAISGALKKAGLSLKDMDLVEVNEAFAPQYLAVEKSLDLDISKTNVNGGAIALGHPLG GSGLVDELRRRGGKYAVGSACIGGGQGIAVIIQSTA >gi568815575f:48583854_48791159|GENSCAN_predicted_CDS_4|651_bp atggcaatgactgcagagaatcttgctgtaaaacataaaataagcagagaagaatgtgac aaatacggcctgcagtcacagcagagatggaaagctgctaatgatgctggctactttaat gatgaaatggcaccaattgaagtgaagacaaagaaaggaaaacagacaatgcagaaactt cctccagtattcaagaaagatggtactgtcactgcagggaatgcatcggggatagctgat ggtgctggagctgttatcatagctagtgaagatgctgttaagaaccataatttcacacca ctggcaagaattgtgggctactttgtatctgtatgtgatccctctatcatgggtattggt cctgtccttgctatcagtggggcactgaagaaagcaggactgagtcttaaggacatggat ttggtagaggtgaatgaagcttttgctccccagtacttggctgttgagaagagtttggat cttgacataagtaaaaccaatgtgaatggaggagccattgctttgggtcacccactggga ggatctggattggttgatgaattaaggcgtcgaggtggaaaatatgccgttggatcagct tgcattggaggtggccaaggtattgctgtcatcattcagagcacagcctga