GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:26:23 Sequence gi568815577f:32970247_33171059 : 200813 bp : 46.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2970 2979 10 2 1 96 106 6 0.270 3.44 1.02 Term + 18831 18901 71 1 2 107 49 72 0.489 3.30 1.03 PlyA + 20344 20349 6 1.05 2.00 Prom + 21286 21325 40 -2.06 2.01 Init + 36649 36678 30 0 0 73 78 48 0.613 1.83 2.02 Intr + 41323 41433 111 0 0 102 78 91 0.980 10.08 2.03 Intr + 50398 50544 147 0 0 87 1 178 0.193 9.43 2.04 Intr + 53401 53563 163 0 1 64 -35 119 0.031 -3.25 2.05 Intr + 55856 56014 159 0 0 47 93 100 0.436 6.36 2.06 Term + 56555 57588 1034 0 2 55 52 1526 0.982 137.99 2.07 PlyA + 58284 58289 6 1.05 3.10 PlyA - 58683 58678 6 1.05 3.09 Term - 60439 60294 146 2 2 83 48 46 0.074 -1.83 3.08 Intr - 62472 62387 86 2 2 92 55 45 0.105 1.06 3.07 Intr - 63397 63238 160 2 1 48 69 110 0.259 4.15 3.06 Intr - 63738 63560 179 2 2 26 35 164 0.414 4.46 3.05 Intr - 65639 65542 98 2 2 28 100 70 0.139 1.01 3.04 Intr - 71368 71118 251 2 2 145 75 42 0.316 5.76 3.03 Intr - 75232 75209 24 2 0 146 58 5 0.446 1.30 3.02 Intr - 82768 82600 169 1 1 44 48 117 0.140 2.82 3.01 Init - 90140 90078 63 2 0 66 44 98 0.131 4.35 3.00 Prom - 91935 91896 40 -5.66 4.00 Prom + 95052 95091 40 -4.26 4.01 Init + 96152 96205 54 1 0 54 51 87 0.407 2.88 4.02 Term + 100001 100816 816 1 0 70 55 1267 0.432 114.74 4.03 PlyA + 102147 102152 6 1.05 5.00 Prom + 107097 107136 40 -6.56 5.01 Init + 107270 107397 128 1 2 73 87 136 0.782 9.65 5.02 Intr + 108837 108918 82 0 1 38 89 46 0.493 -0.66 5.03 Intr + 115960 116022 63 0 0 16 94 80 0.010 0.31 5.04 Intr + 128098 128204 107 0 2 91 105 42 0.759 5.21 5.05 Intr + 154353 154541 189 0 0 76 32 175 0.226 9.40 5.06 Intr + 158500 158560 61 1 1 122 68 3 0.091 0.44 5.07 Intr + 160752 160865 114 2 0 54 37 79 0.093 0.04 5.08 Intr + 164208 164299 92 2 2 84 84 42 0.175 2.19 5.09 Intr + 173641 173822 182 1 2 96 3 99 0.272 1.71 5.10 Term + 179219 179367 149 0 2 88 43 127 0.534 6.26 5.11 PlyA + 179469 179474 6 1.05 6.00 Prom + 182801 182840 40 -6.26 6.01 Init + 188581 188652 72 0 0 68 74 71 0.435 4.77 6.02 Intr + 189210 189312 103 2 1 -3 99 75 0.252 -0.75 6.03 Term + 195088 195161 74 2 2 108 54 85 0.795 5.17 6.04 PlyA + 195457 195462 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 110195 110341 147 1 0 95 48 64 0.840 1.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_1|26_aa MEMDQNRRLHGSWGYEDMEVPAGSLA >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_1|81_bp atggaaatggaccagaaccggaggttacatggcagctggggctatgaggatatggaggtt cctgctggtagcttggcttag >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_2|547_aa MIADDKPAAMAGESAGQSSESGVGANFFGITFQTTETLMSTGHLNGAECKAGPGTVKTLA VEEEASRLWRKPDPYNTRREPDLRGGALDATGAQGGPLDRARKEPETACSHRRKSVAGQT PVGSNKKRNRVPGHVPAAGLTSAAATKERAQCSDTGGLDPYSAGHFSRATLLFCPPHTHR CISLTKSEKSERQQLFLPKPQSAVFGSEGRRTLRKLRRLSSPGAMDSDASLVSSRPSSPE PDDLFLPARSKGSSGSAFTGGTVSSSTPSDCPPELSAELRGAMGSAGAHPGDKLGGSGFK SSSSSTSSSTSSAAASSTKKDKKQMTEPELQQLRLKINSRERKRMHDLNIAMDGLREVMP YAHGPSVRKLSKIATLLLARNYILMLTNSLEEMKRLVSEIYGGHHAGFHPSACGGLAHSA PLPAATAHPAAAAHAAHHPAVHHPILPPAAAAAAAAAAAAAVSSASLPGSGLPSVGSIRP PHGLLKSPSAAAAAPLGGGGGGSGASGGFQHWGGMPCPCSMCQVPPPHHHVSAMGAGSLP RLTSDAK >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_2|1644_bp atgatcgctgatgataagccagctgcgatggctggtgagtcagctggacagtcatctgag tctggagttggggccaacttctttggcatcacattccagacaacagaaacactgatgagc acagggcatctgaatggggccgaatgcaaagcaggtccaggcactgtgaagaccctggcg gtggaggaagaggcttcccggctgtggaggaagccagacccttacaacacaagacgagaa ccagacctgcgtgggggagctctggatgctacaggggctcaaggagggccactggaccga gcgcgcaaagaacctgagaccgcttgctctcaccgccgcaagtcggtcgcaggacagaca ccagtgggcagcaacaaaaaaagaaaccgggttccgggacacgtgccggcggctggacta acctcagcggctgcaaccaaggagcgcgcacaatgctccgatacagggggtctggatccc tactctgcgggccatttctccagagcgactttgctcttctgtcctccccacactcaccgc tgcatctccctcaccaaaagcgagaagtcggagcgacaacagctctttctgcccaagccc cagtcagctgttttcgggtccgagggaaggaggaccctgcgaaagctgcgacgactatct tcccctggggccatggactcggacgccagcctggtgtccagccgcccgtcgtcgccagag cccgatgacctttttctgccggcccggagtaagggcagcagcggcagcgccttcactggg ggcaccgtgtcctcgtccaccccgagtgactgcccgccggagctgagcgccgagctgcgc ggcgctatgggctctgcgggcgcgcatcctggggacaagctaggaggcagtggcttcaag tcatcctcgtccagcacctcgtcgtctacgtcgtcggcggctgcgtcgtccaccaagaag gacaagaagcaaatgacagagccggagctgcagcagctgcgtctcaagatcaacagccgc gagcgcaagcgcatgcacgacctcaacatcgccatggatggcctccgcgaggtcatgccg tacgcacacggcccttcggtgcgcaagctttccaagatcgccacgctgctgctggcgcgc aactacatcctcatgctcaccaactcgctggaggagatgaagcgactggtgagcgagatc tacgggggccaccacgctggcttccacccgtcggcctgcggcggcctggcgcactccgcg cccctgcccgccgccaccgcgcacccggcagcagcagcgcacgccgcacatcaccccgcg gtgcaccaccccatcctgccgcccgccgccgcagcggctgctgccgccgctgcagccgcg gctgtgtccagcgcctctctgcccggatccgggctgccgtcggtcggctccatccgtcca ccgcacggcctactcaagtctccgtctgctgccgcggccgccccgctggggggcgggggc ggcggcagtggggcgagcgggggcttccagcactggggcggcatgccctgcccctgcagc atgtgccaggtgccgccgccgcaccaccacgtgtcggctatgggcgccggcagcctgccg cgcctcacctccgacgccaagtga >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_3|391_aa MKDAYVNVCLAISGYQERPTVGKFGHRDRYVWKGDNMKTHGEDSPLEAKERGLEQILPSR KEPTQLTAGFYTSGFQNWSYSVAQAGKIYLSTLSVPYFMLSLVLYRLTSASDLLCIPSQA MGVYRLTSASDLLCIPSQAMGVLTGLPGDTMLIHTVASTEGMLSVTDRQSMGPPRPQCNA KDIPRELTVHHAQYQAPQGMQRRFWRALRVSFLGDLQPFPQPVDGLWPHRRCNVAKGKLR ASKPAASILADAGRAGVSSRWEPCQIVLVDRARFQGLQTEGPLACLQGPRCAVKEGLRSP WQESAHSQQPRDWRSSSLQSSTSNLFHRQTASSNHWGPRGEIRNTKHTQSLHQYTCLAVG VEEPKLQSRPLPPPARFSAPTGLRSSLLESL >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_3|1176_bp atgaaggatgcctacgtgaacgtctgcctagcaatatcaggttaccaagaaaggccaaca gtggggaaatttggacacagagacagatacgtatggaagggagataacatgaagacccat ggagaagacagccctctagaagccaaggagaggggcctggaacagatccttccctcacgg aaggaaccaacacaattgacagctggattttatacttctggcttccagaactggtcttat tctgttgcccaggctggtaaaatttacctttccacgctgtcagtgccctacttcatgtta tcactagtactttacagactaaccagtgccagtgacttgctgtgtattccttcccaggca atgggcgtttatagactaaccagtgccagtgacttgctgtgtattccttcccaggcaatg ggcgttttgacagggctccctggagacactatgcttatccacacagtggcatcaacagaa ggcatgctgagtgtgacagacaggcagagcatgggccctcctcgtccacaatgcaatgca aaggatattcccagagaactaaccgtgcatcatgctcagtaccaggcaccccagggcatg caaaggcggttctggagagctctgcgtgtctccttcttgggggacctccaaccgttccca caaccagtggacggattgtggccgcaccgacgctgcaatgtcgccaaaggaaaactgcgc gcgtccaagccagctgcttcaatcctggcggatgcgggccgtgccggggtctccagccgg tgggagccctgccagatcgtcctggtggaccgcgcccgctttcagggtttgcaaactgaa ggcccgctcgcgtgtctgcagggccctcggtgtgctgtaaaggagggtctgaggtccccc tggcaggagagcgcgcactcgcagcagccgcgggactggcgcagttcctcgctgcagtcc tccacctccaacctcttccacagacaaacagccagtagcaatcactggggccctagagga gagatacggaacaccaagcacactcagagcctgcaccagtacacctgcctggcagtgggt gtggaagagccaaaactgcaatcaagacccctacccccaccggccaggttctcagcgccc actggtctacgttccagcctcctggagtccctctga >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_4|289_aa MGSSIPKESSLSKNNYRRMYYAVSQARVNAVPGTMLRPQRPGDLQLGASLYELVGYRQPP SSSSSSTSSTSSTSSSSTTAPLLPKAAREKPEAPAEPPGPGPGSGAHPGGSARPDAKEEQ QQQLRRKINSRERKRMQDLNLAMDALREVILPYSAAHCQGAPGRKLSKIATLLLARNYIL LLGSSLQELRRALGEGAGPAAPRLLLAGLPLLAAAPGSVLLAPGAVGPPDALRPAKYLSL ALDEPPCGQFALPGGGAGGPGLCTCAVCKFPHLVPASLGLAAVQAQFSK >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_4|870_bp atgggatcaagcatccccaaggagtcctccctgtccaagaacaactaccgaaggatgtac tatgcggtttcccaggcgcgcgtgaacgcggtccccgggaccatgctgcggccacagcgg cccggagacttgcagctcggggcctccctctacgagctggtgggctacaggcagccgccc tcctcctcctcctcctccacctcctccacctcctccacttcctcctcctccacgacggcc cccctcctccccaaggctgcgcgcgagaagccggaggcgccggccgagcctccaggcccc gggcccgggtcaggcgcgcacccgggcggcagcgcccggccggacgccaaggaggagcag cagcagcagctgcggcgcaagatcaacagccgcgagcggaagcgcatgcaggacctgaac ctggccatggacgccctgcgcgaggtcatcctgccctactcagcggcgcactgccagggc gcgcccggccgcaagctctccaagatagccacgctgctgctcgcccgcaactacatccta ctgctgggcagctcgctgcaggagctgcgccgcgcgctgggcgagggcgccgggcccgcc gcgccgcgcctgctgctggccgggctgcccctgctcgccgccgcgcccggctccgtgctg ctggcgcccggcgccgtaggaccccccgacgcgctgcgccccgccaagtacctgtcgctg gcgctggacgagccgccgtgcggccagttcgctctccccggcggcggcgcaggcggcccc ggcctctgcacctgcgccgtgtgcaagttcccgcacctggtcccggccagcctgggcctg gccgccgtgcaggcgcaattctccaagtga >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_5|388_aa MVVMVVVVVVVVVVLVLFHLLASGGFTHMLPLQSWFPTAMGRRGHKLTACHSPPAERHLG VAPNSINRFVSGDVTGDVAEMQCREGTGYRQRFSEFDLPLEKNRLIICKDVAIRCQTNPI EEKGWESKGGQDNEESIGKAEKTGYIACQAPGILRRCFSCDSGHWYGKRLTIPGQNSEGG GMFLQKRSRGCTSTSPNKCRIPMAVSRVSWSNTKGLILGFFPSTCVTPFFIDEKPGSLID LNIHDVHKVYVAQLVSADYAAGRASRHCYHSDCCLRYARAGLVLADLDLAPLILVSLILL DPLADWICSYHGIGRSTRERARPHKYFPRSLTELGPLIARLICTLTASNCHHTAGSPQQL ENRSRAARAGSTTVQGAEAQGASGTVTI >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_5|1167_bp atggtggtgatggtggtggtggtggtggtggtggtcgtggtgctggttttgttccatttg cttgcttccggtggcttcactcacatgcttcctctgcagtcctggttccccacagccatg ggcagaagaggccacaagctgactgcatgtcattctccaccagcagagcgtcacctcggg gtagctccaaacagtatcaaccggtttgtgtcaggtgatgtaacaggcgatgtggctgaa atgcagtgcagagaagggaccggctaccgccagagattttctgagtttgatttgcccctt gaaaagaatagattgatcatctgtaaagacgttgcaatcagatgtcaaaccaatccaatc gaggaaaaaggctgggagagcaaaggaggacaggacaatgaagaatctattggcaaagca gagaaaacaggctatatcgcttgccaagcccctggaattctgagaaggtgcttctcctgt gatagcggtcactggtacggcaagaggttgacaatccctggccaaaactccgaagggggt ggcatgtttctgcagaaacgcagcagaggctgcacctctacttctccaaacaaatgtaga atccccatggcagtgagtcgagtgtcctggtccaacaccaaagggcttattctaggattt ttcccttccacatgtgtaactcccttcttcattgatgagaaacctggttccctcatcgac cttaacatacatgatgttcataaggtgtatgtagcacaactggtatctgcagactatgct gctggaagggctagccgtcactgttatcacagcgactgctgcctgagatatgccagggct ggcttggtcttggctgatctggacttggcgcctctgatcctggtgtctctcatcctcctg gacccactggctgactggatatgttcctatcatggcattggcagaagcacaagggagaga gcccgaccccacaagtactttccaaggtccttgactgagttgggtccactaatagcccgc ctcatctgcacactaactgcttctaattgtcatcatacggcaggatccccgcaacaactt gagaacagatcccgagcagcccgtgcaggctccacgacggtccagggagctgaggctcag ggagcatcgggcactgtgacgatttag >gi568815577f:32970247_33171059|GENSCAN_predicted_peptide_6|82_aa MRSSRGGVLLTAPGESSAKNQHNTKHKTTTEYTGLSERTRKPLPGRKNGSGVEELLQAGF IPELELTFPEGFAFQAFQVIAN >gi568815577f:32970247_33171059|GENSCAN_predicted_CDS_6|249_bp atgaggagttcccgtggaggtgtgctgctgacagccccaggtgagagctcagccaagaat cagcacaacactaagcacaaaacaaccactgaatacactggattaagtgaacgtactaga aaaccactacctggtaggaagaatggctctggtgtggaggagttactacaagcagggttc atcccagaactggagctcacattcccagaaggctttgcttttcaggcttttcaggttatc gccaattga