GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:57:54 Sequence gi568815597f:145628328_145836134 : 207807 bp : 42.14% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2360 2480 121 0 1 67 92 145 0.883 11.43 1.02 Intr + 3337 3417 81 1 0 58 116 114 0.939 9.03 1.03 Intr + 27232 27414 183 1 0 74 44 155 0.250 7.68 1.04 Term + 27675 27921 247 0 1 66 53 193 0.725 7.88 1.05 PlyA + 28698 28703 6 1.05 2.10 PlyA - 29137 29132 6 1.05 2.09 Term - 41345 41235 111 2 0 48 48 81 0.148 -2.32 2.08 Intr - 44693 44403 291 2 0 86 100 343 0.987 31.71 2.07 Intr - 45554 45330 225 2 0 71 64 151 0.985 8.26 2.06 Intr - 50318 50122 197 0 2 -48 119 238 0.921 11.51 2.05 Intr - 52780 52585 196 1 1 117 116 134 0.957 17.17 2.04 Intr - 54309 54173 137 1 2 91 91 200 0.990 19.97 2.03 Intr - 58399 58150 250 1 1 127 100 174 0.995 18.59 2.02 Intr - 59696 59485 212 0 2 98 68 265 0.997 23.21 2.01 Init - 61781 61724 58 2 1 77 105 46 0.952 6.86 2.00 Prom - 62689 62650 40 -7.75 3.00 Prom + 62831 62870 40 -7.25 3.01 Init + 72323 72366 44 1 2 94 84 26 0.389 2.74 3.02 Intr + 78884 79045 162 2 0 101 99 184 0.852 18.97 3.03 Intr + 93571 93667 97 1 1 48 20 124 0.007 0.69 3.04 Intr + 99929 100073 145 1 1 55 65 153 0.301 8.63 3.05 Intr + 107670 107807 138 1 0 67 95 103 0.670 8.41 3.06 Intr + 107884 108028 145 2 1 80 94 63 0.416 4.52 3.07 Term + 113045 113300 256 2 1 69 50 111 0.227 -0.53 3.08 PlyA + 113787 113792 6 1.05 4.05 PlyA - 113882 113877 6 1.05 4.04 Term - 118670 118539 132 2 0 49 48 82 0.864 -2.59 4.03 Intr - 119783 119668 116 0 2 87 105 69 0.955 7.75 4.02 Intr - 122173 122080 94 1 1 96 74 87 0.863 6.72 4.01 Init - 123176 123111 66 2 0 63 94 27 0.474 2.01 4.00 Prom - 123537 123498 40 -7.15 5.00 Prom + 124087 124126 40 -4.65 5.01 Init + 132527 132799 273 1 0 44 78 168 0.668 8.42 5.02 Term + 133027 133617 591 0 0 75 43 157 0.623 3.44 5.03 PlyA + 134179 134184 6 -0.45 6.07 PlyA - 134325 134320 6 1.05 6.06 Term - 136699 136605 95 2 2 87 49 121 0.314 5.11 6.05 Intr - 138456 138359 98 2 2 81 59 44 0.190 -0.47 6.04 Intr - 139273 139154 120 0 0 68 59 67 0.126 0.49 6.03 Intr - 143592 143384 209 0 2 93 96 155 0.294 13.75 6.02 Intr - 156269 156212 58 1 1 84 107 73 0.974 6.77 6.01 Init - 156802 156768 35 1 2 61 98 35 0.865 1.19 6.00 Prom - 158299 158260 40 -9.45 7.00 Prom + 160637 160676 40 -2.95 7.01 Init + 166905 167053 149 2 2 57 68 163 0.940 10.81 7.02 Intr + 195479 195679 201 2 0 2 94 161 0.065 5.58 7.03 Intr + 195742 195870 129 1 0 57 32 150 0.591 5.19 7.04 Intr + 197437 197596 160 1 1 11 91 136 0.435 5.27 7.05 Intr + 198127 198382 256 0 1 99 56 312 0.910 25.19 7.06 Intr + 198493 198678 186 2 0 93 82 174 0.998 16.04 7.07 Intr + 200422 200510 89 2 2 68 80 84 0.370 4.37 7.08 Intr + 204933 205037 105 2 0 79 90 73 0.958 6.09 7.09 Intr + 205163 205255 93 1 0 55 105 90 0.842 6.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_1|210_aa XILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQTMD MIISKKKRAAAICWRSNPYPTRLGPSRTWRYYQWRLQNSKDGCLLRPLGGSTDLMPAGTF LYKMSGNPWEASPSEEEWIRVPPKEAVWPQSATTTVLCCGEFLLVQTTYSPRHWQGKRAD WTCSDGGCPGNLVVLGNLQPAAAGCNLSGH >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_1|633_bp nggatcttatccatagaacagctcatcagccgggttggtgtgattggagtgactctcatg gctcttctttctggatttggtgctgtcaactgcccatacacttacatgtcttacttcctc aggaatgtgactgacacggatattctagccctggaacggcgactgctgcaaaccatggat atgatcataagcaaaaagaaaagggctgctgccatttgttggcggtcgaatccataccct actcgcctgggtccctctcgcacctggaggtattaccagtggaggctgcagaacagcaaa gatggctgcttgctccgtcctctgggaggaagcactgacctgatgccggcaggaacgttc ctgtataagatgtctggcaacccctgggaggcctcgcctagtgaggaggaatggatccga gtcccacctaaagaagcggtctggccacagtctgccactaccactgtgctgtgctgtggg gaattcctcctggtccaaaccacctattctccccggcactggcaggggaaaagggcagac tggacctgcagtgatggtggctgtcccgggaacttggtcgtcttaggcaatctccagcct gctgctgctggctgcaacctcagtggccactga >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_2|558_aa MAFDKAELPSSPKVLVLTTEMTSTFNPRECKLSKQEGQNYGFFLRIEKDTEGHLVRVVEK CSPAEKAGLQDGDRVLRINGVFVDKEEHMQVVDLVRKSGNSVTLLVLDGDSYEKAVKTRV DLKELGQSQKEQGLSDNILSPVMNGGVQTWTQPRLCYLVKEGGSYGFSLKTVQGKKGVYM TDITPQGVAMRAGVLADDHLIEVNGENVEDASHEEVVEKVKKSGSRVMFLLVDKETDKRH VEQKIQFKRETASLKLLPHQPRIVEMKKGSNGYGFYLRAGSEQKGQIIKDIDSGSPAEEA GLKNNDLVVAVNGESVETLDHDSVVEMIRKGGDQTSLLVVDKETDNMYRLAHFSPFLYYQ SQELPNGSVKEAPAPTPTSLEVSSPPDTTEEVDHKPKLCRLAKGENGYGFHLNAIRGLPG SFIKEVQKGGPADLAGLEDEDVIIEVNGVNVLDEPYEKVVDRIQSSGKNVTLLVCGKKAY DYFQAKKIPIVSSLADPLDTPPDSKEGIVVESNHDSHMAKERQDNDIGGLTARDGIKELS RKREMNLCSSVRFLCHEQ >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_2|1677_bp atggcttttgataaggcagagctgcctagctctcccaaggtgctggtgttgaccactgaa atgacctccaccttcaacccccgagaatgtaaactgtccaagcaagaagggcaaaactat ggcttcttcctgcgaattgagaaggacaccgagggccacctggtccgggtggttgagaag tgtagcccagcagagaaggctggccttcaagatggagacagagttcttaggatcaatggt gtctttgtggacaaagaagaacatatgcaggttgtggatctggtcagaaagagtgggaat tcagtgactttactagttctggatggggattcctatgagaaagcagtgaaaacacgggtg gacttgaaagagttgggtcaaagtcagaaggagcaaggtttgagtgataatatactttcc cctgtgatgaatggaggtgtgcaaacttggacccagccccggctctgctatctcgtgaag gaaggaggcagctatggcttctctctgaaaactgtccaaggtaaaaagggggtgtacatg actgatattacacctcaaggtgtggctatgagagctggagttctggctgatgatcacttg attgaagtgaatggagagaatgtagaggatgccagccatgaggaagtggttgaaaaggtg aagaagtcaggaagccgtgtcatgttcctgctggtggacaaagaaactgacaagcgtcat gttgagcagaagatacaattcaaaagagaaacagccagtttgaaactgttaccccaccag ccccgaattgtggagatgaagaaaggaagcaatggctatggtttctatctgagggcaggc tcagaacagaaaggtcaaatcatcaaggacatagattctggaagtccagcagaggaggct ggcttgaagaacaatgatctggtagttgctgtcaacggcgagtctgtggaaaccctggat catgacagtgtggtagaaatgattagaaagggtggagatcagacttcactgttggtggta gacaaagagacggacaacatgtacagactggctcatttttctccatttctctactatcaa agtcaagaactgcccaatggctctgtcaaggaggctccagctcctactcccacttctctg gaagtctcaagtccaccagatactacagaggaagtagatcataagcctaaactctgcagg ctggctaaaggtgaaaatggctatggctttcacttaaatgcgattcggggtctgccaggc tcattcatcaaagaggtacagaagggcggtcctgctgacttggctgggctagaggatgag gatgtcatcattgaagtgaatggggtgaatgtgctagatgaaccctatgagaaggtggtg gatagaatccagagcagtgggaagaatgtcacacttctagtctgtggaaagaaggcctat gattatttccaagctaagaaaatccctattgtttcctccctggctgatccacttgacacc cctccagattctaaagaaggaatagtggtggagtcaaaccatgactcgcacatggcaaaa gaacggcaggacaatgacattggaggacttactgctagagatggcataaaagaactgtca agaaagagggagatgaatttatgcagtagtgttagattcctgtgtcacgaacagtaa >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_3|328_aa MARCNEIPSGKKQPRQPATGLEKEKGKRKRKKVVQAPVASTLKEDTSSSYLEMEEELLCS FTQEFCLCSTVVLVLTAVREWMSGCEERDKGDSKRKTLITWETEAKDTSRANICFKFLGL LTACRMLLEPGRGCCALAILLAIVDIQSGETGNYTVTGLKQRQHLEFSHNEGTLSSGFLQ EKVWVMLVTSLVALQGGEGWKGCPGGEKNVTQALAIHRKVQTSFTDPKALRRKTDKSDFS MGRRYERSKPYNVNGIRQGLYFDPGKHHQLYTLYSMHSILNPWSSSVARWLNTETLTLLQ ARISPYRTATCDTYPIKVFLHLTVTEPT >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_3|987_bp atggccagatgcaatgaaataccatcaggaaagaaacagcccagacagccagctactggc ttggaaaaagaaaaagggaaaaggaaaagaaagaaggtagttcaggccccggtggccagc accctcaaggaggacacgagcagttcttacctggagatggaagaggagctgctctgttcg ttcactcaggaattctgtctctgcagtacagtcgttcttgtgttaacagctgttcgtgaa tggatgagcggctgtgaggagagggacaaaggggactccaagagaaagactctcataact tgggagactgaagccaaggataccagcagagccaacatttgcttcaagttcctgggcctg ctgacagcgtgcaggatgctgttggaacccggcagaggctgctgtgccctggccatcctg ctggcaattgtggacatccagtctggtgagacagggaactacacagtgacgggattgaaa caaagacaacaccttgagttcagccataatgaaggcactctcagttcaggcttcctacaa gaaaaggtctgggtaatgctggtcaccagccttgtggcccttcaaggaggagaaggttgg aaaggatgtccagggggagagaaaaatgttactcaagccctggctatccacaggaaagtt caaaccagtttcactgatccaaaggcactaaggaggaagacagataaatcagacttttca atgggtaggaggtatgagagatcgaaaccttacaatgtcaatgggatcagacaagggctc tattttgatcctggaaagcatcaccagctgtatacattgtattctatgcattctatttta aacccctggagcagctcagtggctcgctggctgaatactgagacactgacattactgcag gctaggatttccccctataggacagccacttgtgatacttaccccatcaaagttttcctc catcttactgtgacagaacctacatga >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_4|135_aa MLHSNPGDYAWGQTGLDAIVTQLLGQLENTGPPPADKEKITSLPTVTVTQEQVDMGLECP VCKEDYTVEEEVRQLPCNHFFHSSCIVPWLELHDTCPVCRKSLNGEDSTRQSQSTEASAS NRFSNDSQLHDRWTF >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_4|408_bp atgctgcactccaaccctggggactatgcctggggtcagacagggcttgatgccattgta acccagcttttaggacaactggaaaacacaggccctcccccagctgacaaggaaaagatc acatctcttccaacagtgacagtaactcaggaacaagttgatatgggtttagagtgtcca gtatgcaaagaagattacacagttgaagaggaagtccggcagttaccttgcaatcacttc tttcacagcagttgtattgtgccgtggctagaactgcatgacacatgtcctgtatgtagg aagagcttaaatggtgaggactctactcggcaaagccagagcactgaggcctctgcaagc aacagatttagcaatgacagtcagctacatgaccgatggactttctga >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_5|287_aa MWENLELPRDLFNGFAQNANSDMDNKVQAEVVSDGDKKLVGNWSKEDSCYVLTKRLAAFC PCPTDLWNFELERDDSEYLVEETSKQQSIQEAQRPKRKKWFCGLGPGSPCCVQPRDLVPC VPVAPAMAERGQHRAWVVASESASPKPWQRPCGAEPASALKTRTEVWEPPPRFQKMYGNT RMPRQKFAAGAEPSWRTSARAVRKENVGWEPPHRVPTGELPSGSVRSGPPSSRSQNGRST DSLHQCAWKSHRHSTPASESSREGGWTPQSYRGGAAQDHGNPPFASA >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_5|864_bp atgtgggaaaatttggaacttcctagagacttgttcaatggctttgcccaaaatgctaat agtgatatggacaataaggtccaggctgaggtggtctcagatggagataagaaacttgtt gggaactggagcaaagaagactcttgttatgttttaacaaagagactggcagcattttgc ccctgccctacagatttgtggaactttgaacttgagagagatgattcagaatatctggtg gaagaaacttctaagcagcaaagcattcaggaggcccagagacccaagaggaaaaagtgg ttttgtgggctgggcccagggtccccgtgctgtgtgcagcctagggacttggtgccctgt gtcccagttgctccagccatggctgaaaggggccaacatagagcttgggttgtggcttca gagagtgcaagcccaaagccttggcagcgtccatgtggtgctgagcctgcgagtgcactg aagacaagaactgaggtttgggaacctccgcctagatttcagaagatgtatggaaacacc cggatgcccaggcaaaagtttgctgcaggggcggagccctcatggagaacctctgctagg gcggtgcggaaggaaaatgtggggtgggagcccccacacagagtccctactggggaactg cctagcggatctgtgagaagtgggccaccatcttccagatcccagaatggtagatccact gacagcttgcaccaatgcgcctggaaaagccacagacactcaacaccagccagtgaaagc agccgggagggaggttggaccccgcaaagctacaggggtggagctgcccaagaccacggg aacccaccttttgcatcagcatga >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_6|204_aa MEEVAFEMGPESFLGGGGSRIDNTTTTHFAELWGHLDHTMFFQDFRPFLSSSPLDQDNRA NERGHQTHTDFWGARPPRLPLGRRYRSRGSSRPDRSPAIEGAEIAASARPPPLLGSEERL CLAAHRLGCEEPLCLAAQSGNRPVREVRGASARPPLLGSEEPLCPDSRPVREGDTTTIRF LYPFPTFPPSLFHKTAIVIMARSQ >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_6|615_bp atggaggaggtagcatttgagatgggtcctgaaagttttttaggtggtggcggcagtcgg atagacaataccacaacaacacattttgcagagctttggggccatttggatcacacgatg ttttttcaagattttagaccctttctaagtagcagtccactggaccaagataatagagcc aatgaaaggggtcaccagactcacactgacttctggggagcaagacctccacggttgcca ttgggtcggagatacagatctcgaggaagttctcgtcctgacagatctccagctattgaa ggtgccgagattgcagcctctgcccggccaccaccccttctgggaagtgaggagcgtctc tgcctggccgcccatcgtctgggatgtgaggagcctctctgcctggctgcccagtctgga aaccgccctgtccgggaggtgaggggcgcctctgcccggccgcccctactgggaagtgag gagcccctctgcccggacagccgccccgtccgggagggagacacaacaacaatccgattt ctctatcccttccccacctttcccccttctctgttccacaaaaccgccatcgtcatcatg gcccgttctcaatga >gi568815597f:145628328_145836134|GENSCAN_predicted_peptide_7|456_aa MKPQTLQVSVTAHKSSADPKSEQQQVLLRRAKNKASTTRKGPKLLAAAGCGRKTGGRLQR RPSPPRPPKPPPFLPSAAAVREGSRPVPRQAATSRAAVVAAAAASVRPTASEPASVTTKE AQEGETNGPPAGSTNSPSLVLQSSSEKKERASPEAAETLGLPSTMTQAEIKLCSLLLQEH FGEIVEKIGVHLIRTGSQPLRVIAHDTGTSLDQVKKALCVLVQHNLVSYQVHKRGVVEYE AQCSRVLRMLRYPRYIYTTKTLYSDTGELIVEELLLNGKLTMSAVVKKVADRLTETMEDG KTMDYAEVSNTFVRLADTHFVQRCPSVPTTENSDPGPPPPAPTLVINEKDMYLVPKLSLI GKGKRRRSSDEDAAGEPKAKRPKYTTDNKEPIPDDGIYWQANLDRFHQHFRDQAIVSAVA NRMDQTSSEIVRTMLRMSEITTSSSAPFTQPLSSNE >gi568815597f:145628328_145836134|GENSCAN_predicted_CDS_7|1368_bp atgaagccacagaccctccaggtgagtgttacagctcataaaagtagtgcagacccaaag agtgagcagcagcaagttttactgagaagagccaagaacaaagcttccacaacgcggaag ggacccaagctgcttgctgctgctggctgtggcagaaaaaccggtgggcggctacagcgg cgcccgagtccgccccggccgccgaagcctccgccatttttgccctccgccgcggccgtc cgagagggcagccggcccgtccctcgccaggccgctacctcccgagctgcagtcgtcgcc gccgccgccgcctcggtgcggcccaccgcttcagagcccgcgtcggtcaccaccaaagag gcgcaggaaggagagacaaacggcccgcccgccggctccacaaacagcccctcgctggtc ctccagtcttccagcgagaagaaagaaagagcgtccccggaagccgccgaaactctggga ctccccagtacaatgactcaagcagaaattaagctctgttctttgttgctgcaagagcat tttggagagattgtagaaaaaattggagtccatctgataagaaccggcagccagccacta agagtaattgcccatgacacaggaacatcactggatcaggtgaagaaagccctgtgtgtc ctcgtccaacataacctggtgagttatcaagtgcacaaacgtggtgtggtggagtatgaa gcccagtgcagccgggtattgcgaatgcttagatatccccggtacatctatactaccaaa actctgtacagtgacactggagagctgattgttgaggagcttctgttgaacggcaaactg acaatgtcagctgttgtgaagaaagtggcagaccggctcacagagaccatggaggatggc aagaccatggactatgctgaagtatcaaacacatttgtgcgactggcagacacacacttt gtacaacgctgcccttcggtacctaccactgagaattcagaccctgggccaccaccacct gcccccacacttgtcattaatgaaaaggacatgtacctggttcctaaactcagcttgata gggaaaggtaaaaggaggagatcatctgatgaagatgctgctggggagcccaaggccaag agaccaaaatatactacagataacaaggagcccattccagatgatgggatttattggcag gccaaccttgacagattccaccaacacttccgtgaccaagccattgtgagcgcagttgct aacaggatggaccagacaagcagcgagattgtgcgaaccatgctccgaatgagtgagatt accacttcctctagtgctcccttcacccagccattgtcttccaatgag