GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:27:29 Sequence gi568815581f:56734796_56962788 : 227993 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5908 6048 141 0 0 11 101 122 0.190 5.93 1.02 Intr + 14563 14744 182 0 2 112 51 69 0.204 4.27 1.03 Intr + 26371 26534 164 1 2 54 102 32 0.146 0.82 1.04 Intr + 28356 28432 77 1 2 80 81 77 0.704 5.43 1.05 Term + 31823 31870 48 1 0 106 48 44 0.552 -0.50 1.06 PlyA + 31952 31957 6 1.05 2.05 PlyA - 32459 32454 6 1.05 2.04 Term - 38027 37832 196 1 1 83 48 114 0.481 3.78 2.03 Intr - 38993 38929 65 2 2 92 65 39 0.497 -0.48 2.02 Intr - 40145 40092 54 2 0 83 115 24 0.273 3.78 2.01 Init - 45772 45611 162 1 0 84 28 137 0.593 5.05 2.00 Prom - 55482 55443 40 -5.66 3.03 PlyA - 55696 55691 6 1.05 3.02 Term - 58265 57582 684 2 0 68 35 836 0.999 69.94 3.01 Init - 62854 62783 72 1 0 82 60 121 0.972 7.77 3.00 Prom - 66349 66310 40 -3.66 4.00 Prom + 69613 69652 40 -3.96 4.01 Init + 72692 72710 19 1 1 39 110 36 0.312 1.09 4.02 Intr + 74501 74608 108 0 0 39 62 85 0.084 1.26 4.03 Intr + 78876 78919 44 1 2 89 80 35 0.070 0.76 4.04 Intr + 88184 88283 100 1 1 56 76 73 0.005 2.58 4.05 Intr + 98332 98489 158 2 2 34 69 162 0.013 8.63 4.06 Intr + 99983 100464 482 1 2 88 80 797 0.121 70.73 4.07 Intr + 109224 109383 160 0 1 129 36 53 0.884 4.19 4.08 Intr + 113127 113270 144 2 0 133 62 63 0.994 8.68 4.09 Intr + 113901 114058 158 2 2 55 123 55 0.966 4.51 4.10 Intr + 114386 114437 52 2 1 90 95 8 0.939 0.61 4.11 Intr + 121717 121830 114 0 0 84 93 43 0.913 5.04 4.12 Intr + 126996 127123 128 2 2 60 91 47 0.972 1.68 4.13 Intr + 127345 127456 112 1 1 88 73 65 0.824 5.38 4.14 Intr + 142540 142644 105 0 0 106 28 90 0.377 5.21 4.15 Term + 143233 143520 288 0 0 -61 48 457 0.796 22.18 4.16 PlyA + 147991 147996 6 1.05 5.11 PlyA - 148795 148790 6 1.05 5.10 Term - 150046 149845 202 0 1 75 49 203 0.623 11.96 5.09 Intr - 157434 156909 526 1 1 62 32 548 0.066 38.60 5.08 Intr - 160646 160548 99 0 0 100 65 95 0.984 8.48 5.07 Intr - 160809 160726 84 1 0 102 110 19 0.968 5.29 5.06 Intr - 161157 161131 27 1 0 68 99 29 0.512 0.09 5.05 Intr - 164385 164320 66 1 0 70 100 33 0.732 1.58 5.04 Intr - 166783 166624 160 1 1 61 60 198 0.998 13.86 5.03 Intr - 169693 169460 234 1 0 89 92 349 0.999 33.29 5.02 Intr - 173768 173673 96 2 0 116 100 97 0.998 13.91 5.01 Init - 179193 178597 597 0 0 95 75 920 0.987 86.68 5.00 Prom - 186464 186425 40 -2.46 6.08 PlyA - 186606 186601 6 1.05 6.07 Term - 204359 204276 84 2 0 104 48 42 0.773 -0.55 6.06 Intr - 207328 207240 89 2 2 111 100 80 0.970 11.19 6.05 Intr - 211716 211647 70 0 1 64 105 7 0.484 -1.25 6.04 Intr - 214639 214592 48 1 0 88 116 40 0.924 5.68 6.03 Intr - 214972 214886 87 1 0 79 71 36 0.630 1.17 6.02 Intr - 216201 215418 784 2 1 89 22 327 0.389 17.68 6.01 Init - 226224 225980 245 0 2 84 85 263 0.996 22.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 98579 98313 267 2 0 70 49 341 0.972 22.23 S.002 Term + 127817 127996 180 1 0 62 37 122 0.855 2.11 S.003 Term - 157434 156905 530 1 2 62 48 560 0.920 43.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_1|203_aa MFYSNPKACINAVLIEEYQVECTFFSETFPPPNPLQEDKSTVGSWQNALACQVHAHTRSW LLSLRSLYICSPVTTLFTHNSDADQPLPPPSSRMDGKGYTHSSRTQGRFQALERSIYWLS LERAPVSELSGAKIKRYFLFTFHNGKQDPPSQATLCDCVLHKALITSKEHIGEYDNDHTL SLLPVYKKRWVNWVAGSKSSNTK >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_1|612_bp atgttttattctaatccaaaagcatgcataaatgctgtattgattgaagagtaccaggta gaatgcacgttcttctcagagaccttccccccacccaaccctctccaagaagacaaaagc actgttggcagctggcagaatgccctggcctgtcaagtgcacgcacatacacgatcatgg ctcctgagtctacgcagcctttacatttgctccccagtgaccacactgttcactcacaac tctgatgcagatcagcctctgcctccaccatcttctcggatggatggcaaaggttacacc cattccagcaggactcagggaaggttccaggccttagaaaggagcatctattggctgagc ctagaaagggcacctgtgtctgagctgtcaggagcaaagataaagaggtatttcttattc accttccacaatgggaagcaagatcctccatcccaggccaccctgtgtgactgtgtatta cacaaagccttgatcacttcaaaagagcacatcggagaatatgacaatgaccatactctt agtctgcttcctgtctataagaagagatgggtcaactgggtagcaggctccaaatcttca aacactaaataa >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_2|158_aa MGCMCARACVGAGGGDTAPATPPPVHTELLCAKSDLTATLAARSEGGRGGCEGPMRKPKW SKVTDWFKIAQMLLTLDVAVSSLILIFLLWHPGWRCVHSANSSERTLESKFKSGQKCCME RGLNPAALPRQEAGCMLDLLYDVQPMYADDGGDDILIR >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_2|477_bp atggggtgtatgtgcgcgcgtgcttgcgttggggcgggcggtggagacacagcaccggcc acacccccgccagtccacacagagctgctctgtgcaaagagcgacttgactgcgactctg gccgcccggagcgagggcgggcgcgggggctgcgagggcccgatgaggaaacctaaatgg agtaaagtgactgattggttcaagattgcacagatgctgctgacactggacgtggctgtg agctctttgatcctcattttcctcctctggcatccagggtggcgctgtgttcacagtgca aatagctcagaaaggacactggagtcaaagtttaagtcagggcagaagtgttgcatggaa agaggactcaatccggcggctttgccaagacaggaagcaggctgtatgttggatttgctc tacgatgttcaaccaatgtatgcagatgatgggggtgatgatatattgatccgctga >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_3|251_aa MALCYLCCLLPSLVVAHLPLLIILSPPFPLSSSSFLFFIIIITITTTNIITITITITITT IATIITITHTIITTTITPTTITTTTTTTTITTTTTTTTTTTTTTTTTTTTTTTTTTTTTT TTTTTTTTTTTTTTSPSPPSSSSPLPPSSPSTPSSPSPPSAPSPPPPHHHYHHHHHNHHH YYHHHNHHHHHHCHTIITTTTTTTTTIIIIITTTTIITIITITTTTITIITITATPSSPP PPLPSQSLSLS >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_3|756_bp atggccctctgttatttgtgctgtctgctcccttctctggtggtagcacacctgccactc ctcatcatcctgtctcctccttttcccctctcctcctcctccttcttgttcttcatcatc atcatcacaatcaccaccaccaacatcatcaccatcactatcaccatcaccatcaccacc atcgccaccatcatcaccattacccacaccatcatcaccaccaccatcacccccaccacc atcaccaccaccaccaccaccaccaccatcaccaccaccaccaccaccaccaccaccacc accaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccaccacc accaccaccaccaccaccaccaccaccaccaccaccaccacctcaccatcaccaccatca tcatcgtcaccattgccaccatcatcaccatcaaccccatcatcaccatcaccaccatca gcaccatcaccaccaccaccacatcaccattatcaccaccaccaccataaccatcatcac tattaccatcaccacaatcaccatcatcaccatcactgccacaccatcatcaccaccacc accaccaccaccaccactatcatcattatcatcaccaccaccaccatcataaccatcatc actattaccaccaccacaatcaccatcatcaccatcactgccacaccatcatcaccacca ccaccactaccatcacaatcattatcattatcctaa >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_4|723_aa MSDGIPAPYCVQKPNSAVASKYLPTQPHNNLAMPSAIIPQNPDFEQKPTAPAPSRYQVRT AKVSIMFLMASVRLLSNLNRDKGALAQAESRRKPLSAAVRAGAVPVPLWIPCFFGGAEQR CFRVRVLRVADVLQAAARVVAAQVSSLEKMEAERRPAPGSPSEGLFADGHLILWTLCSVL LPVFITFWCSLQRSRRQLHRRDIFRKSKHGWRDTDLFSQPTYCCVCAQHILQGAFCDCCG LRVDEGCLRKADKRFQCKEIMLKNDTKVLDAMPHHWIRGNVPLCSYCMVCKQQCGCQPKL CDYRCIWCQKTVHDECMKNSLKNEKCDFGEFKNLIIPPSYLTSINQMRKDKKTDYEVVFD VTKTPPIKALQLCTLLPYYSARVLVCGGDGTVGWVLDAVDDMKIKGQEKYIPQVAVLPLG TGNDLSNTLGWGTGYAGEIPVAQVLRNVMEADGIKLDRWKVQVTNKGYYNLRKPKEFTMN NYFSVGPDALMALNFHAHREKAPSLFSSRILNKLELDGERVALPSLEGIIVLNIGYWGGG CRLWEGMGDETYPLARHDDGLLEVVGVYGSFHCAQIQVKLANPFRIGQAHTVRQTATSVQ MTAKYSEKVTHPNSGVTNMRIPFSLQLELLHDDDEEERRGRGRGRGRRRRKKRKKKKKKK NKKKKKKKKKKKKKKKKKKKEKEKKKKKKSGWILEGQNAKVKIAEILDLGEHSPLGETGQ RMK >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_4|2172_bp atgtctgacgggatcccagctccctactgtgtacagaaacccaactctgctgtggcatcc aagtaccttccaacccagcctcacaacaacctggcgatgccatctgccatcattccacaa aacccagactttgagcagaaacctacagccccggcacctagcaggtaccaggtgaggaca gccaaggtcagcatcatgttcctgatggccagtgtcaggcttctctccaacttaaacagg gacaaaggagcattggcacaggcagaatcacgaaggaagccgctgagtgcagctgtgcgc gccggggcggtgcctgtgcctctctggattccgtgtttcttcgggggtgctgagcagcgg tgcttccgcgtccgcgttctccgggtagctgatgtgctgcaggctgcagcccgcgtggtc gcggctcaggtatcgtccttggagaagatggaagcggagaggcggccggcgccgggctcg ccctccgagggcctgtttgcggacgggcacctgatcttgtggacgctgtgctcggtcctg ctgccggtgttcatcaccttctggtgtagcctccagcggtcgcgccggcagctgcaccgc agggacatcttccgcaagagcaagcacgggtggcgcgacacggacctgttcagccagccc acctactgctgcgtgtgcgcgcagcacattctgcagggcgccttctgcgactgctgcggg ctccgcgtggacgagggctgcctcaggaaggccgacaagcgcttccagtgcaaggagatt atgctcaagaatgacaccaaggtcctggacgccatgccccaccactggatccggggcaac gtgcccctgtgcagttactgtatggtttgcaagcagcagtgtggctgtcaacccaagctt tgcgattacaggtgcatttggtgccagaaaacagtacatgatgagtgcatgaaaaatagt ttaaagaatgaaaaatgtgattttggagaattcaaaaacctaatcattccaccaagttat ttaacatccattaatcagatgcgtaaagacaaaaaaacagattatgaagtggtttttgat gtaactaaaactcctcctatcaaagccctacaactctgtactcttctcccatattattca gctcgagtacttgtttgtggaggggatgggactgtagggtgggtcctggatgcagttgat gacatgaagattaagggacaagaaaagtacattccacaagttgcagttttgcctctggga acaggcaacgatctatccaatacattgggttggggtacaggttatgctggagaaattcca gttgcgcaggttttgcgaaatgtaatggaagcagatggaattaaactagatcgatggaaa gttcaagtaacaaataaaggatactacaacttaagaaaacccaaggaattcacaatgaac aactatttttctgttggacctgatgctctcatggctctcaattttcatgctcatcgtgag aaggcaccatctctgttttctagcagaattcttaataagctagaactggatggtgagcga gtagcactgcccagcttggaaggtattatagttctgaacatcggatactggggcggtggc tgcagactatgggaagggatgggggacgagacttaccctctagccaggcatgacgatggt ctgctggaagtcgttggagtatatgggtctttccactgtgctcagattcaagtaaaactg gctaatccttttcgaataggacaggcacatacagtgaggcagactgccacgtcggtccaa atgacagccaagtactcagagaaggtgacccaccccaactcaggagtaaccaacatgcga atcccattctctttgcagttggagctattacatgatgatgatgaagaagaaagaagagga agaggaagaggaagaggaagaagaagaagaaagaagaggaagaagaagaagaagaagaag aacaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag gagaaggagaagaagaagaaaaaaaaaagtgggtggattcttgaagggcaaaatgctaag gtgaaaatagcagagatcctggaccttggagaacactcacctttaggggagactggacag aggatgaaatga >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_5|696_aa MAELCPLAEELSCSICLEPFKEPVTTPCGHNFCGSCLNETWAVQGSPYLCPQCRAVYQAR PQLHKNTVLCNVVEQFLQADLAREPPADVWTPPARASAPSPNAQVACDHCLKEAAVKTCL VCMASFCQEHLQPHFDSPAFQDHPLQPPVRDLLRRKCSQHNRLREFFCPEHSECICHICL VEHKTCSPASLSQASADLEATLRHKLTVMYSQINGASRALDDVRNRQQDVRMTANRKVEQ LQQEYTEMKALLDASETTSTRKIKEEEKRVNSKFDTIYQILLKKKSEIQTLKEEIEQSLT KRDEFEFLEKASKLRGISTKPVYIPEVELNHKLIKGIHQSTIDLKNELKQCIGRLQEPTP SSGDPGEHDPASTHKSTRPVKKVSKEEKKSKKPPPVPALPSKLPTFGAPEQLVDLKQAGL EAAAKATSSHPNSTSLKAKVLETFLAKSRPELLEYYIKVILDYNTAHNKVALSECYTVAS VAEMPQNYRPHPQRFTYCSQVLGLHCYKKGIHYWEVELQKNNFCGVGICYGSMNRQGPES RLGRNSASWCVEWFNTKISAWHNNVEKTLPSTKATRVGVLLNCDHGFVIFFAVADKVHLM YKFRVDFTEALYPAFWVFSAGATLSICSPKNVKGHELIQDLLSSLHLDSSYPPDAGLSDD DEPPNASLPPDPPLLTVPQMHSVCDQWLQDAFHISL >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_5|2091_bp atggcagagctgtgccccctggccgaggagctgtcgtgctccatctgcctggagcccttc aaggagccggtcaccactccgtgcggccacaacttctgcgggtcgtgcctgaatgagacg tgggcagtccagggctcgccatacctgtgcccgcagtgccgcgccgtctaccaggcgcga ccgcagctgcacaagaacacggtgctgtgcaacgtggtggagcagttcctgcaggccgac ctggcccgggagccacccgccgacgtctggacgccgcccgcccgcgcctctgcacccagc ccgaatgcccaggtggcctgcgaccactgcctgaaggaggccgccgtgaagacgtgcttg gtgtgcatggcctccttctgtcaggagcacctgcagccgcacttcgacagccccgccttc caggaccacccgctgcagccgcccgttcgcgacctgttgcgccgcaaatgttcccagcac aatcggctgcgggaatttttctgccccgagcacagcgagtgcatctgccacatctgcctg gtggagcataagacctgctctcccgcgtccctgagccaggccagcgccgacctggaggcc accctgaggcacaaactaactgtcatgtacagtcagatcaacggggcgtcgagagcactg gatgatgtgagaaacaggcagcaggatgtgcggatgactgcaaacagaaaggtggagcag ctacaacaagaatacacggaaatgaaggctctcttggacgcctcagagaccacctcgaca aggaagataaaggaagaggagaagagggtcaacagcaagtttgacaccatttatcagatt ctcctcaagaagaagagtgagatccagaccttgaaggaggagattgaacagagcctgacc aagagggatgagttcgagtttctggagaaagcatcaaaactgcgaggaatctcaacaaag ccagtctacatccccgaggtggaactgaaccacaagctgataaaaggcatccaccagagc accatagacctcaaaaacgagctgaagcagtgcatcgggcggctccaggagcccaccccc agttcaggtgaccctggagagcatgacccagcgtccacacacaaatccacacgccctgtg aagaaggtctccaaagaggaaaagaaatccaagaaacctccccctgtccctgccttaccc agcaagcttcccacgtttggagccccggaacagttagtggatttaaaacaagctggcttg gaggctgcagccaaagccaccagctcacatccgaactcaacatctctcaaggccaaggtg ctggagaccttcctggccaagtccagacctgagctcctggagtattacattaaagtcatc ctggactacaacaccgcccacaacaaagtggctctgtcagagtgctatacagtagcttct gtggctgagatgcctcagaactaccggccgcatccccagaggttcacatactgctctcag gtgctgggcctgcactgctacaagaaggggatccactactgggaggtggagctgcagaag aacaacttctgtggggtaggcatctgctacggaagcatgaaccggcagggcccagaaagc aggctcggccgcaacagcgcctcctggtgcgtggagtggttcaacaccaagatctctgcc tggcacaataacgtggagaaaaccctgccctccaccaaggccacgcgggtgggcgtgctt ctcaactgtgaccacggctttgtcatcttcttcgctgttgccgacaaggtccacctgatg tataagttcagggtggactttactgaggctttgtacccggctttctgggtattttctgct ggtgccacactctccatctgctcccccaaaaatgtgaaaggtcatgagctcatccaggac ttgctatcctccctgcatttagacagttcctacccacctgatgctggcctgtctgatgat gatgagcctcccaatgccagcctgccccccgacccgccactcctcactgtgccccagatg cacagtgtttgtgaccagtggctgcaggatgccttccacatcagcctctga >gi568815581f:56734796_56962788|GENSCAN_predicted_peptide_6|468_aa MAASETVRLRLQFDYPPPATPHCTAFWLLVDLNRCRVVTDLISLIRQRFGFSSGAFLGLY LEGGLLPPAESARLVRDNDCLRVKLEERGVAENSVVISNGDINLSLRKAKKRAFQLEEGE ETEPDCKYSKKHWKSRENNNNNEKVLDLEPKAVTDQTVSKKNKRKNKATCGTVGDDNEEA KRKSPKKKEKCEYKKKAKNPKSPKVQAVKDWANQRCSSPKGSARNSLVKAKRKGSVSVCS KESPSSSSESESCDESISDGPSKVTLEARNSSEKLPTELSKEEPSTKNTTADKLAIKLGF SLTPSKGKTSGTTSSSSDSSAESDDQCLMSSSTPECAAGFLKTNPVETPKKDYSLLPLLA AAPQVGEKIAFKLLELTSSYSPDVSDYKEGRILSHNPETQQVDIEILSSLPALREPGKFD LVYHNENGAEVVEYAVTQESKITVFWKELIDPRLIIESPSNTSSTEPA >gi568815581f:56734796_56962788|GENSCAN_predicted_CDS_6|1407_bp atggcagcttccgagacggttaggctacggcttcaatttgattacccgccgccagctacc ccgcactgtacggccttctggcttctggtcgacttgaacagatgccgagtcgtcacagat ctcattagtctcatccgccagcgcttcggcttcagttctggggccttcctaggcctctac ctggagggggggctcttgccccccgccgagagcgcgcgccttgtgagagacaacgactgc ctcagagttaaattagaagagagaggagttgctgagaattctgtagtcatcagtaatggt gacattaatttatctcttagaaaagcaaagaagcgggcatttcagttagaggagggtgaa gaaactgaaccagattgcaaatattcaaagaagcattggaagagtcgagagaacaataac aataatgagaaggtcttggatctggaaccaaaagctgtcacagatcagactgtcagcaaa aaaaacaagagaaaaaataaagcaacctgtggcacagtgggtgatgataacgaagaggcc aaaagaaaatcaccaaagaaaaaggagaaatgtgaatataaaaaaaaggctaagaatccc aagtctccgaaagtacaggcagtgaaagactgggccaatcagagatgtagttctccaaaa ggttctgctagaaacagccttgttaaagccaaaaggaaaggtagtgtaagcgtttgctca aaagagagtcccagttcctcctcggagtctgaatcttgtgatgaatctatcagtgatggt cccagcaaagtcactttggaggccagaaattcctcagagaaattaccaactgagttatca aaggaagaaccctctaccaaaaatacaactgcagacaaactggctataaaacttggcttt agccttacccccagcaagggcaagacctctggaacaacatcttccagttcagactctagt gcagagtcagacgaccaatgcttgatgtcatcgagcaccccggagtgtgctgcgggtttc ttaaagacaaatccagtagagacacccaagaaggactatagtctgttaccactgttagca gctgcccctcaagttggagaaaagattgcatttaagcttttggagctaacatccagttac tctcctgatgtctctgactacaaggaaggaagaatattaagccacaatccagagacccag caagtagatatagaaattctttcatccttacctgccttgagagaacctgggaaatttgat ttagtttatcacaatgaaaatggagccgaggtagtggagtacgctgtgacacaggagagc aagatcactgtattttggaaagagttgattgacccaagactgattattgaatctccaagt aacacatcaagtacagaacctgcctga