GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:22:07 Sequence gi568815583f:64492979_64693323 : 200345 bp : 42.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6442 7188 747 0 0 51 116 716 0.960 65.46 1.02 Intr + 8019 8121 103 2 1 17 77 94 0.321 0.03 1.03 Term + 11260 11330 71 2 2 113 47 -2 0.052 -4.68 1.04 PlyA + 12104 12109 6 1.05 2.05 PlyA - 13177 13172 6 1.05 2.04 Term - 14557 14275 283 0 1 46 33 154 0.080 -0.39 2.03 Intr - 35833 35693 141 0 0 32 46 154 0.093 4.15 2.02 Intr - 36219 35917 303 2 0 7 36 310 0.544 12.48 2.01 Init - 36635 36430 206 2 2 91 16 235 0.777 14.97 2.00 Prom - 44750 44711 40 -4.95 3.02 PlyA - 45437 45432 6 1.05 3.01 Sngl - 51939 51658 282 0 0 79 49 148 0.809 5.24 3.00 Prom - 83909 83870 40 -3.95 4.00 Prom + 98506 98545 40 -5.65 4.01 Init + 100001 100344 344 1 2 75 75 376 0.485 31.75 4.02 Term + 125538 125847 310 0 1 17 43 245 0.025 6.45 4.03 PlyA + 126656 126661 6 1.05 5.00 Prom + 129284 129323 40 -4.75 5.01 Init + 129849 130074 226 2 1 62 115 285 0.891 27.28 5.02 Intr + 133272 133303 32 1 2 88 47 21 0.424 -5.07 5.03 Intr + 133530 133654 125 2 2 71 80 120 0.621 7.96 5.04 Intr + 143317 143371 55 1 1 76 81 38 0.435 -0.04 5.05 Intr + 147728 147853 126 1 0 72 37 94 0.200 2.66 5.06 Intr + 176055 176265 211 2 1 59 66 102 0.285 2.66 5.07 Intr + 177368 177455 88 0 1 96 121 26 0.969 4.81 5.08 Intr + 180938 183278 2341 2 1 54 93 2554 0.976 238.88 5.09 Intr + 185138 185504 367 1 1 49 92 557 0.902 45.99 5.10 Intr + 186770 186853 84 0 0 42 94 63 0.657 1.17 5.11 Intr + 187207 187382 176 2 2 137 107 147 0.999 20.24 5.12 Intr + 187668 187761 94 2 1 104 18 73 0.626 0.52 5.13 Intr + 187803 187884 82 1 1 90 115 21 0.891 2.78 5.14 Intr + 188331 188409 79 0 1 88 115 -18 0.630 -0.47 5.15 Term + 188714 188842 129 1 0 112 49 177 0.949 13.40 5.16 PlyA + 192261 192266 6 1.05 6.04 PlyA - 193059 193054 6 1.05 6.03 Term - 195856 195726 131 2 2 108 38 148 0.882 9.16 6.02 Intr - 196191 196106 86 2 2 115 78 83 0.828 8.54 6.01 Intr - 197571 197396 176 0 2 115 65 163 0.999 14.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 35833 35689 145 0 1 32 47 162 0.836 2.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_1|306_aa MSLSSGASGGKGVDANPVETYDSGDEWDIGVGNLIIDLDADLEKDQQKLEMSGSKEVGIP APNAVATLPDNIKFVTPVPGPQGKEGKSKSKRSKSGKDTSKPTPGTSLFTPSEGAASKKE VQGRSGDGANAGGLVAAIAPKGSEKAAKASRSVAGSKKEKENSSSKSKKERSEGVGTCSE KDPGVLQPVPLGGRGGQYDGSAGVDTGAVEPLGSIAIEPGAALNPLGTKPEPEEGENECR LLKKVKSEKEMELAGLEENGCESRVSSGSIKLQQLIPAKQWPCGVVIQGSSRERLSCPIS SFYRRN >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_1|921_bp atgtccttgagcagtggagcctccggagggaaaggagtggatgcaaacccggttgagaca tacgacagtggggatgaatgggacattggagtagggaatctcatcattgacctggacgcc gatctggaaaaggaccagcagaaactggaaatgtcaggctcaaaggaggtggggataccg gctcccaatgctgtggccacactaccagacaacatcaagtttgtgaccccagtgccaggt cctcaagggaaggaaggcaaatcaaaatccaaaaggagtaagagtggcaaagacactagc aaacccactccagggacttccctgttcactccaagtgagggggcagctagcaagaaagag gtgcaggggcgctcaggagatggtgccaatgctggaggcctggttgctgctattgctccc aagggctcagagaaggcggctaaggcatcccgcagtgtagccggttccaaaaaggagaag gagaacagctcatctaagagcaagaaggagagaagcgaaggagtggggacttgttcagaa aaggatcctggggtcctccagccagttcccttgggaggacggggtggtcagtatgatgga agtgcaggggtggatacaggagctgtggagccacttgggagtatagctattgagcctggg gcagcgctcaatcctttgggaactaaaccggagccagaggaaggggagaatgagtgtcgc ctgctaaagaaagtcaagtctgaaaaggaaatggaacttgcagggctggaggagaatggg tgtgagtctagggtcagctcaggcagcattaaacttcagcagcttattcctgcaaaacag tggccttgtggtgttgtaattcaaggcagttccagagaaagattatcttgtccaatttcc tcattttatagaagaaactaa >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_2|310_aa MTAFNSGKVDIVAINDPFIDLNYMVYMFLYDSTHGKFHGTVKAENGTLVINGNPITHFPG PRSLQNQMGAPAKVIHDNFGIIEGFMTTVHTITATQTINGPSGKLSRDGCRALQDIIPVS TGAAKPVGKVIPELNGKLTGMAFHVPTANVSVADLTCRLEKPANMMTSRSNTHSSTFDAG AAIVLKDHSVKLISWYDNEFGYSNRVVHLMAHNASKESGDDKEMNSVISKASEHSPVSSS GGGLLKHGLKSENHRERAQTASSGRISAVYVTATSQALLPPWMSSAGKSSYTQMAKIQVH TGGGRLRAKT >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_2|933_bp atgactgcttttaactctggtaaagtggatattgttgccatcaatgaccccttcattgac ctcaactacatggtctacatgttcctgtatgattctacacatggcaaattccatggcacc gtcaaggctgagaacgggacgcttgtcatcaatggaaaccccatcacccattttccagga ccaagatccctccaaaatcaaatgggcgccccagccaaggtcatccatgacaactttggt atcatagaaggattcatgaccacagttcacaccatcactgccacccagactataaatggc ccctccgggaaactgtcacgtgatggctgcagggctctccaggacatcatccctgtctct actggcgctgccaagcctgtgggtaaggtcatccctgagctgaacgggaagctcactggc atggccttccatgtccccactgccaatgtgtcagtggcggacctgacctgtcgtctggaa aaacctgccaatatgatgacatcaagaagcaacacccactcttctaccttcgatgctggg gcagccattgtcctcaaggaccactctgtcaagctaatttcctggtatgacaatgaattt ggctacagcaacagggtggtgcacctcatggcccacaatgcctccaaggagtcaggggat gataaggaaatgaacagtgtaatttccaaggcctctgaacacagccctgtcagcagctct ggaggcgggttactcaagcatggcttgaagagtgaaaaccacagagagagggctcagaca gccagcagtggcaggatctcagctgtgtatgttactgccacatcccaagccctgctgcct ccctggatgagctctgctggtaaaagctcttatacccaaatggcaaaaatacaagttcac actgggggaggaagattaagggctaagacttag >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_3|93_aa MQHNTYLQLDGLTAEEKALTWNPETENQTPFNNQCTCPVFLAHGFHARLVKRGSLADSLE ELREGMSPGFPVERTLDSLTVTAGFGKAFQIRL >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_3|282_bp atgcagcacaacacctacctacaacttgatggtctgacagcagaggagaaagcactgact tggaatccagagactgaaaaccaaacgccctttaacaatcaatgtacctgccctgtattc ctggctcatggctttcatgctagattggtaaaaaggggttcgcttgctgacagcttagag gagttaagagaagggatgagtcctggttttcctgttgaaagaactctggacagcctgaca gtcactgctggctttggaaaggcatttcagataaggctgtag >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_4|217_aa MTKKRRNNGRAKKGRGHMQPIRCTNCARCVPKDKAIKKFVIRNIVESAAVRDISEASVFD AYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKDRTPPPRFRPAGAAPRPPPKPMFLSWC TGRIGSNVGLENECKVLLSESNAQQIGEPEGRWFSPGVQLLSGPGSSPNAPAKLCFVPRP AGICHVLFRHVCSSAGMLLSTPSRGPAAVSSSANVFL >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_4|654_bp atgacaaagaaaagaaggaacaatggtcgtgccaaaaagggccgcggccacatgcagcct attcgctgcactaactgtgcccgatgcgtgcccaaagacaaggccattaagaaattcgtc attcgaaacatagtggagtccgcagcagtcagggacatttctgaagcgagcgtcttcgat gcctatgtgcttcccaagctgtatgtgaagctgcattactgtgtgagttgtgcaattcac agcaaagtagtcaggaatcgatctcgtgaagcccgcaaggaccgaacacccccaccccga tttagacctgcgggtgctgccccacgtcccccaccaaagcccatgttcttgtcttggtgt actggaagaattggatcaaatgtgggcttggagaatgagtgtaaggttttattgagtgaa agtaatgctcagcagattggggagccagaagggagatggttttcccctggagttcagcta ctcagcggcccaggctcttctccaaatgccccagccaaactctgcttcgttccacggcct gctggcatttgtcatgtgctcttccgccacgtgtgctcttctgccggcatgctcctctca acgccctctcgaggtccagccgctgtgtcttcttctgccaatgtgttcctctag >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_5|1404_aa MESPVSTPAVLPIHLLVPVVNNDISSPCEQIMVRTRSVGVNTCDVALATEPECLGPCEPG TSVNLEGIVWQETEDEGLHLNCQLAKQTTLSSNRKVLTAQREAVNAWPLRGMEDTEVPVE LSHLAENSCCLHGLLHSNKLYDKKAMWFSEDSSCIAKGMMTFAEPPSPAMAAGYGYLSKR KGRIYDLRIGKEQNENIMLPKEKGKRYSWATINGYMNEYVLFEAIWRWFGRICENSKYKC TVINSFTYRNSSYRHICVGMLVVNVTWRNKTYVGTLLDCTRHDWAPPRFCDSPTSDLEMR NGRGRGKRMRPNSNTPVNETATASDSKGTSNSSKTRAGANSKGRRGSQNSSEHRPPASST SEDVKASPSSANKRKNKPLSDMELNSSSEDSKGSKRVRTNSMGSATGPLPGTKVEPTVLD RNCPSPVLIDCPHPNCNKKYKHINGLKYHQAHAHTDDDSKPEADGDSEYGEEPILHADLG SCNGASVSQKGSLSPARSATPKVRLVEPHSPSPSSKFSTKGLCKKKLSGEGDTDLGALSN DGSDDGPSVMDETSNDAFDSLERKCMEKEKCKKPSSLKPEKIPSKSLKSARPIAPAIPPQ QIYTFQTATFTAASPGSSSGLTATVAQAMPNSPQLKPIQPKPTVMGEPFTVNPALTPAKD KKKKDKKKKESSKELESPLTPGKVCRAEEGKSPFRESSGDGMKMEGLLNGSSDPHQSRLA SIKAEADKIYSFTDNAPSPSIGGSSRLENTTPTQPLTPLHVVTQNGAEASSVKTNSPAYS DISDAGEDGEGKVDSVKSKDAEQLVKEGAKKTLFPPQPQSKDSPYYQGFESYYSPSYAQS SPGALNPSSQAGVESQALKTKRDEEPESIEGKVKNDICEEKKPELSSSSQQPSVIQQRPN MYMQSLYYNQYAYVPPYGYSDQSYHTHLLSTNTAYRQQYEEQQKRQSLEQQQRGVDKKAE MGLKEREAALKEEWKQKPSIPPTLTKAPSLTDLVKSGPGKAKEPGADPAKSVIIPKLDDS SKLPGQAPEGLKVKLSDASHLSKEASEAKTGAECGRQAEMDPILWYRQEAEPRMWTYVYP AKYSDIKSEDERWKEERDRKLKEERSRSKDSVPKEDGKESTSSDCKLPTSEESRLGSKEP RPSVHVPVSSPLTQHQSYIPYMHGYSYSQSYDPNHPSYRSMPAVMMQNYPVISIDIFIFD DEETEAQNSYLFAQGYQASSYLPSSYSFSPYGSKVSGGEDADKARASPSVTCKSSSESKA LDILQQHASHYKSKSPTISDKTSQERDRGGCGVVGGGGSCSSVGGASGAPDVHTPPPPPL GVLIAPSTVQLTLCSRAFFYSHCCQPTRLNSLTLPTPQEVRMTPSARIKSASRARTGLPK EVLKVPFRHQLNGVDHPVCRFHHD >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_5|4215_bp atggagtcccctgtttccacaccagcagtgctgccaatacaccttttggtgccagtggtc aacaatgacatctcatctccctgtgagcagatcatggttcgtacccgatcagttggggtc aacacatgtgatgtggctctggccacagagcctgagtgcttgggcccctgtgaacctgga actagcgtcaaccttgaaggcatcgtgtggcaggaaacagaagatgagggccttcacctg aactgtcagttggcgaagcaaactacactgtcttcaaacagaaaagtccttacagctcag agagaagctgtgaatgcttggcctctgaggggaatggaggacactgaggtccctgtagaa ctgagccaccttgctgagaacagttgctgccttcacggccttctccatagcaacaagcta tatgacaaaaaggctatgtggttttcagaggactccagctgtatagccaaaggcatgatg acctttgctgagccaccttctccagccatggctgcaggatatggttacctcagcaagagg aaaggaaggatatacgatttaaggattggcaaagaacaaaatgagaacatcatgttgcca aaggaaaagggcaagaggtattcatgggctactataaatgggtatatgaatgagtatgtc ctctttgaggcaatttggaggtggtttggccgtatttgtgagaactccaaatacaagtgc actgtgattaacagttttacttacaggaattcatcttacagacatatttgtgtagggatg ttggtggtaaatgtaacgtggaggaacaagacatatgtaggtacactccttgactgcaca cgacatgattgggcacccccaaggttctgtgactccccgaccagtgacctggaaatgcgc aatggccggggtagaggcaaacgcatgcgtcccaacagtaatacacctgtcaatgagaca gccacagcctctgacagcaaagggaccagtaacagcagcaaaacccgggcaggagccaat agcaaaggccgtcggggcagccagaattcttcagagcaccgcccacctgccagcagcact tctgaggatgtcaaggccagcccttcctcagctaataagcggaaaaacaaacccctttca gacatggagctgaattctagctcagaggactccaaagggagcaagcgtgtccgtactaat tccatgggctcagccactggcccccttcctgggacaaaggtagaacccactgttctggac agaaactgcccctcccccgtcctaattgactgtccccacccaaactgcaacaagaagtac aagcacatcaatggacttaagtaccaccaagctcatgcccatacagatgatgacagcaag ccggaagcggatggggacagtgagtacggagaggaacctattctccatgcagatcttggg agctgcaacggtgcatctgtctcacaaaaaggttccttgtcccctgcccgctcagctacc cccaaagttcgacttgtagagccccatagcccttctccttcaagcaaattcagcacaaaa ggcctctgtaagaaaaagttgagtggggaaggggacacagaccttggggccttatccaat gatggctctgatgatggaccctcagtgatggatgaaacaagcaatgatgcctttgattct ttagaaaggaagtgtatggaaaaagaaaaatgtaaaaaaccctctagtttaaaacctgaa aagattccttccaagagcctaaagtcagcccgtcccattgcccctgccatccccccacag caaatctacaccttccagacagccaccttcacagcagcgagcccaggctcttcctcaggc ttgaccgccacagtggcacaagccatgcccaacagtccccaactcaagcccattcagccc aagcccactgttatgggagaacctttcacagtcaaccctgccttgactccagccaaggac aagaaaaagaaagacaaaaaaaagaaggaatcttcaaaggaacttgaaagtcctctgacc cctgggaaggtgtgtcgagcagaggaaggcaaaagcccattcagggaatcttcaggagat gggatgaaaatggaggggctcctaaatggctcatcagacccccaccaaagccgactggct agcatcaaggctgaagccgacaagatctacagtttcacggacaatgcccccagcccttcc attggaggcagtagccgccttgaaaacactacccctactcagcccctgactcccttacat gtggtgacccagaatggagctgaagccagctcagtcaaaaccaacagccctgcatactct gacatctctgatgctggggaggatggggagggcaaggtagacagtgtcaaatcaaaggac gccgaacagttggttaaagaaggggctaagaaaactctttttccccctcagcctcagagc aaagactcaccatattaccaaggctttgagagttactattctccaagttatgcacagtcc agccctggggctctgaaccccagcagccaggcaggagtggagagccaggccctgaagaca aaaagggatgaggaacctgagagcatagaagggaaagtgaagaacgatatctgtgaagaa aagaagcccgagctgagcagttccagtcagcagccctcggtcatccagcagcgtcccaat atgtacatgcagtccctgtactacaaccagtatgcctatgtacccccctatggctacagc gaccagagttaccacacccaccttctgagcactaacacggcttaccggcagcagtacgaa gaacagcagaaacgccagagcttagagcagcagcagcggggagtggacaagaaggcagag atgggcctgaaggagcgggaggcagcactcaaggaagagtggaagcaaaagccgtcaatt ccaccaactctcaccaaggcccccagcctgacagacctggtgaaatcaggacctggcaag gccaaggagccaggggctgacccagccaaatcagtcatcattcccaagttagatgactct tcaaaactcccgggccaggcccctgaaggccttaaagtgaagctgagtgatgccagccac ctaagcaaggaggcctctgaggccaagacaggtgctgagtgtggtcgacaggcagagatg gatccaatactctggtaccgacaggaggcagagccccggatgtggacatatgtttatcct gccaagtactcagacatcaagtcagaggatgagcggtggaaggaggagcgggaccgcaaa ttgaaggaggaaaggagtcggagtaaggactctgtccccaaggaagatgggaaggaaagc acaagtagtgactgcaagctgcccacgtcagaggagtctcgccttgggagcaaggagccc cggccaagtgtccatgtgcctgtgtcctccccacttacccagcaccagtcctacatcccc tacatgcacggctattcctacagtcagtcctacgaccccaaccaccccagctaccggagc atgcctgctgtgatgatgcagaactacccagtaataagtattgatatcttcatttttgat gatgaagaaactgaagctcaaaacagttatttatttgctcaaggttaccaagctagttcc tacctgccttccagctactctttttccccatatggcagcaaggtctcaggtggtgaagat gctgacaaggcacgagccagccccagtgtgacttgtaaatccagctcagagtccaaagcc ctggacatcttgcagcagcatgccagtcactacaagagcaagtctcccacgataagtgat aaaacttctcaggagagagatcgaggaggctgtggggtggttgggggtggtggcagctgt agcagcgtcgggggagcaagtggggcgcctgatgtccacacaccaccaccaccaccactt ggggtactcattgctcccagcacagtacaacttaccctatgcagcagggctttcttctac agccattgttgccagccaacaaggctcaactccctcactctacccaccccccaggaggtg agaatgacaccaagtgcccggataaagtcagcttcacgggcccggactggcttacccaag gaggtgctgaaggtgccgtttagacatcagttaaatggtgttgatcatcctgtttgccgt ttccaccatgactga >gi568815583f:64492979_64693323|GENSCAN_predicted_peptide_6|130_aa DEKLTVTQDLPVNDGKPHIVHFQYEVTEVKVSSWDAVLSSQSLFVEIPDGLLADGSKEGL LALLEFAEEKMKVNYVFICFRKGREDRAPLLKTFSFLGFEIVRPGHPCVPSRPDVMFMVY PLDQNLSDED >gi568815583f:64492979_64693323|GENSCAN_predicted_CDS_6|393_bp gacgagaagctcactgtgacccaggacctccctgtgaatgatggaaaacctcacatcgtc cacttccagtatgaggtcaccgaggtgaaggtctcttcttgggatgcagtcctgtccagc cagagcctgtttgtagaaatcccagatggattattagctgatgggagcaaagaaggattg ttagcactgctagagtttgctgaagagaagatgaaagtgaactatgtcttcatctgcttc aggaagggccgagaagacagagctccactcctgaagaccttcagcttcttgggctttgag attgtacgtccaggccatccctgtgtcccctctcggccagatgtgatgttcatggtttat cccctggaccagaacttgtccgatgaggactaa