GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:56:48 Sequence gi568815592r:131600472_131801401 : 200930 bp : 38.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 22 17 6 1.05 1.13 Term - 339 194 146 1 2 45 54 76 0.214 -3.01 1.12 Intr - 1910 1747 164 1 2 61 96 161 0.648 12.90 1.11 Intr - 2733 2559 175 1 1 110 85 20 0.987 1.98 1.10 Intr - 3849 3707 143 2 2 113 52 96 0.966 7.58 1.09 Intr - 6153 6008 146 0 2 66 80 47 0.591 -0.14 1.08 Intr - 9775 9575 201 1 0 61 95 166 0.930 13.16 1.07 Intr - 18048 17936 113 1 2 52 95 48 0.587 1.08 1.06 Intr - 19425 19356 70 0 1 110 46 39 0.596 -0.26 1.05 Intr - 21508 21410 99 1 0 97 76 99 0.991 8.99 1.04 Intr - 22991 22880 112 1 1 90 71 59 0.948 3.86 1.03 Intr - 27201 27170 32 0 2 106 92 38 0.704 2.01 1.02 Intr - 27637 27540 98 2 2 77 89 38 0.570 1.61 1.01 Init - 27915 27720 196 0 1 75 103 133 0.607 10.75 1.00 Prom - 29277 29238 40 -6.05 2.00 Prom + 32892 32931 40 -7.95 2.01 Init + 36914 36991 78 1 0 67 87 42 0.585 3.11 2.02 Intr + 40984 41059 76 0 1 82 69 59 0.594 1.57 2.03 Intr + 49556 49678 123 0 0 92 18 140 0.533 7.04 2.04 Intr + 52071 52196 126 1 0 81 105 37 0.528 4.43 2.05 Intr + 70777 70856 80 2 2 87 127 32 0.063 5.35 2.06 Intr + 77224 77345 122 0 2 44 42 86 0.069 -2.03 2.07 Intr + 77397 77469 73 0 1 84 89 97 0.748 7.89 2.08 Intr + 82583 82691 109 1 1 72 68 118 0.995 7.14 2.09 Intr + 84893 85024 132 0 0 72 81 62 0.921 3.60 2.10 Term + 89432 89583 152 0 2 -1 43 191 0.004 2.79 2.11 PlyA + 91417 91422 6 1.05 3.02 PlyA - 92720 92715 6 1.05 3.01 Sngl - 100930 99998 933 1 0 47 48 435 0.917 31.60 3.00 Prom - 101074 101035 40 -7.15 4.02 PlyA - 101478 101473 6 1.05 4.01 Sngl - 110546 108213 2334 2 0 79 49 2065 0.738 194.12 4.00 Prom - 114057 114018 40 -6.45 5.00 Prom + 114186 114225 40 -4.95 5.01 Init + 118262 118267 6 1 0 84 78 0 0.211 -0.47 5.02 Intr + 119821 119908 88 0 1 62 103 10 0.541 -1.48 5.03 Intr + 123569 123620 52 0 1 102 106 44 0.540 4.65 5.04 Intr + 125814 125973 160 0 1 62 87 86 0.370 4.97 5.05 Intr + 133117 133252 136 0 1 99 53 60 0.583 2.82 5.06 Intr + 137560 137692 133 2 1 79 108 104 0.955 10.28 5.07 Intr + 139753 139909 157 1 1 9 76 133 0.782 3.29 5.08 Intr + 142327 142372 46 0 1 120 98 -19 0.032 -0.64 5.09 Intr + 159850 159954 105 2 0 100 76 43 0.312 3.57 5.10 Intr + 162053 162159 107 0 2 113 53 8 0.133 -1.19 5.11 Term + 177992 178090 99 1 0 135 49 38 0.403 1.75 5.12 PlyA + 178405 178410 6 1.05 6.04 PlyA - 178452 178447 6 1.05 6.03 Term - 180725 180634 92 0 2 100 48 112 0.531 5.30 6.02 Intr - 183157 183053 105 2 0 59 109 63 0.472 4.77 6.01 Init - 192461 192413 49 2 1 77 78 47 0.541 3.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 89847 89989 143 1 2 83 42 186 0.965 10.61 S.002 Term + 146315 146485 171 1 0 98 32 95 0.830 1.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_1|564_aa MPPGSLGSRFRLGALSSGLRAALLHRRNLLVAPLVSAGFANGAYRALPVFRTGSCCWLGS GGNAPELESDPGLAGHPGKAGVIVQMETQLQSIFEEVVKTEVIEEAFPGLVCESLINSDT LEWERTQLWALTFKLVRKIIGGVDYKGVRDLLKVILEKILTIPNTVSSAVVQQLLAAREL LGNLVSDFVDTFRPTARINSICGRCSLLPVVNNSGAICNSWKLDPATLRFPLKGLLPYDK HKQRCPVLEDQLVDLVVYAMERSETEEKFDDGGTSQLLWQHLSSQLIFFVLFQFASFPHM VLSLHQKYIPVPDINKPQSTHAFAMTCIWIHLNRKAQNDNSKLQIPIPHSLRLHHDLIHS IATRVIKLAHAKSSVALAPALVETYSRLLVYMEIESLGIKGFISQLLPTVFKSHAWGILH TLLEMFSYRMHHIQPHYRVQLLSHLHTLAAVAQTNQNQLHLCVESTALRLITALGSSEVQ PQFTRFLSDPKTVLSAESEELNRALILTLARATHVTGPAVLPPLLLYRQCSYQVKKPDIF LLLFDLIESLKGYWGIPKRSTDHT >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_1|1695_bp atgcctccggggtccttgggctcaaggttccgtttgggggcgctctcgtccggcttgcgg gccgcccttctccacagacggaaccttctggtggcgccgctggtttctgcggggtttgca aacggcgcgtaccgggcccttcccgtcttccgtacaggttcctgctgttggctgggctcg ggagggaacgctccagagctcgagtctgatccgggccttgccgggcaccctggaaaggcg ggggtgatagtacagatggagacgcaactgcagagcattttcgaagaggtggtgaaaacg gaagttatagaagaggcttttcctgggctggtttgtgaatccctgataaactctgacact cttgagtgggaaagaacacagctttgggccttaacatttaaactggttcggaaaataatt gggggagtggattacaagggtgttcgagatctcttaaaagtgattttggagaagattttg acaattcctaatacagtgagctctgctgttgtacagcagcttctggcagcaagagagtta cttggaaacctagtatcagactttgtggataccttcaggcccacagcaaggataaactcc atttgtggtcgctgtagtcttctgccagttgtaaataattcgggtgccatttgtaattca tggaaactggatcctgctactcttcgttttcctttgaaaggccttttgccatatgataag cacaagcagcgctgccctgtgctggaggaccagttggtggatctggttgtttatgccatg gagcgatctgagaccgaggagaagtttgacgatgggggaacaagccaactcctgtggcag catctctcaagtcagctcattttctttgtgcttttccagtttgcaagttttccacatatg gtgctttctcttcatcagaagtatatcccagttcctgatattaacaaaccccagtcaacc catgcctttgcaatgacctgtatttggattcatctcaatagaaaagctcaaaatgacaac tccaagctacagattccaatacctcattccctaagacttcaccatgaccttattcacagc attgcaaccagggtgataaaacttgctcatgcaaagtccagtgtggccttggctccagcc ctagtggaaacttacagtcgtttattggtctatatggaaatagagtctttgggcatcaaa ggatttatcagtcagcttttgccaactgttttcaaatcacatgcatgggggatcttacac acactccttgagatgtttagctaccggatgcatcatattcagcctcattacagagttcag ctcctgagtcatcttcatactttggctgcagttgcacaaacaaaccagaaccagctccat ctttgtgtcgagagcactgctctcaggcttataacagcattaggtagctcagaggtacaa ccgcagtttacacgcttccttagtgatcccaaaacagtgctctcagcagaatctgaagaa ctgaaccgagccttgatattgaccttggctagagcaactcatgtaacaggaccagcagtt ttaccaccattgcttttgtaccgtcagtgctcataccaagtgaaaaagcctgatatcttt ttattattatttgacctcattgaatctttgaagggctattggggaatccccaagaggtct acagaccacacttga >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_2|356_aa MESTLTLATEQPVKKNTLKKYKIACIVLLALLVIMSLGLGLGLGLRKLEKQGSCRKKCFD ASFRGLENCRCDVACKDRGDCCWDFEDTCVESTRIWMCNKFRCGETRLEASLCSCSDDCL QRKDCCADYKSVCQETCGIHSKYMRAMYPTKTFPNHYTIVTETYPCWSQVELLGLNKEEE VREKEKAMASKPKIYKIPQSSLPRFYTMYFEEPDSSGHAGGPVSARVIKALQVVDHAFGM LMEGLKQRNLHNCVNIILLADHGMDQTYCNKMEYMTDYFPRINFFYMYEGPAPRIRAHNI PHDFFSWFMTSAEEVTTVVVKIERELESEMEPEAVTELLQSHEKTGTDEELLLTDE >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_2|1071_bp atggaatctacgttgactttagcaacggaacaacctgttaagaagaacactcttaagaaa tataaaatagcttgcattgttcttcttgctttgctggtgatcatgtcacttggattaggc ctggggcttggactcaggaaactggaaaagcaaggcagctgcaggaagaagtgctttgat gcatcatttagaggactggagaactgccggtgtgatgtggcatgtaaagaccgaggtgat tgctgctgggattttgaagacacctgtgtggaatcaactcgaatatggatgtgcaataaa tttcgttgtggagagaccagattagaggccagcctttgctcttgttcagatgactgtttg cagaggaaagattgctgtgctgactataagagtgtttgccaagaaacatgtggaattcat tcaaaatacatgagagctatgtatcctaccaaaaccttcccaaatcattacaccattgtc acggaaacctatccttgctggagccaagtagaactactgggtctgaacaaggaggaggag gttagagagaaagaaaaggcaatggctagcaagcccaagatctacaaaatcccccaatcc agcctacccaggttttataccatgtattttgaagaacctgattcctctggacatgcaggt ggaccagtcagtgccagagtaattaaagccttacaggtagtagatcatgcttttgggatg ttgatggaaggcctgaagcagcggaatttgcacaactgtgtcaatatcatccttctggct gaccatggaatggaccagacttattgtaacaagatggaatacatgactgattattttccc agaataaacttcttctacatgtacgaagggcctgccccccgcatccgagctcataatata cctcatgacttttttagttggttcatgacttcagcagaggaagtaactacagtcgtggta aaaatagaaagagaactagaatcagaaatggagcctgaagctgtgactgagttgctgcaa tctcatgagaaaactggaacagatgaagagctgcttcttacggatgagtaa >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_3|310_aa MGDNITSIREFLLLGFPVGPRIQMLLFGLFSLFYVFTLLGNGTILGLISLDSRLHAPMYF FLSHLAVVDIAYACNTVPRMLVNLLHPAKPISFAGRMMQTFLFSTFAVTECLLLVVMSYD LYVAICHPLRYLAIMTWRVCITLAVTSWTTGVLLSLIHLVLLLPLPFCRPQKIYHFFCEI LAVLKLACADTHINENMVLAGAISGLVGPLSTIVVSYMCILCAILQIQSREVQRKAFRTC FSHLCVIGLVYGTAIIMYVGPRYGNPKEQKKYLLLFHSLFNPMLNPLICSLRNSEVKNTL KRVLGVERAL >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_3|933_bp atgggagacaatataacatccatcagagagttcctcctactgggatttcccgttggccca aggattcagatgctcctctttgggctcttctccctgttctacgtcttcaccctgctgggg aacgggaccatactggggctcatctcactggactccagactgcacgcccccatgtacttc ttcctctcacacctggcggtcgtcgacatcgcctacgcctgcaacacggtgccccggatg ctggtgaacctcctgcatccagccaagcccatctcctttgcgggccgcatgatgcagacc tttctgttttccacttttgctgtcacagaatgtctcctcctggtggtgatgtcctatgat ctgtacgtggccatctgccaccccctccgatatttggccatcatgacctggagagtctgc atcaccctcgcggtgacttcctggaccactggagtccttttatccttgattcatcttgtg ttacttctacctttacccttctgtaggccccagaaaatttatcactttttttgtgaaatc ttggctgttctcaaacttgcctgtgcagatacccacatcaatgagaacatggtcttggcc ggagcaatttctgggctggtgggacccttgtccacaattgtagtttcatatatgtgcatc ctctgtgctatccttcagatccaatcaagggaagttcagaggaaagccttccgcacctgc ttctcccacctctgtgtgattggactcgtttatggcacagccattatcatgtatgttgga cccagatatgggaaccccaaggagcagaagaaatatctcctgctgtttcacagcctcttt aatcccatgctcaatccccttatctgtagtcttaggaactcagaagtgaagaatactttg aagagagtgctgggagtagaaagggctttatga >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_4|777_aa MEEPGATPQPYLGLVLEELGRVVAALPESMRPDENPYGFPSELVVCAAVIGFFVVLLFLW RSFRSVRSRLYVGREQKLGATLSGLIEEKCKLLEKFSLIQKEYEGYEVESSLEDASFEKA AAEEARSLEATCEKLSRSNSELEDEILCLEKDLKEEKSKHSQQDELMADISKSIQSLEDE SKSLKSQIAEAKIICKTFKMSEERRAIAIKDALNENSQLQTSHKQLFQQEAEVWKGQVSE LNKQKITFEDSKVHAEQVLNDKENHIKTLTGHLPMMKDQAAVLEEDTTDDDNLELKVNSQ WENGANLDDPPKGALKKLIHAAKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQ TQQASLQSENIYFESENQKLQQKLKIMTEFYQENEMKLYRKLTVEENYRIEEEEKLSRVE EKISHATEELETYRKLAKDLEEELERTVHFYQKQVISYEKRGHDNWLAARTAERNLSDLR KENAHNKQKLTERELKFELLEKDPNALDVSNTAFGREHSPCSPSPLGRPSSETRAFPSPQ TLLEDPLRLSPVLPGGGGRGPSSPGNPLDHQITNERGEPSYDRLIDPHRAPSDTGSLSSP VEQDRRMMFPPPGQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMDRSMPSE MESSRNDAKDDLGNLNVPDSSLPAENEATGPGLIPPPLAPISGPLFPVDTRGPFMRRGPP FPPPPPGTMFGASRGYFPPRDFPGPPHAPFAMRNIYPPRGLPPYLHPRPGFYPNPTF >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_4|2334_bp atggaggagcctggtgctacccctcagccctacctggggctggtcctggaggagctaggc agagttgtggcagcactacctgagagtatgagaccagatgagaatccttatggttttcca tcggaactggtggtatgtgcagctgttattggattttttgttgttctcctttttttgtgg agaagttttagatcggttaggagtcggctttatgtgggaagagagcaaaaacttggtgca acgctttctggactaattgaagaaaaatgtaaactacttgaaaagtttagccttattcaa aaagagtatgaaggctatgaagtagagtcatctttagaggatgccagctttgagaaggcg gcagcagaagaagcacgaagtttggaggcaacctgtgaaaagctgagcaggtccaattct gaacttgaggatgaaatcctctgtctagaaaaagacttaaaagaagagaaatctaaacat tctcaacaagatgaattgatggcggatatttcaaaaagtatacagtctctagaagatgag tcaaaatccctcaaatcacaaatagctgaagccaaaatcatctgcaagacatttaaaatg agtgaagaacgacgggctatagcaataaaagatgctttgaatgaaaattctcaacttcag acaagccataaacagctttttcagcaagaagctgaagtatggaaaggacaagtgagtgaa cttaataaacagaaaataacatttgaagactccaaagtacacgcagaacaagttctgaat gataaagaaaatcacatcaagaccctgactggacacttgccaatgatgaaagatcaggct gctgtgcttgaagaagacacaacggatgatgataacctggaattaaaagtgaacagtcaa tgggaaaatggtgctaacttagatgatcctccgaaaggagctttgaagaaactgattcat gctgctaagttaaatgtttctttaaaaagcttagaaggagaaagaaaccacattattatt cagttatctgaagtggacaaaacaaaggaagagcttacagagcatattaaaaatcttcag actcaacaagcatctttgcaatcagaaaacatatattttgaaagtgagaatcagaagctt caacagaaacttaaaataatgactgaattctatcaagaaaatgaaatgaaactctacagg aaattaacagtggaggaaaattaccgaatagaggaagaagagaagctttctagagtggaa gaaaagatcagccatgccactgaagagctggagacctatagaaagctagccaaagatctt gaagaagaattggagagaactgttcatttttatcaaaagcaggttatttcctacgagaaa agaggacatgataattggttggcagctcggactgctgaaagaaacctcagtgatttaagg aaagaaaatgctcacaacaaacaaaaattaactgaaagagagttgaaatttgaactttta gaaaaagatcctaatgcactcgatgtttcaaatacagcatttggcagagagcattcccca tgtagtccctcaccattgggtcggccttcatctgaaacgagagcttttccctctcctcaa actttgttggaggatccactcagactctcacctgtgcttccagggggaggaggaagaggc ccaagcagcccagggaatcccctggaccatcagattaccaatgaaagaggagaaccaagc tatgacaggttaatcgatcctcacagggctccttctgacactgggtccctgtcatctccg gtggaacaggaccgtaggatgatgtttcctccaccagggcaatcatatcctgattcaact cttcctccacaaagggaagacagattttattctaattctgaaagactgtctggaccagca gaacccagaagttttaaaatgacttctttggataaaatggataggtcaatgccttcagaa atggaatccagtagaaatgatgccaaagatgatcttggtaatttaaatgtgcctgattca tctctccctgctgaaaatgaagcaactggccctggccttattcctccacctcttgctcca atcagcggtccattgtttccagtggatacaaggggcccattcatgagaagaggacctcct ttccccccacctcctccaggaaccatgtttggagcttctcgaggttattttccaccaagg gatttcccaggtccaccacatgctccatttgcaatgagaaacatctatccaccgaggggt ttacctccttaccttcatccgagacctggattttaccccaaccccacattctga >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_5|362_aa MEAIFLAHGPSFKEKTEVEPFENIEVYNLMCEYSAGTSESDAKSHPRRNRDSKGGNSLAC SLQDNVQVLDKGCYRCYTGAKEEHIVFYLEKMENVYRPIGQLGDTSPLPPTVPDCLRADV RVPPSESQKCSFYLADKNITHGFLYPPEMWDYFHSVLLIKHATERNGVNVVSGPIFDYNY DGHFDAPDEITKHLANTDVPIPTHYFVVLTSCKNKSHTPENCPGWLDVLPFIIPHRPTNV ESCPNFILFHLLKIPSGRQEFLFSKIDIINQVTVVLDLFGRCGKMLSLASERAQEMVLWA DLCSTQNSYVEVLTPIPQNVTVFECRVFTEEIGVENHSVLGWQLSLYPDLSRAPSAPSRH MT >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_5|1089_bp atggaggctatctttctggcacatggacccagttttaaagagaagactgaagttgaacca tttgaaaatattgaagtctataacctaatgtgtgagtactcagctggaacaagtgaatca gatgctaaatctcacccaagaagaaacagagattcaaagggagggaactcactggcctgt agcttgcaagacaatgtgcaagtattagataaaggttgttataggtgctacacgggtgca aaagaggaacacattgtgttttacctggagaaaatggaaaatgtttacaggccaattggc caattgggagacacatcgcctctgcctcccactgtcccagactgtctgcgggctgatgtc agggttcctccttctgagagccaaaaatgttccttctatttagcagacaagaatatcacc cacggcttcctctatcctcctgaaatgtgggactacttccacagtgttcttcttataaaa catgccacagaaagaaatggagtaaatgtggttagtggaccaatatttgattataattat gatggccattttgatgctccagatgaaattaccaaacatttagccaacactgatgttccc atcccaacacactactttgtggtgctgaccagttgtaaaaacaagagccacacaccggaa aactgccctgggtggctggatgtcctaccctttatcatccctcaccgacctaccaacgtg gagagctgtcctaatttcattcttttccatctgctaaaaattccatctgggcgtcaagaa ttcctcttcagtaaaattgatattattaaccaagtcactgtggtgttggatctatttggg agatgtggcaaaatgctgagtttggcctcagaacgggcacaggagatggtgttatgggct gacttgtgttccactcaaaattcatatgttgaagtcctaaccccaataccccagaatgtg actgtatttgaatgtagggtctttacagaggaaattggtgtggaaaatcactctgtcctg ggatggcaactctccttgtatccagacttatccagagctccctctgctccctctaggcac atgacttga >gi568815592r:131600472_131801401|GENSCAN_predicted_peptide_6|81_aa MQLVTVSLPFYQEKTGGEKLQMSILTTTKYVHSYHAFRDWGTGHFVNPWAPGLPTPPALI KARRLGYKAKQDCRIRVRRGG >gi568815592r:131600472_131801401|GENSCAN_predicted_CDS_6|246_bp atgcagctggtgactgtgtctcttcccttttatcaagagaaaactggaggtgaaaaactc caaatgagtattctgacaactaccaagtatgtccactcctatcatgcattcagagactgg ggaactggacactttgtcaatccatgggctccagggctccccaccccacctgccctgata aaagcacgccgactgggctacaaggccaagcaagattgtaggattcgtgttcgccgtggt ggctga