GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:07:47 Sequence gi568815586r:85704705_85936201 : 231497 bp : 35.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 166 161 6 1.05 1.05 Term - 4032 3830 203 1 2 17 54 174 0.539 3.47 1.04 Intr - 7530 7290 241 0 1 70 75 150 0.790 8.10 1.03 Intr - 8746 8587 160 0 1 86 51 46 0.450 -0.23 1.02 Intr - 28704 28555 150 2 0 -1 44 258 0.045 10.86 1.01 Init - 45497 45346 152 2 2 84 75 75 0.114 5.36 1.00 Prom - 45837 45798 40 -5.15 2.00 Prom + 60856 60895 40 -2.75 2.01 Sngl + 81738 81956 219 2 0 88 42 285 0.971 18.81 2.02 PlyA + 82615 82620 6 1.05 3.00 Prom + 83150 83189 40 -6.15 3.01 Init + 84099 84262 164 2 2 70 72 67 0.366 2.55 3.02 Term + 84959 85478 520 2 1 30 38 263 0.433 8.28 3.03 PlyA + 85611 85616 6 1.05 4.02 PlyA - 87617 87612 6 1.05 4.01 Sngl - 101242 99998 1245 1 0 75 32 1105 0.948 99.17 4.00 Prom - 103318 103279 40 -5.95 5.02 PlyA - 103455 103450 6 1.05 5.01 Sngl - 118772 118518 255 2 0 36 42 227 0.951 7.76 5.00 Prom - 119837 119798 40 -7.55 6.00 Prom + 121624 121663 40 -2.15 6.01 Init + 126340 126423 84 0 0 65 82 -24 0.184 -4.43 6.02 Term + 132215 132484 270 1 0 44 45 249 0.733 10.70 6.03 PlyA + 133075 133080 6 1.05 7.00 Prom + 133390 133429 40 -5.45 7.01 Init + 137089 137204 116 0 2 36 78 70 0.443 0.53 7.02 Intr + 139449 139741 293 0 2 79 80 127 0.367 6.85 7.03 Term + 140087 140325 239 0 2 62 42 121 0.413 0.25 7.04 PlyA + 141180 141185 6 1.05 8.05 PlyA - 143179 143174 6 1.05 8.04 Term - 148012 147923 90 1 0 124 38 78 0.523 3.14 8.03 Intr - 150745 150583 163 0 1 77 64 43 0.425 -0.14 8.02 Intr - 151570 151074 497 1 2 71 31 264 0.201 9.86 8.01 Init - 152662 152516 147 1 0 96 46 105 0.406 6.19 8.00 Prom - 153535 153496 40 -5.75 9.00 Prom + 155454 155493 40 -4.55 9.01 Init + 161682 161743 62 2 2 62 80 85 0.019 3.87 9.02 Intr + 169363 169494 132 1 0 48 57 96 0.012 1.34 9.03 Intr + 173694 173865 172 0 1 31 65 111 0.170 2.12 9.04 Term + 177519 177671 153 2 0 90 54 137 0.998 7.44 9.05 PlyA + 177675 177680 6 1.05 10.02 PlyA - 177689 177684 6 1.05 10.01 Sngl - 189696 189388 309 0 0 91 50 188 0.916 10.95 10.00 Prom - 191823 191784 40 -2.95 11.00 Prom + 199626 199665 40 -4.15 11.01 Init + 204183 204307 125 2 2 71 115 117 0.892 12.39 11.02 Intr + 208303 208491 189 1 0 76 80 74 0.711 3.18 11.03 Term + 220854 220986 133 0 1 65 40 88 0.134 -1.72 11.04 PlyA + 221072 221077 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 28704 28551 154 2 1 -1 54 276 0.904 11.71 S.002 Sngl - 38250 37825 426 0 0 48 43 217 0.970 9.24 S.003 Init + 62882 62891 10 1 1 106 111 2 0.834 4.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_1|301_aa MSTKTMTADTKLAHLRKMESSAPEFTNSSRSKTYRCQYWASQYQSDNKIIRIGCLQAENR EADSAAFSLRAPEKRLVQGPESKDRRPKNPESDIQGQKKKNWSNGLNLLIIYHVLGSYML LHLIHRSLIAEPYYHFTDVENEARDVKYLAHVHTHLHGEEDGHMGGGHMAQSPGKKDSDY CVISGKGVCQEQGEGVCVSKGLWRPRILPNSTAPEGSRSSGEGSLPAPTMGSTADVMVLP DMIQKAVVDQTGSRPPTVTMTFFCANLALGSALELLLSPTAELVIGSCHIKSTCCHTSQS N >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_1|906_bp atgagcactaaaactatgactgcagacaccaagttggctcatctaaggaaaatggaaagc tctgcaccggaatttactaattcaagcaggtccaaaacatacagatgtcaatactgggca agtcagtatcagtcagacaacaaaattatcaggataggttgtctgcaagctgaaaacagg gaagctgacagtgcagccttcagtctgagagcccctgagaagcggctggtgcaaggccca gagtccaaagaccgaagaccaaagaacccggagtctgatatacaagggcagaagaagaag aattggagtaacggcttaaatttattaattatttaccatgtgctaggctcctacatgctg ctgcatttaatccatcgatccttgattgctgaaccttattatcatttcacagatgtagaa aatgaggccagagatgttaaatatcttgcacatgttcacacacacttacatggagaggag gatggacatatgggtggaggacacatggcccaatcccctggaaagaaggactctgactat tgtgtaatctcggggaagggagtgtgccaggaacagggagaaggagtgtgtgtgagcaaa ggcctctggaggcccaggattctgcccaactcaacagccccagagggcagtagatcctca ggggagggctctctccctgcccccacaatgggctccacagcagatgtaatggttttgcca gacatgattcagaaagcagtagtggatcagactggcagcagaccaccaacagtaaccatg acctttttctgtgcaaatttggctttgggaagtgctttggagcttcttctcagcccaact gctgagctggtcatcggcagttgtcatataaaatccacctgttgtcatacatcacaatcc aattga >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_2|72_aa MGKKQSRKTENSKNQSASPPPKERSFSPTMEQSCKENDFDKLREEGFRQSNFSELKEEVR THRKEAKNHEKD >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_2|219_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagtgcctctccccct ccaaaggaacgaagcttctcgccaacaatggaacaaagctgtaaggagaatgactttgac aagttgagagaagaaggcttcagacaatcaaacttctccgagctaaaggaggaagttcga acacatcgcaaagaagctaaaaaccatgaaaaagattag >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_3|227_aa MDKFLDTYTLPRLNQEEVESLNRQITGSEIEAIINSLQTKKSPGPDGFTAEFYQSPKSPK LISNFSKVSGYKINVQKSQAFLYTNNRLTESQTMSELPFTIVSKRIKYLGIQLKKDVKDL LKENYKTLLNEIKEDTNKWKNIPCSWVGRINVVKMAILPKLIYRFNAIPIKLPMTFFTEL EKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLHYKATVSKTA >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_3|684_bp atggataaattcctggacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagacaaataacaggctctgaaattgaggcaataattaatagcttacaaaccaaa aaaagtccaggaccggacggattcacagccgagttctaccagagcccaaaatctcctaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttcttatacaccaataacagactaacagagagccaaaccatgagtgaactcccgttcaca attgtttcaaagagaataaaatacctaggaatccaacttaaaaaggatgtgaaggacctc cttaaggagaactacaaaacactgctcaacgaaataaaagaagacacaaacaaatggaag aatattccatgctcatgggtaggaagaatcaatgtcgtgaaaatggccatactgcccaaa ttaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcctgcattgccaagtcaatc ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactacactacaag gctacagtatccaaaacagcatag >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_4|414_aa MDSEEKEIVVWVCQEEKLVCGLTKRTTSADVIQALLEEHEATFGEKRFLLGKPSDYCIIE KWRGSERVLPPLTRILKLWKAWGDEQPNMQFVLVKADAFLPVPLWRTAEAKLVQNTEKLW ELSPANYMKTLPPDKQKRIVRKTFRKLAKIKQDTVSHDRDNMETLVHLIISQDHTIHQQV KRMKELDLEIEKCEAKFHLDRVENDGENYVQDAYLMPSFSEVEQNLDLQYEENQTLEDLS ESDGIEQLEERLKYYRILIDKLSAEIEKEVKSVCIDINEDAEGEAASELESSNLESVKCD LEKSMKAGLKIHSHLSGIQKEIKYSDSLLQMKAKEYELLAKEFNSLHISNKDGCQLKENR AKESEVPSSNGEIPPFTQRVFSNYTNDTDSDTGISSNHSQDSETTVGDVVLLST >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_4|1245_bp atggattcagaagagaaggaaattgtggtttgggtttgccaagaagagaagcttgtctgt gggctgactaaacgcaccacctctgctgatgtcatccaggctttgcttgaggaacatgag gctacgtttggagagaaacgatttcttctggggaagcccagtgattactgcatcatagag aagtggagaggctccgaaagggttcttcctccactaactagaatcctgaagctttggaaa gcgtggggagatgagcagcccaatatgcaatttgttttggttaaagcagatgcttttctt ccagttcctttgtggcggacagctgaagccaaattagtgcaaaacacagaaaaattgtgg gagctcagcccagcaaactacatgaagactttaccaccagataaacaaaaaagaatagtc aggaaaactttccggaaactggctaaaattaagcaggacacagtttctcatgatcgagat aatatggagacattagttcatctgatcatttcccaggaccatactattcatcagcaagtc aagagaatgaaagagctggatctggaaattgaaaagtgtgaagctaagttccatcttgat cgagtagaaaatgatggagaaaactatgttcaggatgcatatttaatgcccagtttcagt gaagttgagcaaaatctagacttgcagtatgaggaaaaccagactctggaggacctgagc gaaagtgatggaattgaacagctggaagaacgactgaaatattaccgaatactcattgat aagctctctgctgaaatagaaaaagaggtaaaaagtgtttgcattgatataaatgaagat gcggaaggggaagctgcaagtgaactggaaagctctaatttagagagtgttaagtgtgat ttggagaaaagcatgaaagctggtttgaaaattcactctcatttgagtggcatccagaaa gagattaaatacagtgactcattgcttcagatgaaagcaaaagaatatgaactcctggcc aaggaattcaattcacttcacattagcaacaaagatgggtgccagttaaaggaaaacaga gcgaaggaatctgaggttcccagtagcaatggggagattcctccctttactcaaagagta tttagcaattacacaaatgacacagactcggacactggtatcagttctaaccacagtcag gactccgaaacaacagtaggagatgtggtgctgttgtcaacatag >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_5|84_aa MWKKQGACGIWEEAAGKAKEDPCENGIVTVKERKAARMGGGGHNALCYKEDMSDQFCKWL LGVAVKRFLITLRRKVATEDEWRQ >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_5|255_bp atgtggaagaagcaaggtgcatgtgggatctgggaagaagcagctggaaaagcaaaggaa gatccatgtgagaatggcatcgtaacagtcaaggagaggaaagctgcaagaatgggagga ggaggtcacaatgcattatgttacaaagaggatatgtcagatcagttctgcaagtggcta ttgggtgtggcagttaagagatttctgataaccctaagaagaaaagtggctactgaggat gagtggaggcagtag >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_6|117_aa MRFHFRKLICKRQRDSLSRTGIHIHLQEAPAPHSQVHLAKVEGPEGPQGSRADASQFGLG TSWSLVFDSQGVYVDPYPIFDIHWAFRALGISEEKQTAPAPQGVKKTHRGDLLELEM >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_6|354_bp atgaggtttcattttagaaagctgatatgtaagaggcaaagagattccttgtcaaggact ggtattcatatacacctgcaagaggcccctgctccacactctcaagttcatcttgcaaaa gtcgaaggacctgaaggaccacagggctccagggctgatgcatcacagtttggtctagga accagctggagtttagtgtttgactcccaaggcgtttatgttgacccctaccccatcttt gacattcactgggcctttagagccctgggaatttcagaggaaaagcaaactgctcctgcc cctcaaggagtaaagaagacacaccgaggggatctgctggagctcgaaatgtga >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_7|215_aa MYLSSAVRSCLKFAHGNRKLSLKIGMDEASLTKPFKYGCPLWTLGSDKHKREAKWGLREA RHWLAWTFFMNSLGTIESSRRQTGSKEERDGSLVKPHLPAGESLKPGGQAPVSWTTEGTC GVFSCARPWQPMDQSADEGERRTMALRGPQTWEAPRARAVTLSLGPCRSWSLQASRHHHI PQCQPWKLLAVYLVQPQPLSELVPMPAPGAAHPLQ >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_7|648_bp atgtacctatcttcagcagttcgttcctgtcttaaatttgctcatggtaataggaaactg tctctaaaaattggaatggatgaggcttctctcacaaagccttttaaatatggatgcccc ctctggactttgggctctgataagcacaagagggaggccaagtggggactgagggaagct cggcactggcttgcctggaccttcttcatgaacagcctgggcaccatagaaagcagtaga aggcagacaggctccaaggaagaaagggatgggtcccttgtgaagccccaccttccggct ggggaaagcctaaagcctgggggccaggcgccagtctcctggaccacagagggaacttgt ggtgttttttcctgtgcccgcccatggcagcccatggaccaatcagcagatgaaggggag agaagaactatggcccttcggggaccccagacctgggaagctccccgagccagggctgtt actctctctttggggccctgcaggtcctggagtctccaagcttctcggcaccaccacatt ccccagtgtcagccatggaagctgcttgcagtgtacctggtccagccacagcctctcagt gagctggtgcccatgccagcacctggagctgcccatccactgcagtag >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_8|298_aa MGFLYVVQAGLELLDSSNPPASASQNSEIDGGGGPSEAAAVGTSAAARELPTYLILSGCR TSTQDPLNGRTERAVTQTGLKHPPAHNFVGDKKERRGKERKAAVVLVYFHTADKDIPETG KKKRFNGLTVAHRWGGLTIMVEGKEELVTSYMDGSGQRQRARAGKLPFLKPSHHVRLIHC HKNSAEMTCPHDSIISHWVPPTTHENYKSYKIRFGHGMWTGSMSRIQAARPCGQNKLSGR KKNSGKSATGHRSFKLEKQHPNHPMTALQPTQHKDDEDEDVYDDHFYLIKNNYIFYSV >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_8|897_bp atggggtttctctatgttgtccaggctggtctcgaactcctcgattcaagcaatccacct gcctcagcttcccaaaattctgagattgatggtggtggtggcccatctgaagcagctgct gtgggaacatcagctgcagcaagggagttgcccacatacctcattctttctggatgcagg acaagtactcaggacccattgaatggcagaactgaaagagctgtcacacaaacaggactg aaacacccccctgctcataactttgtgggtgacaagaaggagagaagagggaaagagaga aaagctgcagttgtcttagtttatttccacactgctgataaagacatacctgagactggg aagaaaaagaggtttaatggacttacagttgcacatcgctggggaggcctcacaatcatg gtggaaggaaaggaggagctagtcacatcttacatggatggcagtgggcaaagacagaga gctcgtgcagggaaactcccgtttttaaaaccatcacatcacgtcagacttattcactgt cacaagaacagcgcagaaatgacctgcccccatgattcaatcatctcccactgggtccct cccacaacacatgaaaattataagagctacaagataagatttgggcatgggatgtggact ggtagcatgagccgaatacaggctgccaggccgtgtgggcagaacaagctcagtgggcgc aagaaaaactcaggcaaaagtgccaccggccacagaagtttcaagctggaaaagcaacac cctaaccatcctatgacagcattacagcctactcaacataaagatgatgaggatgaagat gtttatgatgatcacttctacttaataaagaataactatattttttattctgtgtga >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_9|172_aa MLTHRLPWLGVGGPLPHVALRNPKQQQQQLGKIVTFTQGSEMGEESRGDKGKGEEKAGQR GEGWSLVNNLNSPAEETGEVHEEELVARRKLPTALDGFSLEAMLTIYQLHKICHSRAFQH WELIQEDILDTGNDKNGKEEVIKRKIPYILKRQLYENKPRRPYILKRDSYYY >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_9|519_bp atgctcactcaccgcctcccttggctgggggtagggggtcccctgccccatgtggctctc agaaatccaaagcagcagcagcagcaattagggaagatcgtcactttcactcaaggttca gaaatgggggaggagagcaggggggacaaaggaaaaggggaggagaaagcagggcaaaga ggggagggatggagtcttgtaaataatttgaacagcccagctgaggaaacaggagaagtt catgaagaggagcttgttgcaagaaggaaacttcctactgctttagatggctttagcttg gaagcaatgttgacaatataccagctccacaaaatctgtcacagcagggcttttcaacac tgggagttaatccaggaagatattcttgatactggaaatgacaaaaatggaaaggaagaa gtcataaagagaaaaattccttatattctgaaacggcagctgtatgagaataaacccaga agaccctacatactcaaaagagattcttactattactga >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_10|102_aa MDKWDHIKLKNFYIAKKTINKVKRQFTGEKIFAHYPSEIGLITTICKELKQLYTRKKSNN LIKRWTIDLNAHFSKEDIQMASRYMKRCSTSLIIREMQIKLQ >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_10|309_bp atggacaaatgggatcacatcaagttaaaaaacttctacatagcaaagaagacaatcaac aaagtgaagagacaattcacaggagagaaaatatttgcacattacccatctgaaatcgga ttaattaccacaatatgtaaggagctcaagcaactctatacgagaaaaaaatctaataat ttgattaaaagatggacaatagatctgaatgcacatttctcaaaagaagacatacaaatg gcaagcaggtatatgaaaaggtgctcaacatcattgatcatcagagaaatgcaaatcaaa ctacaatga >gi568815586r:85704705_85936201|GENSCAN_predicted_peptide_11|148_aa MGKNFMTKTQKAIATKAKIDKWDLIKRKSFCTAKEAIIRENSLHVLCFFFFFEGIAENFL NLEKDINIQVQEEYRTPNRFNPNKTKSRHLIIKLPKNKDKEKMARHCMEKHCENPCPVLF RSNYRCMQPPVMHPLLAQLITTLSRRPP >gi568815586r:85704705_85936201|GENSCAN_predicted_CDS_11|447_bp atgggcaagaacttcatgactaaaacacaaaaagcaattgcaacaaaagccaaaattgac aaatgggatctaattaaacgaaagagcttctgcacagcaaaagaagctatcatcagggag aacagtttacatgtcctttgtttctttttcttttttgaagggatagcagagaacttctta aacctagagaaagacataaatatccaagtacaagaagaatatagaacaccaaacagattt aacccaaataagactaaatcaagacatttaataatcaaactccccaagaacaaggataaa gaaaagatggccagacattgtatggaaaagcactgtgaaaatccctgtcctgttctgttc cgttctaattaccggtgcatgcagcccccagtcatgcaccccctgcttgctcaattgatc acaaccctctcccgcagacccccttag