GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:11:02 Sequence gi568815586f:85774404_85982372 : 207969 bp : 35.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 12039 12257 219 2 0 88 42 285 0.970 18.81 1.02 PlyA + 12916 12921 6 1.05 2.00 Prom + 13451 13490 40 -6.15 2.01 Init + 14400 14563 164 2 2 70 72 67 0.366 2.55 2.02 Term + 15260 15779 520 2 1 30 38 263 0.433 8.28 2.03 PlyA + 15912 15917 6 1.05 3.02 PlyA - 17918 17913 6 1.05 3.01 Sngl - 31543 30299 1245 1 0 75 32 1105 0.948 99.17 3.00 Prom - 33619 33580 40 -5.95 4.02 PlyA - 33756 33751 6 1.05 4.01 Sngl - 49073 48819 255 2 0 36 42 227 0.951 7.76 4.00 Prom - 50138 50099 40 -7.55 5.00 Prom + 51925 51964 40 -2.15 5.01 Init + 56641 56724 84 0 0 65 82 -24 0.184 -4.43 5.02 Term + 62516 62785 270 1 0 44 45 249 0.733 10.70 5.03 PlyA + 63376 63381 6 1.05 6.00 Prom + 63691 63730 40 -5.45 6.01 Init + 67390 67505 116 0 2 36 78 70 0.443 0.53 6.02 Intr + 69750 70042 293 0 2 79 80 127 0.367 6.85 6.03 Term + 70388 70626 239 0 2 62 42 121 0.413 0.25 6.04 PlyA + 71481 71486 6 1.05 7.05 PlyA - 73480 73475 6 1.05 7.04 Term - 78313 78224 90 1 0 124 38 78 0.523 3.14 7.03 Intr - 81046 80884 163 0 1 77 64 43 0.425 -0.14 7.02 Intr - 81871 81375 497 1 2 71 31 264 0.201 9.86 7.01 Init - 82963 82817 147 1 0 96 46 105 0.406 6.19 7.00 Prom - 83836 83797 40 -5.75 8.00 Prom + 85755 85794 40 -4.55 8.01 Init + 91983 92044 62 2 2 62 80 85 0.019 3.87 8.02 Intr + 99664 99795 132 1 0 48 57 96 0.012 1.34 8.03 Intr + 103995 104166 172 0 1 31 65 111 0.170 2.12 8.04 Term + 107820 107972 153 2 0 90 54 137 0.998 7.44 8.05 PlyA + 107976 107981 6 1.05 9.02 PlyA - 107990 107985 6 1.05 9.01 Sngl - 119997 119689 309 0 0 91 50 188 0.916 10.95 9.00 Prom - 122124 122085 40 -2.95 10.00 Prom + 129927 129966 40 -4.15 10.01 Init + 134484 134608 125 2 2 71 115 117 0.892 12.39 10.02 Intr + 138604 138792 189 1 0 76 80 74 0.714 3.18 10.03 Intr + 163762 163834 73 1 1 51 108 40 0.780 0.79 10.04 Term + 163906 164103 198 0 0 122 48 175 0.908 13.32 10.05 PlyA + 165836 165841 6 1.05 11.03 PlyA - 166909 166904 6 1.05 11.02 Term - 173164 173070 95 2 2 86 44 59 0.244 -1.69 11.01 Init - 176497 176410 88 1 1 48 72 106 0.591 5.85 11.00 Prom - 179119 179080 40 -3.95 12.03 PlyA - 179260 179255 6 1.05 12.02 Term - 203155 203094 62 2 2 85 54 63 0.603 -0.41 12.01 Intr - 206027 204993 1035 0 0 78 44 545 0.523 38.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_1|72_aa MGKKQSRKTENSKNQSASPPPKERSFSPTMEQSCKENDFDKLREEGFRQSNFSELKEEVR THRKEAKNHEKD >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_1|219_bp atggggaaaaaacagagcagaaaaactgaaaattctaaaaatcagagtgcctctccccct ccaaaggaacgaagcttctcgccaacaatggaacaaagctgtaaggagaatgactttgac aagttgagagaagaaggcttcagacaatcaaacttctccgagctaaaggaggaagttcga acacatcgcaaagaagctaaaaaccatgaaaaagattag >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_2|227_aa MDKFLDTYTLPRLNQEEVESLNRQITGSEIEAIINSLQTKKSPGPDGFTAEFYQSPKSPK LISNFSKVSGYKINVQKSQAFLYTNNRLTESQTMSELPFTIVSKRIKYLGIQLKKDVKDL LKENYKTLLNEIKEDTNKWKNIPCSWVGRINVVKMAILPKLIYRFNAIPIKLPMTFFTEL EKTTLKFIWNQKRACIAKSILSQKNKAGGITLPDFKLHYKATVSKTA >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_2|684_bp atggataaattcctggacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagacaaataacaggctctgaaattgaggcaataattaatagcttacaaaccaaa aaaagtccaggaccggacggattcacagccgagttctaccagagcccaaaatctcctaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagca ttcttatacaccaataacagactaacagagagccaaaccatgagtgaactcccgttcaca attgtttcaaagagaataaaatacctaggaatccaacttaaaaaggatgtgaaggacctc cttaaggagaactacaaaacactgctcaacgaaataaaagaagacacaaacaaatggaag aatattccatgctcatgggtaggaagaatcaatgtcgtgaaaatggccatactgcccaaa ttaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcctgcattgccaagtcaatc ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactacactacaag gctacagtatccaaaacagcatag >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_3|414_aa MDSEEKEIVVWVCQEEKLVCGLTKRTTSADVIQALLEEHEATFGEKRFLLGKPSDYCIIE KWRGSERVLPPLTRILKLWKAWGDEQPNMQFVLVKADAFLPVPLWRTAEAKLVQNTEKLW ELSPANYMKTLPPDKQKRIVRKTFRKLAKIKQDTVSHDRDNMETLVHLIISQDHTIHQQV KRMKELDLEIEKCEAKFHLDRVENDGENYVQDAYLMPSFSEVEQNLDLQYEENQTLEDLS ESDGIEQLEERLKYYRILIDKLSAEIEKEVKSVCIDINEDAEGEAASELESSNLESVKCD LEKSMKAGLKIHSHLSGIQKEIKYSDSLLQMKAKEYELLAKEFNSLHISNKDGCQLKENR AKESEVPSSNGEIPPFTQRVFSNYTNDTDSDTGISSNHSQDSETTVGDVVLLST >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_3|1245_bp atggattcagaagagaaggaaattgtggtttgggtttgccaagaagagaagcttgtctgt gggctgactaaacgcaccacctctgctgatgtcatccaggctttgcttgaggaacatgag gctacgtttggagagaaacgatttcttctggggaagcccagtgattactgcatcatagag aagtggagaggctccgaaagggttcttcctccactaactagaatcctgaagctttggaaa gcgtggggagatgagcagcccaatatgcaatttgttttggttaaagcagatgcttttctt ccagttcctttgtggcggacagctgaagccaaattagtgcaaaacacagaaaaattgtgg gagctcagcccagcaaactacatgaagactttaccaccagataaacaaaaaagaatagtc aggaaaactttccggaaactggctaaaattaagcaggacacagtttctcatgatcgagat aatatggagacattagttcatctgatcatttcccaggaccatactattcatcagcaagtc aagagaatgaaagagctggatctggaaattgaaaagtgtgaagctaagttccatcttgat cgagtagaaaatgatggagaaaactatgttcaggatgcatatttaatgcccagtttcagt gaagttgagcaaaatctagacttgcagtatgaggaaaaccagactctggaggacctgagc gaaagtgatggaattgaacagctggaagaacgactgaaatattaccgaatactcattgat aagctctctgctgaaatagaaaaagaggtaaaaagtgtttgcattgatataaatgaagat gcggaaggggaagctgcaagtgaactggaaagctctaatttagagagtgttaagtgtgat ttggagaaaagcatgaaagctggtttgaaaattcactctcatttgagtggcatccagaaa gagattaaatacagtgactcattgcttcagatgaaagcaaaagaatatgaactcctggcc aaggaattcaattcacttcacattagcaacaaagatgggtgccagttaaaggaaaacaga gcgaaggaatctgaggttcccagtagcaatggggagattcctccctttactcaaagagta tttagcaattacacaaatgacacagactcggacactggtatcagttctaaccacagtcag gactccgaaacaacagtaggagatgtggtgctgttgtcaacatag >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_4|84_aa MWKKQGACGIWEEAAGKAKEDPCENGIVTVKERKAARMGGGGHNALCYKEDMSDQFCKWL LGVAVKRFLITLRRKVATEDEWRQ >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_4|255_bp atgtggaagaagcaaggtgcatgtgggatctgggaagaagcagctggaaaagcaaaggaa gatccatgtgagaatggcatcgtaacagtcaaggagaggaaagctgcaagaatgggagga ggaggtcacaatgcattatgttacaaagaggatatgtcagatcagttctgcaagtggcta ttgggtgtggcagttaagagatttctgataaccctaagaagaaaagtggctactgaggat gagtggaggcagtag >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_5|117_aa MRFHFRKLICKRQRDSLSRTGIHIHLQEAPAPHSQVHLAKVEGPEGPQGSRADASQFGLG TSWSLVFDSQGVYVDPYPIFDIHWAFRALGISEEKQTAPAPQGVKKTHRGDLLELEM >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_5|354_bp atgaggtttcattttagaaagctgatatgtaagaggcaaagagattccttgtcaaggact ggtattcatatacacctgcaagaggcccctgctccacactctcaagttcatcttgcaaaa gtcgaaggacctgaaggaccacagggctccagggctgatgcatcacagtttggtctagga accagctggagtttagtgtttgactcccaaggcgtttatgttgacccctaccccatcttt gacattcactgggcctttagagccctgggaatttcagaggaaaagcaaactgctcctgcc cctcaaggagtaaagaagacacaccgaggggatctgctggagctcgaaatgtga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_6|215_aa MYLSSAVRSCLKFAHGNRKLSLKIGMDEASLTKPFKYGCPLWTLGSDKHKREAKWGLREA RHWLAWTFFMNSLGTIESSRRQTGSKEERDGSLVKPHLPAGESLKPGGQAPVSWTTEGTC GVFSCARPWQPMDQSADEGERRTMALRGPQTWEAPRARAVTLSLGPCRSWSLQASRHHHI PQCQPWKLLAVYLVQPQPLSELVPMPAPGAAHPLQ >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_6|648_bp atgtacctatcttcagcagttcgttcctgtcttaaatttgctcatggtaataggaaactg tctctaaaaattggaatggatgaggcttctctcacaaagccttttaaatatggatgcccc ctctggactttgggctctgataagcacaagagggaggccaagtggggactgagggaagct cggcactggcttgcctggaccttcttcatgaacagcctgggcaccatagaaagcagtaga aggcagacaggctccaaggaagaaagggatgggtcccttgtgaagccccaccttccggct ggggaaagcctaaagcctgggggccaggcgccagtctcctggaccacagagggaacttgt ggtgttttttcctgtgcccgcccatggcagcccatggaccaatcagcagatgaaggggag agaagaactatggcccttcggggaccccagacctgggaagctccccgagccagggctgtt actctctctttggggccctgcaggtcctggagtctccaagcttctcggcaccaccacatt ccccagtgtcagccatggaagctgcttgcagtgtacctggtccagccacagcctctcagt gagctggtgcccatgccagcacctggagctgcccatccactgcagtag >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_7|298_aa MGFLYVVQAGLELLDSSNPPASASQNSEIDGGGGPSEAAAVGTSAAARELPTYLILSGCR TSTQDPLNGRTERAVTQTGLKHPPAHNFVGDKKERRGKERKAAVVLVYFHTADKDIPETG KKKRFNGLTVAHRWGGLTIMVEGKEELVTSYMDGSGQRQRARAGKLPFLKPSHHVRLIHC HKNSAEMTCPHDSIISHWVPPTTHENYKSYKIRFGHGMWTGSMSRIQAARPCGQNKLSGR KKNSGKSATGHRSFKLEKQHPNHPMTALQPTQHKDDEDEDVYDDHFYLIKNNYIFYSV >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_7|897_bp atggggtttctctatgttgtccaggctggtctcgaactcctcgattcaagcaatccacct gcctcagcttcccaaaattctgagattgatggtggtggtggcccatctgaagcagctgct gtgggaacatcagctgcagcaagggagttgcccacatacctcattctttctggatgcagg acaagtactcaggacccattgaatggcagaactgaaagagctgtcacacaaacaggactg aaacacccccctgctcataactttgtgggtgacaagaaggagagaagagggaaagagaga aaagctgcagttgtcttagtttatttccacactgctgataaagacatacctgagactggg aagaaaaagaggtttaatggacttacagttgcacatcgctggggaggcctcacaatcatg gtggaaggaaaggaggagctagtcacatcttacatggatggcagtgggcaaagacagaga gctcgtgcagggaaactcccgtttttaaaaccatcacatcacgtcagacttattcactgt cacaagaacagcgcagaaatgacctgcccccatgattcaatcatctcccactgggtccct cccacaacacatgaaaattataagagctacaagataagatttgggcatgggatgtggact ggtagcatgagccgaatacaggctgccaggccgtgtgggcagaacaagctcagtgggcgc aagaaaaactcaggcaaaagtgccaccggccacagaagtttcaagctggaaaagcaacac cctaaccatcctatgacagcattacagcctactcaacataaagatgatgaggatgaagat gtttatgatgatcacttctacttaataaagaataactatattttttattctgtgtga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_8|172_aa MLTHRLPWLGVGGPLPHVALRNPKQQQQQLGKIVTFTQGSEMGEESRGDKGKGEEKAGQR GEGWSLVNNLNSPAEETGEVHEEELVARRKLPTALDGFSLEAMLTIYQLHKICHSRAFQH WELIQEDILDTGNDKNGKEEVIKRKIPYILKRQLYENKPRRPYILKRDSYYY >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_8|519_bp atgctcactcaccgcctcccttggctgggggtagggggtcccctgccccatgtggctctc agaaatccaaagcagcagcagcagcaattagggaagatcgtcactttcactcaaggttca gaaatgggggaggagagcaggggggacaaaggaaaaggggaggagaaagcagggcaaaga ggggagggatggagtcttgtaaataatttgaacagcccagctgaggaaacaggagaagtt catgaagaggagcttgttgcaagaaggaaacttcctactgctttagatggctttagcttg gaagcaatgttgacaatataccagctccacaaaatctgtcacagcagggcttttcaacac tgggagttaatccaggaagatattcttgatactggaaatgacaaaaatggaaaggaagaa gtcataaagagaaaaattccttatattctgaaacggcagctgtatgagaataaacccaga agaccctacatactcaaaagagattcttactattactga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_9|102_aa MDKWDHIKLKNFYIAKKTINKVKRQFTGEKIFAHYPSEIGLITTICKELKQLYTRKKSNN LIKRWTIDLNAHFSKEDIQMASRYMKRCSTSLIIREMQIKLQ >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_9|309_bp atggacaaatgggatcacatcaagttaaaaaacttctacatagcaaagaagacaatcaac aaagtgaagagacaattcacaggagagaaaatatttgcacattacccatctgaaatcgga ttaattaccacaatatgtaaggagctcaagcaactctatacgagaaaaaaatctaataat ttgattaaaagatggacaatagatctgaatgcacatttctcaaaagaagacatacaaatg gcaagcaggtatatgaaaaggtgctcaacatcattgatcatcagagaaatgcaaatcaaa ctacaatga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_10|194_aa MGKNFMTKTQKAIATKAKIDKWDLIKRKSFCTAKEAIIRENSLHVLCFFFFFEGIAENFL NLEKDINIQVQEEYRTPNRFNPNKTKSRHLIIKLPKNKDKEKMASHKLVHAQQHFFINVM GLRQVLKAQPAPMAPWGVPYDQLTEEEKTRAWFTDDSAQYAGITRKWIAAALQPLSRTSL KDSSEGKSSQWTEL >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_10|585_bp atgggcaagaacttcatgactaaaacacaaaaagcaattgcaacaaaagccaaaattgac aaatgggatctaattaaacgaaagagcttctgcacagcaaaagaagctatcatcagggag aacagtttacatgtcctttgtttctttttcttttttgaagggatagcagagaacttctta aacctagagaaagacataaatatccaagtacaagaagaatatagaacaccaaacagattt aacccaaataagactaaatcaagacatttaataatcaaactccccaagaacaaggataaa gaaaagatggccagccacaaattggttcatgcacagcagcatttcttcatcaatgtaatg gggctcaggcaggttctgaaggcacaacctgcaccaatggccccatggggagttccctat gatcagttgacagaggaagagaagactagggcctggttcacagatgattctgcacaatat gcaggcatcacccgaaagtggatagctgcagcactgcagcccctttctaggacatctctg aaggacagcagtgaagggaaatcttcccagtggacagaactttga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_11|60_aa MKERYKENYKTLVKEIIDETNKWENIPSSGIPTCGEHHGDLHLRIKRLRWVGQLFHWLRE >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_11|183_bp atgaaagaacgctacaaggagaactacaaaacactagtgaaagaaatcatcgatgaaaca aacaaatgggaaaacattccaagctcaggtatccccacgtgtggagaacatcatggcgac ctgcatttgcgtattaaaagactgcggtgggtgggccagctttttcactggctacgtgaa tga >gi568815586f:85774404_85982372|GENSCAN_predicted_peptide_12|365_aa XYLTIGLSSVKRKKGNYLLETIKSIFEQSSYEELKEISVVVHLADFNSSWRDAMVQDITQ KFAHHIIAGRLMVIHAPEEYYPILDGLKRNYNDPEDRVKFRSKQNVDYAFLLNFCANTSD YYVMLEDDVRCSKNFLTAIKKVIASLEGTYWVTLEFSKLGYIGKLYHSHDLPRLAHFLLM FYQEMPCDWLLTHFRGLLAQKNVIRFKPSLFQHMGYYSSYKGTENKLKDDDFEEESFDIP DNPPASLYTNMNVFENYEASKAYSSVDEYFWGKPPSTGDVFVIVFENPIIIKKIKVNTGT EDRQNDILHHGALDVGENVMPSKQRRQCSTYLRLGEFKNGNFEMSATKVLEEKRTFHKRL NINIL >gi568815586f:85774404_85982372|GENSCAN_predicted_CDS_12|1098_bp nggtatcttacaattggactttcttcagtaaagcgaaaaaaaggaaactatttacttgag acaattaagtcaatttttgagcaatccagctatgaagagctgaaggaaatttcagtggtg gttcacctagcagactttaattcttcctggcgtgatgccatggtccaggatattacacag aaatttgcgcaccatattattgcaggaagattaatggttatacatgctccagaggagtat tacccaatcctagatggccttaaaagaaattacaatgatccagaagatagagtcaaattt cgttccaagcaaaatgtagattatgcttttctgcttaatttttgtgccaatacttcagac tattatgtaatgcttgaagatgatgttcgatgttcaaaaaatttcttaactgccatcaag aaagtcattgcatccctagaaggaacttactgggtaactcttgaattctctaagcttggc tacattggtaaactctatcattctcatgatctcccacgtttggcccattttttattaatg ttttatcaagaaatgccttgtgattggctattgactcatttccgtggtctgttggctcag aaaaatgtgatccgttttaaaccatctctctttcagcacatgggctattattcatcatac aaagggacggagaataagctgaaggatgatgattttgaagaggagtcatttgacattcct gataacccccctgcaagtctgtacaccaacatgaatgtgtttgaaaattatgaagcaagc aaggcttacagtagtgttgatgagtacttttgggggaaaccaccttcaacaggagatgtt tttgtgattgtatttgaaaatccaattataataaaaaaaattaaagtaaatactggaaca gaagatcggcaaaatgatattttgcatcatggagccctagatgttggggaaaacgttatg cctagcaaacaaaggagacaatgttctacttacttaagactaggagaattcaaaaatgga aactttgaaatgtcagctactaaagtgttagaagaaaaacgaacctttcataagaggcta aatattaatattttgtga