GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:46:25 Sequence gi568815592f:122897778_123163758 : 265981 bp : 36.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 807 577 231 1 0 18 88 147 0.075 4.52 1.01 Init - 3589 3172 418 1 1 57 36 512 0.293 39.54 1.00 Prom - 9785 9746 40 -3.55 2.02 PlyA - 10231 10226 6 1.05 2.01 Sngl - 16564 16208 357 1 0 74 37 202 0.553 9.51 2.00 Prom - 22110 22071 40 -4.25 3.09 PlyA - 22459 22454 6 1.05 3.08 Term - 29637 29198 440 1 2 37 38 209 0.350 5.25 3.07 Intr - 32366 32089 278 1 2 37 110 125 0.408 5.84 3.06 Intr - 32644 32519 126 0 0 1 80 130 0.433 2.37 3.05 Intr - 43609 42238 1372 2 1 71 53 357 0.028 17.40 3.04 Intr - 44452 44222 231 2 0 46 72 131 0.298 4.12 3.03 Intr - 47033 46765 269 1 2 25 71 216 0.347 9.85 3.02 Intr - 72375 72333 43 1 1 15 113 31 0.018 -5.32 3.01 Init - 72662 72509 154 2 1 89 96 85 0.514 9.59 3.00 Prom - 74733 74694 40 -8.85 4.00 Prom + 74767 74806 40 -4.15 4.01 Init + 80068 80234 167 0 2 78 83 105 0.798 8.15 4.02 Intr + 80942 81140 199 2 1 35 18 166 0.025 2.73 4.03 Term + 98470 98946 477 0 0 -18 44 453 0.263 24.15 4.04 PlyA + 99330 99335 6 1.05 5.00 Prom + 99859 99898 40 -12.03 5.01 Init + 100001 100389 389 1 2 72 99 490 0.995 44.82 5.02 Intr + 113208 113382 175 0 1 95 91 151 0.925 15.12 5.03 Intr + 123911 124008 98 1 2 30 73 87 0.000 -0.71 5.04 Intr + 132805 132829 25 1 1 107 90 16 0.030 0.81 5.05 Intr + 139586 139693 108 1 0 53 75 75 0.049 2.26 5.06 Intr + 158029 158249 221 0 2 100 69 206 0.471 16.08 5.07 Term + 165897 165984 88 0 1 132 38 81 0.501 3.65 5.08 PlyA + 166939 166944 6 1.05 6.02 PlyA - 167221 167216 6 1.05 6.01 Sngl - 184403 183888 516 2 0 57 29 242 0.252 10.99 6.00 Prom - 206390 206351 40 -2.95 7.03 PlyA - 207160 207155 6 1.05 7.02 Term - 209793 209635 159 0 0 76 41 89 0.167 -0.04 7.01 Init - 244612 244460 153 1 0 73 107 123 0.946 12.73 7.00 Prom - 251659 251620 40 -5.75 8.03 PlyA - 252243 252238 6 1.05 8.02 Term - 259192 259081 112 0 1 83 37 131 0.957 4.55 8.01 Init - 259821 259742 80 0 2 78 100 57 0.665 6.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 89735 90067 333 1 0 58 44 261 0.821 14.47 S.002 Term - 121299 121162 138 0 0 104 47 132 0.902 7.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_1|217_aa MRLRRKQRPYPEGSREAKSDKVRKADSGITKKKKEEEAEGGGGGGKKEERRRGRGRGRRR RRRKGEGGEEEEEEEEEEEEEEGEGGGGGGGGGGKKKEEEGREKEGRKRGREEVKKEGGR EGRKERRKEGSEGGRKERRSICTELGSLENRGISKILGKCLNSRKEERRRETKGSRLLAP WLLGLIIDFVVTEGLAVWELLGVDEDPTASYGGMAGX >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_1|651_bp atgaggctaagaaggaagcagaggccatatcctgagggatctagagaggccaagtcagat aaagtcagaaaagctgacagtggcattacaaagaagaaaaaagaagaggaagcagaagga ggaggaggaggaggaaagaaggaagaaagaagaagaggaagaggaagaggaagaaggagg aggaggaggaagggagaaggaggagaagaggaagaagaagaggaagaagaagaagaagaa gaagaaggagaaggaggaggaggaggtggtggaggaggaggaaagaagaaagaagaagaa ggaagagagaaggaaggaaggaagcgagggagggaggaagtaaagaaggagggagggaga gagggaaggaaggagagaaggaaggaaggaagcgagggagggaggaaggaaagaaggagc atatgtactgaacttgggtctctggaaaatagaggcatctccaaaatccttggaaagtgt ctgaattcaagaaaagaggagagaagaagagagactaaaggctcaaggctgctggcaccc tggcttctgggcttgattatagactttgttgtgacagaagggcttgcagtctgggagctc ctgggggtggatgaagatcctacagcatcttatggaggcatggctggagnn >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_2|118_aa MISFDSMSHITVMLMQDIGSHSLRQLCPFGFAEYSPPPSCFLRLTLSVCGFSRCTVQAVS GSTILGSGECGPLHTVPLGSAPVGALYKGSNHTFPFCTALAEALHEGFTPAAHLCLNI >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_2|357_bp atgatctcctttgactccatgtctcacatcacggttatgctgatgcaagacataggctcc cacagcctcaggcagctctgtccctttggctttgcagagtacagtccccctcctagctgc tttctcaggctgacattgagtgtctgtggcttttccagatgcacggtgcaagctgtcagt ggatctactattctggggtctggagaatgtggtcctcttcacacagttccactaggcagt gccccagttggggctctgtataagggatccaatcacacatttcccttctgtactgcccta gcagaggctctccatgagggcttcacccctgcagcacacctctgcctgaacatctag >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_3|970_aa MTHTEWKSIDSICIISMLPPSLLEKEPCRQLGKEMGTVCHISAILEFRKDEARPRASPKT KKIELRGRLTPYTAGYSSETKLPVERSGSNICCSPISTVLQPLLPIARQTGSGVGLQQTP TDLQLRDLTVRRKTNKQKGHPHQNPICTSSSSKTKEIQTTIREYYKHLYANKLENLEEMD KFLNTYTLPTLNQEEVESLNRPITGSEIEAIINSLPIKTVPEQTDSQPNSIRVLEVLATA IMQKKEINGIQLGNEEVKLSLFADDMIVYLENPIISAQNLLKLISFSKVSGYKIKVQKSQ AFLYANNRQTESQIMSELPFTIASKRIEYLGIQLIRDVKDLFKENYKPLLKEIKEDTNKW KNIPCSWVGRLNIVKMAILLKIIYRFNAIPINLPMTFFTELEKITLKFIWNQKRARIAKS ILSQKNKAGGITLPDFKLYYKATVTKTEWYWYQNRYTDQWNRTEPSEIMPHIYNHLIFDK SDKNKKWGRDSLFNKWCWENWLAICRKLKQDPFLTPYTKINSRWIKDLNVRPKTIKTLEE NLGHTIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQLIEWEKI FAIYSLDKGLISRIHNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSSSL TIREMQIKTTMRYHLIPVRMAIIKKSGNNSLLALGSLGGLNKEESPTRAAVPDHGQTASL SGTPMHPSSLDSLLALESPGVLDKEGFPSMQHTCSTKKQPDCFFKQVSDPVSPDWVRPPN RGLQTPPIGVSRPATGQYTPGMELPEEGAGCHLCCFVAFIGDTPEIQTTIRECYKHLYVY KLENPEEIDKFLDTYTLPSLNQEEMESLKRPITSSEIEAVINSLRPDKFTAEFYQRYKEE LIPFLLKPFQKIKKEGLLPNSFYEASIILTPKRGRDTTTTITTTTTTKKNFRLISLTNID AKILNKILAN >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_3|2913_bp atgactcacactgaatggaaatcaatcgactctatctgtataataagcatgctcccccct tcactcttggagaaggagccatgcagacagctgggaaaagagatgggcactgtgtgtcac attagtgctatactggagttcaggaaagatgaggccagacccagggcttctccaaagaca aagaagattgaactaaggggcagactgacaccttacacggccgggtactcctctgagaca aaacttccagtggaacgatcaggcagcaacatttgctgttcaccaatatccactgttctg cagcctctgctgccgattgccaggcaaacagggtctggagtgggcctccagcaaactcca acagacctgcagctgagggacctgactgttagaaggaaaactaacaaacagaaaggacat ccacaccaaaaccccatctgtacatcatcatcatcaaagaccaaagaaatacaaactacc atcagagaatactataaacacctctatgcaaataaactagaaaatctagaagaaatggat aaattcctcaacacatacaccctcccaacactaaaccaggaagaagttgaatccctgaat aggccaataacaggctctgaaattgaggcaataattaatagcctaccaatcaaaacagtc ccggaacagacggattcacagccgaattctatcagagtgttggaagttctggccacggca atcatgcagaagaaggaaataaacggtattcaattaggaaatgaggaagtcaaattgtcc ctgtttgcagatgacatgattgtatatctagaaaaccccatcatctcagcccaaaatctc cttaagctgataagcttcagcaaagtctcaggatacaaaatcaaagtgcaaaaatcacaa gcatttctatacgcaaataacagacaaacagagagccaaatcatgagtgaactcccattc acaattgcttcaaagagaatagaatacctaggaatccaacttataagggatgtgaaggac ctcttcaaggagaactacaaaccactgctcaaggaaattaaagaggatacaaacaaatgg aagaacattccatgctcatgggtaggaagactcaatattgtgaaaatggccatactgctc aagataatttatagattcaatgccatccccatcaatctaccaatgactttcttcacagaa ttggaaaaaattactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtca atcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactac aaggctacagtaaccaaaacagaatggtactggtaccaaaacagatatacagaccaatgg aacagaacagagccctcagaaataatgccacatatctacaaccatctgatctttgacaaa tctgacaaaaacaagaaatggggaagggattccttatttaataaatggtgctgggaaaac tggctagccatatgtagaaagctgaaacaggatcccttccttacaccttatacaaaaatt aattcaagatggattaaagacttaaatgttagacctaaaaccataaaaaccctagaagaa aacctaggccacaccattcaggacataggcatgggcaaggacttcatgtctaaaacacca aaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaactaaagagcttc tgcacagcaaaagaaactaccatcagagtgaacaggcagcttatagaatgggagaaaatt tttgcaatctactcattggacaaagggctaatatccagaatccacaatgaactcaaacaa atttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggatatgaacagacac ttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaatgctcatcatcactg accatcagagaaatgcaaatcaaaaccacaatgagataccatctcataccagttagaatg gcgatcattaaaaagtcaggaaacaacagcctgctggctctggggagtctaggcggtctg aataaggaggagtccccaacaagagctgctgtgccagatcatggccagactgcttcttta agtgggaccccaatgcatccctcctcactggacagcctgctggctctggagagtccaggt gttctggacaaggaagggttcccctcaatgcagcacacctgctctaccaaaaagcaacca gactgcttctttaagcaggtttctgatcctgtctctcctgactgggtgagacctcccaat aggggtctccagacacctcctataggagtgtccaggcctgcaacaggtcagtatacccct gggatggagctcccagaggaaggagcaggctgccatctttgctgttttgtggccttcatt ggtgatactccagaaatacaaacaactatcagagaatgctataaacacctctatgtatat aaactagaaaatccagaagaaattgataaattcctggacacatacaccctcccaagtctg aaccaggaagaaatggaatccctgaaaagaccaataaccagttctgaaattgaggcagta ataaatagcctacgaccagataaatttacagcagaattctaccagaggtacaaagaagag ctgataccatttctactgaaaccattccaaaaaattaaaaaggagggactcctccctaac tcattttatgaggccagcatcattctgacaccaaaacgtggcagagatacaacaacaaca ataacaacaacaacaacaacaaaaaaaaacttcaggttaatatccctgacgaacatcgat gcaaaaatcctcaataaaatactggcaaattga >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_4|280_aa MVKYDLPGRFHQKKEESLRWSRVQLPKHTARAHFPRVRRTLLMRAAHPKRRIMGKRCPAG PLMTLGTFGLHITFVLVDLESKEGPESSNAANTEFSYANLGQSEAEKVKLLLTLLKEKLS SKQQQPEPPPQPGGATLTRTARESPRRGQRRAAAAPLRPPPRLLPEGSQRRWPPEWPSPS RRSRPPPHLLSRRRKWQGQARTWDSGGPSPAILTPPARRGCGLPADDVAAAAPERIFGPG PRRHPSSLLLPSRPEELMPLSPPPPLLKPRLMDARECRIA >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_4|843_bp atggtgaagtatgatcttccaggtaggttccaccagaaaaaggaagaaagccttagatgg tcacgtgtacaactccctaaacacactgcacgtgctcacttcccaagggtaaggaggact ctgctcatgcgggcagcccaccctaagcgaagaatcatgggaaagagatgccctgctggg cccttaatgacactggggacctttggtttacacatcacttttgttttggttgatctggaa agcaaagaaggaccagaatcttctaatgcagccaacactgaattcagctatgcaaacctg ggtcagtcagaggcagagaaagtaaagttgctgctgacactgctaaaggaaaaactgtca tcgaaacagcagcagccggagccgccgccgcagcccggtggggcaaccctgactcggacc gctcgggagagccccaggagaggccagcgccgcgcagcagccgccccgctgcgcccacct ccccggctgctcccggagggctcacaaaggcggtggccgcccgagtggccttctccatcc aggcgttcgcgtcctcctccccaccttctctcccgaaggcgaaaatggcagggccaggcg agaacctgggacagcggtggccctagccctgcgatcctcacccctcctgctaggagaggc tgcgggctgcccgcggacgatgtggccgcggctgctcccgagcgcatcttcggcccgggt ccccgccgccacccctcttctctgctccttccatcccgcccagaggagttgatgccgctg tcgccgccgccgccgctgctgaagccgcggctgatggatgccagggagtgccgcattgct tag >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_5|367_aa MTHLQAGLSPETLEKARLELNENPDTLHQDIQEVRDMVITRPDIGFLRTDDAFILRFLRA RKFHHFEAFRLLAQYFEYRQQNLDMFKSFKATDPGIKQALKDGFPGGLANLDHYGRKILV LFAANWDQSRYTLVDILRAILLSLEAMIEDPELQVNGFVLIIDWSNFTFKQASKLTPSML RLAIEGLQEIGIGTAGMKDLVAYAAFSGITQMGMRCLITACSKLDFAVQLTLWYDSDLGS GYPASTNSFDLTSPTSLSLYLKSSKIFLHGNNLNSLHQLIHPEILPSEFGGMLPPYDMGT WARTLLDHEYDDDSEYNVDSYSMPVKEVEKELSPKSMKRSQSVVDPTVLKRMDKNEEENM QPLLSLD >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_5|1104_bp atgactcatttgcaagccggtctctcccctgagaccctggagaaagctcgcctggagctc aatgaaaacccagacacgctgcaccaggacatccaggaggtgagggatatggtcatcacc aggccggacattggctttctgcgcacggatgatgccttcatcttacgcttcttgcgggct aggaagtttcatcactttgaggccttccgcctcctggcgcagtactttgagtaccggcag cagaacctggacatgttcaaaagctttaaggccaccgaccctggcatcaagcaggcactg aaggatggcttccctgggggcctggccaatctggaccactatggcaggaagattctagtc ctttttgctgccaattgggatcagagcaggtacacactggtggatattttgcgtgccatc ttactttctttagaagccatgattgaagatcctgagcttcaagtgaatgggtttgttttg atcatagactggagtaacttcactttcaagcaagcctctaaactcacaccaagtatgctg cgattagctattgaaggcctgcaggaaattggcattggtacagcaggaatgaaggacctg gttgcctacgcagccttctctggcatcactcagatggggatgagatgccttattacagct tgctccaaactggactttgcagtccagttaaccctttggtatgactcagatttgggctct ggttaccctgccagtactaacagctttgacttgacatctccaacttcactttctctctac ttgaaatcaagtaagatatttttgcatggtaacaacctgaacagtctacaccaactaatt catcctgagatcctgccctctgagtttggaggaatgctgcctccttatgacatggggaca tgggcaagaacactgctagaccatgaatatgacgatgacagcgagtacaatgtagactcc tacagcatgcctgtgaaggaagtagagaaggaactctccccaaagtccatgaagagatct caatcagtagtggatcctacagtactaaaacgcatggataaaaatgaggaagaaaacatg caaccattgctttctctggactaa >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_6|171_aa MHGNALMPRQKFAAGAEPSWRTSARAVQKGNVGSEPPQRIPTGALPTGAVRRGPQSSRSH NGRSTGSLHCMPGKATDTQQQPMKAARIEAIPCKATGTELPKTMGTHLLHQCELDVRPRV KGDHFGVLKFDIPAGFCTCMGPVTPSFWPFFPIWNFCIYPIPVPPLYLGSN >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_6|516_bp atgcatggaaatgccttgatgcctaggcaaaagtttgctgcaggggcagagccctcatgg agaacctctgctagggcagtgcagaaaggaaatgtggggtcagagcccccacagagaatc cctactggggcactgcctactggagctgtgagaagagggccacagtcttccagatcccat aatggtagatccactggcagcttgcactgcatgcctggaaaagccacagacactcaacag cagccaatgaaagcagccaggattgaggctataccctgcaaagccacagggacagagctt cccaagaccatgggaacccatcttttgcatcagtgtgaactggatgtgagacctagagtc aaaggagatcattttggagttttaaaatttgacatccctgctggattttgcacttgcatg ggccctgtaaccccttcgttttggccattttttcccatttggaacttctgcatttaccca atacctgtacccccattgtatctaggaagtaactag >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_7|103_aa MNTSTCFGECGRGVKWTLWESLIVVLFSAQVFLDAGYVNSEVVTWMDSGPLAMGELFFSF KLWHKLGERLEDNEREEYQKNLQAFSHTWVWQQDTIFNPGLYK >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_7|312_bp atgaataccagcacctgctttggtgaatgtggcagaggagtgaagtggactctgtgggag tccttgattgtagttttgtttagtgcacaggttttcttggatgctggttatgttaacagt gaagttgtcacatggatggactcaggacctctggccatgggagaactcttcttttctttc aagctctggcacaaacttggggagaggcttgaagacaatgagcgggaagaataccagaaa aatctacaggcattttcccacacctgggtctggcagcaggataccatttttaatccaggt ttatacaaataa >gi568815592f:122897778_123163758|GENSCAN_predicted_peptide_8|63_aa MGKTCLHDSIISHRMSSTTRGGYYNSSTTRSSGHRIDVSSASDESADRMHLALQNSSWEK GPE >gi568815592f:122897778_123163758|GENSCAN_predicted_CDS_8|192_bp atgggaaaaacctgccttcatgattcaattatttcccaccggatgtcttccacgacacgt gggggttattacaattcaagtaccacaaggagttctggacacagaattgatgtctcctca gcatcggatgaatctgcagacagaatgcacttggctctccagaattccagctgggaaaaa ggtcctgaatga