GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:18:52 Sequence gi568815592f:126240274_126446322 : 206049 bp : 36.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 156 324 169 1 1 66 39 91 0.157 0.08 1.02 Term + 2146 2845 700 1 1 75 48 239 0.471 10.62 1.03 PlyA + 3657 3662 6 1.05 2.02 PlyA - 4520 4515 6 1.05 2.01 Sngl - 17866 17648 219 1 0 81 55 355 0.975 26.41 2.00 Prom - 20261 20222 40 -7.15 3.03 PlyA - 20391 20386 6 1.05 3.02 Term - 21937 21809 129 1 0 76 46 128 0.260 4.60 3.01 Init - 42558 42319 240 0 0 82 87 119 0.696 9.12 3.00 Prom - 79375 79336 40 -4.25 4.00 Prom + 79394 79433 40 -5.35 4.01 Sngl + 87720 88376 657 2 0 71 43 381 0.934 27.82 4.02 PlyA + 88604 88609 6 1.05 5.00 Prom + 88773 88812 40 -6.15 5.01 Init + 89723 90491 769 1 1 70 28 263 0.272 13.40 5.02 Term + 90619 91634 1016 2 2 -20 48 330 0.220 9.47 5.03 PlyA + 91691 91696 6 -0.45 6.00 Prom + 91987 92026 40 -3.95 6.01 Init + 100001 100126 126 1 0 82 77 162 0.394 14.71 6.02 Intr + 105932 106045 114 1 0 77 93 55 0.487 4.62 6.03 Term + 108193 108219 27 0 0 125 43 2 0.350 -3.50 6.04 PlyA + 109475 109480 6 1.05 7.03 PlyA - 109548 109543 6 1.05 7.02 Term - 120673 120453 221 2 2 73 43 152 0.582 5.42 7.01 Init - 131603 131486 118 2 1 62 72 93 0.714 5.51 7.00 Prom - 135469 135430 40 -3.75 8.00 Prom + 139570 139609 40 -2.65 8.01 Sngl + 143637 144068 432 2 0 38 43 210 0.907 7.53 8.02 PlyA + 144269 144274 6 1.05 9.00 Prom + 144382 144421 40 -0.95 9.01 Init + 163341 163472 132 2 0 36 98 145 0.784 10.49 9.02 Intr + 175776 176024 249 2 0 -23 47 249 0.822 6.41 9.03 Term + 176323 176751 429 0 0 48 41 160 0.853 1.62 9.04 PlyA + 177663 177668 6 1.05 10.00 Prom + 177692 177731 40 -4.85 10.01 Init + 178513 178592 80 0 2 81 47 35 0.022 -0.72 10.02 Intr + 193278 193352 75 0 0 97 113 41 0.735 5.21 10.03 Term + 193871 194030 160 2 1 85 36 86 0.759 -0.47 10.04 PlyA + 194174 194179 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 12989 13080 92 0 2 122 54 83 0.818 5.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_1|289_aa XRTLSWRSLGVCWRSTPDPVCLGISSGGCRTANIGEHLMLLPDCSSGSFVSEEYPAVIGK NYFKVHMVPKKAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRVIDKWNRT EPSEIMPHIYNYLIFEKPDKNKKWGKDSLFNKWFWENWLAICRKLKLDPFLTPHTKINSR WTKDLNVRPKNIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLTKLKSFCTG KETTIRVNRQPTEWDKIFAIYSSDKGLISRIYKELKQIYKKKSNDPIKN >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_1|870_bp ntcaggaccctcagctggaggtctcttggagtttgctggaggtccactccagaccctgtt tgcctgggtatcagcagtggaggctgcagaacagcaaatattggtgaacatctaatgttg ctgcctgattgttcctctggaagttttgtctcagaggagtacccggccgtaattggaaaa aactactttaaagttcatatggtaccaaaaaaagcccacattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagttatagacaaatggaacagaaca gagccctcagaaataatgccacacatctacaactatctgatctttgagaaacctgacaaa aacaagaaatggggaaaggattccctatttaataaatggttctgggaaaactggttagcc atatgtagaaagctgaaactggaccccttccttacacctcatacaaaaattaattcaaga tggactaaagacttaaatgttagacctaaaaacataaaaaccctagaagaaaacctaggc aataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatg gccacaaaagccaaaattgacaaatgggatctaactaaactaaagagcttctgcacagga aaagaaactaccatcagagtgaacaggcaacctacagagtgggataaaatttttgctatc tactcatctgacaaagggctaatatccagaatctacaaagaactcaaacaaatttacaag aaaaaatcaaacgaccccatcaaaaactag >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_2|72_aa MEQLTQLPDCQEKEIPELEIDVDELLDMESEDAQAARVEELLVDCYKPTEAFIPDLLDKI RGMQKLSTPQKK >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_2|219_bp atggagcagctgacacaactccccgactgccaggaaaaggagatcccagaactggagatt gacgtggatgaactcctggacatggagagtgaggatgcccaggctgccagggtcgaggag ctgctggttgactgttacaaacccaccgaggccttcatccctgacctgctggacaagatc cggggcatgcagaagctgagcacaccccagaagaagtga >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_3|122_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETIRVNRQPTECETIFAIYPSDKGLI SRIYKELKQIYKEKKQPHPKAVASPPLHYTHTESILGLTAGMVEEPSLHTYCCKTSGSTT PA >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_3|369_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aagtgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcagagtgaac aggcaacctacagaatgtgagacaatttttgcaatctatccatctgacaaagggctaata tccagaatctacaaggaacttaaacaaatttacaaggaaaaaaaacaaccccatccaaaa gctgttgcaagccctccgctccactacactcacactgaatccattcttggcctgacagct ggtatggtggaagagccatccttgcacacatactgctgcaaaacttcaggttccactaca ccagcatga >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_4|218_aa MEDEMNEMKREGKFKEKRIKRNEQSLQEIWDHVKRPNLRLIAVSESDRENGTKLENTLQD IIQENFLNLARQAKIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKENMLRAAREKGQV THKGKPTRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_4|657_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttaaagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggaccatgtgaaaagaccaaatctacgtctg attgctgtatctgaaagtgacagggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttcctcaatctagcaaggcaggccaaaattcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaatatgttaagggcagccagagagaaaggtcaggtt acccacaaagggaagcccaccagactaacagctgatctctcggcagaaactctgcaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_5|594_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFIAEFYQRYKEEL VPFLVKLFQSIEKEGILPNSFYEASIILIPKLGRDRTKKENFGPISLMNIDAKFLNKILA NQIQQHIKKLIHHDQVGFIPGMQGWYNIRKSINVIQHINRTKDKNHISIDAEKAFDKIQQ LFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFDI VLEVLARAIRQEKEIKGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHL TRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVERINIVKMAILPKVICRFNAIPIKLP MTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQN RDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLGIYRKLKLDPCL TPYTKTNSRWIKDLNIRPKTIKTLEENLGITIQDIGTGKGFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTIRVNRQPTKWEKIFVTYSSDKGLISRIYNELKQIYKKKTTP >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_5|1785_bp atggataaattccttgacacatacaccctcccaagattaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcatagccgaattctaccagaggtacaaggaggaactg gtaccattccttgtgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagctgggcagagacagaaccaaaaaagag aattttggaccaatatccttgatgaacattgatgcaaaattcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct ggaatgcaaggctggtacaatatacgcaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacatctcaatagatgcagaaaaggcctttgacaaaattcaacaa ctcttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaataata agagctatctatgacaaacccacagccaatatcatactgaatgggcaaaagctggaagca ttccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcgacata gtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggatacaaaatc aatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatc atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccacctt acaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggacacaaacaaatggaagaacattccatgttcatgggtagaaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttgtagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttgaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaaccgaacagagccctcagaaataacgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctaggcatatatagaaagctgaagctggatccctgcctt acaccttatacaaaaactaattcaagatggattaaagacttaaacattagacctaaaacc ataaaaaccctagaagaaaatctaggcattaccattcaggacataggcacgggcaagggc ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaattttcgtaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaccccatga >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_6|88_aa MALSTIVSQRKQIKRKAPRGFLKRVFKRKKPQLRLEKSGDLLVHLNCLLFVHRLAEESRT NACASKCRVINKEHVLAAAKVILKKSRG >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_6|267_bp atggcgctgtcgaccatagtctcccagaggaagcagataaagcggaaggctccccgtggc tttctaaagcgagtcttcaagcgaaagaagcctcaacttcgtctggagaaaagtggtgac ttattggtccatctgaactgtttactgtttgttcatcgattagcagaagagtccaggaca aacgcttgtgcgagtaaatgtagagtcattaacaaggagcatgtactggccgcagcaaag gtaattctaaagaagagcagaggttag >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_7|112_aa MNIFTHITRKPRGDKKFLEIYNACRLNQEEIETLNGPITGMVHNQIEMTKMTHTEFRIWM AKKLIEIYQKVETQSKESSKMIQELKDEITIIRKKQTDLLELKIHYGNFIIQ >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_7|339_bp atgaacatctttacgcacataactagaaaacctagaggagataagaaattcttggaaata tacaacgcttgtagattaaaccaggaagaaatagaaacgctgaacggaccaataacagga atggttcataaccagattgaaatgaccaaaatgacacacacagaattcagaatctggatg gcaaagaagctcattgagatttatcagaaagttgagacccaatccaaggaatccagtaaa atgatccaagagctgaaagatgaaataaccattataagaaagaaacaaactgaccttctg gaactaaaaattcactatggaaatttcataatacaatag >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_8|143_aa MDLIDIYRATEYRFFSGAHETYSEIDHTIRHKTILSKCRRTEVIAITLLDHSAIKIEFKT KKITQNHVITSKLNNLLLSDFQVNNEIKAEIKKFVETNESKDTAYQNFWDTAKAVLRGKF VALLKNLYGTKKESNSEGHHKQR >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_8|432_bp atggacctgatagacatctacagagcaacagaatatagattcttctctggtgcacatgaa acatactctgaaattgatcacacaatcagacataaaacaatcctcagcaaatgcagaaga actgaggtcatagcaatcactctcttggaccacagtgcaataaaaatagaattcaagacg aagaaaatcactcaaaaccatgtaattacatcaaaattgaacaacctgctcctgagtgac tttcaagttaataatgaaattaaggcagaaatcaagaagttcgttgaaactaatgagagc aaagatacagcataccagaatttctgggacacagccaaggcagtgttaagagggaaattt gtagcacttttaaaaaatttatatggaaccaaaaaagagtctaatagcgaaggccatcat aagcaaagatag >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_9|269_aa MRFKEIRHLCNIKVQDETVSADIEAKVQDETVSADIEAKIQDETRLVEWFDQYADNDTDN EIQAEVVSDGNEKLVGNWNKGDDSSYVLAKRLAAFCLCSRDLWNVEIERDDLGYLVEEIS KQQSIREPRDLVPCVPAAPSMAERGQCRGQAMASEGASLKPWQIPCGVEPASAQKSRIGI WEPSPRFQRMYGNAWMSRQKFAAGSGVSWRTSARAVQKGNMGLEPTHRVLTGALPSGTVR RRPPSSRPQNCRSNDSLHCAPGKATDTQC >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_9|810_bp atgaggtttaaggaaataagacatctctgtaacataaaagtacaagatgaaacagtaagt gctgatatagaagctaaagtacaagatgaaacagtaagtgctgatatagaagctaaaata caagatgaaacaagacttgttgaatggtttgaccaatatgctgataatgatacggacaat gaaatccaggctgaggtggtctcagatggaaatgagaaacttgttgggaactggaacaaa ggtgatgactcttcttatgttttagcaaagagactggcagcattttgcctgtgctctagg gatttatggaacgttgaaattgagagagatgatttagggtatctggtggaagaaatttct aagcagcaaagcattcgagagcctagggacttggtgccctgtgtcccagctgctccatcc atggctgaaaggggccaatgtagaggtcaggccatggcttcagagggtgcaagcctcaag ccttggcagattccatgtggtgttgagcctgccagtgcacagaagtcaagaattgggatt tgggaaccttcacctagatttcaaaggatgtatggaaatgcctggatgtccaggcagaag tttgctgcaggatcaggggtctcatggagaacctctgctagggcagtgcagaagggaaat atggggttggagcccacacacagagtccttactggggcactacctagtggaactgtgaga agaagaccaccatcttccagaccccaaaattgtagatccaatgacagcttgcactgtgca cctggaaaagccacagacactcaatgctag >gi568815592f:126240274_126446322|GENSCAN_predicted_peptide_10|104_aa MGEAGNHHSQQTIARTKSRTPHVLTHRYLKTEEWRICSAGNKRILLEIRGRRKFTFGFQY NSRVMVVLSNTFLNGSLLDTNMKLAKLSRPNPSFGICGNQEISS >gi568815592f:126240274_126446322|GENSCAN_predicted_CDS_10|315_bp atgggtgaagctggaaaccatcattctcagcaaactatcgcaaggacaaaaagccgaaca ccgcatgttctcactcataggtatctgaaaactgaagagtggagaatatgttcagcaggg aacaagaggattcttttagaaataagaggtagaaggaaattcacatttggatttcagtac aactcaagagttatggtagtgctatccaacacttttttgaatggtagtttgcttgatacc aacatgaaacttgccaaactcagcaggccaaatcctagttttggaatatgtgggaaccaa gaaatcagcagttaa