GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:05:10 Sequence gi568815580f:70224669_70426273 : 201605 bp : 40.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 254 402 149 2 2 17 25 164 0.086 1.96 1.02 Term + 13170 13951 782 1 2 38 43 501 0.256 33.13 1.03 PlyA + 13971 13976 6 1.05 2.00 Prom + 14506 14545 40 -5.45 2.01 Init + 16849 17089 241 0 1 73 86 119 0.927 8.28 2.02 Term + 20836 21137 302 2 2 59 38 179 0.957 4.40 2.03 PlyA + 21975 21980 6 1.05 3.00 Prom + 32193 32232 40 -5.05 3.01 Init + 36886 37161 276 0 0 91 63 95 0.053 4.33 3.02 Intr + 38443 38652 210 0 0 2 85 213 0.137 10.59 3.03 Intr + 38814 38971 158 2 2 39 49 130 0.869 2.09 3.04 Intr + 40422 40494 73 0 1 93 99 49 0.821 4.99 3.05 Term + 52867 52986 120 0 0 -9 43 137 0.011 -3.01 3.06 PlyA + 53420 53425 6 1.05 4.05 PlyA - 54753 54748 6 1.05 4.04 Term - 66052 65944 109 0 1 35 50 149 0.185 2.80 4.03 Intr - 71847 71762 86 0 2 61 43 111 0.218 1.60 4.02 Intr - 72626 72448 179 0 2 37 60 77 0.591 -1.48 4.01 Init - 74964 74793 172 0 1 60 84 102 0.794 6.65 4.00 Prom - 77431 77392 40 -4.65 5.00 Prom + 78083 78122 40 -3.35 5.01 Sngl + 100001 101608 1608 1 0 76 48 1310 0.967 120.88 5.02 PlyA + 102047 102052 6 1.05 6.04 PlyA - 102131 102126 6 1.05 6.03 Term - 121045 120378 668 2 2 6 42 270 0.008 7.20 6.02 Intr - 123578 123416 163 2 1 78 71 103 0.055 6.23 6.01 Init - 142170 142114 57 0 0 79 103 12 0.240 3.16 6.00 Prom - 150233 150194 40 -4.55 7.04 PlyA - 150316 150311 6 1.05 7.03 Term - 158624 158449 176 0 2 85 43 122 0.219 4.34 7.02 Intr - 160967 160758 210 0 0 50 8 139 0.015 0.06 7.01 Init - 168087 167925 163 0 1 90 81 142 0.338 13.64 7.00 Prom - 170208 170169 40 -5.45 8.00 Prom + 170579 170618 40 -5.45 8.01 Init + 174295 174355 61 0 1 81 60 80 0.701 5.96 8.02 Intr + 175351 175476 126 2 0 14 91 166 0.568 9.23 8.03 Term + 187437 187639 203 1 2 32 48 143 0.381 1.27 8.04 PlyA + 188703 188708 6 1.05 9.00 Prom + 189388 189427 40 -5.05 9.01 Init + 194210 194361 152 1 2 79 66 113 0.071 7.76 9.02 Term + 198968 199187 220 2 1 25 45 148 0.037 -0.27 9.03 PlyA + 200187 200192 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 38473 38652 180 0 0 40 85 204 0.836 14.53 S.002 Sngl - 121031 120378 654 2 0 70 42 246 0.940 14.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_1|310_aa XTGPTAGRPEVWVVDTRPQHHQAQYKGWFEAEEQWLSNWQNNPDKCSDTFSREQNWMENE FDKVTEVGFRRWVITNFSKLKEHVLTQFKEAKNFDKRLREMLTRITSLKKNINDLMGLKN TARELCEAYTSINSQIDQAEERISEIENQLNEIKCEDKIREKRMKRNEQSLQEIWDYVKR PNLCLIGVPESHGENGIKLENTLQDITQENFPNLARQASIPIQEIQRTPQRYSLRRTTPR HTIIRFTKVKMKEKLLRAAREKGWVTHKGKPIRLTVDSLQKPYKPEENGDQHPTLLKKRI FNSEFHIQPN >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_1|933_bp nngacaggaccaacagccggcagaccagaagtgtgggttgtagacactcggccccaacat caccaagcccagtacaaagggtggtttgaagcggaggagcagtggcttagtaactggcag aacaaccctgacaaatgcagtgacaccttttcaagggaacaaaactggatggagaatgag tttgacaaagtgacagaagtaggcttcagaaggtgggtaataacaaacttctccaagcta aaggagcatgttctaacccaattcaaggaagctaagaactttgataaaaggttacgggaa atgctaactagaataactagtttaaagaagaacataaatgacctgatggggctgaaaaac acagcacgagaactttgcgaagcatatacaagtatcaatagccaaattgatcaagcggaa gaaaggatatcagagattgaaaatcaacttaatgaaataaagtgtgaagacaagattaga gaaaaaagaatgaaaaggaacgaacaaagcctccaagaaatatgggactatgtgaagaga ccaaacctatgtttgattggtgtacctgaaagtcatggggagaatggaatcaagttggaa aacactcttcaggatattacccaggagaacttccccaacctagcaagacaggccagcatt ccaattcaggaaatacagagaacaccacaaagatactccttgagaagaacaaccccaaga cacacaatcatcagatttaccaaggttaaaatgaaggaaaaattgttaagggcagccaga gagaaaggttgggttacccacaaagggaagcccatcagactaacagtggattctctgcag aaaccctacaagccagaagagaatggggaccaacatccaacattgttaaagaaaagaatt ttcaactcagaatttcatatccagccaaactaa >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_2|180_aa MDKDFITKTPKAMATKAKNEKWDLTKLKSFCTAKETIIRVNRQPTEWEKMFAIYPSDKGL ISRIYKELKQIYKKKTTPSKECTPSDRDRQQRNSRQKKVSPWQSPILKPKSLRPQPKVKT YIPVSAQMLPFPKLPVAHPASHPTLNRQREEKQLDIGLGPFGLVAVILLLFWKHPFLSSV >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_2|543_bp atggacaaagacttcattactaaaacaccaaaagcaatggcaacaaaagccaaaaatgaa aaatgggatctaactaaactaaagagcttctgcacagcaaaagaaactatcatcagggtg aacaggcaacctacagaatgggagaaaatgtttgcaatctatccttctgacaaagggcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaacaaccccatcaaaa gagtgtactccttcggatagggacaggcagcagagaaattctaggcagaaaaaggtgagt ccctggcaaagtcccatactcaagccaaaaagcctaagaccacagcccaaggtaaaaact tacatccctgtttctgctcaaatgttgccttttcctaaactacccgtggcccaccccgcc tcccaccccacactcaaccgacagagagaggagaagcagctggacatcggactgggcccc tttggattggttgctgttattctcctcctgttttggaaacacccatttctaagttcagtt tga >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_3|278_aa MEGDRGRSQQSLKRRKAASVQPAPSKEGAKGIKHPKFFLLWIFHHILFTIIAKPTWTPAG KEAPGNAIHRQQPPRPQSRVEMSEGWIGVRNRISSILILPRDMWNFELERDYLRYLLEEI SKRQSIQEVRVHKSLENLQPDDSVEKEKPFSGEKFKPVAEICPWDIVPCIPAVSAPAMAK RGQSVAQAMASEGASPKSWRLTCGVGPVGNCDSPGIQAQLTSININTDLKTDRTDSLSLQ RIPPPSLLFRDHLDLINRMSGEPQGEEDEKALGLQESH >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_3|837_bp atggagggtgacagagggagatctcaacaaagcctaaagcggaggaaggcagcctctgtc caacctgcacctagcaaagagggggcaaagggaatcaaacaccccaaattctttctgctc tggatcttccatcacatactatttaccatcattgctaaacccacatggacgccagcaggc aaggaagccccaggcaatgccatccatagacagcagcctcctagaccacagagtagagtg gagatgagtgaagggtggattggagtgagaaacagaattagcagcattttgatcctgccc agagatatgtggaactttgaacttgagagagattatttgcggtatctgttagaagaaatt tctaagcggcaaagcattcaagaggtgagagtgcataaaagtttggaaaatttgcagcct gatgattcagtagaaaaggaaaaaccattttctggggagaaattcaagccagtggcagaa atttgcccttgggacatagtgccctgcatcccagctgtttcagctccagccatggctaaa agaggccaaagtgtagctcaggccatggcttcagagggtgcaagccccaagtcttggcgg cttacatgtggtgttgggcctgtgggcaactgtgacagcccaggaattcaggcccagctt accagcattaacatcaacacagaccttaagactgatagaacagactctttaagtctgcaa agaatcccacctccaagcctgctctttagggaccacttagacctgattaataggatgtca ggggaaccacaaggagaggaagatgagaaggccctgggtctgcaggaatcacattag >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_4|181_aa MASGWGWSPRDPVPRVWNFQHRPPVPILKRNEEKRKGLEIEFYRNSNYKIRGPCTHKGTK KITKRITADDNSVKWLKQTGVNQNLRLLERKTTKNQPEKRFHKSELRGGCYLFKKEGGLG LVLAYGDQRATCRYTHIRHGDNRQELHKRKALAKLQIEVLLGILIKSTVHEPGSSKAPRQ A >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_4|546_bp atggcctctggatggggctggtcacccagagacccagtgcctagagtgtggaactttcag caccgaccccccgtccctatcctgaagagaaatgaagagaaaaggaaaggattagagatt gagttctacagaaactctaactacaagattcggggaccttgcacacacaagggaacgaag aaaatcacaaagcgaattacggcagatgacaacagtgttaagtggctaaaacaaactgga gtaaaccaaaatctaagactgctggaaaggaagacgacgaaaaaccagcctgagaaaagg tttcacaaatctgaactgagaggcggatgctatttgtttaaaaaagaggggggactaggc ctggttctggcctatggtgaccagcgtgcgacctgcagatacacccacatccgccatggg gacaacagacaggaactgcacaagcggaaagccctggccaaactccagatagaagttctt ctaggaatccttatcaaatccactgtccacgaaccaggttcctccaaggcaccaaggcag gcttag >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_5|535_aa MKKISLKTLRKSFNLNKSKEETDFMVVQQPSLASDFGKDDSLFGSCYGKDMASCDINGED EKGGKNRSKSESLMGTLKRRLSAKQKSKGKAGTPSGSSADEDTFSSSSAPIVFKDVRAQR PIRSTSLRSHHYSPAPWPLRPTNSEETCIKMEVRVKALVHSSSPSPALNGVRKDFHDLQS ETTCQEQANSLKSSASHNGDLHLHLDEHVPVVIGLMPQDYIQYTVPLDEGMYPLEGSRSY CLDSSSPMEVSAVPPQVGGRAFPEDESQVDQDLVVAPEIFVDQSVNGLLIGTTGVMLQSP RAGHDDVPPLSPLLPPMQNNQIQRNFSGLTGTEAHVAESMRCHLNFDPNSAPGVARVYDS VQSSGPMVVTSLTEELKKLAKQGWYWGPITRWEAEGKLANVPDGSFLVRDSSDDRYLLSL SFRSHGKTLHTRIEHSNGRFSFYEQPDVEGHTSIVDLIEHSIRDSENGAFCYSRSRLPGS ATYPVRLTNPVSRFMQVRSLQYLCRFVIRQYTRIDLIQKLPLPNKMKDYLQEKHY >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_5|1608_bp atgaagaaaattagtcttaaaaccttacggaaatcttttaacttgaataaaagtaaagaa gaaactgatttcatggtagtacaacaaccatcgctagccagtgactttggaaaagatgat tccttatttggtagctgctatggtaaagatatggccagctgcgatatcaacggtgaagat gaaaaaggcggaaaaaacagatcaaaaagcgagagcctgatgggtacgctaaaaaggcgg ctttctgcaaaacagaagtcaaaaggcaaggcgggcacaccctctgggagctctgccgac gaggacaccttctcctcctcctcagcacccatagtctttaaagacgtgagagctcagagg ccgataaggtccacgtcgctccgcagccatcactacagtcccgcgccgtggcctctgcgg cccacaaactccgaggagacctgcatcaagatggaggtgagagtcaaggccttggttcac tcttccagcccgagtccagccctgaatggcgtccggaaggatttccacgacctccagtct gagaccacgtgccaggagcaagccaattcactgaagagctcggcttctcataatggagac ctgcatcttcacctggatgaacatgtgcctgtcgttattggacttatgcctcaggactac attcagtatactgtgcctttagatgaggggatgtatcctttggaaggatcacggagctat tgtctggacagctcttctcccatggaagtctctgcggttcctcctcaagtgggagggcgc gctttccccgaggatgagagtcaggtagaccaggacctagttgtcgccccagagatcttc gtggatcagtccgtgaatggcttgttgattggcaccacgggagtcatgttgcagagcccg agagcgggtcacgatgatgtccctccactctcaccattgctacctccaatgcagaataat caaatccaaaggaacttcagtggactcactggcacagaagcccacgtggctgaaagtatg cgctgtcatttgaattttgatccgaactctgctcctggggttgcaagagtttatgactca gtgcaaagtagtggtcccatggttgtgacaagccttacagaggagctgaaaaaacttgca aagcaaggatggtactggggaccaatcacacgttgggaggcagaagggaagctagcaaac gtgccagatggttcttttcttgttcgggacagttctgacgaccgttaccttttaagcttg agctttcgctcccatggtaaaacacttcacactagaattgagcactcaaatggtaggttt agcttttatgaacagccagatgtggaaggacatacgtccatagttgatctaattgagcat tcaatcagggactctgaaaatggagctttttgttattcaaggtctcggctgcctggatct gcaacttaccccgtcagactgaccaacccagtgtcccggttcatgcaggtgcgctcgttg cagtacctgtgtcgttttgttatacgtcagtataccagaatagacttaattcagaaactg cctttgccaaacaaaatgaaggattatttacaggagaagcactactga >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_6|295_aa MAAGMYLHLCVNFESRILHPLLLIPRQTGSGVDLQQTPTDLQLKVLTVRRKTNKQKGYPH QNAICTSPSSKIKENLEEMDKFLDTYTLPRLNQEEAESLNRPITGSEIVAIINSLPTKKS PGPDRFTAEFYQRYREELVQFFLKLLQSTEKKGILPNSFYEASIILIPKPGRDTTKKENL RPISLMNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNMRKSINVIQHINRTK DKNHMIISIDAEKAFDKIQQSFMLKTLNKLGIDGTYLTIIRAIYDKPTANIILNG >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_6|888_bp atggcagctggaatgtacttacatctttgtgtgaactttgaaagcagaatattacatcct ctgctgttgatacccaggcaaacagggtctggagtggacctccagcaaactccaacagac ctgcagctgaaggtcctgactgttagaaggaaaactaacaaacagaaaggatatccacac caaaacgccatctgtacgtcaccatcatcaaagatcaaagaaaatctagaagaaatggat aaattcctggacacatacaccctcccaagactaaaccaggaagaagctgaatccctgaat agaccaataacaggctctgaaattgtggcaataattaatagcctaccaaccaaaaaaagt ccaggaccagacagattcacagccgaattctaccagaggtacagggaggagctggtacaa ttctttctgaaattactccaatcaacagaaaaaaagggaatcctccctaactcattttat gaggccagcatcatcctgataccaaagccgggcagagatacaacaaaaaaagagaatctt agaccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatactggcaaaccaa atccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatg caaggctggttcaacatgcgaaaatcaataaacgtaatccagcatataaatagaaccaaa gacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacag tccttcatgctaaaaactctcaataaattaggtattgatgggacgtatctcacaataata agagctatttatgacaaacccacagccaatatcatactgaatgggtaa >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_7|182_aa MEAEISVMKLQARNTRNHQDSPEATEGKESFFPGGYSGSMPPPTPSFHISGLHNADLDSP GLMLGGNRITVPESLAMDRKKGPDHIGVLVQKKNFLEGKEEEKKGNGFSTVERNSLKFKD YFSFSGQCCQRPGRAVQAAVTFPTCGKPRQSMSSAVSLPCSEATRGGRRFVPAASPGNKQ QA >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_7|549_bp atggaggcagagatcagtgtgatgaagctacaggccaggaacaccaggaatcaccaggat tcaccagaagctacagaaggcaaagaaagcttcttccctggaggctatagtgggagcatg cccccaccaacgccttcatttcacatttctggactccataatgctgacttagactcacca ggacttatgcttgggggaaataggatcactgtccctgagagcctggccatggataggaaa aaaggccctgatcacattggggttctagtacaaaagaagaacttccttgaaggcaaggaa gaagaaaagaagggcaatggcttctccacagtcgaaaggaattcccttaagtttaaggat tacttctcattcagtggtcagtgttgccaaagaccaggacgagcagtccaggcagctgtc acatttccaacatgtggaaagcctcgccaaagcatgagctctgctgtctcccttccatgc agcgaggccacacgaggcggacgcagatttgttccagcagcttcaccagggaataaacag caggcatag >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_8|129_aa MRRHNEKAAICKPEGEPSLGEEGNLNTDDTERRMPQDEAGRDWTDASTGQGYRQPREARR EAEGDTECHARFKTPENVQAVINAHAEIKQKHCLKLKVLSGGQNRHHREISADRQARLDQ PQEKLAKRH >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_8|390_bp atgagaagacacaatgagaaggctgccatctgcaagccagaaggagagccctcgctggga gaagagggaaatttgaacacagatgacacagagaggagaatgccacaggatgaagcaggt agagactggactgacgcgtctacaggccagggctaccggcaaccacgagaagctaggaga gaagcagaaggagatacagaatgccatgctaggtttaaaactcctgagaatgttcaagca gtaataaatgcacacgcagaaattaaacagaaacactgcttgaaactcaaggtcctttct ggtggtcagaacaggcatcaccgggagatttcagctgatagacaggcccgacttgatcag cctcaggaaaagcttgccaaaaggcactga >gi568815580f:70224669_70426273|GENSCAN_predicted_peptide_9|123_aa MPKGGDCEGKPGFCPAAPRYQGWSWAQERQVSIDVSERIFFQQHEGQLMDRIVKPPERQS VPCQQGPRCQSIKNAENIDNTSFICRSVHLGTVFLGDALKPEDSVLYFGNCCLWVPSGSV LAD >gi568815580f:70224669_70426273|GENSCAN_predicted_CDS_9|372_bp atgcccaaaggaggagactgtgaaggcaagcctggattctgccctgcagcccctagatac cagggctggtcctgggcacaggagagacaggtgagcattgatgtttcagaaagaatattc ttccagcagcatgaagggcagcttatggatagaattgtcaagcccccagagaggcagtct gtcccctgtcagcaaggccccagatgccagagcatcaagaatgcagaaaatatagataat acaagcttcatttgcaggagtgtgcacctcggaacagttttcctgggagatgctctgaag ccagaggattctgtcctctattttgggaactgctgcctgtgggtcccctctggcagtgtg ttagcagattaa