GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:24:53 Sequence gi568815595r:49260212_49475589 : 215378 bp : 48.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1515 1626 112 0 1 57 32 142 0.098 5.88 1.02 Term + 16981 17193 213 0 0 104 55 116 0.159 7.13 1.03 PlyA + 17381 17386 6 1.05 2.14 PlyA - 17937 17932 6 -0.45 2.13 Term - 18240 18082 159 0 0 90 38 115 0.992 4.64 2.12 Intr - 18691 18603 89 2 2 18 89 87 0.993 1.49 2.11 Intr - 20636 20533 104 1 2 74 93 132 0.995 12.12 2.10 Intr - 23925 23776 150 2 0 84 94 174 0.999 16.88 2.09 Intr - 24373 24255 119 1 2 54 80 205 0.999 15.56 2.08 Intr - 24708 24638 71 1 2 85 92 67 0.987 5.70 2.07 Intr - 26114 25887 228 0 0 48 67 219 0.976 13.64 2.06 Intr - 32387 32299 89 1 2 61 113 93 0.953 8.71 2.05 Intr - 37753 37659 95 1 2 70 67 49 0.943 -0.24 2.04 Intr - 38424 38341 84 0 0 125 73 106 0.999 12.82 2.03 Intr - 42331 42173 159 1 0 68 68 236 0.989 19.78 2.02 Intr - 45634 45504 131 2 2 41 56 114 0.084 3.91 2.01 Init - 57236 56894 343 2 1 79 6 689 0.035 57.20 2.00 Prom - 62310 62271 40 -7.96 3.08 PlyA - 63361 63356 6 1.05 3.07 Term - 64461 64326 136 2 1 90 43 55 0.772 -1.31 3.06 Intr - 64552 64491 62 1 2 106 106 -1 0.872 1.03 3.05 Intr - 64828 64683 146 2 2 65 91 226 0.996 20.60 3.04 Intr - 65634 65508 127 0 1 92 77 105 0.996 10.05 3.03 Intr - 67605 67475 131 1 2 113 87 99 0.999 12.71 3.02 Intr - 75385 75258 128 0 2 103 80 77 0.951 8.72 3.01 Init - 79813 79713 101 1 2 89 77 84 0.990 7.13 3.00 Prom - 89990 89951 40 -6.76 4.04 PlyA - 90521 90516 6 1.05 4.03 Term - 97536 97177 360 0 0 95 49 622 0.999 53.54 4.02 Intr - 97893 97816 78 0 0 16 85 130 0.974 5.25 4.01 Init - 98067 97954 114 0 0 66 74 244 0.946 19.01 4.00 Prom - 99337 99298 40 -7.16 5.05 PlyA - 99421 99416 6 1.05 5.04 Term - 102415 102279 137 2 2 85 39 167 0.999 9.68 5.03 Intr - 108337 108217 121 1 1 109 95 83 0.999 11.17 5.02 Intr - 115405 115223 183 1 0 109 109 246 0.445 28.78 5.01 Init - 132051 131986 66 0 0 71 75 35 0.160 -0.40 5.00 Prom - 138713 138674 40 -3.76 6.00 Prom + 149196 149235 40 -6.86 6.01 Init + 152216 152429 214 1 1 96 94 318 0.997 30.11 6.02 Intr + 154609 154754 146 2 2 111 7 104 0.530 4.60 6.03 Term + 154828 154842 15 0 0 82 49 11 0.416 -5.16 6.04 PlyA + 155304 155309 6 -0.45 7.14 PlyA - 155319 155314 6 -0.45 7.13 Term - 157507 157329 179 2 2 77 48 194 0.999 12.15 7.12 Intr - 157762 157607 156 2 0 77 74 95 0.932 6.98 7.11 Intr - 158940 158760 181 0 1 65 80 146 0.971 10.94 7.10 Intr - 159182 159049 134 0 2 33 85 187 0.580 13.26 7.09 Intr - 159577 159499 79 1 1 106 100 53 0.993 7.42 7.08 Intr - 160131 160000 132 0 0 48 74 146 0.997 10.04 7.07 Intr - 161361 161281 81 0 0 73 80 57 0.891 3.23 7.06 Intr - 162060 161893 168 0 0 16 99 187 0.687 12.74 7.05 Intr - 162350 162150 201 2 0 70 75 86 0.516 4.98 7.04 Intr - 164842 164738 105 1 0 52 95 74 0.904 4.91 7.03 Intr - 165785 165672 114 2 0 46 115 113 0.940 10.44 7.02 Intr - 166217 166041 177 2 0 56 76 85 0.541 4.22 7.01 Init - 169028 168897 132 2 0 92 83 101 0.967 10.46 7.00 Prom - 183625 183586 40 -4.36 8.03 PlyA - 184447 184442 6 1.05 8.02 Term - 199261 199240 22 0 1 122 55 11 0.092 -1.02 8.01 Init - 210692 210058 635 2 2 78 81 344 0.275 25.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 11234 10969 266 0 2 53 49 242 0.957 12.47 S.002 Init - 13892 13838 55 2 1 75 97 12 0.862 2.25 S.003 Intr - 45677 45504 174 2 0 61 56 129 0.880 7.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_1|108_aa XFGEGDTAQASKHVPGTINIFGSSDVPGTVCGYPGADVRSDADPSNGHRMRVPRAGPKPP RIRLKVRQKIPDPYVRFLKALPTRSGWPRGGSGDGAFEWGPREESWQG >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_1|327_bp nngtttggtgagggtgacacagcccaagcatccaagcatgtccctggaacaatcaacatc tttggctcctcagatgttcctggaactgtatgtggctacccaggggcagatgtgcgcagt gacgccgacccatccaacggtcatcgcatgcgcgtgccccgcgcaggccccaaaccccca cggattaggttgaaggtcagacaaaaaatcccggacccatacgtccggttccttaaggcc ttgcccacacgcagcggctggccccgcggtgggagtggggacggggctttcgaatggggg ccccgggaagagtcttggcagggatga >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_2|606_aa MALQNYDNKLVKCIEELCQKQEELCWQIQQEEDKKQRLQNEVRQLTEKLACVNEKLARVN ENLARKIASCSKFYQTIAETEATYLKMLESSQTLLSVLKREAGNLTKATASDQKNEYEAE INRDNPLGMKGEIAEAYAELIKQMWSGRDAHVAPRMFKTQVGRFAPQFSGYQQQDSQELL AFLLDGLHEDLNRVKKKPYLELKDANGRPDAYRVTVPLMGAVSDLCEALSRLSGIAAENM VVADVYNHRFHKIFQMDEGLNHIMPRDDIFVRYVKQPLPDEFGSSPLEPGACNGSRNSCE GEDEEEMEHQEEGKEQLSETEGSGEDEPGNDPSETTQKKIKGQPCPKRLFTFSLVNSYGT ADINSLAADGKLLKLNSRSTLAMDWDSETRRLYYDEQESEAYEKHVSMLQPQKKKKTTVA LRDCIELFTTMETLGEHDPWYCPNCKKHQQATKKFDLWSLPKILVVHLKRFSYNRYWRDK LDTVVEFPIRGLNMSEFVCNLSARPYVYDLIAVSNHYGAMGVGHYTAYAKNKLNGKWYYF DDSNVSLASEDQIVTKAAYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGDDEA CSMDTN >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_2|1821_bp atggcgctgcagaactacgacaacaagctggtcaaatgcatagaggagctatgccagaag caggaggagctgtgctggcagatccagcaggaggaggacaagaaacagcggctgcagaat gaggtgaggcagctgacagagaagctggcctgcgtcaacgagaagctggcccgcgtcaac gagaacctggcacgcaagattgcctcttgcagtaagttctaccagaccatcgcggagacg gaggccacctacctcaagatgctggagagctcccagactttgctcagtgtcctgaagagg gaagctgggaacctgaccaaggctacagcctcagaccagaaaaatgagtatgaagccgaa atcaacagagacaaccctctggggatgaaaggggaaattgcagaagcctatgctgaactc attaagcagatgtggtctggaagggacgcccatgtggcacctcgcatgttcaaaactcaa gtaggacgttttgctcctcaattttctggctaccagcaacaagattctcaggagctgctg gcctttcttctagatggattgcatgaagatctgaaccgggtaaagaaaaagccctacttg gagctgaaggatgccaatgggcggccagatgcgtaccgtgtgactgtgccgctgatgggg gctgtgtccgacctgtgcgaggctctctccaggctgtctggcattgctgcagaaaatatg gtggtcgcagatgtgtataatcaccgattccacaaaattttccaaatggatgaaggttta aaccacatcatgcctcgggatgacattttcgtccgctatgtgaaacagcctttacctgat gagtttggcagctcacccttggagccaggggcctgcaatggctccaggaacagctgtgaa ggagaagatgaggaagaaatggagcatcaggaagaaggcaaagagcagctttcagaaaca gaaggcagtggggaagatgagccaggaaatgaccccagtgagaccacccaaaagaagatc aaaggccagccctgcccaaaaaggctttttaccttcagtcttgtgaactcctatggaaca gctgacataaattcacttgcagctgatggaaaactacttaaactcaactctcgatctaca ctggccatggattgggacagtgaaactcggagactttactatgatgagcaagaatctgag gcctacgagaagcatgtgagcatgttgcagcctcagaagaagaagaagaccacagtggcc ctgagagactgcatcgagctcttcaccaccatggagacccttggggagcatgacccctgg tactgtcccaactgtaagaagcatcaacaggccacaaaaaagtttgacctatggtccttg cccaagatcctggtggtccacctcaaacgtttctcctacaacagatactggagggataag ctcgacacagtcgtagaattcccaatcagagggctgaacatgtccgagtttgtctgtaac ctgtcagcaaggccttatgtgtacgacctcattgccgtgtccaatcattatggagccatg ggggttggccactacactgcatatgcgaagaacaaactgaatggtaaatggtattacttt gatgatagcaacgtgtccctggcctctgaggatcagatagtgactaaagcagcttatgtg ctattttaccaacgtcgagatgatgaattttataagacaccttcacttagcagttctggt tcctctgatggagggacacgaccaagcagctctcagcagggctttggggatgatgaggct tgcagcatggacaccaactaa >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_3|276_aa MAEGGGCRERPDAETQKSELGPLMRTTLQRGAQWYLIDSRWFKQWKKYVGFDSWDMYNVG EHNLFPGPIDNSGLFSDPESQTLKEHLIDELDYVLVPTEAWNKLLNWYGCVEGQQPIVRK VVEHGLFVKHCKVEVYLLELKLCENSDPTNVLSCHFSKADTIATIEKEMRKLFNIPAERE TRLWNKYMSNTYEQLSKLDNTVQDAGLYQGQVLVIEPQNEDGTWPRQTLQSKLTCAKETV FPKAVLVLRKIVLIQMNNAYFVYPEFMLLDEDRGFW >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_3|831_bp atggcggaaggtggaggctgccgtgagcgaccggatgcggagactcagaagtccgagctt ggacccttaatgaggaccacactccaacgcggggcgcagtggtatcttattgacagccgg tggttcaagcagtggaagaagtatgtgggctttgacagctgggacatgtacaatgtgggt gaacataacctatttcctggcccaatagacaactctgggctattttcagatcctgagagt cagaccttgaaagaacacttaattgatgaattggactatgtattggtccctaccgaggcg tggaataaactactaaactggtacggctgtgtagaaggccagcaacccatcgtcagaaaa gttgtggagcatggcctgtttgtcaagcactgcaaagtcgaggtgtatttgctggaactg aagctctgtgagaacagtgaccccaccaatgtgctgagttgccatttcagcaaggcagac accattgcaaccatcgagaaagagatgcggaagctattcaacatccctgcggagcgtgaa acacggctctggaacaaatacatgagcaacacctacgagcagttgagcaagctagacaac actgtccaggatgctgggctataccagggtcaggtgctagtaattgagcctcaaaatgaa gatggcacatggcccaggcagaccttgcagtcaaagctcacatgtgcaaaagagacagtc tttcctaaggcagtacttgtactaagaaaaattgttctgattcagatgaacaatgcctat tttgtatatccagagttcatgcttctggatgaagaccgtggcttttggtag >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_4|183_aa MCAARLAAAAAAAQSVYAFSARPLAGGEPVSLGSLRGKMNELQRRLGPRGLVVLGFPCNQ FGHQENAKNEEILNSLKYVRPGGGFEPNFMLFEKCEVNGAGAHPLFAFLREALPAPSDDA TALMTDPKLITWSPVCRNDVAWNFEKFLVGPDGVPLRRYSRRFQTIDIEPDIEALLSQGP SCA >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_4|552_bp atgtgtgctgctcggctagcggcggcggcggcggcggcccagtcggtgtatgccttctcg gcgcgcccgctggccggcggggagcctgtgagcctgggctccctgcggggcaagatgaac gagctgcagcggcgcctcggaccccggggcctggtggtgctcggcttcccgtgcaaccag tttgggcatcaggagaacgccaagaacgaagagattctgaattccctcaagtacgtccgg cctggtggtgggttcgagcccaacttcatgctcttcgagaagtgcgaggtgaacggtgcg ggggcgcaccctctcttcgccttcctgcgggaggccctgccagctcccagcgacgacgcc accgcgcttatgaccgaccccaagctcatcacctggtctccggtgtgtcgcaacgatgtt gcctggaactttgagaagttcctggtgggccctgacggtgtgcccctacgcaggtacagc cgccgcttccagaccattgacatcgagcctgacatcgaagccctgctgtctcaagggccc agctgtgcctag >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_5|168_aa MRSHCIAQAGLELLASDILGLQVISVFCVSAMAAIRKKLVIVGDGACGKTCLLIVFSKDQ FPEVYVPTVFENYVADIEVDGKQVELALWDTAGQEDYDRLRPLSYPDTDVILMCFSIDSP DSLENIPEKWTPEVKHFCPNVPIILVGNKKDLRNDEHTRRELAKMKQA >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_5|507_bp atgaggtctcactgtattgcccaggctggtctcgagctcctggcctcagacatcctggga ttacaggtaatatctgtgttttgtgtttcagcaatggctgccatccggaagaaactggtg attgttggtgatggagcctgtggaaagacatgcttgctcatagtcttcagcaaggaccag ttcccagaggtgtatgtgcccacagtgtttgagaactatgtggcagatatcgaggtggat ggaaagcaggtagagttggctttgtgggacacagctgggcaggaagattatgatcgcctg aggcccctctcctacccagataccgatgttatactgatgtgtttttccatcgacagccct gatagtttagaaaacatcccagaaaagtggaccccagaagtcaagcatttctgtcccaac gtgcccatcatcctggttgggaataagaaggatcttcggaatgatgagcacacaaggcgg gagctagccaagatgaagcaggcatga >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_6|124_aa MAESWSGQALQALPATVLGALGSEFLREWEAQDMRVTLFKLLLLWLVLSLLGIQLAWGFY GNTVTGLYHRPGKWQQTNLSKPTENKGRQQRVSKGITGSAGFYTGFCYSPDLRDNCRGFR ALGK >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_6|375_bp atggcggagtcctggtctgggcaggccttgcaggctctgccggccacggtgctgggcgcg ctgggcagcgagttcttgcgggagtgggaggcgcaggacatgcgcgtgaccctcttcaag ctgctgctgctgtggttggtgttaagtctcctgggcatccagctggcgtgggggttctac gggaatacagtgaccgggttgtatcaccgtccagggaaatggcagcaaacgaacctctca aaacccacagagaataagggaaggcagcagagggtctccaagggcatcactgggtctgct ggcttctacactgggttctgctactccccagacctcagggacaactgccgggggttcagg gcactgggcaagtga >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_7|612_aa MSRVLVPCHVKGSVALQVGDVRTSQGRPGVLVIDVTFPSVAPFELQEITFKNYYTAFLSI RVRQYTSAHTPAKWVTCLRDYCLMPDPHSEEGAQEYVSLFKHQMLCDMARISELRLILRQ PSPLWLSFTVEELQIYQQGPKGLPDPSRVSSEVQQMWALTEMIRASHTSARIGRFDILCS GERPVRAEAWTTSPGVPGSVVASVRRLHSLQATMQRAVSVVARLGFRLQAFPPALCRPLS CAQEVLRRTPLYDFHLAHGGKMVAFAGWSLPVQYRDSHTDSHLHTRQHCSLFDVSHMLQT KILGSDRVKLMESLVVGDIAELRPNQGTLSLFTNEAGGILDDLIVTNTSEGHLYVVSNAG CWEKDLALMQDKVRELQNQGRDVGLEVLDNALLALQAQVLQAGVADDLRKLPFMTSAVME VFGVSGCRVTRCGYTGEDGVEISVPVAGAVHLATAILKNPEVKLAGLAARDSLRLEAGLC LYGNDIDEHTTPVEGSLSWTLGKRRRAAMDFPGAKVIVPQLKGRVQRRRVGLMCEGAPMR AHSPILNMEGTKIGTVTSGCPSPSLKKNVAMGYVPCEYSRPGTMLLVEVRRKQQMAVVSK MPFVPTNYYTLK >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_7|1839_bp atgtcccgcgttttggtgccttgccatgtgaaaggctccgtagccctccaggtgggcgac gtgcggacctcccaaggccggcctggcgtgctggtcatcgatgtcaccttccccagcgtc gctcccttcgagttgcaggaaatcacgtttaagaattactacacagcttttttgagcatc cgtgtccgtcagtacacctcagcacacacacctgccaagtgggtgacctgcctgcgggac tactgcctaatgcctgacccacacagtgaggagggagcccaggagtatgtatcgctgttc aagcatcagatgctgtgtgacatggctagaatatcggagctacgcctgattctgcggcag ccatcaccactgtggctgtctttcacagtggaggagctgcagatctatcagcagggacca aagggtctcccagaccccagcagggtatcctccgaggtgcagcagatgtgggcactgaca gagatgatccgggccagtcacacctccgcaaggatcggccgctttgatatcctttgctcc ggagagagacctgtccgagcagaggcctggactacatctcccggcgtgcctggcagtgtg gtggcctctgtgcgccgtctgcactcgttgcaggcgacgatgcagagggctgtaagtgtg gtggcccgtctgggctttcgcctgcaggcattccccccggccttgtgtcgtccacttagt tgcgcacaggaggtgctccgcaggacaccgctctatgacttccacctggcccacggcggg aaaatggtggcgtttgcgggttggagtctgccagtgcagtaccgggacagtcacactgac tcgcacctgcacacacgccagcactgctcgctctttgacgtgtctcatatgctgcagacc aagatacttggtagtgaccgggtgaagctgatggagagtctagtggttggagacattgca gagctaagaccaaaccaggggacactgtcgctgtttaccaacgaggctggaggcatctta gatgacttgattgtaaccaatacttctgagggccacctgtatgtggtgtccaacgctggc tgctgggagaaagatttggccctcatgcaggacaaggtcagggagcttcagaaccagggc agagatgtgggcctggaggtgttggataatgccctgctagctctgcaagcccaggtacta caggccggcgtggcagatgacctgaggaaactgcccttcatgaccagtgctgtgatggag gtgtttggcgtgtctggctgccgcgtgacccgctgtggctacacaggagaggatggtgtg gagatctcggtgccggtagcgggggcagttcacctggcaacagctattctgaaaaaccca gaggtgaagctggcagggctggcagccagggacagcctgcgcctggaggcaggcctctgc ctgtatgggaatgacattgatgaacacactacacctgtggagggcagcctcagttggaca ctggggaagcgccgccgagctgctatggacttccctggagccaaggtcattgttccccag ctgaagggcagggtgcagcggaggcgtgtggggttgatgtgtgagggggcccccatgcgg gcacacagtcccatcctgaacatggagggtaccaagattggtactgtgactagtggctgc ccctccccctctctgaagaagaatgtggcgatgggttatgtgccctgcgagtacagtcgt ccagggacaatgctgctggtagaggtgcggcggaagcagcagatggctgtagtcagcaag atgccctttgtgcccacaaactactataccctcaagtga >gi568815595r:49260212_49475589|GENSCAN_predicted_peptide_8|218_aa MTEPTKALSSSDRAVAAAVPFPASSGARLPTVGIRPNWATTPIGSQGGGPQAAHDRGQGA GSTFPGARAELRPGPSPRRAARALQPNAPSLCLRDPEQRGGSARAAPPPHPQGRPRPCPR PGPNSEQQQVFRGRLSRQQGWGPKQPGTADRAGGALTQRRREDRGRCSLLLAPGRRHRRS PGIRREAAAAATAKPKRRERRAAAFARARTGRCSQTAG >gi568815595r:49260212_49475589|GENSCAN_predicted_CDS_8|657_bp atgacggaacccaccaaggctctgagcagctctgacagagcagtggcggcagcggtgccc ttccctgcctctagcggggctcgactccccacagtcggaatccggcccaactgggccaca acccctattggaagccagggaggcggcccgcaggcggctcacgaccgcggccaaggcgca ggctccacgttcccgggagcgcgggccgagctaaggcctggcccatcgccgcggcgcgcg gcccgggccttgcagccgaacgctccgagcctttgcctgcgggacccagagcaaaggggc ggctccgcccgggcggcaccgcctcctcacccacaaggacgtccgcggccgtgtccccgc cccgggccgaacagcgagcagcagcaggtgttccgcggaaggctcagccggcagcaggga tgggggcccaagcagccaggcacagcggatagggcgggcggtgcactcacccagcgccgg cgggaggaccgcggcaggtgttcgctcctcctggccccaggacgccgccaccgccgctcc ccggggattcggcgcgaagctgccgccgccgccaccgccaagcctaagaggcgcgagcgc cgcgccgcagccttcgcgcgagcacgtaccgggcgctgttcccagacagctggatga