GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:08:40 Sequence gi568815577r:46499366_46702415 : 203050 bp : 44.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5106 5124 19 2 1 78 89 11 0.346 0.61 1.02 Intr + 9892 10011 120 2 0 128 116 144 0.969 21.67 1.03 Term + 12052 12260 209 2 2 73 42 101 0.704 1.50 1.04 PlyA + 14459 14464 6 1.05 2.03 PlyA - 14679 14674 6 1.05 2.02 Term - 18321 18139 183 0 0 61 42 73 0.272 -2.46 2.01 Init - 19887 19840 48 0 0 51 101 42 0.436 2.75 2.00 Prom - 20190 20151 40 -0.86 3.00 Prom + 22141 22180 40 -1.66 3.01 Init + 24580 24588 9 0 0 75 100 10 0.329 0.84 3.02 Intr + 34159 34282 124 0 1 108 100 62 0.997 9.56 3.03 Intr + 34639 34820 182 2 2 116 101 40 0.545 7.69 3.04 Intr + 35220 35322 103 2 1 38 116 71 0.380 4.65 3.05 Intr + 37859 37923 65 0 2 99 79 0 0.936 -1.36 3.06 Intr + 38081 38174 94 1 1 98 92 162 0.875 17.14 3.07 Intr + 39118 39237 120 2 0 94 67 126 0.954 11.57 3.08 Intr + 40512 40626 115 1 1 73 91 123 0.685 10.61 3.09 Intr + 42391 42530 140 1 2 112 94 109 0.901 14.01 3.10 Intr + 45772 45908 137 2 2 101 68 53 0.987 4.89 3.11 Intr + 46516 46596 81 0 0 95 76 82 0.712 7.53 3.12 Intr + 47550 47677 128 2 2 68 64 260 0.901 21.08 3.13 Intr + 50406 50520 115 0 1 99 80 233 0.995 24.05 3.14 Intr + 51178 51379 202 0 1 83 92 107 0.992 9.46 3.15 Intr + 52269 52378 110 1 2 60 56 55 0.484 -0.50 3.16 Intr + 52459 52539 81 0 0 55 100 85 0.756 6.23 3.17 Intr + 54804 54927 124 2 1 35 80 34 0.612 -2.54 3.18 Intr + 55210 55331 122 2 2 106 69 201 0.607 20.21 3.19 Intr + 55457 55568 112 1 1 94 96 130 0.992 14.35 3.20 Intr + 55622 55656 35 0 2 118 66 -12 0.485 -2.46 3.21 Intr + 56514 56726 213 2 0 87 100 78 0.786 7.81 3.22 Intr + 57574 57704 131 0 2 59 109 123 0.998 10.99 3.23 Intr + 58220 58388 169 2 1 80 96 262 0.999 26.15 3.24 Intr + 58858 59028 171 0 0 109 94 219 0.992 24.74 3.25 Intr + 61357 61418 62 0 2 102 95 130 0.997 12.63 3.26 Intr + 63078 63219 142 0 1 56 64 34 0.591 -1.84 3.27 Intr + 64493 64567 75 1 0 79 83 39 0.534 2.21 3.28 Intr + 66348 66522 175 2 1 46 78 189 0.997 13.21 3.29 Intr + 67195 67318 124 2 1 51 83 195 0.997 14.94 3.30 Intr + 68005 68253 249 1 0 114 58 587 0.449 54.85 3.31 Intr + 68407 68533 127 1 1 97 40 35 0.026 0.28 3.32 Intr + 83354 83482 129 1 0 73 98 31 0.006 3.49 3.33 Intr + 86626 86771 146 0 2 38 47 133 0.007 3.28 3.34 Term + 99444 99573 130 0 1 42 43 150 0.075 3.55 3.35 PlyA + 99591 99596 6 -3.24 4.03 PlyA - 99619 99614 6 -0.45 4.02 Term - 100138 99998 141 1 0 69 50 229 0.920 15.13 4.01 Init - 103050 102913 138 0 0 43 105 186 0.721 15.94 4.00 Prom - 115048 115009 40 -1.56 5.00 Prom + 121429 121468 40 -3.46 5.01 Init + 125204 125268 65 1 2 80 116 52 0.690 5.96 5.02 Intr + 126251 126410 160 2 1 101 47 24 0.147 -0.41 5.03 Intr + 144170 144274 105 1 0 65 68 157 0.761 11.81 5.04 Intr + 144941 145123 183 1 0 86 32 126 0.937 6.78 5.05 Intr + 149093 149254 162 1 0 64 101 158 0.904 14.87 5.06 Intr + 150210 150374 165 2 0 108 81 372 0.975 38.66 5.07 Intr + 159380 159555 176 1 2 72 100 242 0.972 22.54 5.08 Intr + 161468 161597 130 2 1 96 78 34 0.994 3.90 5.09 Intr + 162435 162571 137 2 2 47 86 233 0.943 18.37 5.10 Intr + 164018 164189 172 2 1 87 69 67 0.810 4.65 5.11 Term + 164542 164628 87 0 0 89 38 63 0.692 -0.94 5.12 PlyA + 164728 164733 6 1.05 6.00 Prom + 167724 167763 40 -6.76 6.01 Init + 167853 168142 290 2 2 82 105 247 0.494 20.39 6.02 Term + 191436 191880 445 0 1 -16 35 477 0.102 26.71 6.03 PlyA + 198216 198221 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 80427 80071 357 0 0 67 48 166 0.987 6.56 S.002 Sngl + 191518 191880 363 0 0 27 35 431 0.847 27.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_1|115_aa MLETADGVPVNSRVSSKIQQLLNTLKRPKRPPLKEFFVDDFEELLEVQQPDPNQPKPEGS ETSVLRGEPLTAGVPRPPSLLATLQRWGTTQPKSPCLTALDTTGKAVYTLTYGKC >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_1|348_bp atgctggaaacagcagatggtgtccctgtgaacagcagagtgtcctccaaaatccagcag cttctgaacaccctgaagaggccaaagcgccctccactgaaggagttctttgtggatgat tttgaggaattgttggaagttcagcaaccagatccaaatcagccaaagcctgagggaagc gagacgagtgtgctgagaggggagcctctcactgcaggtgtcccccgaccgccgtcgctg ttggccaccttgcagcgctggggcacaacacagcccaaatccccctgtctgactgccttg gatacaactgggaaagccgtctacactctcacctatggcaagtgttaa >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_2|76_aa MCGEEAGQVPANEDGNQETKSPCGKPLKSRMGLLRSHEETNATAGKPSEERDTWTHQALA ESPGEPEESKGRLGQA >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_2|231_bp atgtgtggtgaagaagctggccaagtccccgcaaatgaagatggcaatcaggagaccaag agcccatgtggaaaacctctgaaaagcagaatggggttgttaagaagccacgaggagacc aacgccacggctgggaagccatcagaggaaagagacacatggacacaccaggcactggca gaaagcccaggagagccagaggagagcaaaggccggctgggccaggcatga >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_3|1413_aa MGKDAGSQQVGFLLGSCGVFLALTTDACQKGLPKAQTGEVAAFKGWPPLSWLVIDGKHLA KPPKDWHPLAQDTGTGTAYIEVMTVPNLENVKTCPTGLSLAFHKNYKTSKEGSTVGVTVS HASLLAQCRALTQACGYSEAETLTNVLDFKRDAGLWHGVLTSVMNRMHVVSVPYALMKAN PLSWIQKVCFYKARAALVKSRDMHWSLLAQRGQRDVSLSSLRMLIVADGANPWSISSCDA FLNVFQSRGLRPEVICPCASSPEALTVAIRRPPDLGGPPPRKAVLSMNGLSYGVIRVDTE EKLSVLTVQDVGQVMPGANVCVVKLEGTPYLCKTDEVGEICVSSSATGTAYYGLLGITKN VFEAVPVTTGGAPIFDRPFTRTGLLGFIGPDNLVFIVGKLDGLMVTGVRRHNADDVVATA LAVEPMKFVYRGRIAVFSVTVLHDDRIVLVAEQRPDASEEDSFQWMSRVLQAIDSIHQVG VYCLALVPANTLPKAPLGGIHISETKQRFLEGTLHPCNVLMCPHTCVTNLPKPRQKQPEV GPASMIVGNLVAGKRIAQASGRELAHLEDSDQARKFLFLADVLQWRAHTTPDHPLFLLLN AKGTVTSTATCVQLHKRAERVAAALMEKGRLSVGDHVALVYPPGVDLIAAFYGCLYCGCV PVTVRPPHPQNLGTTLPTVKMIVEVSKSACVLTTQAVTRLLRSKEAAAAVDIRTWPTILD TGVVWPGCRPKTHDSQGQIMCRLLPVKAQSVQNTCGNTNVAGVSCLTDDIPKKKIASVFR PPSPDVLAYLDFSVSTTGILAGVKMSHAATSALCRSIKLQCELYPSRQIAICLDPYCGLG FALWCLCSVYSGHQSVLVPPLELESNVSLWLSAVSQYKARVTFCSYSVMEMCTKGLGAQT GVLRMKGVNLSCVRTCMVVAEERPRIALTQSFSKLFKDLGLPARAVSTTFGCRVNVAICL QGTAGPDPTTVYVDMRALRHDRGKGRSLATLTPFTAMATATCGESERGPERDPAKPCDVQ DTDSCRAAGILPGVKVIIAHTETKGPLGDSHLGEIWVSSPHNATGYYTVYGEEALHADHF SARLSFGDTQTIWARTGYLGFLRRTELTDASGGRHDALYVVGSLDETLELRGMRYHPIDI ETSVIRAHRSIAECAVFTWTNLLVVVVELDGLEQDALDLVALVTNVVLEEHYLVVGVVVI VDPGVIPINSRGEKQRMHLRDGFLADQLDPIYVAYNIKHFLNYLKKYFESAKYIYKNTDA GILTDGETRKGKERPGMGIWEPPMAAASSRPSFHFAGNFFNNQVNMPNGNLDPQKKQREP VKETTSLGKGGEYYIKGISHGTKESEQQPSALDLPSERGYPSEKEPENQLRSAASAERSP GDQAHPGKQAEATTPLQGHTGQPAVICMDEERI >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_3|4242_bp atggggaaggatgcaggcagccagcaggttgggtttctgctgggcagctgtggagtcttc ttggccctgaccacagacgcttgtcagaaaggcctccccaaggcacagacaggagaggtg gcagctttcaaaggttggcccccgctctcctggctagtgattgatgggaagcatctagcc aagcccccaaaggactggcaccctctggcccaggacacagggactgggactgcctacatt gaggtaatgactgttcctaacttagagaatgtaaaaacatgtcccacaggcttaagtctt gcttttcataagaattataaaaccagcaaagaaggcagtacggtgggggtcacagtgtcc cacgcatccctgctggcacagtgccgggctctgacccaggcgtgcgggtactcagaagct gaaacattaacaaacgtgctggatttcaaaagggatgctggtctgtggcatggcgtgtta acaagcgtcatgaacaggatgcacgtggtcagcgtcccctacgcgctgatgaaggcgaac ccactctcctggatccagaaagtgtgcttctataaagctcgggccgcgctggtgaagtcg cgagacatgcactggtctctcctagctcagcggggccagagggacgtcagcctcagctca ctgcgcatgctgattgtggccgatggtgccaacccgtggtcgatctcctcctgtgacgcc ttcctcaacgtcttccagtccagaggtctgaggccagaggtcatctgtccttgtgcaagt tctcctgaggcgctgactgtcgccatccgcaggccacctgatctgggaggaccacctcca agaaaagcagtcctgtcgatgaacggtctaagttatggtgttatcagagtggatactgaa gaaaagttgtcagtccttactgttcaggacgttggtcaggtgatgcctggagctaatgta tgtgttgtgaagttagaaggtaccccttatctttgtaaaactgatgaagtgggagaaata tgcgtcagttccagtgcaactggcacagcgtactatggattgcttggaatcacgaagaat gtgtttgaggcagttccggtcaccacaggaggagcacccatctttgacaggccattcacc aggacaggcctgctgggcttcatcgggcctgacaacctggtcttcatcgtgggcaaactg gacgggctgatggtcactggagttcgcagacacaatgcagatgacgttgtggccaccgca ctggccgtggagcccatgaagtttgtctacagaggcaggatcgctgtgttctctgtgacc gtgctgcacgacgaccggattgtcctggtggctgagcagcggccggatgcctcggaggag gacagcttccagtggatgagccgtgtgctgcaggccattgatagcatccaccaggtgggc gtgtactgtctggccctggttcctgccaacaccttgcccaaggctcctctcggagggatt cacatttctgaaaccaaacagcgctttctggaagggacgctgcacccgtgtaatgtgctg atgtgccctcacacctgtgttaccaacctccccaaacctcgtcagaaacaaccagaggtt ggaccagcctcaatgatcgtggggaacctggttgctgggaagagaatcgctcaggcttcc gggagagagctcgcccacctggaggacagcgaccaggcacggaagttcctgttcctggct gacgtgctgcagtggcgtgcccacaccactcctgaccacccgctgttcttgctgctgaac gccaagggcaccgtcacaagcactgcaacctgtgtccagctgcacaaaagggctgagaga gtggccgcggctctgatggagaagggaagactgagtgttggggaccatgtggctctggtc tacccaccaggggtggacctcattgccgcgttctatggctgcttgtactgtggctgcgtg cctgtcaccgtgcggcccccgcaccctcagaacctcggcaccacactgcccaccgtcaag atgatcgtggaggtcagcaagtctgcatgcgtcctcaccacgcaggctgtcacacggctg ctcaggtccaaggaggctgctgctgccgtggacatcaggacctggcccaccatcctagac acaggtgtggtgtggcctggctgccgtccaaaaacacacgactcccaaggacaaatcatg tgtcgcctcttgcctgtgaaagcccagagcgttcagaatacatgtgggaacactaatgtt gctggtgtctcctgtttaacagatgacatcccaaaaaagaagatagcaagcgttttcagg cccccctcccccgatgtcctcgcatacttggacttcagcgtgtcaaccactgggatatta gcgggagtgaagatgtcgcacgcggccacaagcgccttatgccgctccataaagctgcag tgtgagctgtacccctcgcggcagatcgccatctgcctcgacccctactgtggccttggt tttgccctgtggtgtctgtgcagtgtctactcgggacaccaatcagtgctggtgcccccg ctggagctggagagcaacgtgtccctgtggctgtcggccgtcagccagtacaaggcccgc gtcaccttctgctcctactctgtgatggagatgtgcaccaagggcctaggcgcacagacg ggtgtcctcaggatgaagggggtgaacctgtcatgtgtgcgcacgtgcatggtggtcgcc gaggagcggcccaggattgcgctgacccagtccttctccaagctcttcaaggacctgggc ctgccggcccgcgccgtaagcaccacgttcgggtgcagggtcaacgtggccatctgcctc cagggcacagctggcccggaccccacaaccgtctacgtggacatgcgggcactgcgccat gacagaggcaagggaaggagcttggccaccctgactccgttcactgccatggccacagcc acctgtggggagtcagagaggggtccagagagggacccagccaagccatgtgatgtccag gatacagactcctgcagggcagcagggatcctccccggcgtgaaggtcatcatcgcacac accgagaccaaaggacccttgggagactcacacctgggagagatctgggtaagcagcccc cacaatgccaccgggtactacaccgtttacggggaggaggcgcttcatgccgaccacttc agtgcccggctgagttttggagacacacagaccatctgggcaaggaccggctaccttggc ttccttcggcgaacagagctcactgatgccagtggagggcggcacgatgcactgtatgtg gttgggtctctggatgaaactctggagctcagaggcatgcggtaccaccccatcgacatt gagacctctgtcatccgagcacacaggagcatcgctgagtgtgccgtattcacctggacc aacctgctggtggtggtggtggagctggatgggctagagcaggatgccctggacctggtg gccctggtgaccaacgtggtgctggaggagcactacctggtcgtgggagtggtggtcatc gtggacccaggggtgatccctatcaactctcggggtgagaagcagcgcatgcacctgcgg gacggcttcctggctgaccagctggaccccatctatgtcgcctacaacataaagcacttc ctgaattatttaaagaaatattttgaatctgccaagtacatttacaaaaacacggatgct ggtattttaacagatggagagacaaggaaaggaaaggaaaggcctggcatgggcatttgg gagccaccaatggcagctgcttctagtaggccatctttccactttgctggtaattttttt aacaaccaagtgaatatgccaaatggtaacttggatccacagaaaaagcaaagagaacca gtaaaggaaaccacatcattaggaaaagggggagagtactacatcaagggaatatcccat gggacaaaagaatctgaacaacagccttcagctctagaccttccctctgagagaggctac ccaagtgagaaggaaccagaaaaccaactccggagcgctgccagcgctgagcgttcacct ggtgatcaggcgcatcccgggaagcaggccgaagccaccacgcccttgcagggccacacc ggccaaccagctgttatctgcatggatgaggaacgcatttaa >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_4|92_aa MSELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMET LDNDGDGECDFQEFMAFVAMVTTACHEFFEHE >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_4|279_bp atgtctgagctggagaaggccatggtggccctcatcgacgttttccaccaatattctgga agggagggagacaagcacaagctgaagaaatccgaactgaaggagctcatcaacaatgag ctttcccatttcttagaggaaatcaaagagcaggaggttgtggacaaagtcatggaaaca ctggacaatgatggagacggcgaatgtgacttccaggaattcatggcctttgttgccatg gttactactgcctgccacgagttctttgaacatgagtga >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_5|513_aa MLAVRCPARPCVFSVLWFLGGRLVTSRAKHLAQPLHPMSNGDGDRTDTLAPATQKGETEA KSRQSAEILKSNLTQGEEPAECSEAGLLQEGVQPEEFVAIADYAATDETQLSFLRGEKIL ILRQTTADWWWGERAGCCGYIPANHVGKHVDEYDPEDTWQDEEYFGSYGTLKLHLEMLAD QPRTTKYHSVILQNKESLTDKVILDVGCGTGIISLFCAHYARPRAVYAVEASEMAQHTGQ LVLQNGFADIITVYQQKVEDVVLPEKVDVLVSEWMGTCLLFEFMIESILYARDAWLKEDG VIWPTMAALHLVPCSADKDYRSKVLFWDNAYEFNLSALKSLAVKEFFSKPKYNHILKPED CLSEPCTILQLDMRTVQISDLETLRGELRFDIRKAGTLHGFTAWFSVHFQSLQEGQPPQV LSTGPFHPTTHWKQTLFMMDDPVPVHTGDVVTGSVVLQRNPVWRRHMSVALSWAVTSRQD PTSQKASVASQAAVSLGVSSVMRVLTGSLRIRV >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_5|1542_bp atgttggccgtcaggtgccctgccaggccctgcgtgttcagtgtgctctggttcctggga ggaagacttgtgacctcaagagccaagcacctggcccaacctcttcaccctatgtctaat ggggatggggataggacagacaccttggcccctgccactcagaagggggaaactgaggca aaaagcaggcagtctgcagagattctgaaatccaacctgacccagggagaagagcctgct gagtgcagtgaggccggtctcctgcaggagggagtacagccagaggagtttgtggccatc gcggactacgctgccaccgatgagacccagctcagttttttgagaggagaaaaaattctt atcctgagacaaaccactgcagattggtggtggggtgagcgtgcgggctgctgtgggtac attccggcaaaccatgtggggaagcacgtggatgagtacgaccccgaggacacgtggcag gatgaagagtacttcggcagctatggaactctgaaactccacttggagatgttggcagac cagccacgaacaactaaataccacagtgtcatcctgcagaataaagaatccctgacggat aaagtcatcctggacgtgggctgtgggactgggatcatcagtctcttctgtgcacactat gcgcggcctagagcggtgtacgcggtggaggccagtgagatggcacagcacacggggcag ctggtcctgcagaacggctttgctgacatcatcaccgtgtaccagcagaaggtggaggat gtggtgctgcccgagaaggtggacgtgctggtgtctgagtggatggggacctgcctgctg tttgagttcatgatcgagtccatcctgtatgcccgggatgcctggctgaaggaggacggg gtcatttggcccaccatggctgcgttgcaccttgtgccctgcagtgctgataaggattat cgtagcaaggtgctcttctgggacaacgcgtacgagttcaacctcagcgctctgaaatct ttagcagttaaggagtttttttcaaagcccaagtataaccacattttgaaaccagaagac tgtctctctgaaccgtgcactatattgcagttggacatgagaaccgtgcaaatttctgat ctagagaccctgaggggcgagctgcgcttcgacatcaggaaggcggggaccctgcacggc ttcacggcctggtttagcgtccacttccagagcctgcaggaggggcagccgccgcaggtg ctcagcaccgggcccttccaccccaccacacactggaagcagacgctgttcatgatggac gacccagtccctgtccatacaggagacgtggtcacgggttcagttgtgttgcagagaaac ccagtgtggagaaggcacatgtctgtggctctgagctgggctgtcacttccagacaagac cccacatctcaaaaagcctctgtggcttcccaggcagcagttagtctaggcgtgtcgtct gtgatgcgtgttctgactggcagccttcggatccgtgtgtga >gi568815577r:46499366_46702415|GENSCAN_predicted_peptide_6|244_aa MQRALCAWRALFACWALCACWALCAREGAATLTRKAASGSRCAQLPALGFQRPASSYSNQ RDLPSPGAPRGVCARFFPKREGAGGSAARGTGSGSVRRSQSEGLKGQEGSVERCPQPHAK KKIRMSLTFRRPKTLRLRRQPRYPRKSTPRRNKLGHYAIIKFPLTTESAVKKIEENNTLV FTVDVKANKHQIRQAVKKLYDSDVAKVTTLICPDKEKAYVRLAPDYDAFDVVTKLGSPKL SPAG >gi568815577r:46499366_46702415|GENSCAN_predicted_CDS_6|735_bp atgcagagggctctgtgcgcctggcgggctctgtttgcctgctgggctctgtgcgcctgc tgggctctgtgcgcccgggaaggtgcggccaccctcacgcggaaggcggccagcggatcc cggtgcgcgcagctcccagcgctggggttccagcgccccgcctcttcctatagcaaccag cgggacctgccgtcccccggggcaccccgaggggtctgcgcccgcttctttccgaaacgg gaaggcgctgggggctcggcagccagagggacgggttcagggagcgtccgccgaagccaa agcgaaggccttaaaggccaagaaggcagtgttgaaaggtgtccgcagccacacgcaaaa aagaagatccgcatgtcactcaccttcaggcggcccaagacactgcgactccggaggcag cccagatatcctcggaagagcacccccaggagaaacaagcttggccactatgctatcatc aagtttccgctgaccactgagtcggccgtgaagaagatagaagaaaacaacacgcttgtg ttcactgtggatgttaaagccaacaagcaccagatcagacaggctgtgaagaagctctat gacagtgatgtggccaaggtcaccaccctgatttgtcctgataaagagaaggcatatgtt cgacttgctcctgattatgatgctttcgatgttgtaacaaaattgggatcacctaaactg agtccagctggctaa