GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:35:11 Sequence gi568815597r:58682078_58883070 : 200993 bp : 42.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 468 57 412 0 1 11 16 396 0.997 17.53 1.04 Intr - 3174 3076 99 0 0 46 121 60 0.956 4.49 1.03 Intr - 5757 5623 135 0 0 42 37 118 0.760 1.94 1.02 Intr - 7039 6961 79 0 1 117 59 66 0.915 5.23 1.01 Init - 17975 17908 68 2 2 107 72 121 0.985 13.00 1.00 Prom - 18689 18650 40 -7.25 2.00 Prom + 22906 22945 40 -2.85 2.01 Init + 34354 34409 56 0 2 83 81 68 0.045 6.41 2.02 Intr + 53593 53762 170 1 2 0 87 194 0.479 9.17 2.03 Intr + 68754 68874 121 1 1 86 98 -12 0.105 -1.77 2.04 Intr + 69189 69284 96 0 0 57 81 128 0.130 7.21 2.05 Intr + 72290 72380 91 2 1 91 84 21 0.049 1.08 2.06 Intr + 75566 75723 158 1 2 93 84 53 0.184 3.29 2.07 Intr + 76202 76228 27 2 0 96 106 21 0.135 1.01 2.08 Intr + 84210 84420 211 0 1 60 115 60 0.259 3.89 2.09 Term + 85308 85460 153 2 0 20 42 141 0.320 -0.36 2.10 PlyA + 85468 85473 6 1.05 3.00 Prom + 91812 91851 40 -6.35 3.01 Init + 94035 94129 95 2 2 79 92 85 0.957 7.90 3.02 Intr + 95627 95746 120 2 0 110 95 65 0.957 8.09 3.03 Term + 96898 97000 103 1 1 38 43 79 0.159 -4.83 3.04 PlyA + 97171 97176 6 1.05 4.14 PlyA - 97803 97798 6 1.05 4.13 Term - 101061 99998 1064 1 2 34 49 1397 0.969 121.69 4.12 Intr - 101427 101220 208 0 1 49 71 232 0.544 15.33 4.11 Intr - 101816 101510 307 1 1 59 22 238 0.626 9.83 4.10 Intr - 102866 102680 187 0 1 -11 14 216 0.832 2.13 4.09 Intr - 103595 103278 318 0 0 121 23 167 0.831 8.51 4.08 Intr - 119714 119639 76 2 1 37 60 91 0.039 -0.63 4.07 Intr - 120793 120682 112 0 1 57 -3 182 0.107 5.46 4.06 Intr - 124602 124459 144 2 0 84 23 115 0.047 3.08 4.05 Intr - 128248 128137 112 2 1 53 82 41 0.290 -1.48 4.04 Intr - 129590 129524 67 2 1 99 100 58 0.234 5.66 4.03 Intr - 134244 134013 232 2 1 46 70 137 0.149 4.55 4.02 Intr - 150903 150825 79 1 1 83 101 18 0.244 0.29 4.01 Init - 153498 153345 154 0 1 57 72 138 0.569 9.29 4.00 Prom - 153826 153787 40 -7.55 5.00 Prom + 157489 157528 40 -5.85 5.01 Init + 157544 157580 37 1 1 64 103 16 0.526 0.92 5.02 Intr + 159360 159515 156 1 0 83 5 131 0.258 3.36 5.03 Intr + 164144 164309 166 0 1 82 74 117 0.672 7.80 5.04 Intr + 166751 167204 454 2 1 106 74 153 0.730 8.03 5.05 Term + 168504 168593 90 2 0 28 53 88 0.514 -3.96 5.06 PlyA + 168726 168731 6 1.05 6.04 PlyA - 169635 169630 6 1.05 6.03 Term - 171746 171622 125 0 2 90 43 119 0.679 5.17 6.02 Intr - 184028 183949 80 1 2 71 111 48 0.210 3.68 6.01 Init - 195142 195099 44 1 2 91 51 58 0.013 2.34 6.00 Prom - 197257 197218 40 -4.05 7.02 PlyA - 198001 197996 6 1.05 7.01 Term - 200757 200487 271 2 1 25 43 223 0.924 5.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_1|265_aa MAAEEADVDIEGDVVAAAGAQPGVHSPTKPASYSVKWTIEEKELFEQGLPGAIYRYRQKR SGQYGLLPSRTLIRCSHDEAVLALVPAGCRCGVQAKFGRRWTKISKLIGSRTVLQVKSYA RQYFKNKVKCGLDKETPNQKTGHNLQVKNEDKGTKAWTPSCLRGRADPNLNAVKIEKLSD DEEVDITDEVDELSSQTPQKNSSSDLLLDFPNSKMHETNQGEFITSDSQEALFSKSSRGC LQNEKQDETLSSSEITLWTEKQSNX >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_1|795_bp atggcggctgaagaggcggatgtggatatcgaaggggacgtggtagcggcggcgggggca cagccaggggtacactctcctacaaaaccagccagttactcagtaaagtggacgatagaa gaaaaagagctgtttgaacaagggctgcctggggcaatttataggtatcggcagaagagg tctgggcagtatggcttgctgcccagcaggacattgataagatgttcccatgatgaggca gttctggcccttgttccggcgggatgtcgttgtggtgttcaggctaaatttggccgaaga tggaccaaaatttcaaagctaattggaagccgcactgttttacaagtgaagagttatgca agacagtattttaaaaataaggtcaaatgcggtctggataaagaaacaccaaatcagaag accggccataatcttcaagttaaaaatgaagataaagggacaaaggcatggacaccatca tgtttaaggggacgtgctgatcccaacttgaatgctgtaaaaattgaaaagttatctgat gatgaagaagtagacatcacagatgaggtggacgagttgtcttctcaaacaccccagaag aattctagcagtgatctcttgttagactttcctaatagtaaaatgcatgaaaccaatcaa ggagaattcattacttctgacagccaggaagctctcttttctaagtcttccaggggctgt cttcaaaatgaaaagcaagatgaaacactttcaagctcagaaattacactgtggactgag aaacagagcaatgnn >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_2|360_aa MAESETEAGTSYMAAVGGRKLEAYKTIRRTGRTKVRRANCQCSGRQEAQPWQEAAANALG CLQQLNGLCSRAWLEAKLLAVFVYTTCLYFLTSHSILLMVQPLKTVLNTWSLEEASLLCN KEGTHGFSEAQAIPRREEGYEGKTSTELAGCCCSAHSAVHAEKLAPGLHQQKGLCLWRKN TINRISDVHFIDEEGRFSKAIQGYTARGRDGIEVPLSASFSFLMPFHFPSWKAFPLQEKK NRTKEVVGELHAGYQRINPGGFQKFQRNYLEGIIPRRKEAKASHRLHKTRDLAYLIPHCI PSTKNYVWLMCRHDSWRAPCNHEVTLGWKPYAKNGGTETEKEPMTLMAVELLQCLPPDFL >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_2|1083_bp atggcagaaagtgaaacggaagcaggcacatcttacatggctgcagtaggaggaaggaaa ttagaagcttacaaaaccattagaagaaccggcagaacaaaggttaggcgagccaactgc cagtgttcaggacgtcaggaagcgcagccatggcaggaagctgctgctaatgctttgggc tgtctgcagcagctaaatgggttatgttcaagagcgtggctggaagccaaactccttgca gtatttgtctacacaacctgtctctacttcctcacctcccactcaatcctcctgatggtg cagcctttaaaaaccgtgttgaatacttggagtctggaagaagcaagccttctctgcaat aaggaggggacccatggcttcagtgaggcccaagcaataccaagaagagaagaaggatat gaaggtaaaacatctacggagttggcagggtgctgctgtagtgcacattctgctgtgcat gctgagaagctggctccgggtctgcaccagcagaaaggtctgtgcttatggaggaaaaac accataaataggatttctgatgtccattttatagatgaagaaggaaggtttagcaaggcc attcaaggatacacagctagaggacgagatggaattgaagtccctttgtcagcatccttt tcattcttgatgccatttcatttcccctcctggaaagccttcccactccaagagaaaaag aataggactaaagaggtggtgggtgagttgcatgcaggataccaaaggataaatcctggg ggcttccagaagttccagaggaattacctagaaggcatcattccaaggaggaaagaggcc aaagcatcacatagacttcataaaacaagggaccttgcctatcttattccccactgtatc cccagtacaaaaaattatgtctggctcatgtgtagacatgatagctggagggcaccttgt aaccatgaagtaactttaggatggaagccatatgcgaagaatggtggaacagaaacagag aaggagcctatgaccctgatggctgtggagctgctacagtgccttcctccagacttttta taa >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_3|105_aa MTTINGHPENSGENNMRQKRQKSFGNYKSMDGFLLVSFFRASLSSLQVPKHKTFSHTSLP LHARFFPIFSAWELFIPFEDGVTKGSFTEKVSLTPRLEEWVGFIS >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_3|318_bp atgactactatcaatgggcatcccgagaacagtggtgaaaataacatgagacagaagagg caaaagagctttggaaactacaagagtatggatggttttcttctcgtatctttcttccga gcttctctaagcagccttcaggtccctaaacacaagacgttttctcatacttctctgcct ttgcatgcacgtttctttccaatcttttctgcctgggaattattcattccttttgaagat ggagttactaagggaagctttacagagaaggtgtcacttacgccaagacttgaagaatgg gtaggatttatcagttga >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_4|1019_aa MTSERHRGHEIDPRALMCKEVGERGAQEKGLDRTSLVLSKGEDIPEGSAIPVRSIGSGSE ITHEPIHFTAFSVLSLTIIAPTWLAERQKSAVAVRGKLFANRRPHPAETPQAGSQKSQSS PNPGSASLSQWDSSGSQFAHLCRRCEEFLHNREKKFQHLHGLHLWGKDLASEAIVVPSSS RPQVPAFLIMAGATSAALYYSDGYGGVRDILKEARTVSEYISVALSYKACGHLLQQPQGT NSSSNYASRWLDAPPTPESSTLLVVIVTFIIIITIIIIIIIIITEIAKGGEQEQPISFSL GSGVHTALKASDLAQVFSSGEAEAPGKYTWSCQIALLYPGILSRLCEDGNCEAGEGSGLF GHHLPSAAGDPVTLTPGSPVRWTAPAPPADEGNEARRSSVICLLHTVVAWPRKNQDSRLR DSFHHTLSSLRGKAVNPRALSPTVPEDEVGCNGDSAERPVSGNTNLSASTSSQTTPRTVA PELDRDANFGVFSTASACGSRAGVSGVSREPGLQPPPGGEPCCPGAVDSGGKQRYPRARR GKSASGCSSKELSRLGGPETSGRVPEPTFASLSCVLGFSTAVKTRRRRRATQEKKDRRRG QVVGVRAAKTRRRPATAGSALIRSAGRAAALGSEFACGLRGTAAHEERSVSDRDFSKPGS ARESTTLLRPRNLCAQPKLTSREVTDCSMTAKMETTFYDDALNASFLPSESGPYGYSNPK ILKQSMTLNLADPVGSLKPHLRAKNSDLLTSPDVGLLKLASPELERLIIQSSNGHITTTP TPTQFLCPKNVTDEQEGFAEGFVRALAELHSQNTLPSVTSAAQPVNGAGMVAPAVASVAG GSGSGGFSASLHSEPPVYANLSNFNPGALSSGGGAPSYGAAGLAFPAQPQQQQQPPHHLP QQMPVQHPRLQALKEEPQTVPEMPGETPPLSPIDMESQERIKAERKRMRNRIAASKCRKR KLERIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKVMNHVNSGCQLMLTQQLQTF >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_4|3060_bp atgacctctgagagacacagaggtcatgaaatagatccaagagctctgatgtgcaaggag gtaggtgaaagaggggcccaagagaaaggcctggaccgcacgtccctagtcctgtccaag ggagaggatattccggagggcagtgctattccagtaagaagtatcgggtctggatcagag atcacccatgagccaattcattttactgcattctctgtactaagtctgacaataatcgcc cccacttggctggctgagcgacaaaaatcagcagtagctgtccgaggaaagctcttcgca aaccggcgcccccacccagcggaaacaccccaagccggatcccagaaatctcagtcttct cccaaccccggctctgcctctctctctcaatgggactcgtccgggtctcagtttgcccat ctgtgcagacggtgtgaagaatttctgcacaatcgagaaaaaaagttccagcatctgcat gggctccacctctggggaaaagacctggcaagtgaagccatcgttgttcccagctcatcc aggccacaggtccctgccttccttatcatggctggtgccacctcagcagccttatactat agtgatgggtatgggggagtgagagatatcttgaaagaggccagaactgtgagcgaatac atttctgtggctttaagctacaaagcttgtggtcatttgttacagcagccacagggcacc aatagcagcagtaactatgcctctcgctggcttgatgctccacccacccctgaatcttcc accttgctggttgttatagttaccttcatcatcatcatcaccatcatcatcatcatcatc atcatcatcacagaaattgcaaagggtggagagcaggaacagcccatttctttttctctg ggctctggagtccacactgcattgaaagccagtgatctggcacaagtcttcagcagtgga gaagctgaggccccaggaaagtacacctggtcctgccaaatcgcactcttatatcctggc atcctatccaggctctgcgaggatggaaactgcgaggcaggggagggaagcgggctgttt ggccaccacctccctagtgctgcaggcgaccctgtcacactaactcctggcagcccagtg aggtggacggcaccggccccacctgcagatgagggaaatgaagctcggaggagttccgtg atttgcttgcttcacactgtggtagcctggccacgaaagaaccaggattcccgacttcgg gattctttccaccacacactttcgtccctaaggggcaaagctgtgaacccccgcgccctt tcccccacggtcccggaggatgaagtggggtgcaacggagactcagctgagcgtccagtt tcgggcaatacaaatctctcggcttctacgagcagccagacgaccccgcggaccgtcgct cctgaacttgaccgagatgcaaacttcggagtgttctcaacagctagcgcctgtggctcc cgggctggtgtttcgggagtgtccagagagcctggtctccagccgcccccgggaggagag ccctgctgcccaggcgctgttgacagcggcggaaagcagcggtacccacgcgcccgccgg gggaagtcggcgagcggctgcagcagcaaagaactttcccggctgggaggaccggagaca agtggcagagtcccggagccaacttttgcaagcctttcctgcgtcttaggcttctccacg gcggtaaagaccagaaggcggcggagagccacgcaagagaagaaggaccggaggagggga caagtcgtcggagtccgggcggccaagacccgccgccggccggccactgcagggtccgca ctgatccgctccgcggggagagccgctgctctgggaagtgagttcgcctgcggactccga ggaaccgctgcgcacgaagagcgctcagtgagtgaccgcgacttttcaaagccgggtagc gcgcgcgagtcgacaaccctgttgcggccccgaaacttgtgcgcgcagcccaaactaacc tcacgtgaagtgacggactgttctatgactgcaaagatggaaacgaccttctatgacgat gccctcaacgcctcgttcctcccgtccgagagcggaccttatggctacagtaaccccaag atcctgaaacagagcatgaccctgaacctggccgacccagtggggagcctgaagccgcac ctccgcgccaagaactcggacctcctcacctcgcccgacgtggggctgctcaagctggcg tcgcccgagctggagcgcctgataatccagtccagcaacgggcacatcaccaccacgccg acccccacccagttcctgtgccccaagaacgtgacagatgagcaggagggcttcgccgag ggcttcgtgcgcgccctggccgaactgcacagccagaacacgctgcccagcgtcacgtcg gcggcgcagccggtcaacggggcaggcatggtggctcccgcggtagcctcggtggcaggg ggcagcggcagcggcggcttcagcgccagcctgcacagcgagccgccggtctacgcaaac ctcagcaacttcaacccaggcgcgctgagcagcggcggcggggcgccctcctacggcgcg gccggcctggcctttcccgcgcaaccccagcagcagcagcagccgccgcaccacctgccc cagcagatgcccgtgcagcacccgcggctgcaggccctgaaggaggagcctcagacagtg cccgagatgcccggcgagacaccgcccctgtcccccatcgacatggagtcccaggagcgg atcaaggcggagaggaagcgcatgaggaaccgcatcgctgcctccaagtgccgaaaaagg aagctggagagaatcgcccggctggaggaaaaagtgaaaaccttgaaagctcagaactcg gagctggcgtccacggccaacatgctcagggaacaggtggcacagcttaaacagaaagtc atgaaccacgttaacagtgggtgccaactcatgctaacgcagcagttgcaaacattttga >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_5|300_aa MMCNALKECIAHAEGSSEQDSKTGLGVTSGETGTSGRKVAEREKRRWNRVQKPGGQGPSQ NGASRKIDTCELRDLWLPPLKWQLPPFIYLSAPPGRWRALYASDFVSLTPDTEPGTQKAR LVLGPQEFSIICMQCLLMTHYHNKGHLSLCRGKLPSWEAYLNGCGLQEALGAEAQPLTHT ELSPIRLRPAQEPPTSPAPAAFLAQTSGCSLWAPGCRITHSQILPLASARIQEEGFVVSL SHLAFSLPPLSVELLKPKLNFSPISSKDQYPSLGAKPFYGPLISMNEAGAEELELARTTL >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_5|903_bp atgatgtgcaatgctcttaaggaatgcatagcacatgcagagggatccagtgagcaggac agcaaaacgggccttggtgtaacttctggagaaacagggacaagtggaaggaaggtggca gaaagggagaagaggagatggaacagggtgcagaaacctggaggccaagggcccagccag aatggagcctcacggaagatcgacacctgtgagctcagggacctgtggctgcctccatta aagtggcaactgccaccttttatatatttgtctgcaccccctggtagatggagagctctt tatgctagcgactttgtgtccctaacacccgacacggaacctgggacacagaaggctcga cttgtgcttggaccgcaggaattttctatcatctgcatgcaatgcctactgatgacacac tatcacaataaaggccacctgagtctatgcaggggaaagcttccttcgtgggaggcctat ctgaatggatgtggcctgcaggaggcccttggagcagaggctcagcctctgactcacact gagctgtcccccatccggttaaggccagcccaggagccgcccacctctcctgcacccgct gccttcttggcacagacctctggctgcagcctttgggctccaggctgcagaattactcac tcgcagattcttcctttggcatctgccaggatccaggaagagggctttgtagtctctctg tctcacttagctttttccctgcctcctttatcagttgaattgctaaagccaaagctgaat ttctctcccatttcctccaaggaccagtatccgagcttgggagccaaaccattttatggt cccctaatttctatgaatgaagctggtgctgaagaattagaactggccagaacaacactt tga >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_6|82_aa MAAPLSKKDEEQDKGISTHTTFDDDRKALFSSFYLKGKFDPGFEELNEKEYVKQDLAGRT HSTGDSPHNLHDFYHKAVGEVN >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_6|249_bp atggctgctccattaagcaagaaggatgaagagcaggacaaaggcatttccacacatact acatttgacgatgatagaaaagcattattttcttcattttacctgaaagggaagtttgac ccaggatttgaagagttaaatgaaaaagaatatgtaaagcaggacctggcaggcaggacc cactcaacaggagatagccctcataatctccatgacttttaccacaaagctgttggagaa gtaaattag >gi568815597r:58682078_58883070|GENSCAN_predicted_peptide_7|90_aa XIYSASVHQEAAGAAGRCGGKNTDANKSLSTGSGSVGKDAPVDQLNTLQETRSGSAFGRL SSHPLHPPPAAHTPASGPFLSWELLRLRSV >gi568815597r:58682078_58883070|GENSCAN_predicted_CDS_7|273_bp nncatttattcagcaagtgttcaccaagaggctgccggggcagctgggcgctgtggcgga aagaacacagatgctaataaaagcctgagcacaggatcaggctctgtgggtaaagatgct ccagtggaccagctgaacactttacaggagactcgttctggctctgcatttggacggctg agctcccacccacttcatcccccacccgcagcacacacaccagcttctgggccatttctg agttgggagcttctcagattacgctcagtgtga