GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:45:17 Sequence gi568815583r:89110288_89318694 : 208407 bp : 43.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6035 6234 200 1 2 90 70 189 0.294 15.45 1.02 Intr + 15631 15662 32 1 2 42 58 34 0.107 -6.43 1.03 Intr + 15822 15922 101 1 2 52 127 101 0.516 10.23 1.04 Term + 16063 16164 102 0 0 117 43 24 0.457 -0.92 1.05 PlyA + 16347 16352 6 1.05 2.03 PlyA - 16604 16599 6 1.05 2.02 Term - 25382 25256 127 1 1 -80 38 266 0.373 2.56 2.01 Init - 25490 25423 68 2 2 68 69 87 0.222 5.34 2.00 Prom - 27487 27448 40 -7.46 3.00 Prom + 31042 31081 40 -0.36 3.01 Init + 39839 39849 11 1 2 72 102 6 0.666 0.02 3.02 Intr + 41390 41565 176 2 2 83 83 128 0.820 11.38 3.03 Intr + 45080 45247 168 0 0 54 103 234 0.782 21.42 3.04 Intr + 65525 65708 184 0 1 107 105 211 0.619 23.55 3.05 Intr + 72102 72222 121 0 1 99 72 39 0.923 3.90 3.06 Intr + 73468 73549 82 0 1 101 90 8 0.789 1.51 3.07 Intr + 75109 75229 121 2 1 62 96 211 0.765 18.85 3.08 Intr + 77906 78016 111 2 0 91 85 125 0.999 11.99 3.09 Intr + 80793 80862 70 0 1 108 96 99 0.980 11.88 3.10 Intr + 82948 83032 85 0 1 96 111 24 0.999 4.89 3.11 Intr + 84940 85132 193 2 1 102 36 286 0.637 23.35 3.12 Intr + 85977 86145 169 0 1 63 76 128 0.560 9.05 3.13 Term + 88592 88699 108 1 0 49 55 61 0.178 -2.59 3.14 PlyA + 89894 89899 6 -1.75 4.02 PlyA - 90316 90311 6 1.05 4.01 Sngl - 91599 91207 393 0 0 65 41 356 0.642 22.84 4.00 Prom - 98301 98262 40 -5.46 5.11 PlyA - 99607 99602 6 1.05 5.10 Term - 100156 99998 159 1 0 155 48 299 0.999 30.54 5.09 Intr - 100522 100412 111 1 0 87 96 120 0.998 13.28 5.08 Intr - 101614 101456 159 1 0 48 94 229 0.468 19.68 5.07 Intr - 104951 104773 179 0 2 123 88 320 0.775 35.14 5.06 Intr - 107037 106833 205 0 1 93 78 467 0.792 44.97 5.05 Intr - 108406 108278 129 1 0 110 100 163 0.998 20.59 5.04 Intr - 108769 108677 93 1 0 68 98 20 0.518 1.16 5.03 Intr - 111319 111248 72 1 0 45 111 89 0.728 6.50 5.02 Intr - 129548 129510 39 2 0 81 123 16 0.213 2.82 5.01 Init - 131071 131069 3 1 0 113 81 0 0.167 1.80 5.00 Prom - 131755 131716 40 -4.06 6.00 Prom + 139935 139974 40 -3.36 6.01 Init + 144474 144527 54 2 0 53 115 2 0.670 0.68 6.02 Intr + 148417 148489 73 0 1 105 85 42 0.937 4.58 6.03 Intr + 151314 151454 141 1 0 59 86 74 0.923 4.62 6.04 Intr + 151534 151591 58 2 1 85 109 8 0.960 0.54 6.05 Intr + 153616 153739 124 1 1 73 82 100 0.927 8.49 6.06 Intr + 154235 154320 86 1 2 85 72 44 0.992 1.12 6.07 Intr + 158112 158238 127 0 1 77 64 134 0.665 10.58 6.08 Intr + 163090 163182 93 0 0 126 68 -4 0.446 1.56 6.09 Intr + 163899 164017 119 2 2 43 97 26 0.192 -1.74 6.10 Intr + 166424 166604 181 2 1 71 100 30 0.522 2.37 6.11 Intr + 170984 171013 30 1 0 120 111 0 0.400 3.83 6.12 Intr + 174809 174931 123 1 0 102 107 31 0.922 7.18 6.13 Intr + 179926 179994 69 0 0 83 98 39 0.862 3.78 6.14 Intr + 182401 182577 177 0 0 94 51 88 0.515 5.82 6.15 Intr + 182655 182776 122 2 2 98 110 -21 0.995 0.39 6.16 Intr + 183546 183710 165 0 0 75 110 19 0.779 1.88 6.17 Intr + 184628 184807 180 2 0 130 111 107 0.978 16.18 6.18 Intr + 189513 189679 167 0 2 79 94 30 0.995 2.30 6.19 Intr + 190013 190098 86 0 2 33 94 94 0.824 4.04 6.20 Intr + 191039 191155 117 1 0 86 79 36 0.766 3.16 6.21 Intr + 195054 195122 69 2 0 114 75 16 0.825 2.28 6.22 Intr + 195318 195411 94 2 1 67 110 42 0.972 3.84 6.23 Intr + 195720 195907 188 1 2 67 73 123 0.949 8.11 6.24 Intr + 202617 202685 69 2 0 40 103 69 0.781 2.98 6.25 Intr + 204325 204420 96 0 0 99 92 87 0.999 10.41 6.26 Term + 204995 205144 150 1 0 45 37 147 0.686 3.21 6.27 PlyA + 205332 205337 6 -3.64 7.04 PlyA - 206053 206048 6 1.05 7.03 Term - 206540 206464 77 0 2 77 42 74 0.851 -0.30 7.02 Intr - 207249 207089 161 2 2 71 84 113 0.767 8.83 7.01 Init - 208390 208254 137 1 2 92 85 314 0.589 31.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_1|144_aa IKMNAMLETPELPAVFDGVKLAAVAAVLYVIVRCLNLKSPTAPPDLYFQDSGLSRFLLKS CPLLTKENCRSLKDLKKGPYYVSKKGVYVFQKEVYATTDSITWYQIQYMWTDLHGIRSTF ILLLNVWLTENCSTIREQEMDLVF >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_1|435_bp atcaagatgaatgccatgctggagactcccgaactcccagccgtgtttgatggagtgaag ctggctgcagtggctgctgtgctgtacgtgatcgtccggtgtttgaacctgaagagcccc acagccccacctgacctctacttccaggactcggggctctcacgctttctgctcaagtcc tgtcctcttctgaccaaagagaattgccgaagcctgaaagatctaaaaaaagggccctac tatgtctccaaaaaaggagtatatgtcttccaaaaggaagtatatgctacaactgattcc atcacgtggtatcaaattcagtacatgtggacggacttacatggaatcagatctactttt atactccttctcaatgtgtggttgactgaaaattgctccactatccgggagcaggaaatg gaccttgttttctag >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_2|64_aa MWNNIAADDRQPYKKKAAKLKENLLQQKKGVVKAEKSKKKKEEEEDEEDEEDKNEEEEDE DDDE >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_2|195_bp atgtggaataacattgctgcagatgacaggcagccttataaaaagaaggctgcgaagctg aaggaaaacctgctgcagcaaaaaaagggagttgtcaaggctgaaaaaagcaagaaaaag aaggaagaggaggaagatgaggaagatgaagaggataagaatgaggaggaggaagatgaa gatgatgatgaataa >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_3|532_aa MIRRYIPPLIWGKSGHIQTALYGKMGRVRSPHPYGHRKFITMSDGATSTFDLFEPLAEHC VGDDITMVICPGIANHSEKQYIRTFVDYAQKNGYRCAVLNHLGALPNIELTSPRMFTYGC TWEFGAMVNYIKKTYPLTQLVVVGFSLGGNIVCKYLGETQANQEKVLCCVSVCQGYSALS QFILVDQYLAQPGEQFLPPLLSSGEHSYLEASQELGYSGQPVCNEGGTNKSKNIPSVRRS AKGEMNQVTLTRCGSSRAQETFMQWDQCRRFYNFLMADNMKKIILSHRQALFGDHVKKPQ SLEDTDLSRLYTATSLMQIDDNVMRKFHGYNSLKEYYEEESCMRYLHRIYVPLMLVNAAD DPLVHESLLTIPKSLSEKRENVMFVLPLHGGHLGFFEGSVLFPEPLTWMDKLVVEYANAI CQWERNKLQCSDTEQVEADLEAARPAMELPSCDHSVMSQKGSGWAEHLGCALALLFTLDK VAVDFNFFTSKMGDLDQPCVFDWLLFRVPVVIVRPLLFGGWSCDCGSCWPLE >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_3|1599_bp atgatcagaagatacattccaccgttgatctgggggaaaagtggacacatccagacagcc ttgtatgggaagatgggaagggtgaggtcgccacatccttatgggcaccggaagttcatc actatgtctgatggagccacttctacattcgacctcttcgagcccttggctgagcactgt gttggagatgatatcaccatggtcatctgccctggaattgccaatcacagcgagaagcaa tacatccgcactttcgttgactacgcccagaaaaatggctatcggtgcgccgtgctgaac cacctgggtgccctgcccaacattgaattgacctcgccacgcatgttcacctatggctgc acgtgggaatttggagccatggtgaactacatcaagaagacatatcccctgacccagctg gtcgtcgtgggcttcagcctgggtggtaacattgtgtgcaaatacttgggggagactcag gcaaaccaagagaaggtcctgtgctgcgtcagcgtgtgccaggggtacagtgcactgagc cagtttatcctggtggaccaatatctggcccagcctggggagcagttcctgccacccctt ctctcttctggagaacatagctatctagaggcaagtcaagaacttggatattcaggccag cctgtctgcaatgaaggaggtactaacaaaagcaagaacatcccttcagttaggaggtca gcaaagggagaaatgaaccaggtcaccctcactcgctgtggttcttccagggcccaggaa accttcatgcaatgggatcagtgccggcggttctacaacttcctcatggctgacaacatg aagaagatcatcctctcgcacaggcaagctctttttggagaccatgttaagaaaccccag agcctggaagacacggacttgagccggctctacacagcaacatccctgatgcagattgat gacaatgtgatgaggaagtttcacggctataactccctgaaggaatactatgaggaagaa agttgcatgcggtacctgcacaggatttatgttcctctcatgctggttaatgcagctgac gatccgttggtgcatgaaagtcttctaaccattccaaaatctctttcagagaaacgagag aacgtcatgtttgtgctgcctctgcatgggggccacttgggcttctttgagggctctgtg ctgttccccgagcccctgacatggatggataagctggtggtggagtacgccaacgccatt tgccaatgggagcgtaacaagttgcagtgctctgacacggagcaggtggaggccgacctg gaagcagcaaggccggctatggagctgccgtcgtgtgaccacagtgtgatgtctcagaag ggctctgggtgggctgagcatctgggctgtgccctggctctgcttttcaccctggacaaa gtcgctgtggacttcaatttcttcacctctaaaatgggggacttggaccagccctgtgtc tttgactggcttctcttcagagtccccgttgtcatcgtaagacccttgctgtttggaggg tggtcttgtgactgtggcagctgctggccgctggaatga >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_4|130_aa MRTRSPSPLAIVPRPQRASRPLLCAVSPMASASGATAKHEQILVLDPPIDLKFKGPFTDV VTTNLKLRNPSDRKVCFKVKTTVPHRYCVRPNSGIIDPGSTVTVSVMLQPFDYDPNEKSK HKFMVQFLLH >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_4|393_bp atgcgaactcgctccccctcccctctcgccatcgtcccccgcccccagcgagcaagccgc cccctgctctgcgctgtctctccaatggcgtctgcctccggggccacggcgaagcacgag cagatcctggtcctcgacccgcccatagacctcaaattcaaaggccccttcacagatgta gtcactacaaatcttaaattgcgaaatccatcggatagaaaagtgtgtttcaaagtgaag actaccgtgcctcaccggtactgtgtgaggcccaacagtggaattattgacccagggtca actgtgactgtttcagtaatgctacagccctttgactatgatccgaatgaaaagagtaaa cacaagtttatggtacaatttttgctccactaa >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_5|382_aa MRCWVRVSTTELVSRREPLELQDIQVPGSPKEELPTWQGRGRLERPGQRKQIPSFSKDSV SSIGNMSEGVGTFRMVPEEEQELRAQLEQLTTKDHGPVFGPCSQLPRHTLQKAKDELNER EETREEAVRELQEMVQAQAASGEELAVAVAERVQEKDSGFFLRFIRARKFNVGRAYELLR GYVNFRLQYPELFDSLSPEAVRCTIEAGYPGVLSSRDKYGRVVMLFNIENWQSQEITFDE ILQAYCFILEKLLENEETQINGFCIIENFKGFTMQQAASLRTSDLRKMVDMLQDSFPARF KAIHFIHQPWYFTTTYNVVKPFLKSKLLERVFVHGDDLSGFYQEIDENILPSDFGGTLPK YDGKAVAEQLFGPQAQAENTAF >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_5|1149_bp atgcgctgctgggttagggtctccacgaccgagctggtctcgcggagggaacctctagag ctccaggacattcaggtaccaggtagccccaaggaggagctgccgacctggcagggaagg ggccggctggagaggccaggacagagaaagcagatcccttctttttccaaggactctgtg tcttccataggcaacatgtcagaaggggtgggcacgttccgcatggtacctgaagaggaa caggagctccgtgcccaactggagcagctcacaaccaaggaccatggacctgtctttggc ccgtgcagccagctgccccgccacaccttgcagaaggccaaggatgagctgaacgagaga gaggagacccgggaggaggcagtgcgagagctgcaggagatggtgcaggcgcaggcggcc tcgggggaggagctggcggtggccgtggcggagagggtgcaagagaaggacagcggcttc ttcctgcgcttcatccgcgcacggaagttcaacgtgggccgtgcctatgagctgctcaga ggctatgtgaatttccggctgcagtaccctgagctctttgacagcctgtccccagaggct gtccgctgcaccattgaagctggctaccctggtgtcctctctagtcgggacaagtatggc cgagtggtcatgctcttcaacattgagaactggcaaagtcaagaaatcacctttgatgag atcttgcaggcatattgcttcatcctggagaagctgctggagaatgaggaaactcaaatc aatggcttctgcatcattgagaacttcaagggctttaccatgcagcaggctgctagtctc cggacttcagatctcaggaagatggtggacatgctccaggattccttcccagcccggttc aaagccatccacttcatccaccagccatggtacttcaccacgacctacaatgtggtcaag cccttcttgaagagcaagctgcttgagagggtctttgtccacggggatgacctttctggt ttctaccaggagatcgatgagaacatcctgccctctgacttcgggggcacgctgcccaag tatgatggcaaggccgttgctgagcagctctttggcccccaggcccaagctgagaacaca gccttctga >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_6|985_aa MTVPLQSSQGDKEDSVSKLTNLLQNQAVKGKVAGALLRAIFKGPLLVELANEFISAVREG SLVNGKSLELLPIILTALATKKENLAYGKGVLSGEECKKQLINTLCSGRDVPLTAEEVEF VVEKALSMFSKMNLQEIPPLVYQLLVLSSKGSRKSVLEGIIAFFSALDKQHNEEQSGDEL LDVVTVPSGELRHVEGTIILHIVFAIKLDYELGRELVKHLKVGQQGDSNNNLSPFSIALL LSVTRIQRFQDQTSVVKSFKDLQLLQGSKFLQNLVPHRSYVSTMILEVVKNSVHSWDHVT QGLVELGFILMDSYGPKKVLDGKTIETSPSLSRMPNQHACKLGANILLETFKTVQRLLKA VQVHVDVHSHYNSVANETFCLEIMDSLRRCLSQQADVRLMLYEGFYDVLRRNSQLANSVM QTLLSQDYLLCCIQHCLAWYKNTVIPLQQGEEEEEEEEAFYEDLDDILESITNRMIKSEL EDFELDKSADFSQSTSIGIKNNICAFLVMGVCEVLIEYNFSISSFSKNRFEDILSLFMCY KKLSDILNEKAGKAKTKMANKTSDSLLSMKFVSSLLTALFRDSIQSHQESLSVLRSSNEF MRYAVNVALQKVQQLKETGHVSGPDGQNPEKIFQNLCDITRVLLWRYTSIPTSVEESGKK EKGKSISLLCLEGLQKIFSAVQQFYQPKIQQFLRALDVTDKEGEEREDADVSVTQRTAFQ IRQFQRSLLNLLSSQEEDFNSKEALLLVTVLTSLSKLLEPSSPQDVEVEKTNHFAIVNLR TAAPTVCLLVLSQAEKVLEEVDWLITKLKGQVSQETLSEEASSQATLPNQPVEKAIIMQL GTLLTFFHELVQTALPSGSCVDTLLKDLCKMYTTLTALVRYNKSKSLNYTGEKKEKPAAV ATAMARVLRETKPIPNLIFAIEQYEKFLIHLSKKSKVNLMQHMKLSTSRDFKIKGNILDM VLREDGEDENEEVSAGFCLEPSHSS >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_6|2958_bp atgactgtgccactgcagtccagccagggtgacaaagaagactctgtctctaagttgact aatctccttcagaatcaagcagtgaaaggaaaagttgctggagcactcctgagagccatc ttcaaaggaccattattggttgaattagccaatgagtttattagtgctgtcagagaaggc agcctagtgaatggaaaatctttggagttactacctatcattctcactgccctggctacg aaaaaggaaaatctggcttatggaaaaggtgtactgagtggggaagaatgtaagaaacag ttgattaacaccctgtgttctggcagggatgtccctctgactgcagaagaggtggaattt gtggtggaaaaagcattgagcatgttctccaagatgaatcttcaagaaataccacctttg gtctatcagcttctggttctctcctccaagggaagcagaaagagtgttttggaaggaatc atagccttcttcagtgcactagataagcagcacaatgaggaacagagtggtgacgagcta ttggatgttgtcactgtgccatcaggtgaacttcgtcatgtggaaggcaccattattcta cacattgtgtttgccatcaaattggactatgaactaggcagagaactcgtgaaacactta aaggtaggacagcaaggagattccaataataacttaagtcccttcagcattgctcttctt ctgtctgtaacaagaatacaaagatttcaggaccagacttcggttgtaaagagctttaag gatcttcaactcctccaaggctcaaaatttcttcagaatctagttcctcatagatcttat gtttcaaccatgatcttggaagtagtgaagaatagcgttcatagctgggaccatgttact cagggcctcgtagaacttggtttcattttgatggattcatatgggccaaagaaggttctt gatggaaaaactattgaaaccagcccaagtctttctagaatgccaaaccagcatgcatgt aagctcggagctaatatcctgttggaaacttttaagactgtacaaaggctgcttaaggca gtgcaggttcatgtggatgttcacagccattacaattctgtcgccaatgaaactttttgc cttgagatcatggatagtttgaggagatgcttaagccagcaagctgatgttcgactcatg ctttatgaggggttttatgatgttcttcgaaggaactctcagctggctaattcagtcatg caaactctgctctcacaggattatctgctgtgttgtattcagcattgtttggcctggtat aagaatacagtcatacccttacagcagggagaggaggaagaggaggaggaagaggcattc tacgaagacctagatgatatattggagtccattactaatagaatgattaagagtgagctg gaagactttgaactggataaatcagcagatttttctcagagcaccagtattggcataaaa aataatatctgtgcttttcttgtgatgggagtttgtgaggttttaatagaatacaatttc tccataagtagtttcagtaagaataggtttgaggacattctgagcttatttatgtgttac aaaaaactctctgacattcttaatgaaaaagcgggtaaagccaaaactaaaatggccaac aagacaagtgatagtcttttgtccatgaaatttgtgtccagtcttctcactgctcttttc agggatagtatccaaagccaccaagaaagcctttctgttctcaggtccagcaatgagttt atgcgctatgcagtgaatgtagctctgcagaaagtacagcagctaaaggaaacagggcat gtgagtggccctgatggccaaaacccagaaaagatctttcagaacctctgtgacataact cgagtcttgctatggagatacacttcaattcctacttcagtggaagagtcgggaaagaaa gagaaaggaaagagcatctcactgctgtgcttggagggtttacagaaaatattcagtgct gtgcaacagttctatcagcccaagattcagcagtttctcagagctctggatgtcacagat aaggaaggagaagagagagaagatgcagatgtcagtgtcactcagagaacagcattccag atccggcaatttcagaggtccttgttgaatttacttagcagtcaagaggaagattttaat agcaaagaagccctcctgctagtcacggttcttaccagtttgtccaagttactggagccc tcctctcctcaggatgtagaggtggagaaaacaaaccactttgcaatagtgaatttgaga acggctgcccccactgtctgtttacttgttctgagtcaggccgagaaggttctagaagaa gtggactggctaatcaccaagcttaagggacaagtgagccaagaaaccttatcagaagag gcctcttctcaggcaaccctaccaaatcagcctgttgagaaagctatcatcatgcaactg ggaactctgcttacatttttccacgagctggtgcagacagctctgccatcaggcagctgt gtggacaccttgttaaaggacttgtgcaaaatgtacaccacacttacagcccttgtcaga tataataagagtaagagcctgaactatacgggagagaaaaaggagaaacctgctgccgtt gccacagccatggccagagttcttcgggaaaccaagccaatccctaacctcatctttgcc atagaacagtatgaaaaatttctcatccacctttctaagaagtccaaggtgaacctgatg cagcacatgaagctcagcacctcacgagacttcaagatcaaaggaaacatcctagacatg gttcttcgagaggatggtgaagatgaaaatgaagaggtcagtgctggcttctgtctggag cccagccactcttcctag >gi568815583r:89110288_89318694|GENSCAN_predicted_peptide_7|124_aa MKWLFEEFAIDGRFCISIHDEVRYLVREEDRYRAALALQITNLLTRCMFAYKLGLNDLPQ SVAFFSAVDIDRCLRKEVTMDCKTPSNPTGMERRYGIPQGEALDIYQIIELTKGSLEKRS QPGP >gi568815583r:89110288_89318694|GENSCAN_predicted_CDS_7|375_bp atgaagtggctgtttgaagagtttgccatagatgggcgcttctgcatcagcatccatgac gaggttcgctacctggtgcgggaggaggaccgctaccgcgctgccctggccttgcagatc accaacctcttgaccaggtgcatgtttgcctacaagctgggtctgaatgacttgccccag tcagtcgcctttttcagtgcagtcgatattgaccggtgcctcaggaaggaagtgaccatg gattgtaaaaccccttccaacccaactgggatggaaaggagatacgggattccccagggt gaagcgctggatatttaccagataattgaactcaccaaaggctccttggaaaaacgaagc cagcctggaccatag