GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:14:09 Sequence gi568815596f:20347466_20548053 : 200588 bp : 47.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2798 3329 532 2 1 86 53 148 0.364 4.82 1.02 PlyA + 6270 6275 6 1.05 2.00 Prom + 7837 7876 40 -5.06 2.01 Init + 13056 13239 184 2 1 83 67 77 0.470 4.38 2.02 Intr + 20448 20555 108 1 0 88 63 29 0.167 0.66 2.03 Intr + 22826 22915 90 0 0 86 61 40 0.035 1.07 2.04 Intr + 24342 24503 162 1 0 35 63 99 0.049 2.05 2.05 Intr + 39241 39452 212 2 2 34 3 185 0.017 3.23 2.06 Intr + 43117 43295 179 0 2 55 70 101 0.021 3.72 2.07 Intr + 43675 43763 89 1 2 23 121 31 0.047 -0.49 2.08 Term + 48903 49093 191 1 2 95 37 141 0.550 7.31 2.09 PlyA + 49752 49757 6 1.05 3.05 PlyA - 50128 50123 6 1.05 3.04 Term - 51993 51973 21 0 0 104 44 21 0.010 -2.39 3.03 Intr - 64177 64077 101 2 2 76 88 74 0.775 6.03 3.02 Intr - 67366 67180 187 1 1 37 67 81 0.042 0.16 3.01 Init - 75053 74709 345 2 0 56 86 144 0.220 8.22 3.00 Prom - 89979 89940 40 -5.86 4.00 Prom + 99570 99609 40 -2.46 4.01 Sngl + 100001 100591 591 1 0 69 55 1572 0.975 148.19 4.02 PlyA + 101863 101868 6 1.05 5.00 Prom + 110670 110709 40 -5.96 5.01 Init + 112180 112325 146 0 2 64 98 87 0.087 6.89 5.02 Intr + 115293 115653 361 0 1 53 63 110 0.062 0.22 5.03 Term + 121015 121227 213 0 0 30 49 196 0.690 7.13 5.04 PlyA + 121584 121589 6 1.05 6.00 Prom + 122941 122980 40 -4.66 6.01 Init + 132437 132520 84 1 0 61 29 151 0.052 5.45 6.02 Intr + 143135 143283 149 1 2 58 87 55 0.249 1.43 6.03 Intr + 145022 145095 74 2 2 79 75 29 0.075 -0.25 6.04 Intr + 168063 168171 109 1 1 84 109 43 0.477 5.34 6.05 Intr + 169208 169267 60 2 0 103 50 60 0.408 1.55 6.06 Intr + 174316 174416 101 1 2 59 53 69 0.114 0.25 6.07 Term + 178139 178299 161 0 2 114 49 67 0.594 3.50 6.08 PlyA + 179059 179064 6 -0.45 7.04 PlyA - 181951 181946 6 1.05 7.03 Term - 182496 182002 495 0 0 33 42 258 0.078 10.27 7.02 Intr - 191246 191141 106 1 1 70 57 73 0.145 2.62 7.01 Init - 193873 193722 152 1 2 111 71 24 0.263 2.53 7.00 Prom - 198569 198530 40 -2.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 43568 43763 196 1 1 92 121 54 0.862 8.52 S.002 Sngl - 182511 182002 510 0 0 82 42 238 0.886 14.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_1|177_aa XADSSVSNLQRAESEGEPGRRPTPPRRRPRRGKEAGKRPARRPLPRTRTPPSGTPPPALC ERAPGRTGPGTQQALSALPLAIGSRRGRRTKPGEPSSTAPKDRRKSERGLTYRAAAAALP QPGTPTVGHTAASLLAVLPLLRLRWRQCLLSPPTTLPPHPTSSFSSPSSSEPPKYRG >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_1|534_bp nnggccgacagctcggtctctaacctccagcgcgccgaatcggagggagaaccgggtcgg cgccccacgcccccgaggcggcggccccgcagagggaaggaagcgggaaagcggccggcg cggcgccccctcccccgcacgcgcacacccccttccggcacccctccccctgcattgtgc gagcgggccccaggccgcaccggcccgggcactcagcaggccctatccgccctcccgctg gcgatcggctcccgccgaggccgccgcacaaagcctggggagccatcctcgacggcgcca aaggaccggagaaagagcgaacgcggactgacttacagggctgctgcggccgcgctgcct cagccggggacaccgaccgttgggcacacggcggcgtcgctcttggcggtcctccccctc ctccgccttcggtggcggcaatgtcttctttctccacctaccaccctccccccccacccc acctcctccttctcctccccctcctcctccgaaccaccgaagtaccgagggtga >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_2|404_aa MGLSYFHELQLIVLTKSALEDGKLGGLLFFLAYPVIAAKSVKSPSSVLESSKSSEAASVS YWYTVPSSSFHVLALSACSFSRCIVQAVSESTILGSGGSCFPNNWVMCLSLFSSVIRLIL MVDNRVPGCTGSVVLASASGKGPKKLTVMAEGGEETGMSYGEREQERVRTRSQTPLNNQL STFLVIFTYLLVARLATIASVSLSFAEHAMVPFYPGCASLPPAVLKNVAVTSILLLMVAN CWSLKLATMLTNPTRQCDWQGRAAQRGFLGEESGAKPPRADAACGNTAMWSSMNNPDTKK GKACLRHKEICHNAVFECTVWYRAQTAAKEKALQMRAWLSQGSHLHSRHHTPGLSLLVLA PVINHPQTEFLNIFLFLLSGFLVYFPLVHFQCQPRCLQLAALHL >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_2|1215_bp atgggtctttcctacttccatgagcttcagcttattgtgttgactaagagtgccctggaa gatggcaagctaggaggactgcttttcttcctggcttatcctgtcatagcagccaagtct gttaagagtcctagctcagtgctggaatcaagcaagagctctgaagcagcctccgtaagc tactggtacacggtcccttctagctcctttcatgtgctcgcattgagtgcctgcagcttt tccaggtgtatagtgcaagctgtcagtgaatctaccattctggggtctggaggctcctgc ttccctaacaactgggtcatgtgtctctcactgttcagctcagtcatccggctcattctc atggttgataacagagttccaggctgtacaggaagtgtggtgctggcatctgcttctggc aagggccccaagaagcttacagtcatggcagaaggtggagaggaaacaggcatgtcatat ggcgaaagggagcaagagagagtgaggacgaggagccagactcctttaaacaaccagctc tcaaccttcctggtcatcttcacatatttgctggtggccagactagctaccattgcttct gtttctctgagctttgctgagcatgcaatggttcccttttaccctggctgtgcctcactg ccccctgctgtgctcaagaatgtggctgtcaccagcatcctcctgctgatggtggctaac tgctggagcttgaagctggccaccatgctgacaaatcccaccaggcagtgtgactggcaa ggcagagcagcccaacgaggcttcctgggagaggagtctggagccaagcctccaagggca gatgcagcctgtggcaacacagctatgtggagcagcatgaacaatccagacaccaaaaag ggcaaggcatgtctgcgccacaaggaaatatgtcacaacgctgtgttcgaatgcacagtt tggtaccgtgcccaaacagcagcgaaggaaaaagccttgcaaatgcgtgcctggctttcc caaggttcccaccttcattcccgccatcacactcctggcttatctctactggtgctggcg cccgtcatcaaccatccccagacagagttcctaaacatcttcctcttcctgctcagtggc ttcctggtgtacttcccacttgtccacttccagtgccagcccaggtgtttgcagctggcc gctctacatctctag >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_3|217_aa MEVGADSKGLALSSEYLGAIMDQGLTPILGKFFKNSSVQCNLDSPVWDGSIVPQIWEVLG HFPGPEVPCPHNPSESWGSAIQASLSRSPAFPNSHMCNGNAASQKQGPLALCLHRSTQMK RNQKNNSGNMKKQGSLAPTKDQTSSPAMDPNQDEISELPEKEFRMLIIKLIKKAPKKEKV VDTVLAPVWPMQLLHYVILELGLKLEYYAAGACVHDS >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_3|654_bp atggaagttggtgctgacagcaaggggctggctctatcctctgagtatctgggagccatc atggatcaaggtctcactcccatattaggaaaattctttaaaaattcctccgtgcagtgc aatcttgattctccagtgtgggatgggagcattgtgccccagatatgggaagtcctgggc cacttccctggaccagaagtgccgtgtccccacaacccatcagagagctggggaagtgcc atccaggcgagcctgagcaggtccccggcctttcccaattcccacatgtgcaatgggaat gcagccagtcagaagcaagggcctctggctctgtgcctccacaggtcaacccaaatgaaa aggaaccagaaaaacaattctggtaatatgaaaaaacaaggttctttagcacccacaaaa gatcaaaccagctcaccagcaatggatccaaaccaagatgaaatatctgaattgccagaa aaagaattcagaatgttgattattaagctaatcaagaaggcaccaaagaaagagaaggta gttgatactgtgctcgcccccgtgtggcccatgcagctactgcactatgtcatcttggaa ctgggactaaagctggagtattatgctgcaggcgcatgtgtccatgacagctag >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_4|196_aa MAAIRKKLVVVGDGACGKTCLLIVFSKDEFPEVYVPTVFENYVADIEVDGKQVELALWDT AGQEDYDRLRPLSYPDTDVILMCFSVDSPDSLENIPEKWVPEVKHFCPNVPIILVANKKD LRSDEHVRTELARMKQEPVRTDDGRAMAVRIQAYDYLECSAKTKEGVREVFETATRAALQ KRYGSQNGCINCCKVL >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_4|591_bp atggcggccatccgcaagaagctggtggtggtgggcgacggcgcgtgtggcaagacgtgc ctgctgatcgtgttcagtaaggacgagttccccgaggtgtacgtgcccaccgtcttcgag aactatgtggccgacattgaggtggacggcaagcaggtggagctggcgctgtgggacacg gcgggccaggaggactacgaccgcctgcggccgctctcctacccggacaccgacgtcatt ctcatgtgcttctcggtggacagcccggactcgctggagaacatccccgagaagtgggtc cccgaggtgaagcacttctgtcccaatgtgcccatcatcctggtggccaacaaaaaagac ctgcgcagcgacgagcatgtccgcacagagctggcccgcatgaagcaggaacccgtgcgc acggatgacggccgcgccatggccgtgcgcatccaagcctacgactacctcgagtgctct gccaagaccaaggaaggcgtgcgcgaggtcttcgagacggccacgcgcgccgcgctgcag aagcgctacggctcccagaacggctgcatcaactgctgcaaggtgctatga >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_5|239_aa MRKSQHKKAKNSKNQNTSSLPKDHNSSPAREQNWMENEFDELTEVGSRSIGSSGQDNQAR EKIKDIQIGREEVKLSLFADDMIVYLENPIVSAPNLLKLISNFSQVSKYKINVRKSQAFL YTNNRQTESQIMSELPFTIATKRIKYLGIQLTRDVRDLFKENYEPLLKEAELRRKFKRQI LALKKEEHYRYQDIPKAHESEALSYLSLIMFIPQLDDLDSDVLEGIQVSDGQGAQGKTC >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_5|720_bp atgaggaaaagccagcacaaaaaggctaaaaattccaaaaaccagaacacctcttctctt ccaaaggatcacaactcctcaccagcaagggaacaaaactggatggagaatgagtttgat gaattgacagaagtaggctccagaagtattggaagttctggtcaggacaatcaggcaaga gaaaaaataaaggatattcaaataggaagagaggaagtcaaattgtctctgtttgcagat gatatgattgtatatttggaaaaccccatcgtctcagccccaaatctccttaagctgata agcaacttcagccaagtctcaaaatataaaatcaatgtgcgaaaatcacaagcattccta tacaccaataatagacaaacagagagccaaatcatgagtgaactcccattcacaattgct acaaagagaataaaatacctaggaatacaacttacaagggatgtgagggacctcttcaag gagaactacgaaccactgctcaaggaagcagaactgagaaggaagttcaagaggcagatt ttggctctaaagaaggaagagcattatcgctatcaggatattccgaaagctcacgagtca gaggcccttagctacctgtcgttgatcatgttcataccacagctggatgacctggactca gatgttctggagggaattcaagtatcagatgggcagggtgctcaaggcaagacctgctga >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_6|245_aa MEGLNLQLWRLLLLLLLLASLGTAALPKTPGTETPGELTLRTLQASGLMLASRHVGDRMV NGDGLHPLSRRAKPGGPRPVALGTLAADRTSRVSLTPHGEEQVNPNSSMRAQQPESPRVG HPVMARLRGHPTVREEKSSWLKWSVELRGILDVPPGMSSLSSTCLLEQTAVEKFISSLSS FQARLYPFDTGPGEAWEPGTGKPLQGTARHCVAQQIWTQGLRGQPDTPLSMRAKEASESS VRGHS >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_6|738_bp atggaaggtcttaatctccagttatggagactgctgctgctgctgctgctgctggcctcc ctggggacagctgccttacccaagactcctggtacagagacccctggggagctcacactg cgcaccctgcaggcatcagggctgatgctagcttccaggcatgttggagaccggatggtg aatggcgatgggctgcatcctctctccaggagggctaagcctgggggcccgaggcctgtg gctctgggcacactcgctgcagacagaacttcccgagtcagtctcacccctcatggtgag gagcaagtgaatcctaacagcagcatgagagcacagcagccagaatccccccgggttgga caccctgtgatggcaaggttaaggggtcacccgacagtcagagaggaaaagagcagctgg ctgaagtggtctgtggagctgcggggcatcttagatgtgcctcctggcatgagcagcctc agttccacctgcttgctggagcaaacggcagtggaaaagttcatcagttctcttagctcc ttccaggctcgtctctacccatttgacaccggccctggggaagcatgggagccagggaca ggaaagcctctgcagggcacagccaggcactgtgtcgcccagcagatctggacacaaggg ctgcggggccaaccagacactccactgagcatgcgggccaaggaggcctctgagagctca gtgagaggacacagctag >gi568815596f:20347466_20548053|GENSCAN_predicted_peptide_7|250_aa MGKRRNGHPFLGERFRDHAVRRVFDVSPSPQTSAVGLPASARKVITSFRLKGSLQPPSGK LPPWGQSLLPPVLPGVKDVLLLKGEQDLFRALWRSLSREVKEHVGTDQFGNKYYYILQNK NWRGQTIQEKRIVEAANKKEVDYETGDIPTEWEAWIRRTRKTPPTMEEILKNEKHREEIK IKSQDFYEKEKLLSKETSEELLPPPVQTQIKGRASAPYFGKEKPSVAPSSTGKTFQPGSW MPRDGKSHNE >gi568815596f:20347466_20548053|GENSCAN_predicted_CDS_7|753_bp atggggaagaggcgcaatggtcatcccttcctaggagagcgtttcagagatcatgctgtg cggcgggtgtttgatgtctcaccatcccctcagaccagtgcggttgggctaccagcatct gcccgcaaggtgatcactagtttcaggcttaaggggtccctgcagccaccatcaggaaag ctcccaccctggggacagtctctcctgcctcctgttcttcctggggtcaaagacgtgctc ctgctcaagggtgagcaggatttgttccgcgccttgtggagatcgctgtcaagggaagtg aaggagcacgtgggcacggaccaattcgggaacaaatactactacatcctgcagaacaag aactggagaggacaaactattcaagagaaaagaattgtagaagcagcaaataaaaaagaa gtagactatgaaacaggggatattccaacagaatgggaagcttggattagaagaacaaga aagactccacctactatggaggaaatactaaagaatgaaaaacacagagaagaaatcaaa ataaaaagccaagatttttatgaaaaagaaaaactccttagtaaagagaccagtgaggaa ctcctgcctccaccagttcaaactcaaattaaaggccgtgcctctgctccatacttcgga aaggaaaaaccctcagtggctcccagcagcactggtaaaacctttcagccaggatcctgg atgccacgagatggcaagagccacaatgaatga