GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:53:17 Sequence gi568815575r:30971405_31171953 : 200549 bp : 39.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 50299 50487 189 0 0 102 47 153 0.917 9.07 1.02 PlyA + 51490 51495 6 1.05 2.03 PlyA - 51850 51845 6 1.05 2.02 Term - 59207 59057 151 1 1 151 38 89 0.775 6.60 2.01 Init - 61230 61175 56 0 2 70 94 26 0.770 2.21 2.00 Prom - 64712 64673 40 -5.55 3.00 Prom + 64869 64908 40 -5.75 3.01 Init + 65053 65122 70 0 1 81 91 69 0.837 7.77 3.02 Intr + 65811 65989 179 1 2 26 51 109 0.212 -0.28 3.03 Intr + 78403 78501 99 0 0 66 103 52 0.040 3.89 3.04 Term + 82365 82388 24 2 0 146 47 15 0.054 0.45 3.05 PlyA + 82497 82502 6 1.05 4.04 PlyA - 83373 83368 6 1.05 4.03 Term - 88882 88644 239 2 2 58 42 197 0.418 7.45 4.02 Intr - 90218 90176 43 2 1 56 93 35 0.309 -2.31 4.01 Init - 92226 92224 3 0 0 88 101 0 0.177 1.35 4.00 Prom - 95939 95900 40 -4.25 5.05 PlyA - 99492 99487 6 1.05 5.04 Term - 100611 99998 614 1 2 68 52 794 0.271 67.45 5.03 Intr - 106223 106105 119 1 2 88 49 44 0.104 -0.31 5.02 Intr - 107664 107294 371 0 2 27 50 194 0.113 2.48 5.01 Init - 109529 108492 1038 2 0 49 41 452 0.345 30.63 5.00 Prom - 109622 109583 40 -6.15 6.02 PlyA - 109791 109786 6 1.05 6.01 Sngl - 111035 110019 1017 2 0 83 43 788 0.988 70.57 6.00 Prom - 120945 120906 40 -4.25 7.03 PlyA - 121660 121655 6 1.05 7.02 Term - 126158 125716 443 0 2 41 44 263 0.505 11.63 7.01 Init - 136540 136462 79 1 1 92 14 124 0.313 6.76 7.00 Prom - 150084 150045 40 -4.95 8.00 Prom + 152789 152828 40 -3.85 8.01 Init + 155798 155847 50 1 2 61 107 40 0.492 3.67 8.02 Term + 155919 156051 133 0 1 66 32 104 0.575 -0.82 8.03 PlyA + 157541 157546 6 1.05 9.07 PlyA - 157580 157575 6 1.05 9.06 Term - 158190 157987 204 0 0 81 49 73 0.354 -0.91 9.05 Intr - 161413 161250 164 2 2 96 75 90 0.577 7.27 9.04 Intr - 162790 162698 93 2 0 66 99 97 0.977 7.62 9.03 Intr - 174950 174887 64 2 1 80 106 89 0.533 7.27 9.02 Intr - 176114 175871 244 1 1 58 61 358 0.998 26.68 9.01 Intr - 198197 198039 159 1 0 70 109 72 0.126 5.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_1|62_aa EIALPIALMKQQEFPEGNALLTSIITISSCGASGSMKVTHPPVRVCSRTAKAAPSSRSST LS >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_1|189_bp gaaattgcactgccaatagccctaatgaagcagcaggagtttcctgaggggaatgcattg ctgacaagtataattactattagcagctgtggggcctctggctccatgaaagtaacccat ccacctgtcagagtgtgcagcaggactgctaaagctgcacctagcagccggtccagcact ctaagctag >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_2|68_aa MPASQGWISNCSLMTGAQRSDPSTSHGNNMHCIKDSILISGQWKAQCEDGETPWNIAPED LGSNPNAH >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_2|207_bp atgcctgcttcccagggatggatatcaaactgttccttgatgactggagcccagaggtct gatccctccaccagccatggtaacaatatgcactgcattaaggactctatattgatctca gggcaatggaaagcacagtgtgaagatggggaaactccatggaatattgcaccagaagac ttaggttcaaatcccaatgcccattag >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_3|123_aa MAAFDEAIYLGASRRVLKVCAREGSQAQEEGKPQNARVFKPLLESCCLAVVPLVKANHLV QIQKIEASAPRLDGKSCDINLAREKILMDLLLLLQEGYGQILFEGVYSMTFFSPSHKLVT TIM >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_3|372_bp atggctgcctttgatgaagccatttatcttggtgccagcaggagggtgttgaaagtgtgt gcacgggaaggatctcaagcacaggaagagggcaaaccccaaaatgcaagagtcttcaag cctctgcttgagtcatgttgcttggctgttgttccactggtcaaagcaaatcacttagtc cagattcagaaaatagaggcaagtgccccgcgtcttgatgggaagagctgtgacatcaat cttgcaagggaaaaaatactcatggatctcctgctgctgcttcaagaaggctatgggcag attctgtttgaaggagtatatagcatgacttttttcagtcccagtcataaacttgtgacc actattatgtga >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_4|94_aa MVPNRDLLEEKSTRKEEYKPRCQDVRVLEEHHQSPWCCQRKCLRATSKKSRACTLPFRER TSGSALAAFTSLEKPFTVFVRRLEEGSLLATPEV >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_4|285_bp atggttcccaacagagacctactggaagaaaaatctactagaaaagaagaatacaagccc agatgtcaggatgtcagagtgctagaagaacatcatcaaagcccctggtgttgccagcgc aagtgtctcagagcaacatcaaagaaatccagggcttgtacccttcctttcagagaaaga acatcaggctcagctctggcagcttttacctcccttgaaaaaccgttcactgtttttgtc agacgtttggaagaaggaagtttattggcaacccctgaagtttaa >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_5|713_aa MGDFNTPLSTLDRSTRQKVNKDIQEFNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KTDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQGRSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKNLQKINESRGWFFEKINKIDRLLARLIK KKREKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELKIKYLGIQLTRDVK DLFKENNKPLLYEIKEDTNKWKNIPCSWVGRINIVKMAILPKEIYRFNAIPIKLPMTFFT ELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWNMDEAGNHHS EQTIARIENQTPHVLTHRWEFNNENTWTQAAALATAPTSPVCSPALALPAMATAQPSQVR QKYDTNCDAAINSHITLELYTSYLYLSMAFYFNRDDVALENFFRYFLRLSDDKMEHAQKL MRLQNLRGGHICLHDIRKPECQGWESGLVAMESAFHLEKNVNQSLLDLYQLAVEKGDPQL CHFLESHYLHEQVKTIKELGGYVSNLRKICSPEAGLAEYLFDKLTLGGRVKET >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_5|2142_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatatccaggaattcaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttttcagcaccacaccacacctattcc aaaactgaccacatagttggaagtaaagccctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcagggaagatctaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaaaccttcaaaaaatcaatgaatcc aggggctggttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc aatcccacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctggacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggctctgaaatcgaggcaataatt aatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagctgaaaataaaatacctaggaatccaacttacaagggacgtgaag gacctcttcaaggagaacaacaaaccactgctctatgaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatggccatactg cccaaggaaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacattgccaag tcaatcctaagccaaaagaacaaagctggaggtatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggaatatggatgaagctggaaaccatcattct gagcaaactatcgcaaggatagaaaaccaaacaccgcatgttctcactcacaggtgggaa tttaacaatgagaacacttggacacaggctgctgccctcgctaccgcaccgacttcgccc gtgtgctcgcctgcacttgcgctgcccgccatggccaccgcccagccgtcgcaggtgcgc cagaagtacgacaccaactgcgacgccgccatcaacagccacatcacgctggagctctac acctcctacctgtacctgtctatggccttctacttcaaccgggacgacgtggccctggag aacttcttccgctacttcctgcgcctgtcggacgacaaaatggagcatgcccagaagctg atgaggctgcagaacctgcgcggtggccacatctgccttcacgatatcaggaagccagag tgccaaggctgggagagcgggctcgtggccatggagtccgccttccacctggagaagaac gtcaaccagagcctgctggatctgtaccagctggccgtggagaagggcgacccccagctg tgccacttcctggagagccactacctgcacgagcaagtcaagaccatcaaagagctgggt ggctacgtgagcaacctgcgcaagatttgttccccggaagccggcctggctgagtacctg ttcgacaagctcaccctgggcggccgcgtcaaagagacttga >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_6|338_aa MGKKQSRKTGNSKNQSTSPPPKERSSSPAMEQSWTENDFDELREEGFRRSNYSELKEEVR THGKEVKNLEKRLDEWLTRIINAEKSLKDLMELKTTARELRDECTSLSSRFDQREERVSV MEDQMNETKREEKFREKRIKRNEQSLQEIWDYLKRPNLRLIGVPESDGESGTKLENTLQD IIQENFPNLARQDNIQIQEIQRMPQRYSSRRATPRYIIVRFTKVEMKEKMLRAAREKGRV THKGKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRIAYPAKLSFISEGEIQYFADK QMLRDFVTTRPALKELLKEALNMERNNWYQPLQKHAKW >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_6|1017_bp atggggaaaaaacagagcagaaaaactggaaactccaaaaatcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagaagatcaaactactccgagctaaaggaggaagttcga acccatggcaaagaagttaaaaaccttgaaaaaagattagacgaatggctaactagaata atcaatgcagagaagtccttaaaggacctgatggagctgaaaaccacagcacgagaacta cgtgacgaatgcacaagcctcagtagccgattcgatcaacgggaagaaagggtatcagtg atggaagatcaaatgaatgaaacgaagcgagaagagaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatctgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagagtggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggacaacattcaaattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagatacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt acccacaaagggaagcccatcagactaacagctgatctcttggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt gcatatccagccaaactaagcttcataagtgaaggagaaatacaatactttgcagacaag caaatgctgagagattttgtcaccactaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaactggtaccagccactgcaaaaacatgccaaatggtaa >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_7|173_aa MALLDAAHVAAVPMALLDAAHVAALSEGSHRTRRGKGPTERITLHCPRTVELREQCNMSF EVSGIAGTPTWAPPQGPHGACSCQQPKQLARFCTCSPMPGLATGSAQSLLLQVHNAASWI PHSLKCSLPQKVKHGGPSKQATLRPKSDKQVEKIPAPTSPWGRISILDFGLLT >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_7|522_bp atggctttgctggatgcagcccatgtcgctgctgtccccatggctttgctggatgcagcc catgtggctgctctctcagaaggcagccaccgcacacgacgaggcaaggggcccactgaa cggataacactccactgtccacggacggtggagctaagagagcagtgtaacatgtccttt gaggtttcggggattgcaggcacccctacgtgggcaccaccgcagggccctcatggagcc tgctcctgccagcagccaaagcagctggccagattctgcacttgttcgcccatgcctggt ctggccacggggtctgcacagagcctgctcctgcaggtgcacaatgcggccagctggatt ccacactcactcaagtgctctctccctcaaaaggttaagcatggtgggccaagtaaacaa gccaccctccgtcccaagtccgacaaacaggttgagaaaattcctgcaccaacatcaccc tggggaagaataagcatcttggactttggacttttaacttag >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_8|60_aa MKLEAFEKTERTDEGKSKYQEPYEAMAKDPLALIHRFLHLYAFICISKVDSDEKVFYDKS >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_8|183_bp atgaaattggaggcatttgagaaaacagaaagaacagatgaggggaagagcaaatatcag gagccttacgaggcaatggctaaagacccactggccttgattcatagattcctacatctg tatgcatttatttgtatttcaaaagtagactctgatgagaaagtgttctatgataagtca taa >gi568815575r:30971405_31171953|GENSCAN_predicted_peptide_9|309_aa XDDEHLLIQHYCQSLNQDSPLSQPRSPAQILISLESEERGELERILADLEEENRNLQAEY DRLKQQHEHKGLSPLPSPPEMMPTSPQSPRDAELIAEAKLLRQHKGRLEARMQILEDHNK QLESQLHRLRQLLEQRSDSSQPMLLRVVGSQTSDSMGEEDLLSPPQDTSTGLEEVMEQLN NSFPSSRDVIIHNEKIIHRLDRIRCFHVSPHYFVLLAFTSTYYVPVEVVGMERRNDKFLA FKGRVVAAVLNSWSRDRGLAKPEIFTIWPFMGKICQSLLYAIISLSILTHYMYRVTGFCA DQLSAEQDY >gi568815575r:30971405_31171953|GENSCAN_predicted_CDS_9|930_bp nnagatgatgaacatttgttaatccagcattactgccaaagtttgaaccaggactccccc ctgagccagcctcgtagtcctgcccagatcttgatttccttagagagtgaggaaagaggg gagctagagagaatcctagcagatcttgaggaagaaaacaggaatctgcaagcagaatat gaccgtctaaagcagcagcacgaacataaaggcctgtccccactgccgtcccctcctgaa atgatgcccacctctccccagagtccccgggatgctgagctcattgctgaggccaagcta ctgcgtcaacacaaaggccgcctggaagccaggatgcaaatcctggaagaccacaataaa cagctggagtcacagttacacaggctaaggcagctgctggagcaaaggtccgacagcagt cagcctatgctgctccgagtggttggcagtcaaacttcggactccatgggtgaggaagat cttctcagtcctccccaggacacaagcacagggttagaggaggtgatggagcaactcaac aactccttccctagttcaagagatgtcataattcataatgagaagatcatacacaggctt gacagaatcaggtgctttcacgtgtctcctcattactttgtcctgcttgcttttactagc acctactacgtaccagtggaggtagtagggatggaaagaagaaatgacaaattccttgcc ttcaagggcagagtagttgcagcagtgctgaatagctggtccagagaccgtgggcttgca aagcctgaaatatttactatctggccctttatggggaaaatttgccaatccctgctttac gctattatttcgttgtccattctgactcattacatgtatagagtcacaggcttctgtgct gatcaactttcagctgaacaagattattga