GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:23:50 Sequence gi568815583r:81943163_82145705 : 202543 bp : 43.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 371 410 40 0.04 1.01 Init + 40015 40110 96 0 0 79 106 60 0.468 7.21 1.02 Term + 49703 49846 144 1 0 42 47 68 0.044 -3.99 1.03 PlyA + 50166 50171 6 1.05 2.00 Prom + 51007 51046 40 -3.46 2.01 Sngl + 52911 53927 1017 2 0 88 43 587 0.996 51.24 2.02 PlyA + 54155 54160 6 1.05 3.00 Prom + 54324 54363 40 -4.96 3.01 Sngl + 55274 56767 1494 1 0 70 50 528 0.988 42.66 3.02 PlyA + 56786 56791 6 1.05 4.00 Prom + 57550 57589 40 -2.46 4.01 Init + 73411 73965 555 0 0 32 98 125 0.326 2.87 4.02 Intr + 79403 79465 63 1 0 115 48 29 0.223 0.51 4.03 Intr + 83842 83937 96 0 0 27 116 99 0.392 6.81 4.04 Intr + 84230 84418 189 1 0 -6 75 149 0.769 3.98 4.05 Intr + 89748 89873 126 2 0 50 93 31 0.386 0.68 4.06 Term + 97779 98069 291 2 0 37 47 195 0.379 5.64 4.07 PlyA + 98184 98189 6 1.05 5.04 PlyA - 98208 98203 6 1.05 5.03 Term - 101451 99998 1454 1 2 91 43 1923 0.999 179.00 5.02 Intr - 102596 102288 309 0 0 79 100 518 0.854 48.38 5.01 Init - 103634 103556 79 2 1 70 43 153 0.893 8.22 5.00 Prom - 119596 119557 40 -5.36 6.03 PlyA - 120186 120181 6 1.05 6.02 Term - 121029 120757 273 0 0 40 43 148 0.158 0.97 6.01 Init - 130470 130432 39 0 0 72 105 45 0.109 4.79 6.00 Prom - 131631 131592 40 -2.26 7.06 PlyA - 132208 132203 6 1.05 7.05 Term - 136967 136954 14 0 2 113 54 14 0.209 -1.14 7.04 Intr - 142610 142478 133 2 1 94 108 -9 0.082 1.92 7.03 Intr - 146264 146236 29 0 2 108 94 4 0.101 0.83 7.02 Intr - 170100 169978 123 1 0 89 27 119 0.710 6.46 7.01 Init - 172484 172445 40 2 1 85 94 31 0.688 3.77 7.00 Prom - 177981 177942 40 -4.86 8.04 PlyA - 178046 178041 6 1.05 8.03 Term - 187399 187211 189 1 0 53 37 251 0.494 13.95 8.02 Intr - 195680 195496 185 0 2 70 83 130 0.919 10.21 8.01 Intr - 200457 200362 96 1 0 62 88 61 0.769 3.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_1|79_aa MGTQKQQRSLVARRTGGLHGKVGILKLHLEGWGVGKDKGVEILFKEIIPENFSNLEKDMN IQGHKGQTPPIRFNPNKTI >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_1|240_bp atgggaacacagaagcagcaaagatcccttgtggctcgaaggaccgggggacttcacgga aaagttggcattttgaaattgcaccttgaaggatggggagtaggaaaggataaaggagta gaaatcttatttaaagaaataataccagaaaacttttcaaatctggagaaagatatgaat atccagggacataaaggccaaacacctcccatcagatttaatccaaataagacgatctaa >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_2|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ SKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSLRRATPRHIIVRFTKVEMNEKMLRAAREKGQV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNRYQLLQNHAKM >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagaacagaaaaacaggaaactctaaaacccagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa agcaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactccttgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaacgaaaaaatgttaagggcagccagagagaaaggtcaggtt accctcaaagggaagcccatcagactaacagcggatctctcggcagaaactctacaagcc agaagggagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagatttcgtcacgaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_3|497_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKGNFRPISLMNTDAKILNQILA KRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIGAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMRELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRARIAKAILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT PHIYNYLIFDKPEKNKQ >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_3|1494_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggctagcatcattctgataccaaagccgggcagagacacaaccaaaaaaggg aatttcagaccaatatccttgatgaacactgatgcaaaaatcctcaatcaaatactggca aaacgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcttttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataggagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagagaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcccgcattgccaaggcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg ccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatag >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_4|439_aa MNALLGKPEASPHHHKEDVHSAEGKNASVAWPGRQGSTKEEEQGRGKPGGQSKGPRENGN NIRQRRESLVKVEDVAWKKRKGSKSQKQQKANQKPEREDMPVARDAGQYRMAVCPERRSG LGKQPAVSATNHPSGHRIPFTPFSGHRKQGDVFWAKGMLRQESPFMPGSEHQEDGREGLW GEKWTVQLVGFEGDPPTATYWAFLGLNSTLIHYTNGIMLIRHDQQELASTPELQRSYKSS LADCMMLEVSGALEDAVVEFIARPIGKIASRSLGYWTNATLSTAKIYTPFNKQFLVHHWT QCPTHQGSKSANLLVRLRSQIWPLPPNIVNLIGLNRMKLPLLKKTEFRLSDSTWLLPLTK PHSQGAGGACQKQTKTSLKGQQTLDTGSLSTPISLPYKQKPIKTSAQIQEGSLQAETVIP SSFFGFPKGVTFSGNSQQS >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_4|1320_bp atgaatgctttgttgggaaaaccagaagcaagtccacatcatcacaaggaggacgtccat agtgcagagggtaagaatgcctctgtggcatggcctgggcggcagggaagcaccaaggag gaggagcaaggaagagggaagccaggaggacagagtaaagggccaagggaaaatgggaac aacataaggcaaaggagagagagccttgtgaaagtagaggatgtagcttggaaaaagaga aaaggaagcaagagtcagaaacagcagaaagccaaccaaaaacctgagagagaagacatg cctgtggcaagggatgctggtcaatatcgcatggctgtgtgtccagagagaagaagtggg cttggtaaacaaccagcagtgtctgccacaaaccacccctctgggcacagaatacctttt actcccttctcaggacacagaaaacaaggggatgttttctgggccaaggggatgctgagg caggagtcaccctttatgcctgggtcagagcaccaggaggatggaagagaaggcttgtgg ggagagaaatggacagtacagcttgttggctttgagggagacccccccacagctacctac tgggctttcctgggcctgaacagtacactgattcattacaccaatggcatcatgctgatc agacatgaccagcaagagctggctagtactccagagctccagaggtcttacaagtccagt ctagcagactgcatgatgttggaggtatccggtgcattagaagatgccgtggtggagttt attgcaaggcccattggaaaaattgcaagcaggtctctggggtactggaccaatgccaca ctatctacagcaaagatttatacgcctttcaataaacagttcctggtacatcattggacc cagtgcccaacccaccaaggttctaaatccgcaaatctccttgtccgtcttcgctcccag atatggcccctgccacctaacattgtcaatttaattgggttgaaccgtatgaaattgcca ttactgaagaagactgagttcaggttaagtgacagcacgtggctcctccccttgacaaag ccccacagccagggagcaggtggagcctgccaaaagcaaacaaaaacttccctcaaagga caacaaacacttgacaccggcagcctgtcaacaccaattagtctgccctacaagcagaag cctattaagaccagtgctcaaatccaggaaggttctctgcaagcagaaactgtcatccct tcttccttcttcggcttccccaagggagtgaccttcagtggaaactcgcaacagagctga >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_5|613_aa MQARARARAVASVGGRVLLCLVRSRAASRRRAEGRRGRGAHGRAMPSSLFADLERNGSGG GGGGSSGGGETLDDQRALQLALDQLSLLGLDSDEGASLYDSEPRKKSVNMTECVPVPSSE HVAEIVGRQGCKIKALRAKTNTYIKTPVRGEEPVFVVTGRKEDVAMARREIISAAEHFSM IRASRNKNTALNGAVPGPPNLPGQTTIQVRVPYRVVGLVVGPKGATIKRIQQQTHTYIVT PSRDKEPVFEVTGMPENVDRAREEIEAHIALRTGGIIELTDENDFHANGTDVGFDLHHGS GGSGPGSLWSKPTPSITPTPGRKPFSSYRNDSSSSLGSASTDSYFGGGTSSSAAATQRLA DYSPPSPALSFAHNGNNNNNGNGYTYTAGGEASVPSPDGCPELQPTFDPAPAPPPGAPLI WAQFERSPGGGPAAPVSSSCSSSASSSASSSSVVFPGGGASAPSNANLGLLVHRRLHPGT SCPRLSPPLHMAPGAGEHHLARRVRSDPGGGGLAYAAYANGLGAQLPGLQPSDTSGSSSS SSSSSSSSSSSSGLRRKGSRDCSVCFESEVIAALVPCGHNLFCMECANRICEKSEPECPV CHTAVTQAIRIFS >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_5|1842_bp atgcaggcgcgggcgcgcgcgcgcgcggttgccagtgtgggcgggcgtgtgcttttgtgt ctcgtgcgctcccgcgcagcttctcgccgccgagccgagggccggcgggggcgcggcgcg cacggccgagcgatgcccagctcgctgttcgcagacctggagcgcaacggcagcggcggc ggcggcggcggcagcagcggagggggagagaccctggatgaccaaagagccctgcagctc gcgctcgaccagctctccctgctggggctggacagtgacgagggcgcctctctgtacgac agcgagccgcgcaagaagagcgtgaacatgaccgagtgcgtgccagtacccagttctgag catgtcgccgagatcgtggggcggcaaggttgtaaaatcaaagcgctgcgggcgaagacc aatacttacatcaagaccccagttcgcggggaggagcctgtctttgttgtgacgggcagg aaggaggatgtggccatggctcggagggagatcatctctgctgccgagcacttctccatg atccgcgcctcccggaataagaacacggcactcaacggcgcggtgcctgggccgcccaac ctgcccgggcagaccaccatccaagtgcgggtaccctaccgcgtggtggggctcgtggtg gggcccaaaggcgccacaatcaagcgcatccagcagcagacgcacacgtacatcgtgacg cccagccgggataaggagccggtgttcgaggtgaccggcatgccagagaacgtggatcgc gctcgagaggagattgaggcgcacattgctctgcgtaccggcggcatcattgagctcaca gacgagaacgacttccacgccaacggcaccgatgtgggcttcgatctgcatcatgggtcc ggcgggtccggcccaggcagcctctggagcaagcccacccccagcatcacgcccaccccc ggccgcaagcctttctctagctaccgcaacgacagctccagctcgcttggcagtgcttcc acagactcttatttcggcggcgggaccagcagcagcgcagcggctacccagcgcctggcg gactacagcccccctagccccgccctgagctttgcgcacaacggaaacaataacaataac ggcaatgggtacacctacacagcggggggagaagcctcagtgccatcccccgacggctgc cccgagctgcagcccacttttgacccggctcccgctcccccacctggggcaccacttatc tgggcccagttcgagcggtccccgggaggcggacctgcagctccggtatcttcttcctgc tcttcttctgcatcttcgtctgcttcttcctcctccgtggtcttccccgggggtggcgcc agtgcgccctccaacgccaacctggggctattggtgcaccgccggctgcaccctggcacc agctgcccgcgcctgtctccacccttgcacatggccccgggggcgggagagcaccacctg gctcgccgggtgcgcagcgacccgggtggaggaggcctggcctacgccgcttatgccaac gggctgggggcacagctgcctggcttgcagccgtcggacacgtcgggctcctcctcttcg tccagctcctcctccagctcttcatcctcttcctccgggcttcggcgtaaaggcagccgc gactgctccgtgtgcttcgagagcgaagtgattgccgcgctggtgccctgtggccacaac ctcttctgcatggagtgcgccaatcgcatctgtgagaagagcgagcccgagtgcccggtc tgccacaccgcggtcactcaggccatccgcatcttttcttaa >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_6|103_aa MHQYRSVACYEQEFFTNIQRRDREACTWNDKAAEFKVWSKKLCIPAALMGTETLWAEICR EMLITLYPLPQHRFCQEVDCASTNVESFGHPRHICSSCGYLNA >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_6|312_bp atgcaccagtacaggtctgtggcctgttacgaacaggagtttttcacaaacatccagaga agggatagggaagcgtgcacgtggaacgacaaggcggctgagttcaaggtctggagcaag aagctctgcatccctgcagctctgatggggacggagaccctctgggcagagatatgcagg gaaatgctcattactctgtaccccctaccccaacaccgcttttgccaagaggtggactgt gctagcaccaatgtggagagttttggacatccacgccatatctgtagcagctgcggctat ctcaatgcataa >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_7|112_aa MAGGMDAGIQSGPGTFGNVWKYTGIQWVQSEELLNILQCTGQAPCDEEVSGSKWGSAFQS EEWQLLLYLYPSRHQNKLFALFPFPTQQMGMPWAVMSAECVLMSCGPPRKSG >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_7|339_bp atggcaggtggcatggatgctggaatccagagtggtccagggacatttggcaatgtctgg aaatacactggcatccagtgggtacagtctgaggagctactaaacatcctgcaatgcaca ggacaggccccctgtgacgaggaagtatctggatctaaatggggatcggccttccaaagt gaagaatggcagttgcttctttatctctaccccagtaggcaccaaaataaactctttgcc ttgtttcccttcccaacccagcaaatggggatgccctgggctgtcatgtcagcagaatgt gtactgatgagttgtgggccaccaaggaaatctggctga >gi568815583r:81943163_82145705|GENSCAN_predicted_peptide_8|156_aa XCFISYLIEAHLYPVNHVGLIQQSYSSVTKGRGRVYAVLSKREGRVLQEEMKEGTDMFII KAVLPVAESFGFADEIRKRTSGLASPQLVFSHWEIIPSDPFWVPTTEEEYLHFGEKADSE NQARKYMNAVRKRKGLYVEEKIVEHAEKQRTLSKNK >gi568815583r:81943163_82145705|GENSCAN_predicted_CDS_8|471_bp ncatgctttatctcctatttgattgaagcccatctgtatcctgtcaaccacgttggcctg attcagcagtcttatagctctgtgacaaaaggaagaggtcgagtctatgctgtcttgtca aagagagaaggtcgggtacttcaagaagaaatgaaagaagggacagacatgttcatcatc aaggctgtgctgcctgttgctgaaagctttggttttgctgatgaaatcaggaagaggaca agtggcctggccagcccacaactagtattcagccattgggagatcattcccagtgacccc ttctgggtgccaactactgaggaggaatacttgcactttggggagaaggctgactctgag aaccaagcccggaagtacatgaacgcagtacgaaagcggaaggggctttatgtggaagaa aagattgtggagcatgcagaaaagcagaggacactcagcaaaaataagtag