GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:38:00 Sequence gi568815596r:88023057_88228017 : 204961 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1232 1227 6 1.05 1.04 Term - 5529 4728 802 2 1 87 48 325 0.945 21.12 1.03 Intr - 9877 9852 26 1 2 96 77 15 0.016 -1.98 1.02 Intr - 32506 32333 174 1 0 87 36 59 0.174 0.74 1.01 Init - 33724 33671 54 1 0 79 88 58 0.612 4.18 1.00 Prom - 36958 36919 40 -3.76 2.00 Prom + 40795 40834 40 -4.86 2.01 Init + 44824 44945 122 0 2 79 80 161 0.833 14.06 2.02 Intr + 61260 61436 177 0 0 98 94 222 0.983 22.83 2.03 Intr + 64806 65019 214 0 1 91 54 407 0.998 36.32 2.04 Intr + 67956 68086 131 2 2 91 89 122 0.997 12.09 2.05 Intr + 73539 73728 190 0 1 77 81 316 0.991 29.39 2.06 Intr + 80002 80094 93 0 0 81 94 30 0.861 3.06 2.07 Intr + 83269 83432 164 0 2 84 42 240 0.827 17.77 2.08 Intr + 85315 85543 229 1 1 101 65 140 0.487 10.87 2.09 Term + 87298 87456 159 0 0 109 55 256 0.995 22.34 2.10 PlyA + 87707 87712 6 1.05 3.05 PlyA - 88411 88406 6 1.05 3.04 Term - 93645 93502 144 0 0 44 48 89 0.048 -1.59 3.03 Intr - 101530 101438 93 1 0 106 101 145 0.959 17.76 3.02 Intr - 103292 103120 173 0 2 24 97 221 0.897 16.26 3.01 Init - 104961 104895 67 0 1 104 109 94 0.978 14.33 3.00 Prom - 105352 105313 40 -3.46 4.04 PlyA - 105720 105715 6 1.05 4.03 Term - 108252 108079 174 0 0 75 33 81 0.005 -0.94 4.02 Intr - 127320 126799 522 0 0 73 -43 222 0.210 0.75 4.01 Init - 127576 127418 159 1 0 60 91 71 0.534 4.42 4.00 Prom - 128324 128285 40 -2.96 5.00 Prom + 139635 139674 40 -3.76 5.01 Init + 141588 141659 72 2 0 70 46 101 0.483 3.27 5.02 Intr + 145632 145695 64 2 1 76 52 66 0.592 0.09 5.03 Intr + 147231 147400 170 1 2 107 54 58 0.587 3.97 5.04 Intr + 150083 150317 235 1 1 97 101 197 0.908 19.16 5.05 Intr + 151583 151777 195 0 0 86 75 213 0.987 19.19 5.06 Intr + 152193 152345 153 1 0 70 85 170 0.998 14.94 5.07 Intr + 155727 155957 231 1 0 108 110 239 0.972 25.84 5.08 Intr + 159643 159791 149 2 2 73 96 140 0.975 13.25 5.09 Intr + 159892 160017 126 0 0 64 86 134 0.988 11.68 5.10 Intr + 162272 162423 152 1 2 60 92 260 0.986 22.56 5.11 Term + 162842 163067 226 2 1 88 38 257 0.792 16.95 5.12 PlyA + 163549 163554 6 1.05 6.03 PlyA - 164582 164577 6 1.05 6.02 Term - 171722 171645 78 2 0 111 44 64 0.278 2.06 6.01 Init - 198066 197944 123 0 0 87 87 54 0.450 5.39 6.00 Prom - 202784 202745 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 5563 5556 8 1 2 76 99 11 0.912 0.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_1|351_aa MDLARWLTPVIPALWEAEDRPRAPSPPGAPRVCEMADQGAARVPRTPHRPPPPERDARAS EWSAEGRGARLAEEAEKWVLAVLARKSRFVNLMKHSKKTYDSFQDELEDYIKVQKARGLE PKTCFRKMKGDYLETCGYKGEVNSRPTYRMFDQRLPSETIQTYPRSCNIPQTVENRLPQW LPAHDSRLRLDSLSYCQFTRDCFSEKPVPLNFNQQEYICGSHGVEHRVYKHFSSDNSTST HQASHKQIHQKRKRHPEEGREKSEEERSKHKRKKSCEEIDLDKHKSIQRKKTEVEIETVH VSTEKLKNRKEKKSRDVVSKKEERKRTKKKKEQGQERTEEEMLWDQSILGF >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_1|1056_bp atggacctggcacggtggctcacgcctgtaatcccagcactttgggaggccgaggacagg cctcgcgcgcccagcccgccgggcgcgccccgagtgtgcgaaatggccgaccagggcgcg gcccgagtcccgcgcacccctcaccggccgccgcccccggaaagagacgccagggcttcc gagtggagtgcggagggaaggggcgctcggcttgcggaggaagccgagaaatgggttctt gctgtgttggccagaaaatcccgttttgtcaacctaatgaagcattcaaagaagacatat gactcttttcaagatgaacttgaagattatattaaagtacagaaagccagaggcttagag ccaaagacttgtttcagaaagatgaaaggggactatttggaaacctgtgggtacaaagga gaggttaattccagacccacgtatagaatgtttgaccagagactcccatctgaaaccatc cagacctacccaagatcatgcaatattccacaaacagtggaaaatcggttgcctcagtgg ttaccagcccatgacagcagattgagactagactctctgagctactgtcagttcacgagg gactgtttctcagaaaaaccagtacccctgaactttaatcaacaagaatatatttgtggc tcacatggtgtagaacatagagtttacaagcacttctcctcagataacagtaccagtact catcaagccagtcacaaacagatacatcagaagaggaaaaggcacccagaggaaggcaga gaaaaatcagaggaggagcggtctaagcataagagaaaaaaaagctgcgaggaaattgac ttagacaaacacaagagcatccaaagaaagaaaacagaggtggaaatagaaaccgtacat gtcagtacagaaaagcttaagaatcgaaaggagaaaaaaagccgagatgtagtctctaag aaagaggaacgtaagcgtacaaaaaagaaaaaggaacaaggccaagaaaggacagaggag gaaatgctttgggaccagtctattcttggattttga >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_2|492_aa MENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRC GQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGKVPNENIRLAARIMWRVEREGTGLTEGC LVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLS DQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKISEGEELTVSYIDFLNV SEERKRQLKKQYYFDCTCEHCQKKLKDDLFLGVKDNPKPSQEVVKEMIQFSKDTLEKIDK ARSEGLYHEVVKLCRECLEKQEPVFADTNIYMLRMLSIVSEVLSYLQAFEEASFYARRMV DGYMKLYHPNNAQLGMAVMRAGLTNWHAGNIEVGHGMICKAYAILLVTHGPSHPITKDLE ASSVLRLVFPSWPELSEDGSAMRVQTEMELRMFRQNEFMYYKMREAALNNQPMQVMAEPS NEPSPALFHKKQ >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_2|1479_bp atggagaacgtggaggtcttcaccgctgagggcaaaggaaggggtctgaaggccaccaag gagttctgggctgcagatatcatctttgctgagcgggcttattccgcagtggtttttgac agccttgttaattttgtgtgccacacctgcttcaagaggcaggagaagctccatcgctgt gggcagtgcaagtttgcccattactgcgaccgcacctgccagaaggatgcttggctgaac cacaagaatgaatgttcggccatcaagagatatgggaaggtgcccaatgagaacatcagg ctggcggcgcgcatcatgtggcgggtggagagagaaggcaccgggctcacggagggctgc ctggtgtccgtggacgacttgcagaaccacgtggagcactttggggaggaggagcagaag gacctgcgggtggacgtggacacattcttgcagtactggccgccgcagagccagcagttc agcatgcagtacatctcgcacatcttcggagtgattaactgcaacggttttactctcagt gatcagagaggcctgcaggccgtgggcgtaggcatcttccccaacctgggcctggtgaac catgactgttggcccaactgtactgtcatatttaacaatggcaaaattgagctccgggcc ctaggcaagatctcagaaggagaggagctgactgtgtcctatattgacttcctcaacgtt agtgaagaacgcaagaggcagctgaagaagcagtactactttgactgcacatgtgaacac tgccagaaaaaactgaaggatgacctcttcctgggggtgaaagacaaccccaagccctct caggaagtggtgaaggagatgatacaattctccaaggatacattggaaaagatagacaag gctcgttccgagggtttgtatcatgaggttgtgaaattatgccgggagtgcctggagaag caggagccagtgtttgctgacaccaacatctacatgctgcggatgctgagcattgtttcg gaggtcctttcctacctccaggcctttgaggaggcctcgttctatgccaggaggatggtg gacggctatatgaagctctaccaccccaacaatgcccaactgggcatggccgtgatgcgg gcagggctgaccaactggcatgctggtaacattgaggtggggcacgggatgatctgcaaa gcctatgccattctcctggtgacacacggaccctcccaccccatcactaaggacttagag gcaagtagcgtcttgaggctggtgttcccttcctggcctgagctttctgaggatgggagt gccatgcgggtgcagacggagatggagctacgcatgttccgccagaacgaattcatgtac tacaagatgcgcgaggctgccctgaacaaccagcccatgcaggtcatggccgagcccagc aatgagccatccccagctctgttccacaagaagcaatga >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_3|158_aa MSFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVIQ NEFTVGEECELETMTGEKVKTVVQLEGDNKLVTTFKNIKSVTELNGDIITNSPAGASYWL NEKEEVSLSDTKQGREEWRRPVGEMEKNQHNGNSFSCK >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_3|477_bp atgagtttctccggcaagtaccaactgcagagccaggaaaactttgaagccttcatgaag gcaatcggtctgccggaagagctcatccagaaggggaaggatatcaagggggtgtcggaa atcgtgcagaatgggaagcacttcaagttcaccatcaccgctgggtccaaagtgatccaa aacgaattcacggtgggggaggaatgtgagctggagacaatgacaggggagaaagtcaag acagtggttcagttggaaggtgacaataaactggtgacaactttcaaaaacatcaagtct gtgaccgaactcaacggcgacataatcaccaattctcctgctggtgcttcttattggtta aatgagaaagaagaagtcagcctttctgacacaaagcagggcagagaagagtggagaagg cctgtgggagaaatggagaaaaaccagcacaatgggaactccttcagttgcaagtga >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_4|284_aa MSELPFTIASKRIKYLGIQLTTDVKDLFKENYKPLLNEIKEDTNKWKNILCSQNWEKTTL KFIWNQKRARIAKSILSQKNKAGGISLPDFKLYYKATVTKTAWYWYQNRDIDQWNGTEPS EIMPHIYNYLIFDKPDKNKKWGKDSLFNKWCWENWLAMCRKLKLDPFLTPYTKINSRWIK DLNVRPKTIKTLEENLGNTIQDVGMVKDFMSKTPKAMATKAKIDKWDTPGKVHLVPTSPE ASGVGADLLYGEAFMTLLPTPLRRSQKAAVWGMILDICDQGERA >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_4|855_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaacggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattctatgctcacagaattgggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagccaaaagaac aaagctggaggcatctcgctacctgacttcaaactgtactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagaccaatggaatggaacagagccctca gaaataatgccacatatctacaactatctgatctttgacaaacctgacaaaaacaagaaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatgtgtaga aagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatggattaaa gacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccatt caggacgtgggcatggtcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatacaccaggaaaggtgcacctggtccctacttcacctgag gcctccggggttggtgctgacttgctctatggagaagctttcatgacgctccttcccact cccctccgtagatcccagaaagcagcagtgtggggcatgatacttgacatttgcgatcaa ggggagagggcctag >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_5|590_aa MAMRIVEALITGVLGPVLLLGEWTNHKGLQARVSVAEGLWERSVEGPSHCEAENPSSGPD RKPHRTVRDADSGVRTFAARAALLRTGPRAPRPAPRAPRPAPASRIMWYVSTRGVAPRVN FEGALFSGYAPDGGLFMPEELPQLDRGTLCQWSTLSYPGLVKELCALFIGSELLPKDELN DLIDRAFSRFRHREVVHLSRLRNGLNVLELWHGVTYAFKDLSLSCTTQFLQYFLEKREKH VTVVVGTSGDTGSAAIESVQGAKNMDIIVLLPKGHCTKIQELQMTTVLKQNVHVFGVEGN SDELDEPIKTVFADVAFVKKHNLMSLNSINWSRVLVQMAHHFFAYFQCTPSLDTHPLPLV EVVVPTGAAGNLAAGYIAQKIGLPIRLVVAVNRNDIIHRTVQQGDFSLSEAVKSTLASAM DIQVPYNMERVFWLLSGSDSQVTRALMEQFERTQSVNLPKELHSKLSEAVTSVSVSDEAI TQTMGRCWDENQYLLCPHSAVAVNYHYQQIDRQQPSTPRCCLAPASAAKFPEAVLAAGLT PETPAEIVALEHKETRCTLMRRGDNWMLMLRDTIEDLSRQWRSHALNTSQ >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_5|1773_bp atggcgatgcgaattgtggaggctttaatcactggtgtgctgggaccagttctgctgctg ggagaatggactaatcacaaaggcctgcaggcccgtgtaagtgtggcagagggcttgtgg gagaggtccgtggaaggtccttcccactgcgaagctgagaaccctagctccggccccgac cggaagccccaccgcaccgtcagggacgcggactcgggagtgcgcacattcgcagcccgg gcagccctgctgcgcaccgggcctcgcgccccgcgccccgcgccccgcgccccgcgcccc gcaccggcctccaggatcatgtggtatgtcagcaccaggggcgtagccccacgggtcaac tttgagggggccctcttctctggctatgcacctgacgggggcctctttatgcctgaagag ctcccacagttggacagagggaccctgtgccagtggagcacactctcctatcctggcctg gtgaaggagctgtgtgccctcttcattggctctgagctccttccaaaagatgaattaaat gatctgatcgaccgagccttcagcagattccgtcacagagaagtggtccatctgtccagg ttgaggaatgggctgaacgtgttggagctgtggcatggcgtcacatatgcatttaaggac ctgtccctgtcctgcacaacacagttcctgcagtacttcctggagaagagggagaagcac gtcactgtggttgtaggaacatctggggacacaggaagtgctgccattgagagtgttcaa ggggcaaagaacatggacattatcgttctgctgcccaaaggtcactgcacaaagattcag gagctccagatgacaacggtgctgaagcagaacgtacatgtgtttggagtggagggaaac agcgatgagctcgatgagccgatcaagactgtgtttgccgatgtggcttttgtcaagaag cacaatctgatgagcctgaattcgatcaactggtcccgggtcctggtgcagatggcccat cacttctttgcttacttccagtgtacgccatccttggacacacatcccctacccctggtg gaggtggttgtgccaacaggggctgccggtaaccttgcagctgggtacattgctcaaaag ataggcctgcccatccgtctggtcgtggcagtgaaccgcaatgacatcatccacaggact gtccagcagggagacttctctctctctgaggctgttaaatcaaccttggcatcagctatg gacattcaggtgccctacaacatggagagggtgttctggctgctctctggctctgacagc caggtgacaagagccctcatggagcagtttgaaaggacccaaagtgtgaatctgcccaag gaactgcacagcaagctttcagaggcagtgacatccgtgtcagtgtcggatgaagccatc acccagaccatgggccgctgctgggatgagaaccagtacttgctgtgcccccactcagcg gtggccgtgaactaccattaccagcagatagacaggcagcagcccagcactccccggtgc tgcctcgcccctgcctctgcagccaagttcccggaagctgtcctggctgctggcctgacc cctgagactcccgcggagatcgtagccctggagcacaaggagacacgctgcaccctgatg cggagaggtgacaactggatgctgatgcttcgggacaccattgaggaccttagccgacag tggaggagtcatgccctcaacacctcccagtag >gi568815596r:88023057_88228017|GENSCAN_predicted_peptide_6|66_aa MHPVHWANASDPTCKLPGPVQDQNPAVSTTCAPSSPALLQQATRWLQGLQVALAFTPVAR GTYGPE >gi568815596r:88023057_88228017|GENSCAN_predicted_CDS_6|201_bp atgcacccagtccactgggcaaatgccagcgaccccacctgtaagctgccaggccctgtg caagatcaaaaccctgcagtctccaccacctgtgcaccaagcagcccagccctgctgcag caggcgacaaggtggctgcaggggctacaagttgcactggctttcaccccagtggcaaga ggcacctatggtcctgaatga