GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:37:49 Sequence gi568815587r:44834887_45038335 : 203449 bp : 50.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2381 2565 185 1 2 25 63 168 0.470 6.59 1.02 Term + 4721 4829 109 2 1 117 53 -4 0.309 -3.02 1.03 PlyA + 8337 8342 6 1.05 2.00 Prom + 12678 12717 40 -3.06 2.01 Init + 13309 13410 102 0 0 63 61 133 0.941 6.34 2.02 Intr + 13581 13629 49 2 1 123 98 -14 0.799 1.35 2.03 Intr + 18824 18981 158 0 2 111 28 82 0.637 4.23 2.04 Intr + 25442 25474 33 1 0 124 99 41 0.581 7.22 2.05 Intr + 25797 25921 125 2 2 21 115 26 0.286 -2.02 2.06 Intr + 31228 31350 123 1 0 64 96 104 0.184 8.50 2.07 Term + 40164 40371 208 0 1 77 46 72 0.011 -1.29 2.08 PlyA + 41981 41986 6 1.05 3.04 PlyA - 43603 43598 6 1.05 3.03 Term - 49620 49483 138 0 0 20 49 144 0.303 1.66 3.02 Intr - 61606 61438 169 0 1 72 111 146 0.833 15.25 3.01 Init - 63778 63717 62 1 2 44 61 58 0.257 -0.58 3.00 Prom - 67363 67324 40 -5.46 4.00 Prom + 68214 68253 40 -6.66 4.01 Init + 71531 71593 63 1 0 92 78 9 0.735 1.45 4.02 Intr + 72746 72892 147 1 0 85 -4 102 0.443 1.13 4.03 Intr + 74819 75013 195 1 0 28 59 363 0.866 27.01 4.04 Intr + 83086 83223 138 0 0 103 41 162 0.709 13.66 4.05 Intr + 84328 84426 99 0 0 109 79 197 0.999 21.21 4.06 Intr + 84931 85113 183 0 0 67 84 121 0.953 9.58 4.07 Intr + 91788 91871 84 2 0 119 107 179 0.996 22.92 4.08 Term + 94245 94292 48 2 0 126 44 85 0.893 5.20 4.09 PlyA + 97519 97524 6 1.05 5.14 PlyA - 98873 98868 6 -3.24 5.13 Term - 100131 99998 134 1 2 92 33 162 0.986 9.35 5.12 Intr - 100776 100675 102 1 0 93 92 158 0.989 16.85 5.11 Intr - 101231 101148 84 0 0 90 91 -1 0.532 0.19 5.10 Intr - 102013 101917 97 1 1 108 70 133 0.987 13.08 5.09 Intr - 102466 102418 49 0 1 100 89 91 0.994 9.08 5.08 Intr - 102727 102669 59 1 2 121 99 34 0.984 5.48 5.07 Intr - 103480 103321 160 0 1 69 81 258 0.651 23.19 5.06 Intr - 108947 108720 228 1 0 51 40 144 0.228 2.88 5.05 Intr - 115257 115087 171 2 0 100 92 33 0.330 3.96 5.04 Intr - 119554 119250 305 1 2 78 116 48 0.338 1.99 5.03 Intr - 132570 132413 158 1 2 78 18 104 0.043 2.13 5.02 Intr - 135633 135533 101 2 2 52 37 78 0.328 -1.15 5.01 Init - 138369 138185 185 0 2 55 58 97 0.301 1.99 5.00 Prom - 145463 145424 40 -6.16 6.00 Prom + 153904 153943 40 -6.66 6.01 Init + 154045 154137 93 0 0 82 79 21 0.548 0.98 6.02 Intr + 154933 154971 39 0 0 114 94 75 0.870 9.12 6.03 Term + 155202 155384 183 2 0 85 54 73 0.832 1.14 6.04 PlyA + 156711 156716 6 1.05 7.03 PlyA - 158637 158632 6 1.05 7.02 Term - 159804 159701 104 1 2 89 53 108 0.743 5.84 7.01 Init - 166865 166823 43 2 1 72 59 58 0.679 1.88 7.00 Prom - 174897 174858 40 -2.46 8.02 PlyA - 175181 175176 6 -0.45 8.01 Sngl - 176009 175269 741 2 0 86 42 263 0.983 17.71 8.00 Prom - 177409 177370 40 -2.46 9.00 Prom + 182595 182634 40 -4.06 9.01 Init + 187688 187705 18 1 0 94 94 -5 0.059 0.71 9.02 Intr + 190047 190142 96 2 0 49 70 87 0.071 3.21 9.03 Term + 194879 195010 132 1 0 53 45 148 0.146 5.09 9.04 PlyA + 196741 196746 6 1.05 10.04 PlyA - 197393 197388 6 1.05 10.03 Term - 199555 199503 53 2 2 96 46 42 0.745 -1.61 10.02 Intr - 199695 199633 63 1 0 82 64 75 0.560 3.19 10.01 Init - 201206 201104 103 2 1 69 106 31 0.489 3.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 42212 42331 120 1 0 80 51 138 0.832 9.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_1|97_aa MSDFGEFTSVEEVTADVVEMARDLELEMEAEAVTELPQSHDKALVDEELLPMDKQRDGNY SWGSSTNCSNTTTTCLLSAPLILANCFLELRQAPSLN >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_1|294_bp atgagtgacttcggggagttcacttcagtggaggaagtaactgcagatgtggtagaaatg gcaagggacctagaattagaaatggaggctgaggctgtgactgaattgccgcaatctcat gataaagccttagtggatgaggagttacttcctatggataagcaaagagatggaaactac tcctggggttcctctaccaactgctccaacaccaccaccacctgccttctctctgcccct ctaatcttggctaattgcttcctggagttaagacaggcaccatcactgaactga >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_2|265_aa MRGRGLLCLADLAPALIHLALRGVQSFAHQRWGKSCSGLASSRGSETKLTGLQEPVFISG VDMHFQEEALRGEDQGFTWKLTGVQKQHLGEEAAALWASRFCVVSISWTTLKLQISQEDN PRRGQTAEKAEVQASPPGCWPMPLKLNWGRVELLPSCALCCHTRAQAQVQVNTIGIRFQQ DWHGAQHIIGAKWTCSRAGAAAPCKSERPWTSPLQRYILPRGALPAHVGLEHQTAQLTAS FSDATAKYQTIYQAFPESWCPQPDL >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_2|798_bp atgaggggcaggggtctcctgtgcctggctgaccttgcccctgccctgatccatttggct cttcgaggcgttcagtcctttgcgcaccaaagatggggcaaaagctgttcaggcttggca agcagcaggggctcagaaaccaaattaacaggcctgcaggagcctgtctttatttctgga gtggacatgcatttccaagaggaagcactgagaggagaggaccagggcttcacttggaaa ctgaccggagtgcagaagcagcacctcggggaggaggccgcagccctctgggcctcccgg ttctgtgtggtgagcatctcctggaccaccctcaagcttcagattagccaagaagataac cctcggaggggccagacagctgagaaggctgaggtgcaggctagcccaccaggttgctgg cccatgcccctcaagctcaactggggaagggtggaactgttgcccagctgtgccctctgc tgccacactcgggcacaggcccaggtccaagtgaacacaattggcattaggttccagcag gactggcatggggcccagcacatcatcggtgctaaatggacttgcagcagggctggagca gcagctccatgcaagagtgagaggccttggaccagccccttgcagaggtacattctcccc cgtggggccctcccagcccatgtgggcctagaacaccagacagcccaactcacagccagc ttctctgatgccactgccaaataccagaccatttaccaggcttttccagagagctggtgt ccccaaccagacctctaa >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_3|122_aa MPCTALELCKESPGSKMAFISFRYSVHLGYSAGMREPMKCSEQPYEVGITPSSPQYNGET ESNGKRERIDLSNITLLKRLKELEEVELIAHRGQGPDAAALFENSKPLYPHPCTTWEQSF AA >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_3|369_bp atgccctgcaccgccttagaactctgcaaagagtcccccggcagcaagatggccttcatc agcttcaggtactccgtgcacttggggtacagtgctggcatgcgtgagcccatgaaatgc tcagaacaaccctatgaagtaggtatcacaccatcatccccacagtacaatggggaaact gagtccaatgggaaaagagagagaattgacttgtccaacatcacactgttgaagaggcta aaggaactggaggaagtcgagctgatcgcccatcgaggccaggggcctgacgctgctgcc ctgtttgaaaacagcaagcccctctacccccacccctgtaccacatgggaacagagcttt gctgcataa >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_4|318_aa MEGDCLSCMKYLMFVFNFFIFPQCSHPKWSWALLEVFSMQLLNPQISNSNKEAQIFLEVV RETNERDTGWLGGACLLAIGIWVMVDPTGFREIVAANPLLLTGAYILLAMGGLLFLLGFL GCCGAVRENKCLLLFFFLFILIIFLAELSAAILAFIFRENVRIRPQAFLPPAISKGLVAI QLTREFFTKELTKHYQGNNDTDVFSATWNSVMITFGCCGVNGPEDFKFASVFRLLTLDSE EVPEACCRREPQSRDGVLLSREECLLGRSLFLNKQGCYTVILNTFETYVYLAGALAIGVL AIELFAMIFAMCLFRGIQ >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_4|957_bp atggaaggcgactgtctgagctgcatgaagtatctgatgtttgtattcaatttcttcata tttccccagtgttcccacccgaaatggagctgggccctccttgaagtgtttagcatgcag ttgctgaatccccaaatctccaacagcaataaggaggcacagattttcctggaggttgtg cgggagaccaacgagagagacacaggctggctgggcggggcctgcctgctggccatcggc atctgggtcatggtggaccccaccggcttccgggagatcgtggctgccaatcctctgctc ctcacgggcgcctacatcctcctggccatggggggcctgctctttctgctcggcttcctg ggctgctgcggggccgtccgtgagaacaagtgtctgctgctatttttcttcctgttcatc ctgatcatcttcctggcagagctctcagcagccatcctggccttcatcttcagggaaaat gtacgtatcaggccccaagctttcctgcctcctgctatcagcaaggggttggtggccatt cagctcacccgagaattcttcaccaaggagctcaccaagcactaccagggcaataacgac acagacgtcttctctgccacctggaactcggtcatgatcacatttggttgctgcggggtc aacgggcctgaagactttaagtttgcatctgtgtttcgactcctgaccctggatagtgaa gaggtgccggaggcctgctgccggagggaaccccaaagtcgggacggggtcctgctgagc cgggaggagtgcctcctgggaaggagcctattcctaaacaagcagggctgttacacggtg atcctcaacaccttcgagacctacgtctacttggccggagcccttgccatcggggtactg gccatcgagcttttcgccatgatctttgccatgtgcctcttccggggcatccagtag >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_5|610_aa MHQIPSERRLPRARLQLYFQPCCGKQFRAGFSSLCTTSTSPPVSMRQFSFSAQGLPQRPN TRPLQASNVDKGHSSSFPDEETKLPKQFGLKQVAQGSVGSWVPGEGGADGNSAVPAKDFE NPKAKRKTFHQRGCWPNATLHGGSWPSHWFLAADTGNPCAHHTFLQGHKHRHTGALLSAH SHLATKEVLLPLLVEGDQGRWSPCCSQVPVGVWRSDPPTPATAAGCFLAPSPTPLPSPPP WGLERNTSPRAADGGGGGVPAAHKSGPTFWGGGLGAGALGRRTRGRLTARSSPPGDTGAE AWVSESGLTGRYTEEHSTSHTFRLVGLGESNCREMTRSRGYKWPVAADTETRESPEEDED PVPWEGPGKRGCVQDSLEEQGACPSAEAGLEEKMAAKQPPPLMKKHSQTDLVSRLKTRKI LGVGGEDDDGEVHRSKISQVLGNEIKFTIREPLGLRVWQFVSAVLFSGIAIMALAFPDQL YDAVFDGAQVTSKTPIRLYGGALLRDFLKASPYRAQVFCLGVRVKENPIIGPGISLIMWN ALYTAEKVIIRWTLLTEACYFGVQFLVVTATLAETGLMSLGILLLLVSRLLFVVISIYYY YQVGRRPKKA >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_5|1833_bp atgcaccagattccttctgaacgccgcctgccgcgtgcacggctacaactctatttccag ccatgctgtggcaagcagttccgggcgggcttcagctccctttgcaccaccagcacctcg cctcctgtcagcatgaggcagttttcattttctgcccaggggcttccgcagcgccccaac acaagacctttacaagccagtaatgtcgacaagggtcactcttcctcatttccggatgaa gaaaccaaattgcccaagcaatttggtttgaagcaggttgcccaagggtctgttggctcg tgggttcctggtgagggcggagctgatggcaacagtgccgtacctgcaaaggactttgaa aaccccaaagcaaagagaaagaccttccaccagaggggctgctggcccaatgccacgctc catggagggagttggccatcacattggtttctggctgcagacactggcaatccctgtgcc caccacaccttcctgcaggggcacaaacacagacacacaggcgcactcttaagtgctcac tcacacctggccaccaaggaggtgttgctgcccctgcttgtggagggcgaccagggcaga tggagcccctgctgctcccaggttcctgtgggagtctggaggagtgaccctcccacccct gcaacagccgctggctgcttcctggctccatccccaaccccactcccatccccacccccc tggggccttgagaggaacacttcaccaagggccgcggacggaggtggtggcggagttccc gctgcccacaagtctggcccgaccttctggggtgggggcctgggggcaggggccctgggc cggagaacccggggccgcctaacggctcggagctcaccgccgggggacaccggcgctgag gcctgggtctcagaaagtggcctcacggggagatacacggaggagcacagtacaagccat accttcagactggttggtcttggggagtcaaactgccgtgaaatgaccaggagtagaggc tataagtggccagtggctgcagacactgagacccgtgagagtccagaggaggatgaggac ccagtgccctgggagggccctgggaagaggggctgtgtgcaagatagcctggaggaacaa ggagcatgcccttctgcagaggccgggctggaggagaagatggcggccaagcagcccccg cctctgatgaagaagcacagccagacggacctcgtgagccgcctgaagacccgcaagatc ctcggcgtgggcggggaggatgacgacggggaggtgcatcgctccaagatcagccaggtc ttaggcaatgaaatcaagtttaccattcgggagcctttggggctcagggtctggcagttc gtctctgctgtgctcttctccggcattgccatcatggcgcttgccttccctgaccagctc tatgatgcggtctttgatggagcccaggtgaccagcaagacccccatccgcctctacggc ggtgccctcctcagggacttccttaaagcgagtccttaccgggctcaggttttctgtctg ggagtccgtgtaaaagaaaacccaatcataggaccaggcatctccctgatcatgtggaac gctctctacacggctgagaaggtcatcattcgatggaccctgctcaccgaagcttgctat ttcggggtccagttcttggtggtcactgccacgctagctgagacgggcctcatgtccctg gggatcctgctgctcctggtcagccgcctcctttttgtcgtcatcagcatttactactat taccaagtcggccgaagacccaagaaggcctag >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_6|104_aa MEMVVLTYRKVVRIKWKQEEELPVNSKNLPTVTLKPASLEQQTQRLAGILIAHWTNCQGS GRQLACVNSLSPHSDPMGKAVIIPVIVRPEKQWHRGVKWPQDWH >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_6|315_bp atggagatggtagtactcacttataggaaggtagtgaggattaaatggaagcaggaggag gagctccctgtaaacagcaagaacttgcccacagtaaccctcaagcccgcatccctggag cagcagacccagcgtctggctggcatcctaatcgctcattggacaaattgccagggcagc ggtagacagctggcatgtgttaactcgctcagtcctcacagcgatcccatggggaaggcg gttatcatccccgttattgtacggccagagaaacagtggcacagaggagtgaagtggcct caggattggcactga >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_7|48_aa MNKVMELTPQCDEAAHSLGAAVSPIGGDGCDDSPNSDLHVVPTVLTGH >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_7|147_bp atgaacaaagtaatggaacttacgccccagtgtgatgaggcagcccattctctaggggct gccgtctctcccatcggaggggacggttgtgatgacagccccaacagcgacctgcatgtg gtgcccactgtgctcacaggacactga >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_8|246_aa MAILPKVIYRFDAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKTILSQKNKAGGIRLPD FKLYYKATVIKIAWYWCQNRDVDKWNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDSLFNK WCWENWLAICRKLKVDPFLTPYTKINSRWIKDLHVRPKTIKTLEENLGNAIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKVKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIY NELKQI >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_8|741_bp atggccatactgcccaaggtaatttatagattcgatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagacaatcctaagccaaaagaacaaagctggaggcatcaggctacctgac ttcaaactatactacaaggctacagtaatcaaaatagcatggtactggtgccaaaacaga gatgtagacaaatggaacagaacagagccctcagaaataatgccgcatatctacaactat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaagtggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttacatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaatgccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaagtaaaaagcttctgcacagcaaaagaaaccaccatcagagtgaacaggcaacctaca gaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttaa >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_9|81_aa MSLMLQLPHKEAQANSLEKPPGGKLRHLIGSPIPIARLAQTSFDMVRLAAPTKHIGEMPA FEPPSYISQEDVKMYPLNGQY >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_9|246_bp atgagcctgatgttacagctgccacataaggaagcccaagctaactccctggagaagcca cctggaggtaaactgaggcacctgattggcagccccatccccattgctagacttgcccag acctcatttgatatggtccgccttgctgcccccacaaagcacatcggggaaatgccagca tttgagcctccaagctacatctcccaggaagatgtgaaaatgtatcccctgaatgggcag tactag >gi568815587r:44834887_45038335|GENSCAN_predicted_peptide_10|72_aa MDSSLVDFADPFALCQPSGAAACSEHCVGLESGLGSLVPSVAIDCAVDSRCEQKRECGYG VRLPGPHPEGNT >gi568815587r:44834887_45038335|GENSCAN_predicted_CDS_10|219_bp atggatagctccttggtggattttgcagacccctttgctctttgtcagccttctggggct gcagcctgctcagagcactgtgttggactggagtcaggactgggttcccttgttccaagt gtggccattgactgtgctgtggacagcaggtgtgagcagaagagagaatgtggatatgga gtaaggcttcctggaccacatcctgagggcaatacctag