GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:41:11 Sequence gi568815582r:11773122_12015546 : 242425 bp : 45.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 1398 1115 284 1 2 89 115 164 0.944 16.34 1.05 Intr - 1892 1859 34 2 1 102 91 32 0.999 2.70 1.04 Intr - 3411 3331 81 0 0 97 99 -1 0.769 1.73 1.03 Intr - 3788 3630 159 2 0 80 80 118 0.999 10.38 1.02 Intr - 6242 6045 198 2 0 101 101 118 0.999 13.95 1.01 Init - 24209 24003 207 2 0 51 91 186 0.948 12.02 1.00 Prom - 25178 25139 40 -6.56 2.12 PlyA - 25363 25358 6 1.05 2.11 Term - 26677 26471 207 1 0 101 42 79 0.125 1.94 2.10 Intr - 30959 30862 98 0 2 29 105 48 0.046 0.33 2.09 Intr - 44876 44741 136 2 1 63 77 53 0.059 1.84 2.08 Intr - 64992 64732 261 0 0 89 28 122 0.463 3.98 2.07 Intr - 66864 66574 291 0 0 58 57 187 0.847 9.93 2.06 Intr - 68699 68574 126 2 0 75 30 74 0.688 1.18 2.05 Intr - 68879 68786 94 1 1 48 77 106 0.748 5.47 2.04 Intr - 73722 73574 149 0 2 85 100 17 0.604 1.63 2.03 Intr - 74685 74547 139 2 1 83 91 32 0.325 3.47 2.02 Intr - 77297 77158 140 2 2 98 87 -36 0.269 -3.44 2.01 Init - 78391 78287 105 1 0 84 109 110 0.549 12.51 2.00 Prom - 85002 84963 40 -4.16 3.14 PlyA - 86143 86138 6 1.05 3.13 Term - 91211 91140 72 2 0 98 37 39 0.137 -2.29 3.12 Intr - 103054 102965 90 1 0 50 79 70 0.816 2.49 3.11 Intr - 104459 104286 174 2 0 61 97 161 0.999 14.44 3.10 Intr - 110055 109894 162 0 0 8 116 118 0.658 6.77 3.09 Intr - 112153 112060 94 0 1 79 75 64 0.996 4.17 3.08 Intr - 113490 113350 141 2 0 58 103 64 0.883 4.37 3.07 Intr - 113810 113656 155 2 2 44 83 78 0.994 1.77 3.06 Intr - 114609 114449 161 1 2 43 99 116 0.268 7.91 3.05 Intr - 123664 123437 228 2 0 29 87 157 0.183 7.54 3.04 Intr - 124760 124719 42 0 0 77 91 37 0.208 1.11 3.03 Intr - 142585 142248 338 0 2 19 70 670 0.311 53.46 3.02 Intr - 143259 143017 243 2 0 77 26 197 0.572 9.01 3.01 Init - 148479 148391 89 0 2 72 90 40 0.785 2.74 3.00 Prom - 149684 149645 40 -5.06 4.00 Prom + 150764 150803 40 -2.66 4.01 Init + 152150 152190 41 1 2 80 81 16 0.949 -0.22 4.02 Intr + 153077 153183 107 2 2 93 101 162 0.984 17.86 4.03 Term + 158112 158272 161 1 2 40 43 131 0.235 1.90 4.04 PlyA + 158663 158668 6 -3.44 5.05 PlyA - 159126 159121 6 -0.45 5.04 Term - 160540 160392 149 2 2 47 44 184 0.965 7.96 5.03 Intr - 160803 160704 100 0 1 69 86 64 0.973 3.98 5.02 Intr - 164547 164419 129 0 0 82 92 32 0.931 3.89 5.01 Init - 168122 168048 75 2 0 70 56 113 0.985 5.39 5.00 Prom - 168561 168522 40 -3.56 6.00 Prom + 178177 178216 40 -4.66 6.01 Init + 194466 194722 257 2 2 75 59 214 0.816 13.90 6.02 Intr + 203475 203692 218 0 2 11 101 102 0.372 1.95 6.03 Intr + 204461 204564 104 0 2 83 90 49 0.356 4.49 6.04 Intr + 205002 205044 43 2 1 53 81 22 0.196 -4.19 6.05 Intr + 212434 212507 74 2 2 96 41 93 0.636 4.53 6.06 Term + 214603 214662 60 0 0 127 44 28 0.477 0.10 6.07 PlyA + 215682 215687 6 1.05 7.05 PlyA - 215695 215690 6 1.05 7.04 Term - 217608 217464 145 2 1 56 35 155 0.526 4.38 7.03 Intr - 219523 219414 110 1 2 75 99 9 0.481 -0.22 7.02 Intr - 229639 229370 270 1 0 32 39 188 0.368 6.04 7.01 Init - 238640 238635 6 2 0 84 115 10 0.507 3.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_1|321_aa MAASPGRQARPPAARGRRRGRNRAAAAAANRGRAASEHPGPAAAAATAARLPPPASACLL PPAGASSPQVYLRALVRNLFNEGNDVYREHDWNNSISQYTEALNIADYAKSEEILIPKEI IEKLYINRIACYSNMGFHDKVLEDCNIVLSLNASNCKALYRKSKALSDLGRYKKAYDAVA KCSLAVPQDEHVIKLTQELAQKLGFKIRKAYVRAEALNHSVEDIEPDLLTPRQEAVPVVS LPAPSFSHEVGSELASVPVMPLTSILPLQVEESALPSAVLANGGKMPFTMPEAFLDDGDM VLGDELDDLLDSAPETNETVM >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_1|963_bp atggcggcgtctcccgggcggcaagcgcggcctcccgccgctagagggcgccgccgcggg cgcaaccgagccgctgccgccgccgctaaccgagggagagctgcgagcgagcacccaggc cccgccgccgccgccgccaccgctgcccgcctgccgccgcccgcctccgcctgccttctg ccgccggccggcgcgtcgtccccacaggtatatttgcgtgctctcgtgagaaatcttttt aatgaaggaaatgacgtttatcgggaacatgattggaacaactcgataagccagtacacg gaagccttgaatatagctgattatgcaaaatctgaagaaattttaatccccaaagaaata attgaaaaactatatataaatcgtattgcctgctattctaatatgggtttccatgataaa gttttggaggactgcaatatagtcctcagtttaaatgccagtaactgcaaagctctgtat cggaaatctaaggctttaagtgatttaggaagatacaaaaaggcttacgatgctgtagca aagtgctccttagcagtgcctcaggatgagcatgtaataaaactaactcaagaactagct cagaaattgggatttaaaataagaaaagcgtatgtcagagctgaggctttgaaccattct gtggaagatattgagccagatttattaactccaaggcaagaagcagttcctgttgtctct ttaccggcacccagtttttctcatgaagttggaagtgagctggcctcagttcctgttatg cccttaacttctattttgccactacaagtggaagagagcgctctgccatctgcagtgctg gcaaatggaggaaagatgcccttcactatgccagaagcttttctagatgatggagatatg gtccttggagatgaactagatgacctgcttgattctgcacctgaaactaatgaaactgtt atg >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_2|581_aa MEDSASASLSSAAATGTSTSTPAAPTARKQLDKEQVRKAVDALLTHCKSRKNNYGLLLNE NESLFLMVVLWKIPSKELRVRLTLPHSIRSDSEDICLFTKDEPNSTPEKTEQFYRKLLNK HGIKTVSQIISLQTLKKEYKSYEAKLRLLSSFDFFLTDARIRRLLPSLIGRHFYQRKNAI RIGHVGMQIEHIIENIVAVTKGLSEKLPEKWESVKLLFVKTEKSAALPIFSSFVSNWDEA TKRSLLNKKKKEARRKRRERNFEKQKERKKKRQQARKTASVLSKDDVAPESGDTTVKKPE SKKEQTPEHGKKKRGRGKAQVKATNESEDEIPQLVPIGKKTPANEKVEIQKHATGKKSPA KSPNPSTPRGKKRKALPASETPKAAESETPGKSPEKKPKIKEEAVKEKSPSLGKKDARQT PKKPEAKFFTTPSKSIMAFVSSHVSVWLFVTTIQCLRAQGSGNFQRAEHGEVPGGWLTQE AERKTKATGNNPIILTPNTAENRYNMQRGGGFKVSRTWKLAGAGVSPVSRDLLSLLATGL DEVISDHDDCLEDQIGRCVSEQLLDGLSVKASPREATVVEA >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_2|1746_bp atggaggattcggcctcggcctcgctgtcttctgcagccgctactggaacctccacctcg actccagcggccccgacagcacggaagcagctggataaagaacaggttagaaaggcagtg gacgctctcttgacgcattgcaagtccaggaaaaacaattatgggttgcttttgaatgag aatgaaagtttatttttaatggtggtattatggaaaattccaagtaaagaactgagggtc agattgaccttgcctcatagtattcgatcagattcagaagatatctgtttatttacgaag gatgaacccaattcaactcctgaaaagacagaacagttttatagaaagcttttaaacaag catggaattaaaaccgtttctcagattatctccctccaaactctaaagaaggaatataaa tcctatgaagccaagctccgccttctgagcagttttgatttcttccttactgatgccaga attaggcggctcttaccctcactcattgggagacatttctatcaaagaaagaatgctata cgtattggtcacgttggaatgcaaattgagcacatcattgaaaacattgttgctgtcacc aaaggactttcagaaaaattgccagagaagtgggagagcgtgaaactcctgtttgtgaaa actgagaaatcggctgcacttcccatcttttcctcgtttgtcagcaattgggatgaagcc accaaaagatctttgcttaataagaagaaaaaagaggcaaggagaaaacgaagagaaaga aattttgaaaaacaaaaggagaggaagaagaagaggcagcaggctaggaagactgcatca gttcttagtaaagatgatgtggcacctgaaagtggtgatactacagtgaagaaacctgaa tcaaagaaggaacagaccccagagcatgggaagaaaaaacgtggcagaggaaaagcccaa gttaaagcaacaaatgaatccgaagacgaaatcccacagctggtaccaataggaaagaag actccagctaatgaaaaagtagagattcaaaaacatgccacaggaaagaagtctccagca aagagtcctaatcccagcacacctcgtgggaagaaaagaaaggctttgccagcatctgag accccaaaagctgcagagtctgagaccccagggaaaagcccagagaagaagccaaaaatc aaagaagaggcagtgaaggaaaaaagtccttcgctggggaaaaaagatgcgagacagact ccaaaaaagccagaggccaagtttttcaccactcctagtaaatctatcatggcctttgtg tccagtcatgtttctgtctggttgttcgtcacaactatccagtgcctaagagcacagggt tcagggaacttccagagagctgaacacggggaggttcctggagggtggctgacccaagaa gctgaaaggaaaaccaaggcaaccggaaataatcctattatactgactcctaatacagct gagaacaggtacaacatgcagaggggtggaggattcaaggtgtcaagaacctggaaactg gctggggctggggtgtcaccagtatccagagacctcctgagccttctggcaacaggacta gacgaggtcatttcagatcatgatgactgcttggaagatcagatagggcgctgtgttagt gagcaacttttagatgggctgtcagtgaaggcctctccaagagaggctactgtagttgag gcctga >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_3|662_aa MSHSAQLAIISLTHLQAFLIRILNREGFSSLCPAGQSGPLLWAKILSVFGPAQSCTNQGL PLPPHQVTGGAAERPPDLRLLALPAGSRSRFRLRPVPELRLPSAPGKTAAGGGGGGGGGG GGSSSGSSSSDSAPDCWDQADMEAPGPGPCGGGGSLAAAAEAQRENLSAAFSRQLNVNAK PFVPNVHAAEFVPSFLRGPAAPPPPVGGAANNHGAGSGAGGRAGSNSAVSMELSEPIVEN GETEMSPEESWEHKEEISEAEPGGGSLGDGRPPEESAHEMMEEEEEIPKPKSVVAPPGAP KKEHVNVVFIGHVDTNQEERDKGKTVEVGRAYFETEKKHFTILDAPGHKSFVPNMIGGAS QADLAVLVISARKGEFETGFEKGGQTREHAMLAKTAGVKHLIVLINKMDDPTVNWSNERY EECKEKLVPFLKKVGFNPKKDIHFMPCSGLTGANLKEQSDFCPWYIGLPFIPYLDNLPNF NRSVDGPIRLPIVDKYKCRISLWAIELLYSTKKSETGFYFTFDQDMGTVVLGKLESGSIC KGQQLVMMPNKHNVEVLGILSDDVETDTVAPGENLKIRLKGIEEEEILPGFILCDPNNLC HSGRTFDAQIVIIEHKSIICPGYNAVLHIHTCIEEVEITIYSILCMNNFSKSFYRSLIDD VA >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_3|1989_bp atgagccacagcgcccagctggctattatatctttaacacatctccaagctttcctcatc cgcattttgaacagggagggcttcagcagcctgtgtccagcaggccagagtggacccctc ctatgggctaaaatactctcggtcttcggcccagcacagagctgcaccaaccagggcctg ccgctgcccccccaccaggtgacaggtggggccgccgagcggccaccagacctccgcctc ctcgcgttgcctgccgggagccggagccgcttccggttgcgccctgtccctgaacttcga ctcccgtcggcccccgggaaaaccgcagcgggtggcggcggcggcggcggcggcggcggc ggcgggagcagcagcggcagcagcagcagcgactcggcgcctgactgctgggaccaggcg gacatggaagcccccgggccgggcccttgcggcggcggcggctccctggcggcggcggcc gaggcccagcgggagaacctcagcgcggccttcagccggcaactcaacgtcaacgccaag cccttcgtgcccaacgtccacgccgccgagttcgtgccgtccttcctgcggggcccggca gcgccgccacccccagttggcggcgccgccaataaccacggagccggcagcggcgcggga ggccgtgcgggttcaaattcagctgttagcatggaactttcagaacctattgtagaaaat ggagagacagaaatgtctccagaagaatcatgggagcacaaagaagaaataagtgaagca gagccagggggtggttccttgggagatggaaggccgccagaggaaagtgcccatgaaatg atggaggaggaagaggaaatcccaaaacctaagtctgtggttgcaccgccaggtgctcct aagaaagagcatgtaaatgtagtattcattgggcacgtagacacaaatcaggaagaacga gacaagggtaaaacagtagaagtgggtcgtgcctattttgaaaccgaaaagaagcatttc acaattctagatgcccctggccacaagagttttgtcccaaatatgattggtggtgcctct caagctgatttggctgtgctggtaatctcagccaggaaaggagagtttgaaactggattt gaaaaaggaggacagacaagagaacatgcaatgttggcaaagacagcaggtgtaaaacac ctaattgtgctaattaataagatggatgatccaacagtaaattggagcaatgagagatat gaagaatgtaaggagaaactagtgccatttttgaaaaaagttggcttcaatcccaaaaag gacattcactttatgccctgctcaggacttactggagcaaatctcaaagagcagtcggat ttctgtccttggtacattggattaccgtttattccatatctggataatttgccgaacttc aatagatcagttgatggaccaatcaggctgccaattgtggataagtacaagtgtagaatt tcactttgggctattgagctgttatacagcaccaagaagtctgaaactggattttatttt acatttgatcaggatatgggcactgtggtcctgggaaagctggaatcaggatctatttgt aaaggccagcagcttgtgatgatgccaaacaagcacaacgtggaagttcttggaatactt tccgatgatgtagagactgataccgtagccccaggtgaaaacctcaaaatcagactgaaa ggaattgaagaagaggagattcttccagggtttatactttgtgatcctaataatctttgt cattctggacgcacatttgatgcccagatagtgattatagagcacaaatccatcatctgc ccaggctataatgcggtgctgcatattcatacctgtattgaggaggtggaaataacaatt tattcaattctgtgcatgaataacttctctaagtccttttatagatccctgattgatgac gtagcataa >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_4|102_aa MKGGNPILREGVAWCCHGHDCCYTRAEEAGCSPKTERYSWQCVNQSVLCGDNGKPSTLVL TDFNKVQFTEIEGRKRKRPKKGKFAVLPSHHLIHGPDPKDLT >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_4|309_bp atgaagggagggaatccaattttacgagagggtgtagcctggtgctgccatggccacgac tgttgttacactcgagctgaggaggccggctgcagccccaagacagagcgctactcctgg cagtgcgtcaatcagagcgtcctgtgcggtgacaacggcaaaccatccaccctggtgttg actgactttaacaaggttcagttcacagagattgagggcagaaaaaggaaacggcctaaa aagggtaagtttgctgtgttgccctcacaccacttgattcatggtcctgatcctaaggat ctcacctga >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_5|150_aa MRVLWMLFWLLFWLLLEFISHQSTCVINTLADHHHRGTDFGGSPWLRIIIAFPRSYKVVL TLWTVYLWLSFLKTIFQSENGHDVSTDVQQRARRSNRRRQEGNKIGLKDVITLRRHVETK GRAKIRKMKVTTKINHHDKINGKRKTAKKQ >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_5|453_bp atgcgggtgctgtggatgctcttttggctcctcttttggctcctgctggaatttatcagc catcagtccacctgtgttatcaatactctggctgaccatcatcatcgtgggactgacttt ggtggaagtccttggttacgtatcattattgcatttccgagaagttataaagttgtcctt accctctggacagtttacctttggttgtctttcctgaagactatcttccagtctgaaaat ggacatgatgtatccacagatgtacagcagagagccaggaggtccaaccgccgtagacag gaaggaaataaaattggcctgaaagacgtcattactctacggagacatgtggaaacaaaa ggtagagctaaaatccgtaagatgaaggtgacaacgaaaatcaaccatcatgacaaaatc aatggaaagaggaagaccgccaaaaaacagtaa >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_6|251_aa MANIDLEKSRTGDEIILPRGLEYTVEECTCEDCIKSKPKVDSDHCFPLPAMEEGATILVT TKTNDYCKSLPAALSATEIEKSISASTALHTGSGDLDLRRGLRSGSAGGASSAARLYFRG GAAGAGLLSPGLSGARQPQKRQRRRRGAGTGPGRGTMSAHLGTVTSSFFFPVQTSSSHHR GGGVDGDAGSEREGKICEMDSPNNEGAGSRSSMERAGHVVILQPITVILVGQKMEIKASN ECIVRFSGPGT >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_6|756_bp atggctaacattgacctggaaaagagcaggactggtgatgaaattattcttccgagaggc ctcgagtacacggtggaagaatgcacctgtgaagactgcatcaagagcaaaccgaaggtc gactctgaccattgctttccactcccagctatggaggaaggcgcaaccattcttgtcacc acgaaaacgaatgactattgcaagagcctgccagctgctttgagtgctacggagatagag aaatcaatttctgctagcaccgctctccacaccgggtccggcgacttggatctgcgccgc ggtttacgttccgggtcggcgggcggggcctcgtcggcggccaggctctacttccggggc ggggcggccggggcggggctcctgtctcccggcctgtctggagctcggcagccgcagaag cggcagcggcggcggcgcggcgcaggcaccggcccggggagaggcaccatgagcgcccat ctggggaccgtcacttcgtccttcttcttcccagtccaaacttcttcctctcaccatcga gggggtggtgttgatggggatgctggaagtgagagggagggcaagatttgtgaaatggac agtcctaataatgagggggctggcagccgctcctccatggagcgggctggccatgtggtc atcctgcaacccatcactgtgattttggttggccagaaaatggaaattaaagcaagtaat gaatgtatagtgcgtttctcaggacctggaacatag >gi568815582r:11773122_12015546|GENSCAN_predicted_peptide_7|176_aa MKKQALWENIANRKTKAHSHLCIIQEAAPACRQLTVYHTQLFTGDNYNFYKPVLNGSLWT SPRVVMTGYDVTTAQIKPHMSHLVVRNDNLLRASVPSTHHTVILVGTRASSLNNWEPEPC LQNWEGQNRPHIQSLAGNKDIINLLSLQNSFSVTLAGIWDARRYQPNDDFSVAAIV >gi568815582r:11773122_12015546|GENSCAN_predicted_CDS_7|531_bp atgaagaagcaagccttgtgggaaaacattgctaacagaaaaacaaaagcacacagccat ctgtgcattatccaggaagctgctcctgcctgtcggcagctgactgtctaccacacacag ctcttcacgggcgataattacaacttctataagcctgtgctgaatggatccctttggacg agccccagagttgtcatgactggctatgacgttacaacagcacaaatcaagcctcatatg agtcatctggttgtgcggaatgacaatttactgagggcaagtgttccttccacacaccac actgtcatcctggtggggaccagggcaagctcactgaacaattgggaacctgagccgtgc ttgcaaaactgggaaggacagaacaggccccacatacagagtcttgcagggaacaaggac atcatcaacctgttgtcactgcagaattcattctcagttacactggcgggcatctgggat gcccggcgctaccagcccaatgatgactttagtgttgctgctattgtttaa