GENSCAN 1.0 Date run: 2-Nov-116 Time: 17:53:47 Sequence gi568815576r:21823095_22046048 : 222954 bp : 48.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1585 1580 6 1.05 1.03 Term - 12496 12467 30 1 0 109 42 17 0.046 -2.95 1.02 Intr - 20316 20287 30 0 0 111 81 8 0.060 0.73 1.01 Init - 44346 44233 114 0 0 125 42 315 0.021 28.81 1.00 Prom - 66486 66447 40 -2.56 2.09 PlyA - 68469 68464 6 1.05 2.08 Term - 100377 99998 380 1 2 95 43 459 0.946 37.05 2.07 Intr - 102568 102475 94 1 1 116 89 102 0.973 12.64 2.06 Intr - 108197 108054 144 2 0 129 110 29 0.854 9.68 2.05 Intr - 110485 110297 189 1 0 76 78 287 0.987 26.28 2.04 Intr - 111132 110771 362 1 2 66 65 261 0.678 16.54 2.03 Intr - 116586 116438 149 2 2 140 76 209 0.999 24.78 2.02 Intr - 123014 122749 266 2 2 100 96 230 0.632 21.31 2.01 Init - 124017 123988 30 0 0 89 93 2 0.335 0.56 2.00 Prom - 125661 125622 40 -9.06 3.00 Prom + 126757 126796 40 -7.56 3.01 Init + 128099 128122 24 1 0 105 103 2 0.797 3.03 3.02 Intr + 129720 129899 180 2 0 74 54 252 0.858 20.46 3.03 Intr + 129971 130052 82 1 1 112 119 -18 0.903 2.91 3.04 Term + 130341 130366 26 1 2 70 52 14 0.395 -5.61 3.05 PlyA + 131091 131096 6 -0.45 4.19 PlyA - 131231 131226 6 1.05 4.18 Term - 134501 134020 482 0 2 73 45 609 0.585 50.06 4.17 Intr - 135599 135398 202 2 1 96 113 398 0.990 41.96 4.16 Intr - 136138 136038 101 2 2 100 96 144 0.999 16.23 4.15 Intr - 136642 136493 150 2 0 86 40 312 0.989 26.33 4.14 Intr - 137355 137227 129 1 0 44 97 277 0.988 24.87 4.13 Intr - 139508 139335 174 0 0 66 72 304 0.951 26.51 4.12 Intr - 139799 139653 147 0 0 98 42 315 0.985 28.11 4.11 Intr - 140934 140829 106 0 1 140 54 131 0.985 14.69 4.10 Intr - 141221 141067 155 0 2 105 113 301 0.998 34.09 4.09 Intr - 142281 142191 91 0 1 129 71 100 0.999 11.97 4.08 Intr - 144622 144509 114 1 0 134 69 77 0.734 11.04 4.07 Intr - 145681 145525 157 0 1 107 16 123 0.980 7.01 4.06 Intr - 147312 147116 197 0 2 74 53 361 0.999 29.41 4.05 Intr - 148208 148143 66 2 0 94 65 29 0.518 0.30 4.04 Intr - 148857 148783 75 0 0 69 105 119 0.952 11.41 4.03 Intr - 149624 149518 107 0 2 81 96 164 0.998 16.43 4.02 Intr - 151394 151263 132 0 0 104 94 139 0.994 16.72 4.01 Init - 152615 152546 70 2 1 80 75 84 0.752 7.51 4.00 Prom - 152983 152944 40 -7.46 5.00 Prom + 159072 159111 40 -4.16 5.01 Init + 159576 159735 160 2 1 58 70 124 0.579 5.59 5.02 Term + 159782 159960 179 0 2 53 38 115 0.562 0.85 5.03 PlyA + 159964 159969 6 1.05 6.06 PlyA - 160088 160083 6 1.05 6.05 Term - 168660 168117 544 2 1 116 42 658 0.935 57.74 6.04 Intr - 171437 171050 388 0 1 -20 68 257 0.193 6.75 6.03 Intr - 171679 171621 59 0 2 129 48 43 0.364 2.93 6.02 Intr - 172338 172039 300 2 0 86 110 127 0.944 10.55 6.01 Init - 174883 174828 56 1 2 84 53 87 0.936 5.56 6.00 Prom - 186562 186523 40 -3.66 7.00 Prom + 187399 187438 40 -0.76 7.01 Init + 202982 203027 46 1 1 109 62 81 0.974 6.45 7.02 Term + 203156 203403 248 0 2 131 38 154 0.962 10.65 7.03 PlyA + 204298 204303 6 1.05 8.00 Prom + 206678 206717 40 -0.96 8.01 Init + 207900 207948 49 2 1 100 71 71 0.999 5.72 8.02 Term + 208069 208445 377 2 2 89 48 307 0.958 21.70 8.03 PlyA + 212231 212236 6 1.05 9.03 PlyA - 213641 213636 6 1.05 9.02 Term - 214001 213928 74 0 2 110 44 22 0.019 -1.93 9.01 Init - 220654 220588 67 1 1 94 109 117 0.867 15.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 44346 44228 119 0 2 125 89 317 0.977 33.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_1|57_aa MAAAAAAGAGPEMVRGQVFDVGPRYTNLSYIGEGAYGMESSALPFAEERVAASLCVL >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_1|174_bp atggcggcggcggcggcggcgggcgcgggcccggagatggtccgcgggcaggtgttcgac gtggggccgcgctacaccaacctctcgtacatcggcgagggcgcctacggcatggagagc tcagccttaccctttgctgaagaacgtgtagcagccagcctctgtgtcctttaa >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_2|537_aa MCLFPVREKPGEAWRPSCPGLAAGPRDALGMSSGAPQKSSPMASGAEETPGFLDTLLQDF PALLNPEDPLPWKAPGTVLSQEEVEGELAELAMGFLGSRKAPPPLAAALAHEAVSQLLQT DLSEFRKLPREEEEEEEDDDEEEKAPVTLLDAQSLAQSFFNRLWEVAGQWQKQVPLAARA SQRQWLVSIHAIRNTRRKMEDRHVSLPSFNQLFGLSVSAPVQTLGTASEDRGEGGGPPGT EKSCPPRLRQERHPSAPVLAQQETKKGNSDPVNRAYFAVFDGHGGVDAARYAAVHVHTNA ARQPELPTDPEGALREAFRRTDQMFLRKAKRERLQSGTTGVCALIAGATLHVAWLGDSQV ILVQQGQVVKLMEPHRPERQDEKARIEALGGFVSHMDCWRVNGTLAVSRAIGDVFQKPYV SGEADAASRALTGSEDYLLLACDGFFDVVPHQEVVGLVQSHLTRQQGSGLRVAEELVAAA RERGSHDNITVMVVFLRDPQELLEGGNQGEGDPQAEGRRQDLPSSLPEPETQAPPRS >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_2|1614_bp atgtgcctctttccagtgagagagaagccgggtgaagcctggagaccctcttgccctggc ctagctgcaggcccccgggatgctttgggcatgtcctctggagccccacagaagagcagc ccaatggccagtggagctgaggagaccccaggcttcctggacacgctcctgcaagacttc ccagccctgctgaacccagaggaccctctgccatggaaggccccagggacggtgctcagc caggaggaggtggagggcgagctggctgagctggccatgggctttctgggcagcaggaag gccccgccaccacttgctgctgctctggcccacgaagcagtttcacagctgctacagaca gacctttccgaattcaggaagttgcccagggaggaagaagaagaggaggaggacgatgac gaggaggaaaaggcccctgtgaccttgctggatgcccaaagcctggcacagagtttcttt aaccgcctttgggaagtcgccggccagtggcagaagcaggtgccattggctgcccgggcc tcacagcggcagtggctggtctccatccacgccatccggaacactcgccgcaagatggag gaccggcacgtgtccctcccttccttcaaccagctcttcggcttgtctgtgagtgctccc gtccagaccctggggacagcttcggaggaccggggcgagggtgggggccccccaggtacc gagaagagctgcccaccaaggctgagacaggagagacacccgtcagcccctgtacttgcc cagcaagagactaagaagggcaacagtgaccctgtgaaccgcgcctactttgctgtgttt gatggtcacggaggcgtggatgctgcgaggtacgccgctgtccacgtgcacaccaacgct gcccgccagccagagctgcccacagaccctgagggagccctcagagaagccttccggcgc accgaccagatgtttctcaggaaagccaagcgagagcggctgcagagcggcaccacaggt gtgtgtgcgctcattgcaggagcgaccctgcacgtcgcctggctcggggattcccaggtc attttggtacagcagggacaggtggtgaagctgatggagccacacagaccagaacggcag gatgagaaggcgcgcattgaagcattgggtggctttgtgtctcacatggactgctggaga gtcaacgggaccctggccgtctccagagccatcggggatgtcttccagaagccctacgtg tctggggaggccgatgcagcttcccgggcgctgacgggctccgaggactacctgctgctt gcctgtgatggcttctttgacgtcgtaccccaccaggaagttgttggcctggtccagagc cacctgaccaggcagcagggcagcgggctccgtgtcgccgaggagctggtggctgcggcc cgggagcggggctcccacgacaacatcacggtcatggtggtcttcctcagggacccccaa gagctgctggagggcgggaaccagggagaaggggacccccaggcagaagggaggaggcag gacttgccctccagccttccagaacctgagacccaggctccaccaagaagctag >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_3|103_aa MALCVQQVLSPRGSVSRKLLPAGPAHFPSLAAAGAAGAAQAQSGLQAGGAGPLLPPGGYV DGDCGPLVLWVGGWMGKAIPRSQRPEIKQLLPSIAASKPAMAS >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_3|312_bp atggccctttgtgtacagcaggtgctgtctcctcgcggctccgtgtcccgcaagctgctg ccggccggccccgcccacttcccgtccctggccgccgcgggcgccgcgggcgccgcgcag gcgcagtcgggcctccaggctggcggggccggacctctgctgccccctggcggctacgtg gacggtgactgcggccctttagtgctttgggtgggcggctggatggggaaagcaatccct aggtcacagcgcccagaaattaagcaacttctgccgtcaatagctgcctcgaagccagca atggccagttga >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_4|884_aa MKTVLMVAEKPSLAQSIAKILSRGSLSSHKGLNGACSVHEYTGTFAGQPVRFKMTSVCGH VMTLDFLGKYNKWDKVDPAELFSQAPTEKKEANPKLNMVKFLQVEGRGCDYIVLWLDCDK EGENICFETVVLWVAPVLLYPQLSPVAQELVLDAVLPVMNKAHGGEKTVFRARFSSITDT DICNAMACLGEPDHNEALSVDARQELDLRIGCAFTRFQTKYFQGKYGDLDSSLISFGPCQ TPTLGFCVERHDKIQSFKPETYWVLQAKVNTDKDRSLLLDWDRVRVFDREIAQMFLNMTK LEKEAQVEATSRKEKAKQRPLALNTVEMLRVASSSLGMGPQHAMQTAERLYTQGYISYPR TETTHYPENFDLKGSLRQQANHPYWADTVKRLLAEGINRPRKGHDAGDHPPITPMKSATE AELGGDAWRLYEYITRHFIATVSHDCKYLQSTISFRIGPELFTCSGKTVLSPGFTEVMPW QSVPLEESLPTCQRGDAFPVGEVKMLEKQTNPPDYLTEAELITLMEKHGIGTDASIPVHI NNICQRNYVTVESGRRLKPTNLGIVLVHGYYKIDAELVLPTIRSAVEKQLNLIAQGKADY RQVLGHTLDVFKRKFHYFVDSIAGMDELMEVSFSPLAATGKPLSRCGKCHRFMKYIQAKP SRLHCSHCDETYTLPQNGTIKLYKELRCPLDDFELVLWSSGSRGKSYPLCPYCYNHPPFR DMKKGMGCNECTHPSCQHSLSMLGIGQCVECESGVLVLDPTSGPKWKVACNKCNVVAHCF ENAHRVRVSADTCSVCEAALLDVDFNKAKSPLPGDETQHMGCVFCDPVFQELVELKHAAS CHPMHRGGPGRRQGRGRGRARRPPGKPNPRRPKDKMSALAAYFV >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_4|2655_bp atgaagactgtgctcatggttgctgaaaagccgtccttggcacagtcaattgccaaaatc ctctctagagggagcctgtcctcacacaaagggctgaacggggcctgctcagtccacgag tacactgggacctttgctggccagccagtgcgcttcaagatgacgtctgtctgtggtcac gtgatgaccctggatttcctgggaaaatacaacaaatgggacaaagtggaccccgcagaa ctgttcagccaagctcccacggagaagaaagaagctaaccccaagctgaacatggtgaag ttcctgcaggtggagggcagaggctgcgactacatcgtgctgtggctggactgcgacaag gagggggagaacatctgctttgagactgttgttctatgggtcgcccccgtgctcctgtac ccccagctctcacccgtggctcaagagctggttcttgatgctgttctgcccgtcatgaac aaggcccatggtggcgagaagaccgtgttccgggccaggtttagctccatcacggacaca gacatctgtaatgccatggcctgcctaggcgagcctgaccacaacgaggcgctctcagtg gatgctcgccaggagctggacctgcgaatcggctgtgcattcaccaggtttcagactaaa tatttccaggggaaatacggtgatttagacagctctctcatctcctttgggccgtgtcag actccaaccctgggattctgtgtggagagacatgataaaatccagtccttcaaaccagag acctactgggtgctgcaggccaaggttaacactgacaaagacagatctctccttttggac tgggaccgagtaagagtgtttgaccgggagatcgcacagatgtttttaaacatgacaaag ctggagaaggaagcccaggtggaggccacaagcaggaaagaaaaggccaagcagaggccc ctggccctgaacactgtggagatgctgcgtgtggccagctcttctctgggcatggggccg cagcacgccatgcagacggctgagcggctctacacgcaaggctacatcagctacccacgg acagagaccacccactaccctgagaactttgacctgaagggctctctgcggcagcaggcc aaccacccctactgggccgacacggtgaagcggttgttagcagaaggtatcaaccgcccg cggaaaggccatgacgccggcgaccatccccccatcacccccatgaagtctgccacagag gccgaattagggggtgacgcgtggcggctctatgagtacatcaccagacacttcatcgcc acggtcagccatgactgcaagtacctgcagagcaccatctccttcagaattgggcccgag ctcttcacctgctccgggaagaccgtcctctcaccaggcttcacggaggtcatgccctgg cagagcgtgcccctggaggagagcctgcccacttgccagcggggtgatgccttccctgtg ggcgaggtgaagatgctggagaagcagacgaacccacccgactacctgacggaggccgag ctcatcacgctcatggagaagcatggcatcggcacggatgccagcatccctgtgcatatc aacaacatctgccagcgcaactatgtcacggtggagagcgggcgccggctcaagcccacc aacctcggcatcgtcctggtgcacggctactataagattgatgcagagctggtgctcccc accatccgcagtgcagtggagaagcagctgaacctgatcgcccagggcaaggccgactac cgccaggtcctgggccacaccctggacgtgttcaagaggaagttccactactttgtcgac tccattgctggcatggatgagttgatggaggtgtctttctcgcccctggcggccacaggc aagcccctctcacgctgtgggaagtgccaccgcttcatgaagtacatccaggccaagcca agccgcctgcactgctcccactgcgatgagacctacacgctcccccagaacggcaccatc aagctctacaaggagctccgctgccctctggatgacttcgagctggtcctgtggtcatca ggctctcggggcaagagctacccgctgtgcccctactgctacaaccacccacccttccga gacatgaagaaaggcatgggctgcaacgagtgtacgcacccctcctgccagcactcgctg agcatgctgggcatcggccagtgcgtggaatgtgagagcggggtgctggtgctggacccc acctcgggccccaagtggaaggtggcctgcaacaagtgcaacgtggtagcgcactgcttc gagaacgcccaccgcgtgcgggtgtccgccgacacctgcagtgtctgtgaggccgccttg cttgatgtggacttcaacaaggccaagtccccactcccgggcgatgagacgcagcacatg ggctgcgtcttttgtgaccccgtcttccaggagctggtggagctgaagcatgcggcctcc tgccaccccatgcaccgcggtggaccagggagaaggcagggtcgagggcggggccgggcc aggaggccccctgggaagcccaaccccagacggcccaaggacaagatgtcagccctggcc gcctactttgtatga >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_5|112_aa MRALPAFLSRRARVPQPAAHPQPHRGSSSGPCSRGGYRQPLFPGPAASGVPASGSVARGF REHPREWHLHAADRRDPQALHPDSSPPRRDNHDNYHYNDYENWHENIYAVNN >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_5|339_bp atgcgtgcgcttcccgcctttctgagccgccgggcgagggtcccgcagcccgccgctcac ccacagccgcaccgcggatccagctccggtccttgttcccggggcggctaccgacaaccc ctatttccgggtccagccgcttctggcgtccccgcgtccggttccgtggctcgcggattc cgggagcaccccagggagtggcacttgcacgccgcggaccgcagggatccccaagccctg cacccagattcttccccgccacggagagacaatcatgacaattatcattacaatgattat gaaaactggcatgaaaacatttatgctgttaataattaa >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_6|448_aa METPLTYIAVFGEAELEGRSMEVNHPAPLSLLELAAHSLLSNEASVPLVLEQLTVNLFPP LLTAAFAKGHRKALKALVQAWPFRFLHLGSLIVQWPNQDSLQAVVDGLEAFSAHRACPRK SELRMLDFTLDSEQVFYCDKFLSDLLTKVEQSHGALHLCCRKLHIEKMAVDSLLRILKTL RLDFIQELEVFYWCRDFLVLAEPNLSAIQLGRIFNLRSLKLFYYKWAFSSWVRRPSSYFF SQLTMLGHLRKLHLSHSYLVGKLHYILSCLWVPLHSLEICNCKLLDTDITYLSRSHHTTC LKKLDLSVNDLSYMIPGPLGTLLRAVSGTLQHLDLKHCWLKDAHLSALLPALCRCSHLSS LSLSDNPISSACLLSLLEHTMGLMELKQVLYPIPVDCCIYLHGVCRGPVNEDKLCQLQAE IQKQLQAMQQADMQWSPSTVFAYAAGAV >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_6|1347_bp atggagaccccactgacctacatcgctgtgtttggtgaagcagaactggagggaaggtcc atggaggtgaaccacccagcacccctcagcctcttggaacttgcagcccacagcctgctg agcaatgaggcttcagttccactggtgttggaacagctcacagtgaaccttttcccgcca ctactcactgctgcatttgccaaggggcacagaaaggctctgaaggccttggtgcaggcc tggcccttccggtttctccatctgggctctctgatagtgcagtggcccaaccaagacagc ctgcaagctgtggtggatgggctggaggccttttctgcccacagggcttgtcccaggaag tcagaactgaggatgctggatttcaccctggactctgagcaagtcttttactgtgataag ttcctctccgacctcctgacgaaagtggaacagagccacggggccctgcatctctgctgt aggaagttgcacattgagaagatggccgtcgacagtctgctgaggatcctgaagacactc aggctggatttcatccaggagctggaggtgttttactggtgtagagacttcttggtattg gcagagccaaacctatctgccatacagctgggaaggatcttcaacctgcgcagcctcaag ctcttctactacaagtgggccttctcctcatgggtcaggcggccctccagctacttcttc tcccagctcaccatgctgggccacctccggaagctgcacctgtctcactcctacctcgta ggcaaactacattacatactcagctgtctgtgggtcccactacattccctggagatctgc aactgcaagcttctggacactgacatcacctacttgtcccggagccatcataccacctgc ctgaagaagctggatctgagcgtcaacgacttgtcctacatgatccctgggcccttgggt accctgctgagggcagtctcagggacactgcagcacctagacctgaaacactgctggctg aaggatgcccacctcagtgccctcctgcccgccctgtgccgctgctcccacctcagttcc ctgagcctctccgacaaccccatctccagtgcctgcctcctgagcctgctggagcacacc atggggctgatggagctgaagcaggtactctatcccatcccagttgactgctgcatctac ctgcatggcgtctgccggggtcctgtgaacgaggacaagctgtgccagttgcaggccgag atacagaagcagctgcaagccatgcagcaggctgacatgcagtggagcccctccactgtc tttgcttatgcagctggtgcagtttga >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_7|97_aa MAWTPLLLQLLTLCSGSWAQSALTQEASVSGTVGQKVTLSCTGNSNNVGSYAVGWYQQIS HGAPKTVMFGNSLPSGIPDRFSGSKSGTTASLTISGL >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_7|294_bp atggcctggacccctctcctcctccagcttctcaccctctgctcagggtcctgggcacag tctgcgctgacccaggaagcctcggtgtcagggaccgtgggacagaaggtcaccctctcc tgtactggaaacagcaacaacgttggaagttatgctgtgggctggtaccaacagatttct cacggtgctcccaaaactgtgatgtttggaaattctctgccctcagggatccctgaccgc ttctctggctcaaagtctgggaccacagcctccctgactatctcgggcctctag >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_8|141_aa MAWTPLLFLTLLLHCTGSLSQLVLTQSPSASASLGASVKLTCTLSSGHSSYAIAWHQQQP EKGPRYLMKLNSDGSHSKGDGIPDRFSGSSSGAERYLTISSLQSEDEADYYCQTWGTGIH TVTQADEEVGQKPQPAQGLVI >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_8|426_bp atggcttggaccccactcctcttcctcaccctcctcctccactgcacagggtctctctcc cagcttgtgctgactcaatcgccctctgcctctgcctccctgggagcctcggtcaagctc acctgcactctgagcagtgggcacagcagctacgccatcgcatggcatcagcagcagcca gagaagggccctcggtacttgatgaagcttaacagtgatggcagccacagcaagggggac gggatccctgatcgcttctcaggctccagctctggggctgagcgctacctcaccatctcc agcctccagtctgaggatgaggctgactattactgtcagacctggggcactggcattcac acagtgacacaggcagatgaggaagtgggacagaaacctcagcctgctcagggtcttgtt atatga >gi568815576r:21823095_22046048|GENSCAN_predicted_peptide_9|46_aa MALVLLRPDSSIFANIVAVPSAAQVHGDLGGAGQMRDQKATREGVF >gi568815576r:21823095_22046048|GENSCAN_predicted_CDS_9|141_bp atggccctggtgctgctgcggccagacagctccatctttgccaacattgttgctgtcccc agtgcagctcaggttcatggagatctgggtggggctgggcagatgagggaccagaaggcc actcgagagggggtgttctag