GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:36:40 Sequence gi568815584f:69299370_69558590 : 259221 bp : 45.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8717 8863 147 0 0 118 48 101 0.404 9.31 1.02 Intr + 13793 13848 56 0 2 37 91 43 0.054 -1.80 1.03 Intr + 21342 21499 158 2 2 97 96 284 0.992 28.91 1.04 Intr + 25323 25421 99 0 0 113 107 199 0.999 23.53 1.05 Intr + 25968 26035 68 0 2 58 78 60 0.970 0.55 1.06 Intr + 26593 26658 66 2 0 89 113 51 0.964 6.58 1.07 Intr + 29081 29202 122 0 2 89 83 161 0.899 15.91 1.08 Term + 29618 29671 54 1 0 60 42 53 0.761 -4.54 1.09 PlyA + 30260 30265 6 1.05 2.00 Prom + 31157 31196 40 -3.06 2.01 Init + 31385 31513 129 1 0 60 49 141 0.820 7.55 2.02 Intr + 32095 32182 88 0 1 103 105 79 0.998 10.64 2.03 Intr + 33716 33800 85 0 1 108 77 67 0.954 6.48 2.04 Intr + 34128 34231 104 0 2 109 97 106 0.998 13.42 2.05 Intr + 39282 39408 127 1 1 90 96 110 0.999 11.74 2.06 Intr + 40158 40250 93 0 0 139 80 149 0.999 18.28 2.07 Intr + 40297 40321 25 1 1 67 68 19 0.871 -4.17 2.08 Intr + 41774 41858 85 1 1 55 100 105 0.926 7.79 2.09 Intr + 42281 42395 115 0 1 61 94 188 0.926 16.21 2.10 Intr + 47671 47812 142 1 1 88 110 105 0.999 13.06 2.11 Intr + 48508 48633 126 0 0 121 109 95 0.999 15.78 2.12 Term + 52662 52799 138 2 0 117 49 41 0.316 1.06 2.13 PlyA + 55075 55080 6 1.05 3.04 PlyA - 56226 56221 6 -0.45 3.03 Term - 59302 59111 192 1 0 31 34 145 0.075 0.82 3.02 Intr - 63906 63748 159 0 0 90 56 93 0.155 6.48 3.01 Init - 69618 69604 15 0 0 73 109 7 0.122 0.49 3.00 Prom - 78854 78815 40 -0.96 4.05 PlyA - 79225 79220 6 1.05 4.04 Term - 81271 81169 103 0 1 112 32 142 0.999 8.75 4.03 Intr - 87714 87594 121 1 1 94 78 87 0.913 7.85 4.02 Intr - 95543 95456 88 2 1 74 113 61 0.614 6.74 4.01 Init - 95897 95895 3 2 0 63 46 0 0.291 -6.70 4.00 Prom - 97721 97682 40 -1.16 5.00 Prom + 97840 97879 40 -7.36 5.01 Init + 98770 99081 312 0 0 57 116 357 0.797 30.73 5.02 Intr + 124725 124833 109 2 1 106 76 145 0.694 14.96 5.03 Intr + 142700 142897 198 0 0 74 113 172 0.983 17.62 5.04 Intr + 153872 153940 69 0 0 94 91 34 0.900 3.45 5.05 Intr + 155413 155528 116 2 2 60 121 49 0.916 5.57 5.06 Intr + 156363 156497 135 2 0 63 121 57 0.984 7.26 5.07 Term + 158994 159224 231 2 0 99 36 77 0.523 0.07 5.08 PlyA + 160918 160923 6 1.05 6.00 Prom + 180505 180544 40 -5.26 6.01 Init + 185597 185745 149 1 2 61 101 275 0.995 25.66 6.02 Intr + 200746 200839 94 1 1 77 100 57 0.980 5.77 6.03 Intr + 201208 201297 90 0 0 93 84 137 0.970 13.99 6.04 Intr + 201502 201578 77 0 2 90 78 100 0.994 7.51 6.05 Intr + 202365 202456 92 0 2 135 24 80 0.452 5.94 6.06 Intr + 203458 203510 53 2 2 68 92 35 0.482 0.53 6.07 Intr + 222914 223008 95 1 2 131 26 173 0.871 14.16 6.08 Intr + 224860 224953 94 1 1 102 78 82 0.905 8.57 6.09 Intr + 226575 226753 179 2 2 81 77 271 0.999 24.02 6.10 Intr + 227328 227460 133 0 1 80 86 241 0.936 23.75 6.11 Intr + 227819 227963 145 1 1 114 80 169 0.999 18.56 6.12 Intr + 228414 228563 150 1 0 94 79 298 0.990 29.63 6.13 Intr + 228881 229046 166 0 1 73 -2 207 0.032 9.22 6.14 Intr + 238362 238407 46 0 1 106 81 72 0.389 6.71 6.15 Intr + 248442 248580 139 2 1 40 93 104 0.957 6.14 6.16 Intr + 248613 248722 110 1 2 56 105 135 0.629 12.00 6.17 Term + 253170 253226 57 2 0 102 43 54 0.526 -0.01 6.18 PlyA + 254493 254498 6 1.05 7.02 PlyA - 254610 254605 6 -0.45 7.01 Term - 255466 255272 195 1 0 96 44 108 0.750 4.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 54674 54477 198 2 0 79 53 211 0.840 9.88 S.002 Init - 122420 122321 100 2 1 75 65 34 0.852 0.26 S.003 Init + 124345 124347 3 0 0 108 22 0 0.882 -4.60 S.004 Term + 228881 229050 170 0 2 73 45 212 0.885 13.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_1|256_aa XLSYSGRYLGPQMLQLRKDLLFINVFIPEFHSEDDDDNDDSNELNPFILDAQSSPHTLGY SSCGWKDQVTGTPSKGFDEKAYLSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHY SCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDC LLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEG GFGQMAEKDEVITTRP >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_1|771_bp ngactaagctactccggacggtacctcggccctcagatgttacagctgaggaaggactta cttttcatcaatgtatttatccctgagtttcactctgaggatgatgatgataatgatgac agtaatgagctcaatccttttatcttagatgcccagtcttccccccatactctgggctac agcagctgtggctggaaggatcaggtgacaggaactccctcgaaaggctttgatgagaag gcctacctgtcggccaagcagctgaaggctggagaggacccctacagacagcacgccttc aaccagctggagagtgacaagctgagcccagaccggcccatccgggacacccgccattac agctgcccatctgtgtcctactcctcggacctgccagccaccagcgtcatcatcaccttc cacaatgaggcccgttccaccctgctgcgcacagtgaagagtgtcctgaaccgaactcct gccaacttgatccaggagatcattttagtggatgacttcagctcagatccggaagactgt ctactcctgaccaggatccccaaggtcaagtgcctgcgcaatgatcggcgggaagggctg atccggtcccgagtgcgtggggcggacgtggctgcagctaccgttctcacctttctggat agccactgcgaagtgaacaccgagtggctgccgcccatgctgcagcgggtgaaggaggga ggatttggacagatggcagaaaaggatgaggtgataacaacccgaccataa >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_2|418_aa MRRLMKGDEVLHVVYDIIYHPKKEASKINIIHQGFGFGFVFWEDHTRVVSPIIDVISLDN FAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHL GKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFPEGNALTYIRNT KRTAEVWMDEYKQYYYEARPSAIGKAFGSTTPAKYEVLNVEETEAQIAGDEGRQPVAIVV GVPEEGQSPSPASSYSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGVN CLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDHLIQQQGKCLAATSTLMSSPGSP VILQMCNPREGKQKWRRKGSFIQHSVSGLCLETKPAQLVTSKCQADAQAQQWQLLPHT >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_2|1257_bp atgagaaggctcatgaagggtgatgaagttttacatgtggtatatgacataatctaccac cccaaaaaagaggccagcaaaatcaacattattcatcaaggttttggatttggatttgtg ttctgggaggaccacacccgcgtggtgagtcccatcattgatgtcatcagtctggataat tttgcctaccttgcagcatctgctgaccttcgtggagggttcgactggagcctgcatttc aagtgggagcagatccctcttgagcagaagatgacccggacagaccccaccaggcccata aggacgcctgtcatagctggaggaatcttcgtgatcgacaagtcctggtttaaccacttg ggaaagtatgatgcccagatggacatctgggggggagagaattttgagctctccttcagg gtgtggatgtgtggtggcagtctggagatcgtcccctgcagccgggtgggccatgtcttc aggaaacggcacccctacaacttccctgagggtaatgccctcacctacatcaggaatact aagcgcactgcagaagtgtggatggatgaatacaagcaatactactatgaggcccggccc tcggccatcgggaaggccttcggcagcacaaccccagcaaaatacgaagtgttgaatgta gaggaaactgaggctcagatagctggtgacgagggccgtcaaccagtggccattgtggtg ggtgtacctgaggaaggccaaagcccaagccctgcctcctcctacagtgtggctacgcgg atagagcagaggaagaagatgaactgcaagtccttccgctggtacctggagaacgtctac ccagagctcacggtccccgtgaaggaagcactccccggcatcattaagcagggggtgaac tgcttagaatctcagggccagaacacagctggtgacttcctgcttggaatggggatctgc agagggtctgccaagaacccgcagcccgcccaggcatggctgttcagtgaccacctcatc cagcagcaggggaagtgcctggctgccacctccaccttaatgtcctcccctggatcccca gtcatactgcagatgtgcaaccctagagaaggcaagcagaaatggaggagaaaaggatct ttcatccagcattcagtcagtggcctctgcctggagacaaagcctgcccagctggtgacc agcaagtgtcaggctgacgcccaggcccagcagtggcagctgttgccacacacatga >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_3|121_aa MARAQRDCHPPSGLVMAARPCVLYGPHKQRALAEDCLRSLHLPCSREPKSCQTGMQTTEG EPPGALSTAKASSPSQHPSEGEVSPKPPPDGCRTKATPHTQEKAPSATLLLHLLPPQLTV T >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_3|366_bp atggccagagcacagcgtgactgccatcctccctcaggcctggtgatggcagcaaggcca tgcgtgctctacggcccccacaagcagagggcactggccgaggactgtctccgcagcctg catcttccctgctccagggaacctaagtcatgtcagacagggatgcagacaacggagggg gaaccccccggagctctaagcactgccaaagccagctcaccttcccagcatccttcagaa ggagaagtcagtcccaagcccccacctgatggctgcagaaccaaggccacaccgcacacc caagagaaagctcccagtgccactctgctgctccacctgctgcccccacaacttactgtg acctga >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_4|104_aa MSHTILLVQPTKRPEGRTYADYESVNECMEGVCKMYEEHLKRMNPNSPSITYDISQLFDF IDDLADLSCLVYRADTQTYQPYNKDWIKEKIYVLLRRQAQQAGK >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_4|315_bp atgtctcacaccattttgctggtacagcctaccaagaggccagaaggcagaacttatgct gactacgaatctgtgaatgaatgcatggaaggtgtttgtaaaatgtatgaagaacatctg aaaagaatgaatcccaacagtccctctatcacatatgacatcagtcagttgtttgatttc atcgatgatctggcagacctcagctgcctggtttaccgagctgatacccagacataccag ccttataacaaagactggattaaagagaagatctacgtgctccttcgtcggcaggcccaa caggctgggaaataa >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_5|389_aa MGLVGRGKRMGLGSLWRPGTRTRVAAEAFLTIAPNSLRYSSCRHRRRYTSLTTTPLTANP RERSRRKSSKVQRKRAFWIIEAQSLPAARAATDSVSAIFDKGKKERLKLVTVLGAGLLCG TALAVIVPEGVHALYEDILEGKHHQASETHNVIASDKAAEKSVVHEHEHSHDHTQLHAYI GVSLVLGFVFMLLVDQIGNSHVHSTDDPEAARSSNSKITTTLGLVVHAAVFYVSLFPNIA DGVALGAAASTSQTSVQLIVFVAIMLHKAPAAFGLVSFLMHAGLERNRIRKHLLVFALAA PVMSMVTYLGLSKSSKEALSEVNATGVAMLFSAGTFLYVATVHVLPEVGGIGHSHKPDAT GGRGLSRLEVAALVLGCLIPLILSVGHQH >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_5|1170_bp atggggctcgtggggaggggaaaacgtatggggctggggtcgctctggaggccggggact cggactcgggtagccgcggaggcctttctcaccatcgcgccaaactctcttcgctacagc agctgccgacaccgccgccgttacacgagcttaactacaacgccgctaacagccaatcct cgcgagaggagccgccggaagtcgtcgaaagtgcaacgcaaaagagccttttggatcata gaggcacagtcacttccggcagctagagcagctactgactctgtttcagccatcttcgat aaaggcaaaaaggaacgactgaagctggtgactgttttgggtgctggccttctctgtgga actgctctggcagtcatcgtgcctgaaggagtacatgccctttatgaagatattcttgag ggaaaacaccaccaagcaagtgaaacacataatgtgattgcatcagacaaagcagcagaa aaatcagttgtccatgaacatgagcacagccacgaccacacacagctgcatgcctatatt ggtgtttccctcgttctgggcttcgttttcatgttgctggtggaccagattggtaactcc catgtgcattctactgacgatccagaagcagcaaggtctagcaattccaaaatcaccacc acgctgggtctggttgtccatgctgcagtattttatgtttccttgtttccaaatatagct gatggtgttgctttgggagcagcagcatctacttcacagaccagtgtccagttaattgtg tttgtggcaatcatgctacataaggcaccagctgcttttggactggtttccttcttgatg catgctggcttagagcggaatcgaatcagaaagcacttgctggtctttgcattggcagca ccagttatgtccatggtgacatacttaggactgagtaagagcagtaaagaagccctttca gaggtgaacgccacgggagtggccatgcttttctctgccgggacatttctttatgttgcc acagtacatgtcctccctgaggtgggcggaatagggcacagccacaagcccgatgccacg ggagggagaggcctcagccgcctggaagtggcagccctggttctgggttgcctcatccct ctcatcctgtcagtaggacaccagcattaa >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_6|622_aa MFTSKSNSVSPSPSLEQADSDALDISTKVQLYGVLWKRPFGRPSAKWSRRFFIIKESFLL YYSESEKKSFETNKYFNIHPKGVIPLGGCLVEPKEEPSMPYAMKISHQDFHGNILLAAES EFEQTQWLEMLQESGKVTWKNAQLGEAMIKSLEAQGLQLAKEKQEYLDKLMEETEELCLQ REQREELERLNQVLEAEKQQFEEVVQELRMEQEQIKRELELTARCLKGVEQEKKELRHLT ESLQQTLEELSIEKKKTLEMLEENENHLQTLANQSEQPPPSGGLHSNLRQIEEKMQQLLE EKLLAEKRMKENEERSRALEEEREFYSSQSQALQNSLQELTAEKQQAERELKAEVKVRMD LERRLREAEGALRSLEQGLNSKVRNKEKEERMRADVSHLKRFFEECIRNAELEAKMPVIM KNSVYIHKAATRRIKSCRFHRRRSSTSWNDMKPSQSFMTSQLDANNMEELKEVAKRLSRD QRFRESIYHIMATQPGAPSALSRGGNCKMFSHHIGTDGRKKWLLLSAPEDLISSQRLAPR APHPTLGHACGLARVGETRPPSFTSNAAQTKPCGLHGIINSENAPRDLRKETPLVAGLQV LVCLCPTDDTILPVTPGSPMAS >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_6|1869_bp atgttcacgtccaagtccaactcggtgtcgccctcgccgtccctggagcaggctgactcg gacgccctggatatcagcaccaaagtgcagctctacggcgtgctgtggaagaggcctttc ggcaggccgtcggccaagtggtcccggcggtttttcatcatcaaagagagctttctgctt tactactctgagagcgaaaaaaagagctttgaaaccaataaatacttcaatatacatcct aagggcgtcatccctctggggggctgcctggtggagcccaaggaagagcctagcatgccc tatgccatgaagatctcccaccaggacttccatgggaacatcttgcttgctgctgagtcg gagtttgagcagacccagtggctggagatgctgcaggagtctgggaaggtgacctggaag aatgcccagctgggagaagccatgatcaaaagcctggaggcccaggggctgcagttggct aaggaaaagcaggagtatttagacaaactgatggaagagaccgaagaactctgccttcag agggagcagagagaggagcttgagcgccttaaccaggtgctggaggccgagaagcagcag ttcgaggaggtggtgcaggagctgagaatggagcaggagcagatcaagagggagctggaa ctgactgcaagatgccttaagggtgtagaacaagagaaaaaggaactgaggcacctcacg gagtccttgcagcagacactggaggaactctccatagagaagaagaaaaccctggaaatg ctggaggagaacgagaaccacctgcagacactggccaatcagagtgagcagccccctccc agtgggggcctccatagcaacctccggcagatcgaggagaagatgcagcagctcttagag gagaagctcctggcagagaagcggatgaaggagaacgaggagcgctcacgggccctggag gaggagcgtgagttctactccagccagtcccaggcactgcagaactcgctgcaggagctg acggcagagaagcagcaggctgagcgggagctcaaggctgaggtgaaggtccgcatggac ctggagaggcgtctccgggaggcagaaggggccttgcgaagcctggaacaggggctgaat tccaaggtgcggaataaggagaaggaggagaggatgcgggctgatgtgagccatctgaaa aggttctttgaggagtgcatccggaatgccgagctggaggccaagatgcctgtgatcatg aagaactccgtgtacatccataaggcagccactcgccgcatcaagagctgccgcttccac cgacgccggtccagcacctcctggaatgacatgaagccgtcccagtccttcatgacctcc cagctggatgccaacaacatggaggagctaaaggaggtggccaagcggctcagcagggac cagcgcttccgggaatccatctaccacatcatggccacccagcctggagccccctcggca ctctcccggggtggaaattgcaagatgttcagccatcacatcggcactgatggcaggaag aagtggctgctgctgtctgcgcccgaggacctcatctcctcgcagaggctggctccccgc gccccgcatcccacgctgggacacgcgtgtggccttgccagggtgggcgagacacggccg ccttccttcacatccaacgccgcacagacaaagccttgcgggctccacgggataatcaac agcgagaacgcccccagggacctccggaaagagacaccattggttgcaggcctccaggtt ctcgtctgcctgtgccccactgatgacaccattctgccagtgacgcctggatctcccatg gcatcgtag >gi568815584f:69299370_69558590|GENSCAN_predicted_peptide_7|64_aa VKTKATVCRKNPKRGPGKTEGSTNKKQQHLTAAKGNALKKCLQGKAGWEPEPWLSELAAR PWSL >gi568815584f:69299370_69558590|GENSCAN_predicted_CDS_7|195_bp gttaaaacgaaagccactgtctgcagaaagaaccccaaaagagggcctggaaaaacagaa ggaagcactaataaaaagcagcagcatctaactgctgccaagggcaatgcgctcaagaaa tgtcttcaaggaaaggctggctgggagccagagccctggctttcagagttggctgcaaga ccttggtccttgtag