GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:19:38 Sequence gi568815592f:133789398_133991799 : 202402 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 761 756 6 1.05 1.04 Term - 1339 1001 339 1 0 45 54 276 0.416 13.45 1.03 Intr - 13671 13582 90 0 0 63 73 83 0.266 3.57 1.02 Intr - 25258 25205 54 1 0 73 90 86 0.723 5.56 1.01 Init - 30946 30872 75 1 0 78 66 46 0.463 2.54 1.00 Prom - 32354 32315 40 -4.15 2.04 PlyA - 35128 35123 6 1.05 2.03 Term - 36182 35954 229 1 1 41 36 178 0.515 3.02 2.02 Intr - 49830 49718 113 0 2 52 81 114 0.261 5.36 2.01 Init - 53224 53141 84 1 0 47 77 73 0.243 2.97 2.00 Prom - 58523 58484 40 -5.35 3.05 PlyA - 59354 59349 6 1.05 3.04 Term - 65182 64985 198 1 0 86 42 115 0.980 3.12 3.03 Intr - 65472 65290 183 0 0 72 65 96 0.948 4.76 3.02 Intr - 65887 65674 214 0 1 7 75 210 0.920 9.40 3.01 Init - 71108 70984 125 2 2 66 68 69 0.743 2.39 3.00 Prom - 72350 72311 40 -4.95 4.00 Prom + 81091 81130 40 -3.65 4.01 Init + 100001 100450 450 1 0 80 105 658 0.936 62.66 4.02 Intr + 102250 102366 117 0 0 118 5 110 0.644 5.44 4.03 Term + 103578 103700 123 2 0 60 47 170 0.752 7.50 4.04 PlyA + 106121 106126 6 1.05 5.03 PlyA - 106135 106130 6 -3.64 5.02 Term - 106960 106259 702 1 0 -7 44 330 0.572 11.63 5.01 Init - 131649 131584 66 0 0 72 64 81 0.118 3.66 5.00 Prom - 138923 138884 40 -7.75 6.00 Prom + 140559 140598 40 -5.35 6.01 Init + 142329 142474 146 2 2 68 98 139 0.468 12.54 6.02 Term + 142837 143356 520 1 1 7 43 225 0.518 2.68 6.03 PlyA + 143688 143693 6 -0.45 7.00 Prom + 143754 143793 40 -8.25 7.01 Sngl + 144693 145643 951 2 0 68 38 395 0.990 28.83 7.02 PlyA + 145701 145706 6 1.05 8.00 Prom + 146983 147022 40 -3.25 8.01 Init + 147678 147778 101 2 2 76 103 37 0.631 2.25 8.02 Intr + 157569 157697 129 0 0 84 110 30 0.738 3.69 8.03 Intr + 160811 160901 91 2 1 119 37 52 0.397 2.28 8.04 Intr + 162663 163063 401 2 2 54 32 176 0.050 0.98 8.05 Intr + 163955 164089 135 2 0 90 49 82 0.077 3.26 8.06 Intr + 164732 164850 119 2 2 -3 92 116 0.103 2.09 8.07 Intr + 190685 190863 179 0 2 118 83 63 0.921 7.52 8.08 Intr + 193171 193253 83 0 2 95 94 -2 0.892 -1.38 8.09 Intr + 193420 193483 64 1 1 72 97 94 0.942 6.50 8.10 Intr + 194979 195082 104 2 2 83 111 35 0.958 3.35 8.11 Intr + 195180 195274 95 0 2 34 66 103 0.561 1.39 8.12 Term + 197564 197643 80 0 2 98 31 72 0.491 -0.55 8.13 PlyA + 198075 198080 6 1.05 9.02 PlyA - 198119 198114 6 1.05 9.01 Term - 200833 200644 190 0 1 59 34 155 0.396 3.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_1|185_aa MVIEETKTGLYIPDIFDEEVVCSGQVHGIYVTSAEAAAEEGRQLKWFVPKREAITDAGEK VEKKEPSYTVDKNSRDLVSCMPATPDVAKRGQGTAQAIASEGASPKPWQLPHGIEPVGAQ KSGIEVWEPPRRFQGMYGSAWMSRKKFAAGVELSWKTSVRAVQMANVGLKPPHRVPTGAL SLVEL >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_1|558_bp atggtgatagaagagacaaagactggattatatatacctgatatattcgatgaagaagta gtgtgttcaggacaggtacatggcatttatgtgacatcagcagaggcagctgctgaagaa ggtagacagctaaaatggtttgtgccaaaaagagaggcaataacagatgctggtgagaaa gtggagaaaaaagaaccctcatacacagttgataaaaattctagagacttggtgtcctgt atgccagccactccagatgtggctaaaaggggccaaggtacagctcaggccatagcttca gagggtgcaagccctaagccttggcagcttccacatggcattgagcctgtgggtgcacag aagtcaggaattgaggtttgggaacctccacgcagatttcaggggatgtatggaagtgcc tggatgtccaggaagaagtttgctgcaggagtggagctctcatggaaaacttctgttagg gcagtacagatggcaaatgtgggattgaagcccccacacagagtccccactggggcactg tctttagtggagctgtga >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_2|141_aa MDSQKRESSRGDGSGPLGFPETNLRQQELRFLSPFLADLKQSKKGADQSYLGDKSGLGEI AARRARFYRNVHPYVHCSSIHNRKDMESTQVAINGGLNKENVVHIHHGILCSHKELNHIL CGNTDAAGNHYPKQTNTETEN >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_2|426_bp atggatagtcagaaaagagaaagcagcagaggagatggctctggtccactgggctttcct gagaccaatctgagacagcaggagctacgctttctctccccgttccttgccgatcttaaa caatccaagaaaggggcagaccaaagctacttaggagacaagagcggtcttggtgagatt gcagcaagaagggcaaggttctaccgaaacgtgcacccatatgttcattgcagcagtatt cacaatagaaaagacatggaatcaacccaggtggccatcaatggcggactgaataaagaa aatgtggtacatatacaccatggaatactatgcagccataaagaattaaatcatatcctt tgtggcaacacagatgcagctggaaaccattatcctaagcaaactaacacagaaacagaa aactaa >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_3|239_aa MGKDFMTKTPKATATKAKIDKWDLIKLKSFCTAKETINRINRISDLVTVPEAGEKVPPES ERKSPLEAWRQQYEPLKPGEGTLAGWGRQRCSLTSAGEDALPLGMRAPWGQEQTSGGDSP RLLRDRQLPWLASEPQPPLWVSWGHWERSQPIPCTLALSADWSCPLGFPGLKARKFELSS PFHLNPFGQSIRMLPISSGIFKALIRRLTAPSYALPSFPSGIPLDACSKYLLAPEEIHM >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_3|720_bp atgggcaaggattttatgacaaagacaccaaaagcaactgcaacaaaagcaaaaattgac aaatgggatctaattaaacttaaaagcttctgcacagcaaaagaaactatcaacagaata aacaggatctcagatttggtgaccgtcccagaagctggggagaaggtgccgccagagagt gagcggaaatctcccttggaggcctggaggcagcaatacgaacccctcaagccgggagag ggcaccctggctggctggggccggcagcgctgcagcctcacgtccgcaggtgaagatgcg ctcccgctgggcatgcgcgccccctggggccaggagcagacctccgggggagactcgccg cggcttctgcgagatcgtcagctcccctggctggcctcagagcctcagcctcccctctgg gtttcgtggggtcactgggagcggtcgcagcccatcccctgcacgctggcactatctgct gactggagttgccctctggggttcccagggctaaaagcgaggaaattcgaattatcttcc ccattccacttaaaccccttcggccaatcaatccggatgttaccgatttcctctgggata tttaaagctttgatccgcaggcttacagctccatcttacgcactcccatctttcccctct gggatccctctagatgcctgctcaaaatacctgctggcgccagaggaaattcacatgtaa >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_4|229_aa MSTGSLSDVEDLQEVEMLECDGLKMDSNKEFVTSNESTEESSNCENGSPQKGRGGLGKRR KAPTKKSPLSGVSQEGKQVQRNAANARERARMRVLSKAFSRLKTTLPWVPPDTKLSKLDT LRLASSYIAHLRQILANDKYENGYIHPVNLAPSPPHPLRHGHLPPPPSSFPQTWPFMVAG KPESDLKEVRSPRRPMRQMPPGLFNVRGYDRSVYKTSSVIYTVIPRREP >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_4|690_bp atgtccaccggctccctcagcgatgtggaggaccttcaagaggtggagatgttggaatgt gacgggttgaaaatggattcgaacaaggaatttgtgacttccaacgagagcaccgaggag agctccaactgcgagaatgggtctccccagaagggccgcggcggcctgggcaagaggagg aaggcgcccaccaagaagagccccctgagcggggtcagccaggaggggaagcaggtccag cgcaacgccgccaacgcgcgagagcgggcccgcatgcgagtgctgagcaaggccttctcc agactcaagaccaccctgccctgggtgccccccgacaccaagctctccaagctggacacg ctcaggctggcgtccagctacatcgcccacttgaggcagatcctggctaacgacaaatac gagaacgggtacattcacccggtcaacctggccccgagtccaccccaccccctccgccac ggccacttacctcctccaccctcttctttcccgcagacgtggccctttatggtggccggg aaacccgagagtgacctgaaagaagtgcgcagcccgcggcggccgatgcgccagatgcca ccggggctgtttaatgttcggggttatgaccgcagtgtttacaagacgtcttcggttatt tatactgttattcctcgcagagagccttga >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_5|255_aa MSHRLTHCAPPSRLLQTMLLQQSLEVVNLSVNWKEGGPAQPAQTQRSKACSSKRLKEARR TLKNAAGNQRVSKGRASIWSEGCKAKKGCWQASCAPLAPGFPAVWGRNPEPALRLIPEPA DLQRDTHPLARRLHPERDGKEAVRTEAQNSFPVSPTSKNSSPRRGSLRGCFPPPPLRSAP RWPRALQALLLRSPPAVASLQPQLQADFPTHTVPDQSKANHSHWSDGEIFFWPENGPRDS LAGDVIPGSPDFSLP >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_5|768_bp atgagccaccgcctcacccactgtgctcctcctagcaggctcctgcagaccatgctcctg cagcagtccttggaggtggttaatctttcggtgaactggaaggagggtgggcccgcgcag cctgcccagacccaaagatcaaaggcgtgttcctccaaacgcctgaaggaagcccggcgg actctgaagaatgctgctggcaaccagcgagtttccaagggcagagccagcatttggagc gaaggatgcaaagctaagaaaggatgttggcaggccagctgcgccccgctcgcacctggg ttccccgcggtgtggggacgtaacccagaacctgccctgcgcctcatccctgagccagcg gatctgcagcgggacacgcacccgctggcccggagattgcacccggaaagggacggaaag gaggctgtgcggactgaggctcagaactccttccccgtctcgcctacttccaaaaactct tctcccaggcggggaagcctcagaggctgcttccctcctccgcctttgcgctcagctccc cgctggccgcgagctctccaggcactcctgctgcgctcgccaccagccgtggcgtccctg caaccccaactccaggccgacttcccgacgcacacagtgccggaccagagcaaggccaac cactcgcactggagtgatggggagatttttttctggcccgagaatggaccaagggactct cttgcaggggacgtgattccagggagcccagatttctctctcccttag >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_6|221_aa MRRNQCKKAENSKNQKASSPPKEHNSWRAREQNWTENEFDKLTEVGFRSDEEHGTKLENT LQAIIQENFPNLAGQTNIQIQEIQRTPQRYSLRRAIPRHIIIRFTKVEMKEKMLRAAREK GRVTHKGKPIRLTADLAGEALQARREWEPKFNILKENNFQSRISYPGKLSFISEGEIKSF TDKQMLRDFVTTRPALQELLKEALNMERNNRYQPLQKLTKL >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_6|666_bp atgaggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaaagcctcttctcct ccaaaggaacacaactcctggagagcaagggaacaaaactggacagagaatgagtttgac aaattgacggaagtaggcttcagaagtgatgaggagcatggaaccaagttggaaaacaca cttcaggctattatccaagagaacttccccaacctagcaggacagaccaacattcaaatt caggaaatacagagaacaccacaaagatactccttgagaagagcaatcccaagacacata atcatcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaa ggtagggttacccacaaagggaagcccatcagactaacagctgatctcgctggagaagct ctacaagccagaagagagtgggagccaaaattcaacattcttaaagaaaataattttcaa tccagaatttcatatccaggcaaactaagcttcataagtgaaggagaaataaaatccttc acagacaagcaaatgctgagagactttgtcaccaccaggcctgccttacaagagctcctg aaggaagcactaaacatggaaaggaacaaccggtaccagccactgcaaaaacttaccaaa ttgtaa >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_7|316_aa MGKSLDTYILPRLNQEEAESLNRPITSSEIEAVINSLPTKKSPSPDRVTAEFYQRYKEVL VPFRLKLYQTIEKEELLPNSFYEAGIILIPKPGRNTTRKENFRPIFLMNIDVKIFNKILE NRIQQHIKKLINHDQVSFIPRMQGWFNICKSINVIHINRTNDKNHMIISIDAEKAFNKIQ HPFMIKTLNKLGTDGTYLKIIRAIYDKLAANIILNGQNLEAFPLKTSTRQGCHLSPLLFK IILEVLARAIRKEKEIKGIQIGGEEVKLSLFAEDMIVYLEIPIISTPNLLNLISNFSKVS GYKNQCTQITSVPIHQ >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_7|951_bp atgggtaaatccctggacacatatatcctcccaagactaaatcaggaagaagctgaatcc ctgaatagaccaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaaa aaaagcccaagtccagacagagtcacagctgaattctaccagaggtacaaggaggtgctg gtaccattccgtctgaaactataccaaacaatagaaaaagaggaactcctccctaactca ttttatgaggccggcatcatcctgataccaaaacctggcagaaacacaacaagaaaagaa aatttcaggccaatattcctgatgaacatcgatgtgaaaatcttcaataaaatactggaa aaccgaatccagcagcacatcaaaaagcttatcaaccacgatcaagtcagcttcatccct cggatgcaaggctggttcaacatatgcaaatcaataaatgtaattcacataaacagaacc aatgacaaaaaccacatgattatctcaatagatgcagaaaaggccttcaataaaattcaa caccccttcatgataaaaactctcaataaactaggtactgatggaacatatctcaaaata ataagagctatttatgacaaactggcggccaatatcatactaaatgggcaaaacctggaa gcattccctttgaaaaccagcacaagacaaggatgccatctctcaccactcctattcaaa ataatattggaagttctggccagggcaatcaggaaagaaaaagaaataaagggtattcaa ataggaggagaggaagtcaaattgtctctgtttgcagaggacatgattgtctatttagaa atccccatcatctcaaccccaaatctccttaacctgataagcaacttcagcaaagtctca ggatacaaaaatcaatgcacacaaatcacaagcgttcctatacaccaataa >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_8|526_aa MGCLGTQIAPRAFLLLPLPLYFTPLSSVTCISSSGVRKWPSGSLIVYYGKTIVASPDGST LPLEAEVSNPAEDRVSRPVLHFRVVMVNTECQLDWIEGCKVLIPDVSPSSGGKADAGLRV GNVGPGAGGFRAAWRGVAWRGALRAVTRFSLPRAAPLRPLPRDPEDALGPQGDGSNGPLC APGVPRPLQGYLGAEPRRVSVPRGPPIPGCLSPRQHPPSPTPILPGSASRFPRDSPDRKP TFSFASDATGSSAAAASSATAPSSHGNPLGDPGHRERGRLSGLSTGCASAGSVNVLIAIR LKLQSIHICELSDCICHLQFQSDTGRCDLRGGKLNFKTTPMDADSDVALDILITNVVCVF RTRCHLNLRKIALEGANVIYKRDVGKVLMKLRKPRITATIWSSGKIICTGATSEEEAKFG ARRLARSLQKLGFQVIFTDFKVVNVLAVCNMPFEIRLPEFTKNNRPHASYEPELHPAVCY RIKSLRATLQIFSTGSITVTGPNVKAVATAVEQIYPFVFESRKEIL >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_8|1581_bp atgggctgcttggggacccagattgctcccagggcctttctgctgctacctctacccctg tatttcactccactcagctctgtaacttgcatcagctccagtggtgtaaggaaatggcca agtggtagtctcatagtctattacggcaaaaccattgttgcatctcctgatggaagcact cttcccttggaggctgaggtctctaacccagcagaagacagagtttccaggccagttcta cattttagggttgtgatggttaatactgagtgtcaacttgattggattgaaggatgcaaa gtattgattccggatgtgtctccgagctctggtggaaaggccgatgccggcctgagagtg ggaaacgtggggccgggagccgggggcttccgcgcggcgtggcgtggcgtggcgtggcgt ggcgccctcagggccgttacgcgattttcgctcccgcgggcagcgcctctcaggccactc ccgagggaccccgaggacgcactcgggccccagggcgacggttcaaacgggccattgtgc gcccccggggtcccccggcccctgcagggctacttgggcgcagagccgcggagggtctcc gttcctagaggtcctcctatcccgggctgcctgagtcctcgccagcatccgccctctccc actcccatccttcctggatccgcctctcggttcccgagggacagtcccgaccgcaaaccc accttctccttcgcatccgacgcaaccggcagcagcgctgccgccgcgtcctcagccacc gctccctcttcccacggtaaccccctaggcgaccctgggcacagggagcgcgggcggctg agcggcctgagcaccggatgtgcatctgcaggctctgtcaatgtgctcattgcaataaga ctcaaactgcagtcaatccacatctgtgagctctctgactgcatctgtcacctgcagttc caaagcgacactggcagatgtgatcttcgtggtggaaagctaaattttaaaaccacccca atggatgcagacagtgatgttgcattggacattctaattacaaatgtagtctgtgttttt agaacaagatgtcatttaaacttaaggaagattgctttggaaggagcaaatgtaatttat aaacgtgatgttggaaaagtattaatgaagcttagaaaacctagaattacagctacaatt tggtcctcaggaaaaattatttgcactggagcaacaagtgaagaagaagctaaatttggt gccagacgcttagcccgtagtctgcagaaactaggttttcaggtaatatttacagatttt aaggttgttaacgttctggcagtgtgtaacatgccatttgaaatccgtttgccagaattc acaaagaacaatagacctcatgccagttacgaacctgaacttcatcctgctgtgtgctat cggataaaatctctaagagctacattacagattttttcaacaggaagtatcacagtaaca gggcccaatgtaaaggctgttgctactgctgtggaacagatttacccatttgtgtttgaa agcaggaaagaaattttataa >gi568815592f:133789398_133991799|GENSCAN_predicted_peptide_9|63_aa XNITTSGYGTQAIIDDDVQITEDTINDIQTSGQFPHVLFYDGHLYQQRGFHPHNDELFSY FSS >gi568815592f:133789398_133991799|GENSCAN_predicted_CDS_9|192_bp nngaatatcactacctcaggttacggtacacaggctataattgatgatgatgttcagata actgaagacacaataaatgacattcagacatcaggacaattccctcatgttcttttctat gatggccacctgtaccagcaacgtgggtttcacccacacaacgatgaactgttctcttac ttctccagttga