GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:55:32 Sequence gi568815597f:205423168_205630706 : 207539 bp : 49.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 623 618 6 1.05 1.05 Term - 5163 5056 108 0 0 103 55 44 0.230 1.11 1.04 Intr - 9518 9424 95 0 2 70 115 27 0.389 3.28 1.03 Intr - 13370 13181 190 2 1 38 84 69 0.245 0.66 1.02 Intr - 13753 13571 183 1 0 92 34 100 0.278 4.98 1.01 Init - 17859 17764 96 0 0 58 46 95 0.489 2.61 1.00 Prom - 22724 22685 40 -6.66 2.00 Prom + 26607 26646 40 -6.56 2.01 Init + 27818 27875 58 1 1 97 69 45 0.789 3.74 2.02 Intr + 32474 32581 108 0 0 75 109 36 0.795 4.66 2.03 Term + 33725 33837 113 0 2 127 47 86 0.867 7.12 2.04 PlyA + 34051 34056 6 1.05 3.00 Prom + 36409 36448 40 -5.46 3.01 Init + 42091 42149 59 0 2 74 106 38 0.677 4.98 3.02 Intr + 61861 62158 298 1 1 -85 21 580 0.145 30.88 3.03 Intr + 68509 68590 82 0 1 113 95 9 0.485 3.31 3.04 Intr + 70634 70773 140 0 2 93 75 130 0.558 12.38 3.05 Term + 76232 76345 114 1 0 20 36 170 0.528 3.57 3.06 PlyA + 79135 79140 6 1.05 4.00 Prom + 87447 87486 40 -5.66 4.01 Init + 97738 97882 145 0 1 71 56 106 0.848 5.98 4.02 Intr + 99294 99478 185 1 2 104 57 -4 0.793 -2.39 4.03 Intr + 99980 100130 151 1 1 90 42 232 0.875 18.54 4.04 Intr + 100316 100458 143 0 2 40 94 142 0.755 10.07 4.05 Intr + 101065 101190 126 0 0 81 105 136 0.900 15.48 4.06 Intr + 101972 102028 57 1 0 78 94 58 0.956 4.48 4.07 Intr + 102898 103012 115 0 1 70 54 224 0.999 17.22 4.08 Intr + 103200 103294 95 1 2 73 85 180 0.586 15.88 4.09 Intr + 103608 103670 63 2 0 70 100 125 0.532 10.81 4.10 Intr + 104627 104750 124 1 1 117 73 299 0.986 31.46 4.11 Intr + 104881 105001 121 2 1 59 89 111 0.885 7.85 4.12 Intr + 105832 105929 98 1 2 141 69 133 0.995 16.35 4.13 Intr + 106157 106262 106 0 1 118 68 156 0.996 15.87 4.14 Intr + 106354 106396 43 1 1 119 105 40 0.988 7.04 4.15 Intr + 107092 107182 91 0 1 83 79 90 0.907 7.17 4.16 Intr + 107461 107538 78 2 0 93 91 127 0.914 13.02 4.17 Intr + 110226 110379 154 1 1 40 72 70 0.232 -0.27 4.18 Intr + 113568 113664 97 0 1 90 61 30 0.124 0.51 4.19 Term + 123512 123688 177 1 0 98 35 145 0.977 7.89 4.20 PlyA + 126219 126224 6 1.05 5.00 Prom + 138972 139011 40 -2.96 5.01 Init + 145903 146132 230 0 2 93 86 478 0.853 45.94 5.02 Intr + 156584 156782 199 2 1 122 115 394 0.983 44.85 5.03 Intr + 157494 157778 285 2 0 138 91 269 0.648 29.74 5.04 Intr + 160788 160956 169 2 1 95 63 172 0.989 14.92 5.05 Intr + 161709 161824 116 1 2 124 101 128 0.997 17.87 5.06 Intr + 162867 163040 174 2 0 101 91 197 0.994 21.44 5.07 Intr + 168973 169120 148 0 1 78 100 285 0.999 28.51 5.08 Intr + 174613 174683 71 2 2 69 88 54 0.433 2.40 5.09 Intr + 175964 176084 121 1 1 109 110 0 0.845 4.37 5.10 Intr + 177127 177159 33 2 0 101 94 14 0.617 1.49 5.11 Intr + 177239 177290 52 0 1 96 83 12 0.261 -0.53 5.12 Term + 180088 180190 103 1 1 96 42 27 0.160 -3.35 5.13 PlyA + 182817 182822 6 1.05 6.05 PlyA - 184467 184462 6 1.05 6.04 Term - 190175 190170 6 2 0 123 50 0 0.311 -2.43 6.03 Intr - 197671 196943 729 1 0 80 33 413 0.448 26.63 6.02 Intr - 200724 200509 216 0 0 91 62 182 0.984 14.60 6.01 Intr - 203002 202819 184 0 1 22 -1 182 0.412 2.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_1|223_aa MPQFTFACFCGLHGFCKMKRKKEEVHRERETAEGTHYKGHTYQMCPDVAGTFGDRYLVHN CKGGLSPEKKISGLLSSVGSLQAGPGSLVFLLKGESSGPQAIPGVVSAHCAGVIVASPPG FASLNFHGIGTIDHEEETGTLTLHGGAQEVRTIHQEASTDFCPWLYVRTDRASNDPAGRY DRKPLLSQHLKGSFSKITVMASLPCSKLFGNRIGKMLTTVEAG >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_1|672_bp atgccccagttcactttcgcctgcttctgtggtctccacggcttctgcaaaatgaagagg aagaaggaggaagttcacagagagcgggagacggcggaaggcacgcattataaagggcac acttatcagatgtgcccagatgtggctggaacttttggagacagatacttggttcacaac tgcaaaggtgggctttcaccagagaagaaaatctctgggctgctgagttcagtgggctcc ttgcaggctggtccaggaagtctggtgttcctcctgaagggtgaatcctctggaccccag gccatccctggagtggtcagtgcccactgtgccggggttattgtggccagtccgcctggg ttcgcttctctaaatttccatgggataggaactatagatcatgaagaagaaacaggaacc ttgactcttcacggaggagcccaggaggtcagaaccatccaccaggaggcgtctacagac ttctgcccatggctgtatgtacgcacggacagagctagcaatgaccctgctggcaggtat gataggaagcctcttctgtcacagcacctcaagggctctttctcaaagataactgtcatg gcatcactcccctgctccaagctttttggtaacaggattggcaaaatgttgacaactgtt gaagcagggtga >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_2|92_aa MPTIPVSSGATTTFLQLAGAAKSFKRNKGNGGGEGASSLVDLGRRLTQPAAGQLSAPQPR GRRARPTPPSAPPDGRGQPAFRSSASELQLRA >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_2|279_bp atgcccacgattcctgtctcctctggggccaccacaacctttctgcagcttgctggtgcg gcaaaatcctttaaaaggaacaaaggcaatgggggcggggagggggcttcctctttggtc gacctagggcggcggctgacgcagcccgccgccggtcagctttcagccccccagccccgg ggccggcgtgcccggcccaccccgccctccgcgccccccgacggccgcggtcagccggca ttccggagcagcgcgtcggagctgcaacttcgcgcctaa >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_3|230_aa MSQRKPGAEQLVSNIISEDGKKKEKKKEEEEEKEKEQEREEKKEKKKKEEEEKEKKKEEE KEKEREKKEKKEKKEEEEEEEEEEEKTPATIPATLLTRRGILEKSLSLSELWFPHLDEGQ APPHLGLLLLKHPKFTVPRASPRGRTVNSPACTIFSQALSFPIKIQRLELEPGVILAQAA QAGRPENSEVGMQASGAFLKVFTKMFNWLLKGYLATAEISTCAWPMADIE >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_3|693_bp atgtcccagagaaaaccaggtgctgaacaacttgttagcaacattatctctgaggatggg aagaagaaggagaagaagaaggaggaagaggaggagaaggagaaggagcaggagagggag gagaagaaggagaagaagaagaaggaggaggaggagaaggagaagaagaaggaggaggag aaggagaaggagagggagaagaaggagaagaaggagaagaaggaggaggaggaggaggag gaggaggaggaggagaagacaccagctacgatcccagccaccctgctcacccgccgtggg atcttggagaagtcacttagcctctctgagctttggtttcctcacctcgatgaagggcaa gccccaccccacctgggcctgctgcttttgaagcaccccaaattcactgtccctagggct tctccccgaggccgcactgtaaactcgcctgcttgcacgattttctctcaggctctgtcc ttccctatcaagatccagcggctggagctggagcctggcgtcatcctggcccaagctgcc caggcgggacgaccagagaactcagaggtggggatgcaggcctctggagcatttctaaaa gtcttcaccaagatgttcaactggctcctcaagggctacctggctacagctgaaatcagc acctgtgcttggcccatggcagatattgagtaa >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_4|722_aa MVAALWKLPVCKSPADSRPETLHRATEERLKMTDRKEFRDGESGKSKAAGEKVQLASSDA LFPSSSFSPLASWGMPEQRPSGVSIRGEMRFGPNHVPRVLAKLTFSLRALDPAAQSLMIM NKMKNFKRRFSLSVPRTETIEESLAEFTEQFNQLHNRRNENLQLGPLGRDPPQECSTFSP TDSGEEPGQLSPGVQFQRRQNQRRFSMEDVSKRLSLPMDIRLPQEFLQKLQMESPDLPKP LSRMSRRASLSDIGFGKLETYVKLDKLGEGTYATVFKGRSKLTENLVALKEIRLEHEEGA PCTAIREVSLLKNLKHANIVTLHDLIHTDRSLTLVFEYLDSDLKQYLDHCGNLMSMHNVK IFMFQLLRGLAYCHHRKILHRDLKPQNLLINERGELKLADFGLARAKSVPTKTYSNEVVT LWYRPPDVLLGSTEYSTPIDMWGVGCIHYEMATGRPLFPGSTVKEELHLIFRLLGTPTEE TWPGVTAFSEFRTYSFPCYLPQPLINHAPRLDTDGIHLLSSLLLYESKSRMSAEAALSHS YFRSLGERVHQLEDTASIFSLKEIQLQKDPGYRGLAFQQPGSARIAADLEYTSCCCLDRL PTFDLDTTHPRNTSYTAPKVLRVLLLSSSVSRTSHHLPTKSIIGHQGPAPALAVEARWLA SLLMGCVPSTSKFLPGCPNLEIMLKMEPDVQDFQLGFILLVNLLGYKDKDESNDSQIRKY LY >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_4|2169_bp atggtggctgccctctggaagcttcctgtctgcaagagtcctgcagacagcaggcctgag acattgcacagagccacggaggagcggctgaagatgactgacaggaaggagtttagggat ggggagtcaggaaagtccaaggctgctggagagaaggtgcagctggccagctctgatgca ctcttcccttcctcctctttttcccctcttgcctcctggggcatgccagagcagagacct tcaggtgtcagtatcagaggtgaaatgaggtttggacccaatcatgtcccaagggtgttg gccaagctgacattcagtttacgggccttggacccggctgcccagtccctcatgatcatg aacaagatgaagaactttaagcgccgtttctccctgtcagtgccccgcactgagaccatt gaagaatccttggctgaattcacggagcaattcaaccagctccacaaccggcggaatgag aacttgcagctcggtcctcttggcagagaccccccgcaggagtgcagcaccttctcccca acagacagcggggaggagccggggcagctctcccctggcgtgcagttccagcggcggcag aaccagcgccgcttctccatggaggacgtcagcaagaggctctctctgcccatggatatc cgcctgccccaggaattcctacagaagctacagatggagagcccagatctgcccaagccg ctcagccgcatgtcccgccgggcctccctgtcagacattggctttgggaaactggaaaca tacgtgaaactggacaaactgggagagggcacctatgccacagtcttcaaagggcgcagc aaactgacggagaaccttgtggccctgaaagagatccggctggagcacgaggagggagcg ccctgcactgccatccgagaggtgtctctgctgaagaacctgaagcacgccaatattgtg accctgcatgacctcatccacacagatcggtccctcaccctggtgtttgagtacctggac agtgacctgaagcagtatctggaccactgtgggaacctcatgagcatgcacaacgtcaag attttcatgttccagctgctccggggcctcgcctactgtcaccaccgcaagatcctgcac cgggacctgaagccccagaacctgctcatcaacgagaggggggagctgaagctggccgac tttggactggccagggccaagtcagtgcccacaaagacttactccaatgaggtggtgacc ctgtggtacaggccccccgatgtgctgctgggatccacagagtactccacccccattgat atgtggggcgtgggctgcatccactacgagatggccacagggaggcccctcttcccgggc tccacagtcaaggaggagctgcacctcatctttcgcctcctcgggacccccacagaagag acgtggcccggcgtgaccgccttctctgagttccgcacctacagcttcccctgctacctc ccgcagccgctcatcaaccacgcgcccaggttggatacggatggcatccacctcctgagc agcctgctcctgtatgaatccaagagtcgcatgtcagcagaggctgccctgagtcactcc tacttccggtctctgggagagcgtgtgcaccagcttgaagacactgcctccatcttctcc ctgaaggagatccagctccagaaggacccaggctaccgaggcttggccttccagcagcca ggctctgcaagaattgctgcagacctcgaatacacctcctgctgctgcctagaccgcctc cccacctttgatctggacacaacccacccccgcaacacatcatacacagctcccaaagtg ctcagagtcctgctgttgtccagttcggtgtccagaacctcccaccatctgccaactaaa agcatcattgggcaccagggccctgctcccgccctagctgtggaggcccggtggctggca tcattgctgatgggctgtgtgccctccacatccaagtttctccctggctgccctaatctg gagataatgctgaaaatggagccagatgtccaagacttccagctgggatttatcctcttg gtaaatctgctagggtataaggataaggatgaaagcaacgacagccagattaggaagtat ctctattag >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_5|566_aa MGCDGRVSGLLRRNLQPTLTYWSVFFSFGLCIAFLGPTLLDLRCQTHSSLPQISWVFFSQ QLCLLLGSALGGVFKRTLAQSLWALFTSSLAISLVFAVIPFCRDVKVLASVMALAGLAMG CIDTVANMQLVRMYQKDSAVFLQVLHFFVGFGALLSPLIADPFLSEANCLPANSTANTTS RGHLFHVSRVLGQHHVDAKPWSNQTFPGLTPKDGAGTRVSYAFWIMALINVSVPGVGQLP VPMAVLMLLSKERLLTCCPQRRPLLLSADELALETQPPEKEDASSLPPKFQSHLGHEDLF SCCQRKNLRGAPYSFFAIHITGALVLFMTDGLTGAYSAFVYSYAVEKPLSVGHKVAGYLP SLFWGFITLGRLLSIPISSRMKPATMVFINVVGVVVTFLVLLIFSYNVVFLFVGTASLGL FLSSTFPSMLAYTEDSLQYKGCATTVLVTGAGVGEMVLQMLVGSIFQAQGSYSFLVCGVI FGCLAFTFYILLLFFHRMHPGLPSEPQNGRETNSLVPTQDRSIGMENSECYQRNLDISES CGITLFPDFPLLLVYGNLTHQAHRIH >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_5|1701_bp atgggctgcgacggccgcgtgtcggggctgctccgccgcaacctgcagcccacgctcacc tactggagcgtcttcttcagcttcggcctgtgcatcgccttcctggggcccacgctgctg gacctgcgctgtcagacgcacagctcgctgccccagatctcctgggtcttcttctcgcag cagctctgcctcctgctgggcagcgccctcgggggcgtcttcaaaaggaccctggcccag tcactatgggccctgttcacctcctctctggccatctccctggtgtttgccgtcatcccc ttctgccgcgacgtgaaggtgctggcctcagtcatggcgctggcgggcttggccatgggc tgcatcgacaccgtggccaacatgcagctggtaaggatgtaccagaaggactcggccgtc ttcctccaggtgctccatttcttcgtgggctttggtgctctgctgagcccccttattgct gaccctttcctgtctgaggccaactgcttgcctgccaatagcacggccaacaccacctcc cgaggccacctgttccatgtctccagggtgctgggccagcaccacgtagatgccaagcct tggtccaaccagacgttcccagggctgactccaaaggacggggcagggacccgagtgtcc tatgccttctggatcatggccctcatcaatgtgagtgtccctggggtgggccagcttcca gtgcccatggctgtgctgatgctgctgtccaaggagcggctgctgacctgctgtccccag aggaggcccctgcttctgtctgctgatgagcttgccttggagacacagcctcctgagaag gaagatgcctcctcactgcccccaaagtttcagtcacacctagggcatgaggacctgttc agctgctgccaaaggaagaacctcagaggagccccttattccttctttgccatccacatc acgggcgccctggtactgttcatgacggatgggttgacgggtgcctattccgccttcgtg tacagctatgctgtggagaagcccctgtctgtgggacacaaggtggctggctacctcccc agcctcttctggggcttcatcacactgggccggctcctctccattcccatatcctcaaga atgaagccggccaccatggttttcatcaacgtggttggcgtggtggtgacgttcctggtg ctgcttattttctcctacaacgtcgtcttcctgttcgtggggacggcaagcctgggcctg tttctcagcagcaccttccccagcatgctggcctacacggaggactcgctgcagtacaaa ggctgtgcaaccacagtgctggtgacaggggcaggagttggcgagatggtgctgcagatg ctggttggttcgatattccaggctcagggcagctatagtttcctggtctgtggcgtgatc tttggttgtctggcttttaccttctatatcttgctcctgtttttccacaggatgcaccct ggactcccatcagagccacagaatggcagagaaacaaacagcctggttcctacccaagac agatcaattggaatggaaaactctgagtgctaccagaggaatttggacatttcagaatcc tgtggaatcaccctgttcccagattttcctctgctacttgtatatggaaatctaacccat caggcacatcggattcactaa >gi568815597f:205423168_205630706|GENSCAN_predicted_peptide_6|378_aa XSGVAASQTVDTLVTVGNVEKEVFMVFLVELTHCGTGGQNDIVDKEKQGILSSEMNSLLD QELIAMDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMN YDKLSRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNMDPMTVGRIEGDCESLNFSEVSS SSKDVENGGKDKPPQPGAKTSSRNDYIHSGLYSSFTLNSLNSSNVKLFKLIKTENPAEKL AEKKSPQEPTPSVIKFVTTPSKKPPVEPVAATISIGPSISPSSEETIQALETLVSPKLPS LEAPTSASNVMTAFATTPPISSIPPLQEPPRTPSPPLSSHPDIDTDIDSVASQPMELPEN LSLEPKDQDSVLLEKDKK >gi568815597f:205423168_205630706|GENSCAN_predicted_CDS_6|1137_bp nnctcaggggtagcagcatcacagactgtagacactttggtcactgtaggcaacgtagag aaagaagtcttcatggtgttcctggtagagctgacccattgtggcactggtgggcagaat gacattgttgacaaagaaaaacaaggcatcctcagctcggagatgaattcgcttctggat caagaactcattgctatggacagtgctatcaccctgtggcagttccttcttcagctcctg cagaagcctcagaacaagcacatgatctgttggacctctaatgatgggcagtttaagctt ttgcaggcagaagaggtggctcgtctctgggggattcgcaagaacaagcctaacatgaat tatgacaaactcagccgagccctcagatactattatgtaaagaatatcatcaaaaaagtg aatggtcagaagtttgtgtacaagtttgtctcttatccagagattttgaacatggatcca atgacagtgggcaggattgagggtgactgtgaaagtttaaacttcagtgaagtcagcagc agttccaaagatgtggagaatggagggaaagataaaccacctcagcctggtgccaagacc tctagccgcaatgactacatacactctggcttatattcttcatttactctcaactctttg aactcctccaatgtaaagcttttcaaattgataaagactgagaatccagccgagaaactg gcagagaaaaaatctcctcaggagcccacaccatctgtcatcaaatttgtcacgacacct tccaaaaagccaccggttgaacctgttgctgccaccatttcaattggcccaagtatttct ccatcttcagaagaaactatccaagctttggagacattggtttccccaaaactgccttcc ctggaagccccaacctctgcctctaacgtaatgactgcttttgccaccacaccacccatt tcgtccataccccctttgcaggaacctcccagaacaccttcaccaccactgagttctcac ccagacatcgacacagacattgattcagtggcttctcagccaatggaacttccagagaat ttgtcactggagcctaaagaccaggattcagtcttgctagaaaaggacaaaaaatga