GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:04:15 Sequence gi568815593f:171209366_171411596 : 202231 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 143 138 6 1.05 1.01 Sngl - 6617 6279 339 2 0 71 44 222 0.195 12.13 1.00 Prom - 6671 6632 40 -5.86 2.00 Prom + 13083 13122 40 -7.46 2.01 Init + 15147 15366 220 2 1 51 72 103 0.497 3.79 2.02 Intr + 18628 18881 254 2 2 60 52 158 0.146 6.55 2.03 Intr + 25657 25738 82 0 1 7 51 83 0.058 -4.29 2.04 Intr + 31563 31777 215 1 2 81 68 174 0.915 13.13 2.05 Intr + 33317 33455 139 1 1 65 87 20 0.529 -0.36 2.06 Intr + 56316 56482 167 1 2 51 93 139 0.397 10.38 2.07 Intr + 84518 84616 99 1 0 77 113 129 0.804 14.61 2.08 Intr + 86522 86649 128 1 2 75 99 75 0.976 6.78 2.09 Term + 89397 89493 97 0 1 152 46 25 0.772 2.14 2.10 PlyA + 89793 89798 6 1.05 3.00 Prom + 90811 90850 40 -6.36 3.01 Init + 100001 100421 421 1 1 89 99 483 0.958 46.05 3.02 Intr + 100785 101028 244 1 1 84 81 460 0.998 41.46 3.03 Intr + 102024 102228 205 0 1 94 50 250 0.773 21.00 3.04 Intr + 104066 104184 119 1 2 151 14 -3 0.701 -2.14 3.05 Intr + 105390 105560 171 0 0 64 43 123 0.867 4.46 3.06 Intr + 107012 107127 116 2 2 34 53 127 0.870 3.89 3.07 Term + 107429 107676 248 0 2 46 43 166 0.678 3.85 3.08 PlyA + 107837 107842 6 -0.45 4.09 PlyA - 107884 107879 6 -0.45 4.08 Term - 109045 109007 39 1 0 109 42 36 0.038 -1.71 4.07 Intr - 110165 110063 103 1 1 128 53 53 0.040 5.98 4.06 Intr - 120934 120844 91 2 1 83 84 90 0.116 7.15 4.05 Intr - 122176 121990 187 1 1 57 40 54 0.007 -3.24 4.04 Intr - 127222 127031 192 1 0 42 -5 235 0.010 9.29 4.03 Intr - 128501 128374 128 0 2 29 50 89 0.009 -0.40 4.02 Intr - 135946 135663 284 0 2 6 -49 373 0.002 12.46 4.01 Init - 136265 135985 281 2 2 97 -7 390 0.002 26.88 4.00 Prom - 139393 139354 40 -3.06 5.00 Prom + 140927 140966 40 -8.06 5.01 Init + 143765 143890 126 1 0 50 89 48 0.020 1.31 5.02 Intr + 156931 157194 264 0 0 64 55 345 0.171 26.51 5.03 Intr + 157372 157485 114 0 0 39 4 146 0.368 1.94 5.04 Term + 161945 162112 168 1 0 58 52 161 0.975 7.38 5.05 PlyA + 164088 164093 6 1.05 6.00 Prom + 166488 166527 40 -5.86 6.01 Init + 176408 176420 13 1 1 113 103 8 0.504 3.82 6.02 Intr + 178250 178325 76 0 1 10 59 72 0.289 -4.93 6.03 Intr + 178568 178689 122 2 2 53 77 116 0.593 7.14 6.04 Intr + 180686 180765 80 0 2 97 32 116 0.978 6.17 6.05 Intr + 181940 182059 120 1 0 86 103 79 0.997 9.89 6.06 Intr + 182341 182434 94 0 1 99 77 -4 0.991 -0.86 6.07 Intr + 183345 183451 107 1 2 47 80 139 0.991 8.93 6.08 Intr + 183549 183613 65 2 2 60 89 223 0.996 17.12 6.09 Intr + 190788 190845 58 0 1 51 87 135 0.999 8.59 6.10 Intr + 191474 191560 87 1 0 49 106 69 0.964 4.97 6.11 Intr + 195937 196038 102 0 0 103 87 98 0.998 11.57 6.12 Intr + 198335 198409 75 1 0 91 78 55 0.602 4.51 6.13 Term + 201162 201200 39 2 0 138 48 4 0.328 -1.41 6.14 PlyA + 201493 201498 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 126220 126410 191 0 2 83 29 83 0.906 0.38 S.002 Intr + 127006 127120 115 1 1 69 103 87 0.935 8.75 S.003 Term - 135946 135631 316 0 1 6 36 354 0.841 16.41 S.004 Init - 136265 135982 284 2 2 97 77 392 0.956 35.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_1|112_aa MGKDFMMKTPKAIATKANIDKRDLIKLKSFCTAKETIIRVNRQPTEWEKIFASYSSDNGL ISRIYKELKQIYKKKTNNSIKKWAQDINRHFSKEDIYVANKYVKKKLIITSH >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_1|339_bp atgggcaaagacttcatgatgaaaacaccaaaagcaattgcaacaaaagccaacattgac aaacgggatctaatcaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaagctactcatctgacaatggtcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaacaaacaactccatc aaaaagtgggcacaggatataaacagacacttctcaaaagaagatatttatgtggccaac aagtatgtgaaaaaaaagctcatcatcactagtcattag >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_2|466_aa MPNLWASKVVLFSVGLSDIEKGIAAKVEKNNYVRIMSEEEEEETEEKGNEGIGEVKWSGG RDKTSRAFLQRKARALMEMYKEINVVFMSANTTCILQPMDQGVILTFKSYYIRNTFCKAV AAIDSDSSDGSGQSKLKAFQKGFTILDAIKNICDSQEEVRKKELAKKSEKEQTLKKIERM CLGRRGNQILSLGSLSKDQIYPMKLKGISICYSALKSALCGNYVSFGVFKLYGDNHFDNV LQAFVKMLLSVSHSDLLQYRKLSQSYYPLLECLTQDHMSFIINLEPPVLMYVLTSISEGL TTLDTVVSSSCCTSLDYIVTYLFKHIAKEGKKPLRCREATQAGQRLLHFMQQNPDVLQQM MSVLMNTIVFEDCRNQWSVSRPLLGLILLNEKYFSELRASLINSQPLPKQEVLAQCFRNL MEGVEQNLSVKNRDRFTQNLSVFRRDVAEALRSDGNTEPCSLDMMS >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_2|1401_bp atgccaaatctatgggcttcaaaggttgttctgttctctgttgggttgtctgacatagaa aaaggcattgcagcaaaagttgagaaaaataactatgtgaggattatgtcggaggaggag gaagaggagacagaagagaaaggaaatgaagggataggagaagtgaagtggagtggggga agagacaaaacgtccagagcattcttgcaaagaaaagccagagctctgatggagatgtac aaggagattaatgttgttttcatgtctgctaacacaacatgcattctgcagcccatggat caaggagtaattttgactttcaagtcgtattatatcagaaatacattctgtaaggctgta gctgccatagatagtgattcctctgatggatctgggcaaagtaaattgaaagccttccag aaaggattcactattctagatgccatcaagaacatttgtgattcacaggaggaggtaaga aagaaggaactggcaaagaagagtgagaaggagcaaacactgaagaaaatcgagagaatg tgtcttggaagacgaggtaatcagatcctgtcccttgggagcctctcaaaagatcagatt tatccaatgaaactcaagggcatctccatctgctattcagctctcaagtctgccttgtgt ggaaattatgtcagctttggcgtcttcaagttgtatggggacaaccattttgacaatgta ctccaggcttttgtcaaaatgctgctgtcagtgtcccacagtgacttgctacaataccgg aaactgagccagtcttattatccactcctggaatgtctcactcaggaccatatgagcttc atcatcaacttagagcctcctgtactcatgtatgttctcacatctatctcagagggactc actactcttgatacagttgtctcctccagctgctgtaccagtttagactacatcgtcacc tacctcttcaagcacatagcaaaagagggcaagaagccacttcgatgcagagaggctacc caggctggtcagagactattacattttatgcagcaaaacccagatgtcctgcagcagatg atgtctgtcctcatgaacaccattgtctttgaagactgtcggaaccagtggtcagtatcc aggcctctcctggggctcatcctgctcaatgagaagtatttcagtgaactgagagcaagt ttgataaacagccagcccctccccaagcaggaggtccttgcccagtgcttcagaaaccta atggaaggagtggagcagaacctgtccgtcaagaacagagacaggttcacccaaaatctg tctgtattcagaagagatgtggcagaggcgttgcgcagtgatggcaacactgaaccatgc agtctcgacatgatgagctga >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_3|507_aa MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPAPRGPDGASYLGGPPGGRPGATYPSLP ASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPSALPAMPSVPTVSSL GGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICE LEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLM LQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLAQLPVAPKAK HFRGRVPEHPSHSTSTPTRLIPLPPSEPEPAFAAFGVFQRFALTQPCPARDGLRVFSCYR AEPTHVVPSSDYLSATSTQSNVGAPSGLPGLRAGAFAIRAEVGRPSRDHHERSGDTQRSH RAPGQAKPCVNKCTKAAERLVWCRRLRSGVVKVPPQGRRFSFREIRNSAPQFQAPAPLIA STLSDAQTQAVHCLGSDTGKVLEKLPW >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_3|1524_bp atggaggcgcccgccagcgcgcagaccccgcacccgcacgagcccatcagcttcggcatc gaccagatccttaacagcccggaccaggacagcgcacccgccccgcggggccccgacggc gccagctacctgggagggccccccgggggccgtccgggcgccacatacccgtctctgccc gcctcctttgcgggcctcggcgcgcccttcgaggacgcgggatcttacagtgtgaacctg agcctagcgcccgcaggcgtgatccgggtgccggcgcacaggccgctgcccggggccgtg ccaccgcctctgccaagcgcgctacccgccatgccctccgtgcccacggtctccagcctt ggcggtctcaatttcccctggatggagagcagccgccgcttcgtgaaagaccgcttcaca gcggcggccgcactcacgcccttcaccgtgacccggcgcatcggccacccctaccagaac cggacgccgcccaagcgtaagaagccgcgcacgtccttttcccgggtgcagatctgcgag ctggaaaagcgcttccatcgccagaagtacctggcctctgccgagagggcggcgctcgcc aagtccctcaaaatgacggacgcgcaggtcaagacctggttccaaaaccggaggaccaag tggcggcggcagacggcggaggagcgggaggcggagcggcagcaggcgagccggctcatg ctgcagctgcaacacgacgccttccaaaagagcctcaacgactccatccagcctgacccg ctctgtctgcacaactcgtcactctttgctctgcagaatctgcagccctgggaggaggat agttccaaggttcccgctgtcacctccctggcccagctcccagtagctcctaaggcaaaa cactttcggggccgcgtcccagagcacccctcccactccacgagcacccccacccgcctc atcccactgcctccctcggagccggagcctgccttcgcggcgtttggcgtcttccagcgc tttgccctcacccagccttgcccagccagagacgggctacgggtcttttcctgttacaga gctgagcccactcatgtggtgccaagtagcgactatctctcggccacctccacccagagc aatgtgggcgcccccagcggcctcccaggcttgcgcgctggcgcctttgccatccgtgcc gaagtggggagacctagccgcgaccaccacgagcgcagcggtgacacccagaggtcccac cgggcccctgggcaggctaaaccgtgtgtaaacaagtgcaccaaggccgcagagcggctg gtgtggtgcaggcgcctgcgttctggagtagtgaaggttcctccacagggaaggcggttt tccttccgtgaaattcgaaattcagccccccagttccaagcgcctgctccgttaatagca tctactctcagcgacgcccagacccaggctgtgcactgtcttggctcagacactggcaaa gtcctcgagaaattgccttggtga >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_4|434_aa MSMLGLQKRPAASVLRYGKKKVWLDPNEANEIASANSRQQIRKLIKDGLIICKPVTVHSQ AQCRKNTLAHRKGRHMGTGKRKGTANAPMPEKVTRYRDSKKINHHMYHSLYLKVKESVFK DKQILMEHIHKLKADKAGKKLLADQAEACRPKTKEARKQREECLQAKKEGIINTLSKDEE MKKQKLPLLTVPPSRYRAQRPVNINIPVKCALPLIPFPPQIVFLLLRVSSKTALEADIDY EGQKPWLKCRFFPKAPGKLPDHPPYRAQRQIQLIQKAFDGGRSTKRGKGFLALQNLSFVV NVAGTPPKNGGLMECAPAGLTADTHARGACIQEGWSGLRTWEIQESLPDFSYNILQAGFL IQEGFRDSLPRATAVVSSQVQASVSVPLAESPQPQQLWATPGRLTATGVHQSLFFCLSWE SESHYVPGTKVPQL >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_4|1305_bp atgagcatgctcgggcttcagaagaggcctgccgctagtgtcctccgctatggcaagaag aaggtctggttggaccccaatgaggccaatgaaatcgccagtgccaactcccgacagcag atccggaagctgatcaaagatgggctgattatctgtaagcctgtgactgtccattcccag gctcaatgccggaaaaacaccttggcccaccggaagggcaggcacatgggcacgggtaag cgaaagggtacagccaatgccccaatgccagagaaggtcacaagataccgtgactctaag aagatcaatcaccacatgtatcatagcctgtacctgaaggtgaaggagagtgtgttcaaa gacaagcagattctcatggaacatatccacaagctgaaggcagacaaggccggcaagaag ctcctggctgaccaggctgaggcctgcaggcctaagaccaaggaagcacgaaagcaacgt gaagagtgcctccaggccaagaaggaggggatcatcaacactttgtcgaaggacgaagag atgaagaaacaaaagctcccccttttgactgtgccgccttccagatatcgagcacaacga ccagtaaatattaacattcctgttaaatgtgcgctgcctcttattcctttccccccacaa attgtcttcctgctcctccgagtgtcatccaaaacagcattagaggctgacattgattat gaaggccaaaaaccgtggctgaaatgccgcttctttcctaaagcacctggaaagcttcca gatcatccgccttaccgtgctcagcggcagatccagttgattcagaaagcatttgatgga ggaagaagcaccaagcgtggcaaaggcttcctggcccttcagaatctgagttttgtggta aatgtggcaggcaccccacctaagaatggaggactcatggaatgtgcacctgctgggctg acagcagacacccatgccaggggagcctgcattcaggagggatggagtggcttgaggact tgggagatccaggagtccttgccagatttctcctacaatattttgcaagcaggtttcctc attcaggaaggcttccgtgattcgctcccccgggccacagctgtcgtcagctcccaagtg caggcctccgtgtcagttcctctggctgaaagtcctcagccccagcagctgtgggctaca ccagggcgcctcaccgccacaggggttcaccagagcctgttcttctgcctgagctgggag tcagagtctcactacgtcccaggaacaaaagtgccccagctgtag >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_5|223_aa MPWVTAEKASALCHRAAHTPGPGPRGPEREHSERRSQQEVAQCRLRTGMRGAFGKPQGTV ARVHIGQVIMSICTKLQNKEHVIEALHRAKFKFPGRQKIHISKKWGFIKFNANEFEDMVA EKRLILDGCGRAELSFAAATGATPIAGHFTPGTFINQVQAAFWELRLLLQIQQSKDEWIV LTMTRIPLGSVLGRGAISRTQTGVDDPGKVLPGPEEGSWTTSP >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_5|672_bp atgccgtgggttactgcagaaaaagccagtgccctctgccacagagctgcacatacacct ggcccaggccccagagggcctgaaagagaacattcggagaggcgaagccagcaagaggtg gcacaatgtaggctccggacaggcatgcgaggtgcctttggaaagccccagggcactgtg gccagggttcacattggccaagttatcatgtccatctgtaccaagctgcagaacaaggag catgtgatagaagccctgcacagggccaagttcaagtttcctggccgccagaagatccac atctcaaagaagtggggcttcatcaagttcaatgccaatgaatttgaggacatggtggct gagaagcggctcatcctagatggctgtgggcgggctgagctgagctttgctgctgccaca ggagccactcctattgctggccacttcacccctggaaccttcattaaccaggtccaggca gccttctgggagctgcgtctgttgctgcaaattcagcaatccaaggatgagtggattgtt ctcaccatgaccagaattcccctgggatctgtgctcggcagaggtgccatctcccgaacc cagactggagttgatgatccagggaaggtgcttccgggtcctgaagagggctcctggacc accagcccctga >gi568815593f:171209366_171411596|GENSCAN_predicted_peptide_6|345_aa MAHACDSSGGVGPVYECACSVGARGVPGRSACRHPMEDSMDMDMSPLRPQNYLFGNCWGE LERGRAGPGGGCELKADKDYHFKVDNDENEHQLSLRTVSLGAGAKDELHIVEAEAMNYEG SPIKVTLATLKMSVQPTVSLGGFEITPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEE EEDVKLLSISGKRSAPGGGSKVPQKKVKLAADEDDDDDDEEDDDEDDDDDDFDDEEAEEK APVKKSIRDTPAKNAQKSNQNGKDSKPSSTPRSKGQESFKKQEKTPKTPKGPSSVEDIKA KMQASIEKGGSLPKVEAKFINYVKNCFRMTDQEAIQDLWQWRKSL >gi568815593f:171209366_171411596|GENSCAN_predicted_CDS_6|1038_bp atggctcacgcatgcgacagcagcggaggggtggggccagtgtacgagtgcgcgtgctcg gtgggagcccgcggagtacctggaaggagtgcgtgccgccacccgatggaagattcgatg gacatggacatgagccccctgaggccccagaactatcttttcggtaactgctggggggag ctggagcgaggccgagcggggcctggtggcggttgtgaactaaaggccgacaaagattat cactttaaggtggataatgatgaaaatgagcaccagttatctttaagaacggtcagttta ggggctggtgcaaaggatgagttgcacattgttgaagcagaggcaatgaattacgaaggc agtccaattaaagtaacactggcaactttgaaaatgtctgtacagccaacggtttccctt gggggctttgaaataacaccaccagtggtcttaaggttgaagtgtggttcagggccagtg catattagtggacagcacttagtagctgtggaggaagatgcagagtcagaagatgaagag gaggaggatgtgaaactcttaagtatatctggaaagcggtctgcccctggaggtggtagc aaggttccacagaaaaaagtaaaacttgctgctgatgaagatgatgacgatgatgatgaa gaggatgatgatgaagatgatgatgatgatgattttgatgatgaggaagctgaagaaaaa gcgccagtgaagaaatctatacgagatactccagccaaaaatgcacaaaagtcaaatcag aatggaaaagactcaaaaccatcatcaacaccaagatcaaaaggacaagaatccttcaag aaacaggaaaaaactcctaaaacaccaaaaggacctagttctgtagaagacattaaagca aaaatgcaagcaagtatagaaaaaggtggttctcttcccaaagtggaagccaaattcatc aattatgtgaagaattgcttccggatgactgaccaagaggctattcaagatctctggcag tggaggaagtctctttaa