GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:34:12 Sequence gi568815590f:117420877_117639975 : 219099 bp : 38.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1105 1336 232 2 1 61 83 103 0.186 3.01 1.02 Intr + 2958 3114 157 0 1 77 63 68 0.530 2.29 1.03 Intr + 3194 3328 135 1 0 37 49 94 0.227 0.24 1.04 Intr + 14535 14865 331 2 1 89 41 364 0.399 26.08 1.05 Intr + 17609 17716 108 0 0 67 87 29 0.139 0.04 1.06 Term + 18351 18499 149 1 2 58 54 149 0.531 5.58 1.07 PlyA + 21393 21398 6 1.05 2.00 Prom + 24498 24537 40 -3.25 2.01 Sngl + 34960 35562 603 0 0 86 43 136 0.748 4.85 2.02 PlyA + 36316 36321 6 1.05 3.03 PlyA - 36542 36537 6 1.05 3.02 Term - 43783 43669 115 0 1 92 41 113 0.925 4.06 3.01 Init - 47762 47731 32 2 2 113 72 17 0.381 1.86 3.00 Prom - 51404 51365 40 -5.65 4.00 Prom + 53979 54018 40 -5.25 4.01 Sngl + 57819 58748 930 2 0 77 49 184 0.738 9.83 4.02 PlyA + 59410 59415 6 1.05 5.00 Prom + 63845 63884 40 -3.75 5.01 Init + 66827 66971 145 1 1 50 80 110 0.439 6.73 5.02 Term + 68366 69024 659 0 2 25 37 377 0.824 19.42 5.03 PlyA + 69044 69049 6 1.05 6.02 PlyA - 71454 71449 6 1.05 6.01 Sngl - 75246 75055 192 0 0 94 49 113 0.650 2.89 6.00 Prom - 79356 79317 40 -3.45 7.00 Prom + 93172 93211 40 -5.65 7.01 Init + 100001 100177 177 1 0 84 91 233 0.922 20.51 7.02 Intr + 107775 107933 159 2 0 41 76 114 0.938 4.76 7.03 Intr + 109847 109951 105 1 0 61 87 205 0.970 17.19 7.04 Term + 127204 127299 96 0 0 107 37 72 0.174 0.99 7.05 PlyA + 127776 127781 6 1.05 8.06 PlyA - 128026 128021 6 1.05 8.05 Term - 128457 128377 81 0 0 89 38 91 0.037 0.91 8.04 Intr - 132634 132512 123 1 0 47 59 77 0.013 0.56 8.03 Intr - 141176 141129 48 2 0 88 85 51 0.041 2.76 8.02 Intr - 146153 146092 62 0 2 48 78 65 0.040 -0.97 8.01 Init - 151064 150926 139 2 1 71 64 107 0.472 6.95 8.00 Prom - 154642 154603 40 -1.85 9.00 Prom + 156726 156765 40 -6.85 9.01 Init + 163370 163489 120 1 0 73 94 41 0.258 3.45 9.02 Intr + 168088 168161 74 0 2 56 79 98 0.789 2.99 9.03 Intr + 174688 174842 155 1 2 45 74 98 0.578 2.89 9.04 Term + 174902 175140 239 0 2 74 44 168 0.471 6.35 9.05 PlyA + 178710 178715 6 1.05 10.00 Prom + 178877 178916 40 -6.55 10.01 Init + 183463 183504 42 0 0 76 97 39 0.808 4.07 10.02 Intr + 192122 192427 306 1 0 93 17 130 0.022 2.12 10.03 Intr + 196260 196392 133 2 1 71 57 65 0.557 1.00 10.04 Term + 203409 203674 266 1 2 76 33 160 0.465 3.99 10.05 PlyA + 203768 203773 6 1.05 11.00 Prom + 210426 210465 40 -7.05 11.01 Sngl + 216066 216788 723 2 0 66 43 256 0.911 14.87 11.02 PlyA + 217145 217150 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_1|370_aa XIQTTIREYYKHLCANKLENLEEMDKFLDTYTLPRLNQEEIESLNRPITSSEIEALINTL PTKKSPGPDGFTAKFCQSFSRCTVQAVSRSTILGSGGQWPSSHSSTRWCSSRDWCGGSNS TFPFHTALAEAEVPKPQFLTSVYLQAQLYVETAKAWGLHPLKPWAELYVGLFQSQECSSS PAMEQSWTENDFDELREEGFRRSIYSELKEEVQTNGKEVKNLEKRLDEWLTRITNAEKSL KDMMELKTTAQELRDECTSLSSRLDQLEEWNQVHSQILPEVQGGADVVKGGWTRKNFQRK KRRVGGGGSKMKVMQVESLASEEGETLGLCAEEKPHEDTASRWPSASQGEASEETTTADT GIVDFQLPEL >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_1|1113_bp naaatacaaacaaccatcagagaatactataaacacctctgtgcaaacaaactagaaaat ctagaagaaatggataaattcctggacacatacaccctcccaagactaaatcaggaagaa attgaatccctgaatagaccaataacaagttctgaaattgaagcactaattaatacccta ccaaccaaaaaaagcccaggaccagatggattcacagccaaattctgccagagcttttcc aggtgcacagtgcaagctgtcagtagatctaccattctgggatctggaggacagtggcct tcttctcacagctccactaggtggtgttccagtagggactggtgtgggggctccaactcc acatttcccttccacactgccctagcagaggcagaggttcctaaaccccagttcttgact tcagtgtacttgcaggctcaactctatgtggaaactgccaaggcttggggcttgcaccct ctgaagccatgggctgagctctatgttggtctctttcagtcacaggaatgcagctcctca ccagcaatggaacaaagctggacggagaatgactttgacgagttgagagaagaaggcttc agacgatcaatctactcggagctaaaggaggaagttcaaaccaatggcaaagaagttaaa aaccttgaaaaaagattagatgaatggctaactagaataaccaatgcagagaagtcctta aaggacatgatggagctgaaaaccacagcacaagaactacgtgacgaatgcacaagcctc agtagccgactcgatcaactggaagaatggaaccaagttcacagccaaattctaccagag gtacaaggaggagctgatgtagtgaaggggggatggacaagaaaaaatttccagaggaaa aagcgaagggttggaggaggagggagcaagatgaaagtaatgcaagttgagtccctggcc tcagaagagggagagacattagggctgtgcgcagaggaaaaaccacatgaggacacagcg agcaggtggccatctgcaagccaaggagaggcctcagaggaaaccacaactgctgacacc gggattgtagacttccagcttccagaactgtga >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_2|200_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLNFIWNQKRACIAKTILSQKNKAGGITLPD FKLYYKAIVTKTAWYWYQNRDTDQWNRTEPSEIIPHIYNYLIFDKPEKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWNKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKTDKWDLN >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_2|603_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaacttcatatggaaccaaaaaagggcc tgcattgctaagacaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctatagtaaccaaaacagcatggtactggtaccaaaacaga gatacagaccaatggaacagaacagaaccctcagaaataataccacacatctacaactat ctgatctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggaataaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggatttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaactgacaaatgggatctaaac taa >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_3|48_aa MAYPGTLDFFGYCSFCLSYNIAQDSVPVFMGHSVDDKDKQPDNYKTVK >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_3|147_bp atggcctatccaggtactttggacttttttggctactgcagtttttgcttgagttacaac attgcacaagatagtgttcctgtcttcatgggccatagtgtagatgacaaagataaacaa ccagacaattataaaacagtgaaataa >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_4|309_aa MKQTVEDCCLSAPFVAGQGILLERVSETTLKFIWNQKRAHIAKSILSQKNKAGGITLPDF KLYYKAIVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKKWGKDSLFNKW CWENWLAICRKLKLDPFLTPYTKINSRWNKDLNVRPKTIKTLEENLGNTIQDIGMGKDFM SKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKISATYSYDKGLISRIYN ELKQIYRKKTNNPINKWVKDMNRHFSKEDIYAAKRHMKRCSSSLAIREMQIKTTMRYHLT PVRMTIIKK >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_4|930_bp atgaagcaaacagttgaagactgttgtctaagcgcaccctttgtagctgggcagggaatc cttcttgaaagggtatctgaaactactttaaagttcatatggaaccaaaaaagagcccac attgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttc aaactatactacaaggctatagtaaccaaaacagcatggtactggtaccaaaacagagat atagaccaatggaacagaacagagccctcagaaataatgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagaaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaattaattcaagatggaataaagacttaaatgttagacctaaaaccataaaa accctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtaaacaggcaacctacagaa tgggagaaaatttctgcaacctactcatatgacaaagggctaatatccagaatctacaat gaactcaaacaaatttacaggaaaaaaacaaacaaccccatcaacaagtgggtgaaggat atgaatagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaagatgc tcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcaca ccagttagaatgacgatcattaaaaagtaa >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_5|267_aa MRKIQQKNTENSKAQSGSSPPNDCKPSPTRVQNWMEDEMDELTDVASVDRSMRQNINRDI QDLSSPLDQVDLIDIYRTLHPKTAEYTFFSVLHGTYSKINHIIKSKTLLSKLKRTKIITN SLSDHSAIKLELRIKKLTENHTTTWKVNNLLLKDSWVNNEIKSEIEKFFETNENKETMYQ NLWDAAKAVLRGKFIALNAHIKKLKRPQIDTLTSQLKELENQGQTNPKASRRQEITNIRV ECKEINTGKTLQKISESRSWLFEKLIK >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_5|804_bp atgagaaagattcaacaaaaaaatactgaaaactcaaaagcccagagtggctcttctcca ccaaatgattgcaaaccctctccaacaagagtgcagaactggatggaggatgagatggac gaattgacagacgtagcttcagtagacagatcaatgagacagaacattaacagggatatt caggacctgagctcacctctggatcaagtggacctgatagatatctacagaactctccac ccaaaaacagcagaatatacattcttctcagtgctacatggcacttactctaaaatcaac cacataattaaaagtaaaacactcctcagcaaattgaaaagaactaaaatcataacaaac agtctctcagaccacagtgcaatcaaattagaactcaggattaagaaacttactgaaaac cacactactacatggaaagtgaacaacctgctcctgaaagactcctgggtaaataatgaa attaagtcagaaatcgagaagttctttgaaaccaatgagaacaaggagacaatgtaccag aatctttgggatgcagctaaagcagtgttaagagggaaatttatagcactaaatgcccac atcaaaaagctaaaaagacctcaaattgacaccctaacatcacaactaaaagaactagag aaccaagggcaaacaaaccccaaagccagcagaagacaagagataaccaacatcagagtg gaatgcaaggagataaacacaggaaaaacccttcaaaaaatcagtgaatccaggagctgg ctgtttgaaaaattaataaagtag >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_6|63_aa MKNLNAGTSPPKDCTNSPAMVPNQNRSSEITGKQDKHGLKECSTRSRKMLKIHTKKLLKQ SRK >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_6|192_bp atgaaaaatctgaatgcagggacatcaccaccaaaggattgcactaactctccagcaatg gttcctaaccaaaatagaagctcagaaataacaggtaaacaagacaagcatggattgaaa gaatgctcaacaagatcaaggaaaatgttgaaaatccacacaaagaagcttctaaaacaa tccaggaaatga >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_7|178_aa MSTPPLAASGMAPGPFAGPQAQQAAREVNTASLCRIGQETVQDIVYRTMEIFQLLRNMQL PNGVTYHTGTYQDRLTKLQDNLRQLSVLFRKLRLVYDKCNENCGGMDPIPVEQLIPYVEE DGSKNDDRAGPPRFASEERREIAEVNKGCKGLSEQVVEKKKSISENGANIISLYISHP >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_7|537_bp atgtccacccctccgttggccgcgtcggggatggcgcccgggcccttcgccgggccccag gctcagcaggccgcccgggaagtcaacacggcgtcgctgtgccgcatcgggcaggagaca gtgcaggacatcgtgtaccgcaccatggagatcttccagctcctgaggaacatgcagctg ccaaatggtgtcacttaccacactggaacatatcaagaccggttaacaaagctacaggat aatcttcgccaactttcagttctcttcaggaagctgagattggtatatgacaaatgcaat gaaaactgtggtgggatggatcccattccagtcgagcaacttattccatatgtggaagaa gatggctcaaagaatgatgatcgggctggcccacctcgttttgctagtgaagagaggcga gaaattgctgaagtaaataaaggttgtaaaggtctttctgaacaggtcgtggaaaagaaa aagagcatatctgagaatggcgctaacatcatctctctttatatttcccacccctag >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_8|150_aa MGKDFMAKTPKAMATKAKMDKWDLIKLKSFCTAKETSISVNRQPIESALGSRYIRNPGIQ PEKESEEVQEVLVPPPPEKLKLQMYHFRAPKTFLKSFEELRAKKSKCVTDIINLSSICLN DTVFYTTLQDAEIHFYGKSGCGSWFGISWC >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_8|453_bp atgggcaaagacttcatggctaaaacaccaaaagcaatggcaacaaaagccaaaatggac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactagcatcagtgtg aacaggcaacctatagaatctgcccttggctccaggtatatccgtaatcctggaatacag ccagagaaggaatcagaagaggttcaagaggttcttgtgcctccacctcctgagaagctg aaattacagatgtaccactttagagctcctaaaacatttctgaagagttttgaagagttg agagccaagaaatccaaatgtgttactgacataataaatctctcctctatttgtctgaac gatacagtgttttacactacacttcaagatgcagagatacacttctatggtaaaagtggc tgtggcagctggtttggcatcagctggtgctga >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_9|195_aa MGPSYMEPERLSPTRGQLDFLCTSGSLEGRRRGWTGTQKQPIFTCTSATEFRGAYRIEKK TDNKQTLRLGKGPRTVPDTLLGWFSKTGKSNGHVKCSRVSRILGTEYGEKNQKAQRKGRR NNLFFQGSGVSDSSIVEKFICVYPLQIRTDNGQCCYKTGLSASNVGDDGPEWQKPGGITQ SPGRRWAQLPYWHPG >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_9|588_bp atgggcccttcctacatggagcctgaaaggctatcgcccacaagaggccaactggacttt ctctgcacctctggttccctagaaggaagaaggagaggctggacagggactcagaagcag ccaatattcacttgtactagtgccacggaatttcgtggagcctacagaatagagaagaag acagacaataaacagactttacgtcttggtaagggcccaagaactgtacctgataccctg ctggggtggttctcaaagactggaaaaagtaatggtcatgttaaatgcagtagagtttcc agaattttaggcacagagtatggagagaagaatcaaaaggctcagagaaagggccgaagg aacaacttgttttttcagggttctggtgttagtgacagcagcattgttgaaaagttcatt tgtgtctatcctctgcaaatcaggactgacaatgggcaatgctgttataaaactggcctt tctgccagcaatgtgggtgatgatggtccagagtggcagaagccaggtggcatcactcaa tcaccaggaagaagatgggcacaactaccttactggcatccaggttga >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_10|248_aa MPKLDGEREERTEKLCLRLETSLIQRDLCHLLWSSISKSFQVLNPFENPMESMDMLQKNK CTVLAHFHTADKDIPETGQFTKERSSIGLAVPRGWGGLTIMVEGKEEQVTSYMEAAARMQ WDNHGLLQPRTPQLKQSNNIVMVNTDCQLDWIEGYKVLILDYQVLEKEQGRMLKEVLRQS VTQAKLQEIGDESQPCLHLAASLCQLQQTVSACARLCSHHLPPPTGKLIMQIKAATVVQL KEPGRRNQ >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_10|747_bp atgcctaagctagatggagagagagaagagaggacagagaagctttgcctaaggctagag acatccttgattcagagagatttgtgccatttactatggagcagtatttcaaaaagtttc caggtcctgaatccctttgagaatccaatggaaagtatggacatgcttcaaaaaaataaa tgcactgtattagcccattttcacactgctgataaagacatacctgagactggacaattt acaaaagaaaggagttcaattggcctcgcagttccacgtggatggggaggcctcacaatc atggtggaaggcaaggaggagcaagtcacatcttacatggaggcagcggctagaatgcag tgggacaaccatggcttgctgcaacctcgaactcctcagctcaaacaatccaacaatatt gtgatggttaatactgactgtcaacttgattggattgaaggatacaaagtattgatcctg gactatcaggtccttgaaaaagaacaaggaagaatgcttaaagaggtgttaaggcagtca gttactcaagccaagctgcaggagataggggatgaatcacaaccctgcttacaccttgca gcttcactatgtcagttgcaacaaacagtgagtgcctgtgctcgcctctgctcacatcat ctccctccaccaacaggcaagttaattatgcaaataaaggcagcaactgttgtgcagttg aaagagcctggtcgcagaaaccagtaa >gi568815590f:117420877_117639975|GENSCAN_predicted_peptide_11|240_aa MALRPDGFTAKFYQRYNEELVPFLLKLFQSIEKEGILSNSFYEASIILILKLGRDTTKKE NFRPISLMNIDAKILNKILANRIQQHIKKLIYHDQVGFIPGMQGWFNIHKSINVMQHINR TKDKNHVIISIDAENAFDKIQQPFMLKTLNKLGTDGTYLKIIRAICDKLTANIILNGQKL EAFPLKTGTRQGCPLSPLLVNIVLEVLARAIRQEKEIKGIQLGKEEDKLSLFADDMIVYI >gi568815590f:117420877_117639975|GENSCAN_predicted_CDS_11|723_bp atggccctcagaccagatggattcacagccaaattctaccagaggtacaatgaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctctctaactca ttttatgaggccagcatcatcctgatactaaagctgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatctaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatacacaaatcaataaatgtaatgcagcatataaacaga accaaagacaaaaaccacgtgattatttcaatagatgcagaaaatgcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtactgatgggacgtatctcaaa ataataagagctatctgtgacaaactcacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccccctctcaccactcctagtc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagacaaattgtccctgtttgcagatgacatgattgtgtatatc tag