GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:26:55 Sequence gi568815593f:110639120_110861779 : 222660 bp : 36.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 381 598 218 1 2 92 48 100 0.810 2.72 1.02 PlyA + 1541 1546 6 1.05 2.05 PlyA - 1940 1935 6 1.05 2.04 Term - 11640 11524 117 0 0 137 38 2 0.512 -2.34 2.03 Intr - 15813 15613 201 0 0 71 86 149 0.727 11.56 2.02 Intr - 42321 42299 23 1 2 100 91 25 0.066 0.34 2.01 Init - 43693 43681 13 1 1 88 110 -11 0.088 1.60 2.00 Prom - 49957 49918 40 -5.55 3.00 Prom + 52745 52784 40 -3.85 3.01 Sngl + 54228 54764 537 2 0 88 45 491 0.740 40.63 3.02 PlyA + 55106 55111 6 1.05 4.00 Prom + 55638 55677 40 -6.15 4.01 Init + 55731 56498 768 2 0 49 44 272 0.002 13.63 4.02 Term + 57758 58513 756 1 0 34 48 256 0.012 8.64 4.03 PlyA + 58572 58577 6 -0.45 5.00 Prom + 58846 58885 40 -3.65 5.01 Init + 68196 68331 136 2 1 109 3 71 0.270 1.05 5.02 Term + 71777 71964 188 0 2 45 36 168 0.886 3.97 5.03 PlyA + 72053 72058 6 1.05 6.00 Prom + 77506 77545 40 -2.55 6.01 Init + 100001 100283 283 1 1 70 69 375 0.930 31.05 6.02 Intr + 102928 102970 43 2 1 92 90 46 0.445 1.58 6.03 Intr + 117583 117640 58 1 1 65 99 26 0.222 -0.63 6.04 Term + 122085 122663 579 2 0 71 50 337 0.992 21.70 6.05 PlyA + 122868 122873 6 1.05 7.00 Prom + 133872 133911 40 -3.95 7.01 Sngl + 135857 136828 972 1 0 44 37 323 0.943 19.28 7.02 PlyA + 137138 137143 6 -0.45 8.00 Prom + 137187 137226 40 -11.44 8.01 Init + 137346 138949 1604 2 2 42 53 438 0.306 27.60 8.02 Intr + 174468 174644 177 0 0 90 78 50 0.029 2.31 8.03 Term + 184539 184773 235 0 1 76 51 154 0.076 5.31 8.04 PlyA + 186190 186195 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 55308 56582 1275 2 0 71 43 347 0.861 24.16 S.002 Term - 165269 165046 224 0 2 75 47 160 0.856 6.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_1|72_aa XPSHVHEDCDDYTSSGIMFLPSKLDRNKEEYRQERHVFFKSKSTKSTGNVAVFTSLCSGP GHVTTPNQITSK >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_1|219_bp nttccctctcatgtgcatgaggactgtgatgactacacaagctcaggcatcatgttctta ccaagcaaactagacaggaacaaggaagagtataggcaggaaaggcatgtctttttcaaa agcaagtcaacaaaaagtacaggaaatgtagctgtctttacatcactgtgctcaggacca ggtcatgtgactacccctaatcaaatcactagcaagtga >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_2|117_aa MEGSGEKLKLEQVSELPFTISSKRIKYLGIQLTRDVKGLFKENYKPLLNEMKEDTNKWKN IPCSWVGRINIVKMAILPKDPIQDTTLHLVIMSPLALLGWDSFSDFVFNDHDNFEEY >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_2|354_bp atggagggatcaggtgagaaactgaaacttgaacaggtgagtgaactcccattcacaatt tcttcaaagagaataaaatacttaggaatccaacttacaagggacgtgaagggcctcttc aaggagaactacaaaccgctgctcaatgaaatgaaagaggacacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggat cccatacaggacactacattacatttagttataatgtctcctttggctcttctaggctgg gacagtttctcagactttgtttttaatgaccatgacaattttgaagagtactag >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_3|178_aa MGKKQSRKTGNSKNQSASPPPKECSSSPATEESWIENDFDKLREEGFRRSNYSELKEEVP TNGKEVKNPEKRLDEWLTRITNAEKSLKDLMELKTTAQQLRDESTSLSSRCDQLEERVSM MEDEMNEMKQEGKFREKRIKRNKQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTL >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_3|537_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcaacagaagaaagctggatagagaatgactttgac aagttgagagaagaaggcttcagaagatcaaactactctgagctaaaggaggaagttcca accaatggcaaagaagttaaaaaccctgaaaaacgattagatgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccacggcacaacaacta cgtgacgaatccacaagcctcagtagccgatgtgatcaactggaagaaagggtatcaatg atggaagatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaacaaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgtag >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_4|507_aa MGDFNTPLSTLDRSKRQKVHKDILELNTALHQEDLIDIYRTLHPKSTEYTFFSAPHHTYS KIEHIVGSKALLSKCKRTEIITNCLSDHSAIKLELSIKKLTQNHSTTWKLNNLLLNDYWV HKEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTVTSQLK ELEEQEQTHSKASRRQAITKIRAELKGIETQKTPQKINESRSWFFEKINKIDRLLARLIK KKREKNQIDTIKNDKGVIYRFNAIPIKLPMSFFTELEKTTLKFIWNQKRARIAKSILSQK NKAGGITLPDFKLYYKPTVTKTAWYWYQNRDTDQRNTTEPSEIMPHICNHLIFDKPDKNK KWGKDSLFKKWCWENWLATHRKMKPDPFLTPYTKINSRWIKDLHVRPKTMKTLEENLGNI IQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAEETTIRVNRQPTEWEKIFTIYS SDKGLISRIYNELKQIYKKKTNNPINK >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_4|1524_bp atgggagactttaacaccccactgtcaacattagacagatcaaagagacagaaagttcac aaggatatcctggaattgaacacagctctgcaccaagaggacttaatagacatctacaga actctccaccccaaatcaacagaatatacattcttttcagcaccacaccacacctattcc aaaattgaacacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcagcattaagaaactc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataaagaaatgaaggcagaaataaagatgttcttcgaaaccaatgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacacggtaacatcacaattaaaa gaactagaggagcaagagcaaacacattcaaaagctagcagaaggcaagcaataactaaa atcagagcagaactgaagggaatagagacacaaaaaacccctcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggtaatttataga ttcaatgccatccccatcaagctaccaatgagtttcttcacggaattggaaaaaactact ttaaagttcatatggaaccagaaaagagcccgcattgccaagtcaattctgagccaaaag aacaaagctggaggcatcacgctacctgacttcaaactatactacaagcctacagtaacc aaaacagcatggtactggtaccaaaacagagatacagaccaacggaacacaacagagccc tcagaaataatgccgcatatctgcaaccatctgatctttgacaaacctgacaaaaacaag aaatggggaaaggattccctatttaagaaatggtgctgggaaaactggctagccacacat agaaagatgaaaccagatcccttccttacaccttatacaaaaattaattcaagatggatt aaagacttacatgtcagacctaaaaccatgaaaactctagaagaaaacctaggcaatatc attcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaaca aaagccaaaattgacaaatgggatctaattaaactgaagagcttctgcacagcagaagaa actaccatcagagtgaacaggcaacctacagaatgggagaaaatttttacaatctactca tctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaa acaaacaaccccatcaacaagtag >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_5|107_aa MAQKKNVCNWRRESAVTAGLNVEINAALSQWITKLCWAQPEPVHEILEVLARAIRQEKET KGIQIGKKEIKLSLFANDMTLYLEESKDFTKKRLELTNKFSKVAEYK >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_5|324_bp atggcacagaaaaagaatgtgtgtaattggaggagggagagtgcagtaactgcaggactt aacgttgaaatcaatgctgccctgtcacagtggataacaaagctctgctgggcacagcca gaacctgtgcatgaaatactggaagtcttagctagagcaatcagacaagagaaagaaaca aagggcatccaaattggaaagaaagaaatcaaattatccttgtttgcaaatgatatgacc ttgtatttggaagaatctaaagacttcaccaaaaaacgattagaactgacgaacaaattc agtaaagttgcagaatacaaataa >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_6|320_aa MHPRRPDGFDGLGYRGGARDEQGFGGAFPARSFSTGSDLGHWVTTPPDIPGSRNLHWGEK SPPYGVPTTSTPYEGPTEEPFSSGGGGSVQGQSSEQLNRFAGFGIGLASLTYVVAMPFYS ASLIETVQSEIIRDNTGILECVKEGIGRVIGMGVPHSKRLLPLLSLIFPTVLHGVLHYII SSVIQKFVLLILKRKTYNSHLAESTSPVQSMLDAYFPELIANFAASLCSDVILYPLETVL HRLHIQGTRTIIDNTDLGYEVLPINTQYEGMRDCINTIRQEEGVFGFYKGFGAVIIQYTL HAAVLQITKIIYSTLLQNNI >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_6|963_bp atgcatccgcggcgcccggacggatttgatggcttgggctaccggggtggtgcccgggac gagcagggctttggcggcgccttccctgcaaggtccttcagcaccgggtcggacctgggc cactgggtgacgactcccccagatatccccggcagccgcaacctgcactggggcgagaag agcccgccctacggcgtgcccaccacctccaccccgtacgaaggccccacggaggaaccc ttttccagtggcggcggcggcagtgtgcaggggcagagcagtgaacagctgaatagattt gctggatttggtattggacttgcaagcctaacttacgtggtggcaatgcctttttattca gcaagtctgattgaaacagtgcagagtgagataattcgagataatactggcattttggag tgtgttaaagaaggaattggaagagtgataggcatgggagtgcctcatagcaaacgactt cttccgcttctttccttgatcttccctacggtgcttcatggagttcttcattacatcatc agctcagttattcagaagtttgtcctactaattctaaagagaaagacttacaatagccac ctagctgagagcactagccctgtgcagagtatgttggatgcttattttccagaacttatt gctaactttgctgccagtctttgttctgacgttatactttacccattggaaacagttttg caccgccttcacattcaaggaacacgcacaataattgacaatacagaccttggctatgaa gtgcttccaattaatacacaatatgagggaatgagagactgtatcaataccataaggcag gaggaaggagtgtttggtttttataaagggtttggtgctgttataatacagtacacactg catgcagctgttttacagattaccaaaattatttactctacacttcttcaaaataacatt tga >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_7|323_aa MVKGSIQQEELTIVNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSMR QRVNKDTQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHILGSKALLSKCK RTEIVTNYLSDHSAIKLELRIKNLTQNRSTTWRLNNLLLNDYWVHNEMKAEIKMLFETNE NKDTTYQNLWDTFKAVCRGKFITLNAHKRKQERSKIDTLTSQLKELEKQEQTYPKASRRQ ETTKIRAELKEIETQKTLQKINESRSWFFERINKIDRPQARLIKKKREKNQIDTIKNDKG DITTESHRNTNYHQRILQTPLRK >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_7|972_bp atggtaaagggatcaattcaacaagaagagctaactatcgtaaatatatatgcacccaat acaggagcaccaagattcataaagcaagtcctgagtgacctacaaagagacttagactct cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaatgaga cagagagtcaacaaggatacccaggaattgaactcagctctgcaccaagtggacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattgtaacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcacccaaaaccgctcaactacatggagactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgctctttgaaaccaatgag aacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaa tttataacactaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactggaaaagcaagagcaaacatatccaaaagctagcagaaggcaa gaaacaactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa attaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccacaagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgaatcccacagaaatacaaactaccatcagagaatactacaaacacct ctacgcaaataa >gi568815593f:110639120_110861779|GENSCAN_predicted_peptide_8|671_aa MIISIDAEKAFDKIQQTFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEIKLSLFADNMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASNRIKYPGIQLT RDVKDLFKENNKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRLNAIPIKLPM PFFTELEKTTLKFIWNQKRAHITKSILSQKNKAGGITLPDFKLYYKVTVTKTAWYWYQNR DIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWNKDLNIRPKTIKTLEENLGITIQDLGMGKDFMSKTPKAMATKDKTDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELQQIYKKKTNNPIKKWAM DMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMVIIEKSGNNRLECLQ ALLQVKSISLVQDYELFYDLILHPGKFLRQISKTRDGDNGTLSLRDTFILGAEWTRVQDL LNMGTENAITSWPSSLHWQTAAAPCDRSGCKAKPAAKLQAKAGQGHCQLGVSSWQVIEKN PVSFSVGSSGI >gi568815593f:110639120_110861779|GENSCAN_predicted_CDS_8|2016_bp atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaaaccttcatgcta aaaactctcaataaattaggtattgatgggacatatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa atcaaattgtccctgtttgcagacaacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagttgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaacttccattcacaattgcttcaaacagaataaaatacccaggaatccaacttaca agggatgtgaaggacctcttcaaggagaacaacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttatagactcaatgccatccccatcaagctaccaatg cctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggttacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatccattccttaca ccttatacaaaaattaattcaagatggaataaagacttaaacattagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacttaggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagacaaaactgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgatg gacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagatatcatctc acaccagttagaatggtgatcattgaaaagtcaggaaacaacaggctcgaatgtctccag gctctgttgcaagtcaagtcaatttctttagtgcaagattatgaactgttttatgacctg attctccacccagggaaatttctgagacagatatctaagactagggatggagacaatggc accctctctctgagggacactttcattctaggagctgaatggacaagagttcaggaccta ctgaacatgggtacagagaacgctataacatcatggccatcttccctccactggcagaca gcagctgctccatgtgacagaagtggctgtaaagccaagccagccgcgaagctgcaggcc aaggcagggcaagggcattgccagctgggtgtctccagctggcaagtaatcgagaaaaat cctgtgtcattttctgtgggctcatctgggatctga