GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:26:16 Sequence gi568815590r:81421683_81622226 : 200544 bp : 37.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6532 7446 915 0 0 42 43 667 0.998 53.66 1.02 PlyA + 7674 7679 6 1.05 2.00 Prom + 7843 7882 40 -6.15 2.01 Init + 7936 8973 1038 0 0 44 41 511 0.894 36.03 2.02 Term + 9261 10127 867 2 0 -14 38 392 0.557 15.68 2.03 PlyA + 10303 10308 6 1.05 3.12 PlyA - 11100 11095 6 1.05 3.11 Term - 17006 16605 402 2 0 40 36 163 0.210 0.47 3.10 Intr - 22919 22818 102 2 0 70 95 75 0.876 5.85 3.09 Intr - 23307 23135 173 1 2 95 84 84 0.979 7.44 3.08 Intr - 25731 25632 100 0 1 52 86 66 0.137 1.56 3.07 Intr - 36751 36707 45 1 0 99 70 31 0.147 0.09 3.06 Intr - 37021 36920 102 1 0 73 103 50 0.940 4.45 3.05 Intr - 37655 37483 173 0 2 71 77 162 0.458 12.14 3.04 Intr - 47881 47824 58 1 1 105 68 -1 0.003 -2.86 3.03 Intr - 57833 57732 102 2 0 89 89 90 0.976 8.65 3.02 Intr - 58916 58744 173 0 2 75 100 144 0.917 13.04 3.01 Init - 61485 61413 73 0 1 46 82 64 0.522 2.88 3.00 Prom - 72542 72503 40 -5.05 4.08 PlyA - 72859 72854 6 1.05 4.07 Term - 100669 99998 672 1 0 32 41 575 0.494 39.96 4.06 Intr - 105439 105338 102 1 0 95 89 97 0.960 9.95 4.05 Intr - 107928 107756 173 1 2 84 100 116 0.966 11.14 4.04 Intr - 118061 117936 126 0 0 122 61 81 0.128 8.53 4.03 Intr - 141142 140960 183 2 0 78 79 77 0.032 4.64 4.02 Intr - 147834 147731 104 2 2 77 34 52 0.045 -2.40 4.01 Init - 149058 148910 149 0 2 59 65 222 0.986 16.61 4.00 Prom - 164512 164473 40 -4.35 5.03 PlyA - 164550 164545 6 1.05 5.02 Term - 183999 183899 101 1 2 96 50 50 0.554 -0.69 5.01 Init - 185009 184889 121 2 1 31 58 146 0.679 6.30 5.00 Prom - 192331 192292 40 -2.45 6.06 PlyA - 193932 193927 6 1.05 6.05 Term - 195741 195652 90 0 0 40 48 102 0.655 -1.86 6.04 Intr - 196437 196275 163 2 1 -9 94 161 0.480 6.06 6.03 Intr - 197134 197014 121 2 1 10 53 119 0.463 -0.77 6.02 Intr - 198431 198326 106 2 1 86 100 36 0.967 3.47 6.01 Init - 199772 199635 138 2 0 57 88 100 0.781 7.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 109633 109561 73 1 1 56 80 74 0.975 4.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_1|304_aa MENDFDELREEGFRRSNYSELQEDVQTKGKEVENFEKNLEECITRITNTEKCLKELMELK TKARELREECRSLRSRCNQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVK RPNLHLIGVPESDGENGTKLENTLQDIIQENFPNLARQAKIHIQEIQRTPQRYSSRRATP RHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGPIFNILKEK NFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQN HAKM >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_1|915_bp atggagaatgactttgacgagctgagagaagaaggcttcagacgatcaaattactctgag ctacaggaggacgttcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaa gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaag accaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcaatcaactg gaagaaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagttt agagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaa agaccaaatctacatctgattggtgtacctgaaagtgatggggagaatggaaccaagttg gaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaag attcacattcaggaaatacagagaacgccacaaagatactcctcgagaagagcaactcca agacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagcc agggagaaaggtcgggttaccctcaaagggaagcccatcagactaacagcggatctctcg gcagaaaccctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaag aattttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaata aaatactttacagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaa gagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccagccgctgcaaaat catgccaaaatgtaa >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_2|634_aa MGDFNTPLSTLDRSMRQKVNKDTEELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKVSRSQEITKTRAELKEIETQKTLQKINESRSWFFERINKIDRLLAILIK KKREKNQIDVIKNDKGDITTDPTEIETTIREYYKHLHANKLENLEEMDKFLNTYTLPRLN QEEIESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELHINRAKDKNHMIIS IDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTR QGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKPRNPTYKGREG PLQEELQTTAQGNKRGYKQMEEHSMLMGRKNQYRENGHTAQGNLQIQCHPHQGTNAFLHR IGKNYLKVDMEPKRSPHRQINPKPKEQSWRHHTT >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_2|1905_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagtcaac aaggataccgaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagttagcagaagccaagaaataactaaa accagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagactgctagcaatactaataaag aaaaaaagagagaagaatcaaatagacgtaataaaaaatgataagggggatatcaccacc gatcccacagaaatagaaactaccatcagagaatactacaaacacctccacgcaaataaa ctagaaaatctagaagaaatggataaattcctcaacacatacactctcccaagactaaac caggaagaaattgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggatgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggtacaaga cagggatgcccgctctcaccactcctattcaacatagtattggaagttctggccagggca attaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ttgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcaactttagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaacctaggaatccaacttacaagggacgtgaagga cctcttcaagaagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatg gaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcc caaggtaatttacagattcaatgccatccccatcaaggtaccaatgcctttcttcacaga attggaaaaaactacttgaaagttgatatggaaccaaaaaggagcccgcatcgccaaatc aatcctaagccaaaagaacaaagctggaggcatcacactacctga >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_3|500_aa MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVVIII KNLTTKIISGPLGVTGVNFAARNMAGLVKPTVTISVDGKMMTIRTESSFQDTKISFKLGE EFDETTADNRKVKSTITLENGSMIHVQKWLGKETTIKRKIVDEKMVVECKMNNIVSTRIY EKNCVELSPITMSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDI ITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKETTIKRKL VNGKMVASTQMRGSQKNNSSNITKLVSLTLPKDHTSSPAMDSSQDEISELPEKEFRTLIL KLIREAPEKSELQLKEIKDVIQDMKGKFCEIGSINKKQSHLLEIKDTLREMQNALETLSN RIKQAEERSSELEDKAFELI >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_3|1503_bp atgtgtgatgcttttgtaggtacctggaaacttgtctccagtgaaaactttgatgattat atgaaagaagtaggagtgggctttgccaccaggaaagtggctggcatggccaaacctaac atgatcatcagtgtgaatggggatgtgatcaccattaaatctgaaagtacctttaaaaat actgagatttccttcatactgggccaggaatttgacgaagtcactgcagatgacaggaaa gtcaagagcaccataaccttagatgggggtgtcctggtacatgtgcagaaatgggatgga aaatcaaccaccataaagagaaaacgagaggatgataaactggtggtggtcatcataata aaaaatctcacaacaaagataatctctggacctcttggcgtcacaggagtgaatttcgca gcccggaacatggcagggttagtgaaaccgacagtaactattagtgttgatgggaaaatg atgaccataagaacagaaagttctttccaggacactaagatctccttcaagctgggggaa gaatttgatgaaactacagcagacaaccggaaagtaaagagcaccataacattagagaat ggctcaatgattcacgtccaaaaatggcttggcaaagagacaacaatcaaaagaaaaatt gtggatgaaaaaatggtagtggaatgtaaaatgaataatattgtcagcaccagaatctac gaaaagaactgtgttgagctctcacccatcacgatgagcaacaaattcctgggcacctgg aaacttgtctctagtgagaactttgacgattacatgaaagctctgggtgtggggttagcc accagaaaactgggaaatttggccaaacccactgtgatcatcagcaagaaaggagatatt ataactatacgaactgaaagtacctttaaaaatacagaaatctccttcaagctaggccag gaatttgaagaaaccacagctgacaatagaaagaccaagagcatcgtaaccctgcagaga ggatcactgaatcaagtgcagagatgggatggcaaagagacaaccataaagagaaagcta gtgaatgggaaaatggtagcgtccacccaaatgagagggagccagaaaaacaattctagt aatataacaaaacttgtttctttaacactcccaaaagatcataccagctcaccagcaatg gattcaagccaagatgaaatctctgaattgccagaaaaagaattcagaacgttgattctt aagctaatcagggaggcaccagagaaaagtgaactccaacttaaagaaatcaaagacgtg atacaagatatgaaaggaaaattctgtgaaataggtagcataaataaaaaacaatcacat cttctggaaataaaggacacacttagagaaatgcaaaatgcactggaaactctcagcaat agaatcaaacaagcagaagaaagaagttcagagcttgaagacaaggcttttgaattaatc taa >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_4|502_aa MTGRLAAERNYPLCQELQTMGQAACRKEPPTLGPPLCGELETTGQPAAERETEDRIAKSP APFRTLQNQSVSSRCHLLEPVLKGCPEMPSRIQGLESETLGIYVVLYSTAAKLAPKPQDE VLLTLSLSTSRGVSSLLWPPLPQAHRIPCIILICLNHMIAQKQTAPASQHSTLRAAFSEQ SAEILKQSIGRASRKLGRLAKPTVTISTDGDVITIKTKSIFKNNEISFKLGEEFEEITPG GHKTKSKVTLDKESLIQVQDWDGKETTITRKLVDGKMVVPGRPPIASRPSPLHRPLRPPQ GPPAAAPAPPLQPPLFSRRHGDPVHLAGAPGLQPGLRGRHQPPHQPGALRLLLLPVHVLY FDRDDVALKNFAKYFLYQSHEEREHAEKLMKLQNQRGGRIFLQDIKKPDCDDWESGLNAM ECALYLEKNVNQSLLELYKLATDKNDPHLCDFIETHYLNEQVKAIKELSDHVTNLRKMGA PESGSAEYLFDKHTLGDSDNES >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_4|1509_bp atgacaggacgactagctgcagagagaaactaccctctctgccaagagctgcagacgatg ggacaagctgcctgcagaaaggagccacccactctagggcctcctctctgcggagagctg gagacaacgggacaaccagctgcagagagagaaactgaagacagaattgctaagtcacct gctcccttcagaactctgcagaatcagtctgtttcttctagatgccatctcttggagcca gtgttgaaaggctgtccagagatgccatctagaatccagggcctagagtcagaaacctta ggaatctatgtggtcctctattctactgcagctaagctggcacccaagccacaagatgaa gttcttctcactctttccctttccacaagcaggggagtctcttctctcctgtggccacca ctgccccaggcccacagaattccgtgtattattttgatttgtctgaatcacatgattgct cagaagcagacagcaccagcaagccagcactcaaccctaagagcagccttcagtgagcag tctgctgaaattctaaaacagagtataggaagagccagcaggaaactgggccgtttggca aaacccactgtgaccatcagtacagatggagatgtcatcacaataaaaaccaaaagcatc tttaaaaataatgagatctcctttaagctgggagaagagtttgaggaaatcacgccaggt ggccacaaaacaaagagtaaagtaaccttagataaggagtccctgattcaagttcaggac tgggatggcaaagaaaccaccataacgagaaagctggtggatgggaaaatggtggtgccc ggccggccgcccatagccagccgtccgtcacctcttcaccgccccctcagaccgccccaa ggtccccccgccgccgctccagcgccgccgctgcagccgcctctctttagtcgccgccat ggcgaccccgtccacctcgcaggagcgccaggactacaaccaggactcagaggccgccat caaccgccacatcaacctggagctctgcgcctcctacttttacctgtccatgtcttatac tttgaccgcgatgatgtggctttgaagaactttgccaaatactttctttaccaatctcat gaggagagggaacatgctgagaaactgatgaagctgcagaaccaacgaggtggccgaatc ttccttcaggatatcaagaaaccagactgtgatgactgggagagcgggctgaatgcgatg gagtgtgcattatatttggaaaaaaatgtgaatcagtcactactggaactgtacaaactg gccactgacaaaaatgacccccatttgtgtgacttcattgagacacattacctgaatgag caggtgaaagccatcaaagaattgagtgaccacgtgaccaacttgcgcaagatgggagcg cccgaatctggctcggcagaatatctcttcgacaagcacaccctgggagacagtgataat gaaagctaa >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_5|73_aa MCIAASGGAEAFYEMGIHCWDIAVAAIIVTEAGGVLMDVTGGPFHLMSRRIIAANCTALA ERIAKEIQVAPFQ >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_5|222_bp atgtgcattgcggcaagtggaggagcagaggcattttatgaaatgggaattcactgctgg gatattgcagtagctgccattattgttactgaagctggtggcgtgctaatggatgttact ggtggaccattccatttaatgtcacggagaataattgctgcaaattgtacagcattagca gaaaggatagccaaagaaattcaggtagcaccttttcaatga >gi568815590r:81421683_81622226|GENSCAN_predicted_peptide_6|205_aa MGGKKEKKKEKRFNPKKSLEKSKTMMCAWNSEMVMEREALEVAGAEIEFGVVYSCVEDKR YTVRKGKGAFYNGQKLQVSQEELKEAGLLGASEGRKLHKDAARNARTSYRDGLRKTGRSL CGKGPLRAGQGGHAYPKEWLWWARLLSMAAGEVANSAAYGFAPATLVTPLGTLSVLEWQD MPIDDVTGTFDWLYNNRGDILVACF >gi568815590r:81421683_81622226|GENSCAN_predicted_CDS_6|618_bp atggggggaaaaaaggaaaagaaaaaagaaaagcgttttaatccaaagaagagcttggag aagtcgaagaccatgatgtgtgcatggaacagtgagatggtcatggagagggaagcactg gaagtggccggggcagagatagaatttggagttgtgtacagttgtgtggaagacaagagg tacactgtcaggaaaggaaaaggtgccttttataatggtcaaaaactacaggtttcacaa gaagaactgaaggaagcggggcttcttggcgcttctgaagggagaaagctgcacaaggat gccgccaggaatgctcgcacctcctaccgggatggcctgcgtaagacggggcggtctctg tgcggaaaaggtcctctgagagcaggtcaaggtggccacgcatatcctaaagaatggttg tggtgggctagactgctgtcaatggcagctggcgaggtggccaactcagctgcatatggg tttgcaccggccacactggtgactccactaggaactctcagcgtcctagagtggcaagat atgcccattgatgatgtcactggtacttttgactggctttacaataatcgtggggatatt cttgttgcatgcttttaa