GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:06:52 Sequence gi568815590r:81378868_81583167 : 204300 bp : 37.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 7737 8120 384 2 0 2 34 260 0.546 5.90 1.02 PlyA + 8561 8566 6 1.05 2.00 Prom + 12899 12938 40 -3.65 2.01 Init + 15672 15731 60 2 0 66 94 43 0.313 4.00 2.02 Term + 30807 30935 129 2 0 109 47 88 0.223 4.00 2.03 PlyA + 31054 31059 6 1.05 3.00 Prom + 42721 42760 40 -2.65 3.01 Sngl + 49347 50261 915 2 0 42 43 667 0.998 53.66 3.02 PlyA + 50489 50494 6 1.05 4.00 Prom + 50658 50697 40 -6.15 4.01 Init + 50751 51788 1038 2 0 44 41 511 0.894 36.03 4.02 Term + 52076 52942 867 1 0 -14 38 392 0.557 15.68 4.03 PlyA + 53118 53123 6 1.05 5.12 PlyA - 53915 53910 6 1.05 5.11 Term - 59821 59420 402 1 0 40 36 163 0.210 0.47 5.10 Intr - 65734 65633 102 1 0 70 95 75 0.876 5.85 5.09 Intr - 66122 65950 173 0 2 95 84 84 0.979 7.44 5.08 Intr - 68546 68447 100 2 1 52 86 66 0.137 1.56 5.07 Intr - 79566 79522 45 0 0 99 70 31 0.147 0.09 5.06 Intr - 79836 79735 102 0 0 73 103 50 0.940 4.45 5.05 Intr - 80470 80298 173 2 2 71 77 162 0.458 12.14 5.04 Intr - 90696 90639 58 0 1 105 68 -1 0.003 -2.86 5.03 Intr - 100648 100547 102 1 0 89 89 90 0.976 8.65 5.02 Intr - 101731 101559 173 2 2 75 100 144 0.917 13.04 5.01 Init - 104300 104228 73 2 1 46 82 64 0.522 2.88 5.00 Prom - 115357 115318 40 -5.05 6.08 PlyA - 115674 115669 6 1.05 6.07 Term - 143484 142813 672 0 0 32 41 575 0.494 39.96 6.06 Intr - 148254 148153 102 0 0 95 89 97 0.960 9.95 6.05 Intr - 150743 150571 173 0 2 84 100 116 0.966 11.14 6.04 Intr - 160876 160751 126 2 0 122 61 81 0.128 8.53 6.03 Intr - 183957 183775 183 1 0 78 79 77 0.032 4.64 6.02 Intr - 190649 190546 104 1 2 77 34 52 0.045 -2.40 6.01 Init - 191873 191725 149 2 2 59 65 222 0.988 16.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 152448 152376 73 0 1 56 80 74 0.975 4.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_1|127_aa AEVRLLAESSDPGTGGCTAVLPPRRTVPLSAETVRVGSYGECGLLMPRSHSSPQQQQWDL SLRYVAIPDLSSPFLAWQQQWQCQPGSRVGFRPSELRSSEGHQAAAAVGLDACGTPYKFL LWSNVSV >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_1|384_bp gcagaagtgaggctgctggcagaatcctcagatccaggcactggtggctgcactgcagtt ctgccaccaaggaggacagtgcccctctcagctgaaacagtgcgggtaggtagctatggg gagtgtggtttactcatgcctcgatcccacagcagtccacagcagcagcagtgggatttg tccttaaggtatgtggcaatacctgacctctcctctccctttttggcttggcagcaacag tggcaatgtcagcctggctccagggtaggattcagaccttcagagcttagatcctcagaa gggcaccaagctgcagctgctgtgggcttggatgcttgtgggactccatataagtttctt ctctggagcaatgtctccgtgtag >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_2|62_aa MEPRHKTPSGGINVVESAAQQWQHGAERESECLGERKDSNCETLHRTQCCPVIAESKRDE LS >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_2|189_bp atggagccaagacataaaactccctctggtggaattaatgtagtggagtcagctgcacag cagtggcagcatggtgcagagagagaatccgagtgcttgggagagagaaaggacagcaat tgtgaaacattgcatcgaactcagtgctgccctgttatagcagaaagcaaaagggatgaa ctcagctga >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_3|304_aa MENDFDELREEGFRRSNYSELQEDVQTKGKEVENFEKNLEECITRITNTEKCLKELMELK TKARELREECRSLRSRCNQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVK RPNLHLIGVPESDGENGTKLENTLQDIIQENFPNLARQAKIHIQEIQRTPQRYSSRRATP RHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGPIFNILKEK NFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQN HAKM >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_3|915_bp atggagaatgactttgacgagctgagagaagaaggcttcagacgatcaaattactctgag ctacaggaggacgttcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaa gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaag accaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcaatcaactg gaagaaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagttt agagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaa agaccaaatctacatctgattggtgtacctgaaagtgatggggagaatggaaccaagttg gaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaag attcacattcaggaaatacagagaacgccacaaagatactcctcgagaagagcaactcca agacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagcc agggagaaaggtcgggttaccctcaaagggaagcccatcagactaacagcggatctctcg gcagaaaccctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaag aattttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaata aaatactttacagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaa gagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccagccgctgcaaaat catgccaaaatgtaa >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_4|634_aa MGDFNTPLSTLDRSMRQKVNKDTEELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKVSRSQEITKTRAELKEIETQKTLQKINESRSWFFERINKIDRLLAILIK KKREKNQIDVIKNDKGDITTDPTEIETTIREYYKHLHANKLENLEEMDKFLNTYTLPRLN QEEIESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELHINRAKDKNHMIIS IDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTR QGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKPRNPTYKGREG PLQEELQTTAQGNKRGYKQMEEHSMLMGRKNQYRENGHTAQGNLQIQCHPHQGTNAFLHR IGKNYLKVDMEPKRSPHRQINPKPKEQSWRHHTT >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_4|1905_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagtcaac aaggataccgaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagttagcagaagccaagaaataactaaa accagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagactgctagcaatactaataaag aaaaaaagagagaagaatcaaatagacgtaataaaaaatgataagggggatatcaccacc gatcccacagaaatagaaactaccatcagagaatactacaaacacctccacgcaaataaa ctagaaaatctagaagaaatggataaattcctcaacacatacactctcccaagactaaac caggaagaaattgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggatgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggtacaaga cagggatgcccgctctcaccactcctattcaacatagtattggaagttctggccagggca attaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ttgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcaactttagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaacctaggaatccaacttacaagggacgtgaagga cctcttcaagaagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatg gaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcc caaggtaatttacagattcaatgccatccccatcaaggtaccaatgcctttcttcacaga attggaaaaaactacttgaaagttgatatggaaccaaaaaggagcccgcatcgccaaatc aatcctaagccaaaagaacaaagctggaggcatcacactacctga >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_5|500_aa MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVVIII KNLTTKIISGPLGVTGVNFAARNMAGLVKPTVTISVDGKMMTIRTESSFQDTKISFKLGE EFDETTADNRKVKSTITLENGSMIHVQKWLGKETTIKRKIVDEKMVVECKMNNIVSTRIY EKNCVELSPITMSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDI ITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKETTIKRKL VNGKMVASTQMRGSQKNNSSNITKLVSLTLPKDHTSSPAMDSSQDEISELPEKEFRTLIL KLIREAPEKSELQLKEIKDVIQDMKGKFCEIGSINKKQSHLLEIKDTLREMQNALETLSN RIKQAEERSSELEDKAFELI >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_5|1503_bp atgtgtgatgcttttgtaggtacctggaaacttgtctccagtgaaaactttgatgattat atgaaagaagtaggagtgggctttgccaccaggaaagtggctggcatggccaaacctaac atgatcatcagtgtgaatggggatgtgatcaccattaaatctgaaagtacctttaaaaat actgagatttccttcatactgggccaggaatttgacgaagtcactgcagatgacaggaaa gtcaagagcaccataaccttagatgggggtgtcctggtacatgtgcagaaatgggatgga aaatcaaccaccataaagagaaaacgagaggatgataaactggtggtggtcatcataata aaaaatctcacaacaaagataatctctggacctcttggcgtcacaggagtgaatttcgca gcccggaacatggcagggttagtgaaaccgacagtaactattagtgttgatgggaaaatg atgaccataagaacagaaagttctttccaggacactaagatctccttcaagctgggggaa gaatttgatgaaactacagcagacaaccggaaagtaaagagcaccataacattagagaat ggctcaatgattcacgtccaaaaatggcttggcaaagagacaacaatcaaaagaaaaatt gtggatgaaaaaatggtagtggaatgtaaaatgaataatattgtcagcaccagaatctac gaaaagaactgtgttgagctctcacccatcacgatgagcaacaaattcctgggcacctgg aaacttgtctctagtgagaactttgacgattacatgaaagctctgggtgtggggttagcc accagaaaactgggaaatttggccaaacccactgtgatcatcagcaagaaaggagatatt ataactatacgaactgaaagtacctttaaaaatacagaaatctccttcaagctaggccag gaatttgaagaaaccacagctgacaatagaaagaccaagagcatcgtaaccctgcagaga ggatcactgaatcaagtgcagagatgggatggcaaagagacaaccataaagagaaagcta gtgaatgggaaaatggtagcgtccacccaaatgagagggagccagaaaaacaattctagt aatataacaaaacttgtttctttaacactcccaaaagatcataccagctcaccagcaatg gattcaagccaagatgaaatctctgaattgccagaaaaagaattcagaacgttgattctt aagctaatcagggaggcaccagagaaaagtgaactccaacttaaagaaatcaaagacgtg atacaagatatgaaaggaaaattctgtgaaataggtagcataaataaaaaacaatcacat cttctggaaataaaggacacacttagagaaatgcaaaatgcactggaaactctcagcaat agaatcaaacaagcagaagaaagaagttcagagcttgaagacaaggcttttgaattaatc taa >gi568815590r:81378868_81583167|GENSCAN_predicted_peptide_6|502_aa MTGRLAAERNYPLCQELQTMGQAACRKEPPTLGPPLCGELETTGQPAAERETEDRIAKSP APFRTLQNQSVSSRCHLLEPVLKGCPEMPSRIQGLESETLGIYVVLYSTAAKLAPKPQDE VLLTLSLSTSRGVSSLLWPPLPQAHRIPCIILICLNHMIAQKQTAPASQHSTLRAAFSEQ SAEILKQSIGRASRKLGRLAKPTVTISTDGDVITIKTKSIFKNNEISFKLGEEFEEITPG GHKTKSKVTLDKESLIQVQDWDGKETTITRKLVDGKMVVPGRPPIASRPSPLHRPLRPPQ GPPAAAPAPPLQPPLFSRRHGDPVHLAGAPGLQPGLRGRHQPPHQPGALRLLLLPVHVLY FDRDDVALKNFAKYFLYQSHEEREHAEKLMKLQNQRGGRIFLQDIKKPDCDDWESGLNAM ECALYLEKNVNQSLLELYKLATDKNDPHLCDFIETHYLNEQVKAIKELSDHVTNLRKMGA PESGSAEYLFDKHTLGDSDNES >gi568815590r:81378868_81583167|GENSCAN_predicted_CDS_6|1509_bp atgacaggacgactagctgcagagagaaactaccctctctgccaagagctgcagacgatg ggacaagctgcctgcagaaaggagccacccactctagggcctcctctctgcggagagctg gagacaacgggacaaccagctgcagagagagaaactgaagacagaattgctaagtcacct gctcccttcagaactctgcagaatcagtctgtttcttctagatgccatctcttggagcca gtgttgaaaggctgtccagagatgccatctagaatccagggcctagagtcagaaacctta ggaatctatgtggtcctctattctactgcagctaagctggcacccaagccacaagatgaa gttcttctcactctttccctttccacaagcaggggagtctcttctctcctgtggccacca ctgccccaggcccacagaattccgtgtattattttgatttgtctgaatcacatgattgct cagaagcagacagcaccagcaagccagcactcaaccctaagagcagccttcagtgagcag tctgctgaaattctaaaacagagtataggaagagccagcaggaaactgggccgtttggca aaacccactgtgaccatcagtacagatggagatgtcatcacaataaaaaccaaaagcatc tttaaaaataatgagatctcctttaagctgggagaagagtttgaggaaatcacgccaggt ggccacaaaacaaagagtaaagtaaccttagataaggagtccctgattcaagttcaggac tgggatggcaaagaaaccaccataacgagaaagctggtggatgggaaaatggtggtgccc ggccggccgcccatagccagccgtccgtcacctcttcaccgccccctcagaccgccccaa ggtccccccgccgccgctccagcgccgccgctgcagccgcctctctttagtcgccgccat ggcgaccccgtccacctcgcaggagcgccaggactacaaccaggactcagaggccgccat caaccgccacatcaacctggagctctgcgcctcctacttttacctgtccatgtcttatac tttgaccgcgatgatgtggctttgaagaactttgccaaatactttctttaccaatctcat gaggagagggaacatgctgagaaactgatgaagctgcagaaccaacgaggtggccgaatc ttccttcaggatatcaagaaaccagactgtgatgactgggagagcgggctgaatgcgatg gagtgtgcattatatttggaaaaaaatgtgaatcagtcactactggaactgtacaaactg gccactgacaaaaatgacccccatttgtgtgacttcattgagacacattacctgaatgag caggtgaaagccatcaaagaattgagtgaccacgtgaccaacttgcgcaagatgggagcg cccgaatctggctcggcagaatatctcttcgacaagcacaccctgggagacagtgataat gaaagctaa