GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:54:12 Sequence gi568815596r:234396011_234596586 : 200576 bp : 45.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8954 9045 92 1 2 54 47 81 0.600 0.62 1.02 Term + 10196 10415 220 2 1 88 45 183 0.880 10.51 1.03 PlyA + 10961 10966 6 1.05 2.03 PlyA - 11785 11780 6 1.05 2.02 Term - 14798 14650 149 0 2 47 42 123 0.574 1.66 2.01 Init - 20948 20753 196 2 1 66 99 72 0.596 5.28 2.00 Prom - 22273 22234 40 -4.56 3.00 Prom + 26142 26181 40 -5.46 3.01 Init + 32614 32771 158 0 2 63 94 52 0.138 2.70 3.02 Intr + 51657 51819 163 0 1 93 49 116 0.629 8.18 3.03 Intr + 66500 66709 210 1 0 66 81 61 0.052 2.31 3.04 Intr + 67030 67118 89 0 2 93 48 10 0.013 -3.73 3.05 Intr + 77327 77538 212 2 2 121 54 136 0.156 12.06 3.06 Intr + 88775 88898 124 0 1 96 59 68 0.107 4.34 3.07 Term + 92109 92187 79 0 1 78 34 44 0.016 -4.76 3.08 PlyA + 92546 92551 6 1.05 4.09 PlyA - 93512 93507 6 1.05 4.08 Term - 100866 99998 869 1 2 -27 38 1666 0.047 142.93 4.07 Intr - 129233 129063 171 0 0 60 81 60 0.065 2.41 4.06 Intr - 135851 135696 156 0 0 79 103 36 0.255 4.18 4.05 Intr - 152300 152260 41 1 2 84 96 8 0.004 -0.93 4.04 Intr - 161218 161025 194 1 2 54 105 59 0.034 2.49 4.03 Intr - 165230 165125 106 1 1 82 107 61 0.046 7.62 4.02 Intr - 172207 172081 127 2 1 74 66 17 0.048 -2.16 4.01 Init - 174998 174944 55 2 1 75 87 28 0.077 2.85 4.00 Prom - 175156 175117 40 -8.96 5.06 PlyA - 175186 175181 6 1.05 5.05 Term - 175716 175613 104 1 2 92 44 108 0.923 5.24 5.04 Intr - 183735 183512 224 2 2 99 53 102 0.402 5.57 5.03 Intr - 184853 184629 225 1 0 21 34 194 0.612 4.40 5.02 Intr - 187655 187362 294 1 0 58 77 132 0.145 5.22 5.01 Init - 195986 195904 83 2 2 82 44 49 0.570 0.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 141384 141229 156 0 0 103 60 107 0.864 9.31 S.002 Term - 195290 195071 220 1 1 108 55 102 0.884 5.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:234396011_234596586|GENSCAN_predicted_peptide_1|103_aa MVDCKRWHPDGVFAISLRMPFSSTCPIARLPHIPHRNACTDVTRDGHRHACSSIIHDRHQ LATTQMSTNSTMDEYIAAYPYNGRQHAQSLLPKLYEKSFYPVT >gi568815596r:234396011_234596586|GENSCAN_predicted_CDS_1|312_bp atggtagattgcaagagatggcatccggatggggtctttgctatcagcctgcggatgccc ttcagcagcacctgtcccatcgcaagacttccgcatataccccacagaaatgcttgcaca gatgttacaagagatggacacaggcatgcatgtagcagcattattcatgaccgccaccag ctggcaacaacacaaatgtccaccaactccacgatggatgaatacatcgcagcgtatccc tacaatggaagacagcacgcgcagtctttattgccaaagctctatgagaagtctttctat ccagtaacatga >gi568815596r:234396011_234596586|GENSCAN_predicted_peptide_2|114_aa MVDERAGWGRSYGAASPLAAHRGGWPYYYHSPSDVNIVMMMCLMQKLEKFLTELTCCLKA KTGQTGASEKECELCREANEECVHLLASTHGSEKDIPGTGALKAEDKHDGYQRT >gi568815596r:234396011_234596586|GENSCAN_predicted_CDS_2|345_bp atggtggacgagagggcaggctggggcagaagttatggcgctgccagcccattggcagcc cacaggggtggttggccatactactatcactctccgtctgatgttaatatcgtgatgatg atgtgcttgatgcagaaactagagaaattcctcacggagcttacctgttgtctaaaagcg aaaactggtcaaactggggcttcagagaaagaatgtgagctatgcagagaagcgaatgag gaatgtgttcacctcttagcctcaacccatgggtcagaaaaagacatccctggaactgga gctttgaaagcagaagataaacatgatggatatcagagaacttaa >gi568815596r:234396011_234596586|GENSCAN_predicted_peptide_3|344_aa MFSFAHSLTQQISQHIPRVPGIQGKPVVRLKSIKRLNSHQSTSSWSIAFEHGSKAHSQGE GVKLQQLSLLSIITKAKDSAIYLNRTAYRIFQSEIRNVKRFSHGNYRIIGACFSHFSQQL SPGQPSRLPLLPPYPTAQGSIVAISNWSFWCALESIPWAYAIFQLKNALQTSLWLLKRLH TLQGFGIDLFFCPQQPPGSQGIGACSWLRRTLWQHAAYLPAFMHGCTPQTPWSPAIIQLI PSWYSPGPPCCLLLLKSIHAKLANSTCLMPPLLLRDLGGCGGQNNGPPNAHVLIAETCEY VTPHGIKNAVDVIKLRTLRKSLKELEKDFEGGRILHMAGTRTTE >gi568815596r:234396011_234596586|GENSCAN_predicted_CDS_3|1035_bp atgttttcatttgcccactcactcactcagcaaatttcacagcatattccccgtgtgcca ggtattcagggaaagcccgttgtccgtctgaagtctattaagaggttaaatagtcatcag tccacttcaagctggtcgatcgcctttgagcatgggagcaaagcccacagccagggagag ggggtcaaactgcagcagctctcacttctgagcatcataacgaaagccaaagacagcgcc atttacctaaatcggacagcctacagaatcttccagtctgaaataagaaatgtcaagcgt tttagccatgggaattataggatcataggagcatgtttctcccacttcagtcagcagctt tcccctggacagccttccaggcttccactactgcccccctaccccactgcccagggttca atagttgccatttccaactggtccttctggtgtgccctggagagcattccctgggcctac gcaatcttccaacttaagaatgctctgcagacatcactctggttgctcaagaggctgcac acattacaaggttttggaattgatttgtttttctgcccacagcagcctccaggctctcag ggcattggagcctgttcctggctccggcgcaccctctggcagcacgctgcctatctgcca gccttcatgcatggctgcactccacagacgccctggtcaccagccatcatccagctgatc cccagctggtactcaccgggaccaccatgctgcttgttactgttgaaatccatccatgcc aagttggccaatagcacctgcctgatgcccccgctcctactcagggaccttggagggtgt ggtgggcagaataatggtcccccaaatgctcatgtcctaattgctgaaacctgtgaatat gttaccccacacggcataaagaacgctgtggatgtgattaagctgaggaccttgaggaaa tctctgaaagagttggagaaggattttgaggggggcagaattcttcacatggcaggtacc aggaccacagagtaa >gi568815596r:234396011_234596586|GENSCAN_predicted_peptide_4|572_aa MGLTCFYEVPVVFSGFSVGPHPEACMLGGCIPSLDEGAATERLHRKHILPLKSVGNGGMI PYPQSDLQETKTQRKSVISSSGFSPQEFMAASILELPTFSSWKCELGRWTRPQGAGCLVG PSEKGTSCQGGPGHLPLCRCDVHNCWDTKHFPSELATEQTCGKPRGHELWTLWSDIKQCD VGVWVEACRTEACLVQQFQERRGEDQAKPTPGKSTGVHTCKNGRVHDSFWLISTPMSEPS SGELPSVLAQATITKYYKLGGLNNRHLFSRSSGDWKSWMKIRQERAQAWGGRRTSEAASR ETKPRVETRCPPIASPAGEGAGTAASERAKPCARGDAEPRGRCGRRRAGPRCARPRRARD CSGGGALPPQPQWLAPQPGAMGNISSNISAFQSLHIVMLGLDSAGKTTVLYRLKFNEFVN TVPTIGFNTEKIKLSNGTAKGISCHFWDVGGQEKLRPLWKSYSRCTDGIIYVVDSVDVDR LEEAKTELHKVTKFAENQGTPLLVIANKQDLPKSLPVAEIEKQLALHELIPATTYHVQPA CAIIGEGLTEGMDKLYEMILKRRKSLKQKKKR >gi568815596r:234396011_234596586|GENSCAN_predicted_CDS_4|1719_bp atgggactcacctgcttctatgaggttcctgtggttttttctgggttctctgttgggccc cacccagaggcatgtatgctgggtggctgcattccatcactcgacgaaggagctgctaca gagcgtctccatcggaagcacatcctccctctgaaatcagtgggtaatggtgggatgata ccgtatcctcagtcagatctccaagagaccaagactcagaggaagagcgttattagcagt tcagggttctcaccccaagagttcatggcagcttcgatcctggaactgcccaccttctct tcttggaagtgtgaacttgggagatggaccaggcctcagggtgcaggatgcctcgtgggg ccatctgaaaaaggcacatcctgccagggaggaccaggccacctgcccctctgccggtgt gatgtccataactgctgggacaccaagcattttccttccgagctggccactgagcaaacc tgtggaaagcccagaggccacgagctttggacgctgtggtcagatataaaacaatgtgat gtaggagtttgggttgaggcctgcaggactgaagcctgtctggttcagcagttccaggag aggaggggagaggatcaagccaaacccactcctggaaaaagcacaggagtccatacgtgc aagaatgggagggtgcacgacagcttctggctgatttccacaccaatgtctgaaccctct tctggggagctgccatctgtcctagctcaggctaccataacaaaatactacaagctgggt ggcttaaacaatagacacttattttctcgcagttctggggactggaagtcctggatgaag atccggcaagagcgagcccaggcctggggagggcgccgaacatctgaggcggcttcgcgg gagacaaagccgcgcgtagagacgcgatgcccgccgatcgcgagcccggccggcgagggc gcggggactgcggcgtctgagcgcgccaagccgtgcgcccgcggggacgccgagccccgg ggccggtgcgggcggcggcgggcggggcccaggtgcgcccggccgcgtcgggcccgtgac tgctcggggggcggcgccctcccgccgcagccgcagtggctggcgccgcagccaggagcc atgggcaacatctcctctaacatctcggccttccagtccctgcatatcgtcatgttgggc ttggactcggccggcaagaccacggtgctctaccggctcaagttcaacgagttcgtgaac acggtgcccaccatcggcttcaacaccgagaagatcaagctgagcaacggcacggccaag ggcatcagctgccacttctgggacgtgggcggccaggagaagctgcggccgctgtggaag tcctacagccgctgcacggacggcatcatctacgtggtggactcggtggacgtggaccgg ctggaggaggccaagacggagctgcacaaggtgaccaagttcgccgagaaccagggcacg ccgctgctggtcatcgccaacaagcaggacctgcccaagtcgctgccggtggcagagatt gagaagcagctggcgctgcacgagcttatcccggccaccacctatcacgtccagccggcg tgcgccatcatcggcgagggcctcaccgagggcatggacaagctctatgagatgatcctg aaacgcaggaagtccctcaagcagaagaagaagcggtaa >gi568815596r:234396011_234596586|GENSCAN_predicted_peptide_5|309_aa MAATSTVGSLHGYHLQEVVSFATSKVNRTQCKILGTFIQIPVNCKLNAYEIRVGPIRSLG RHEVPEYVSEGSQRNGQRAHGAYNNSESLLVVGMLPEEADLLAHSPGAGKETLDLPSNLL WKQEVWPGNQSAARSIQQGESLKCDRGAHGYQRGSTYLCLGTFTDGNKVIASFKSCKALL DIKKASSKFLVLYQAWPTVTIDSGPQAKSKPSQMLLVDVQRGSESWRILNQSQEHGLEVT LIPSAHNSGARTGPLTLLNTRGPGQTVLNILPLSEELEAVNIATNPADYIGGGFAYIADG GGVLGKHKA >gi568815596r:234396011_234596586|GENSCAN_predicted_CDS_5|930_bp atggctgctacatccactgttggttctcttcacgggtaccatttgcaggaagtagtcagt tttgccacaagcaaagtaaacagaacccaatgcaagatacttggcacttttatacagatt cctgttaactgcaagctaaatgcttatgaaattcgggttggaccaatcagaagcttggga aggcacgaggtccctgagtatgttagtgaaggttcccagagaaatggccaaagagcacat ggtgcctacaataactctgaaagtttgctggtggtagggatgcttccagaagaagcagat cttctagcacattcacctggtgctgggaaagagaccttggaccttcccagcaacctactg tggaagcaagaggtgtggcctgggaaccagtcagcagcaaggtccatacagcagggtgag tctctgaagtgtgaccgaggtgctcatggctatcagaggggcagcacctatctgtgtctt ggaaccttcactgatgggaacaaagtcattgcttccttcaagagctgcaaagccttgttg gacatcaagaaagcatcctcaaagttcctggtactttaccaagcatggcccacagtgacc attgactcagggcctcaggctaagtcaaaaccctcccagatgctgctggtggacgtgcag aggggaagtgagtcctggaggatcttaaaccagagccaagagcatggcctagaggtgaca ctcatcccttctgctcacaactcaggggccagaacgggacctctgaccctactcaacacc agggggccaggacaaacagtcctgaacatcctgccattatcggaagaattagaagccgtc aacatagctacgaacccagcagattatattggtggcggctttgcatacattgctgatggg ggaggagttctaggaaagcataaagcatag