GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:20:33 Sequence gi568815588f:124701830_124934968 : 233139 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 25 20 6 -0.45 1.02 Term - 3872 3762 111 2 0 86 44 108 0.149 4.76 1.01 Init - 4884 4807 78 0 0 64 81 66 0.115 4.15 1.00 Prom - 12375 12336 40 -4.36 2.00 Prom + 24391 24430 40 -4.96 2.01 Init + 26199 26332 134 2 2 43 109 103 0.523 7.51 2.02 Intr + 30484 30598 115 1 1 101 99 -15 0.509 1.35 2.03 Intr + 31451 31540 90 1 0 82 62 72 0.462 4.19 2.04 Intr + 34187 34369 183 1 0 85 100 15 0.707 2.38 2.05 Term + 36236 36349 114 1 0 50 49 121 0.921 2.97 2.06 PlyA + 37029 37034 6 1.05 3.09 PlyA - 38049 38044 6 1.05 3.08 Term - 41065 40895 171 1 0 95 49 139 0.100 8.53 3.07 Intr - 41309 41122 188 0 2 88 21 61 0.036 -1.19 3.06 Intr - 63779 63563 217 2 1 45 115 101 0.409 6.58 3.05 Intr - 78989 78915 75 2 0 101 58 24 0.094 0.41 3.04 Intr - 85903 85890 14 2 2 134 91 10 0.313 0.40 3.03 Intr - 87929 87843 87 0 0 109 58 57 0.560 4.74 3.02 Intr - 88509 88454 56 2 2 34 67 65 0.224 -2.38 3.01 Init - 90004 89895 110 1 2 69 73 185 0.448 14.06 3.00 Prom - 90538 90499 40 -8.26 4.00 Prom + 95186 95225 40 -4.76 4.01 Init + 100001 100223 223 1 1 101 105 137 0.889 15.42 4.02 Intr + 101126 101210 85 0 1 88 41 42 0.008 -1.62 4.03 Intr + 117555 117621 67 0 1 27 95 74 0.085 0.91 4.04 Intr + 124766 124956 191 1 2 73 98 206 0.943 18.48 4.05 Intr + 126927 127046 120 0 0 100 106 68 0.998 9.41 4.06 Intr + 127564 127648 85 1 1 85 93 30 0.944 3.02 4.07 Intr + 129520 129634 115 0 1 106 100 61 0.998 9.12 4.08 Term + 132673 133142 470 2 2 29 36 245 0.967 8.54 4.09 PlyA + 133443 133448 6 1.05 5.00 Prom + 135093 135132 40 -4.56 5.01 Init + 143999 144160 162 1 0 56 78 226 0.526 18.13 5.02 Intr + 144473 144560 88 1 1 55 37 85 0.130 -0.36 5.03 Intr + 147556 147661 106 2 1 96 49 28 0.040 -1.03 5.04 Intr + 164666 164944 279 2 0 100 68 126 0.549 8.39 5.05 Intr + 165666 165819 154 0 1 32 24 144 0.642 2.47 5.06 Term + 165989 166435 447 1 0 12 38 699 0.632 52.52 5.07 PlyA + 166824 166829 6 1.05 6.02 PlyA - 171734 171729 6 1.05 6.01 Sngl - 215614 215063 552 1 0 84 40 305 0.719 19.42 6.00 Prom - 232519 232480 40 -2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42097 42441 345 0 0 28 43 266 0.887 10.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_1|62_aa MVMVLSESLSTRGADSIACGTFSRELASQSAFRVLGTRLEEFREQDELEPGPVLLTLEKL VV >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_1|189_bp atggtgatggtcctaagtgaaagcctcagcacccggggagctgactccattgcatgtggg accttcagccgtgaactggcctcacagagcgccttccgtgtgttgggcaccaggctggag gagttcagggaacaggatgagctggagccaggccctgtactcctcacccttgagaagctc gtggtctag >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_2|211_aa MMLTIQTPENTQTTKFHLGLKISMDSLGFAIQSHILIGTTPWLTRFHYPPRNRAGSATLS RRLTSAGEGLGQAGSWQTQGAPQVTCHTVNGAWNLGWHLRAVEASGALAAKSQPQEAPRW HAADHGLCSRTPACPVTLISYLALLCKALRPLQGASQTSSKLIFLKTLGGRTSKGWVPFR CHFTKTEQQQQQQQMPIGLLRSRIRKPPISC >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_2|636_bp atgatgctgacgattcagacccctgaaaacactcaaacgactaagtttcatcttggtttg aagatctccatggactcgcttggcttcgcgatccagtcacacattctcatcggcaccact ccctggctcactcggttccactatccacccagaaaccgggcaggatcagccacgctcagt cgtcgtcttacaagtgcaggggaagggcttggccaggcaggaagctggcagacccaggga gctcctcaggtcacatgccacacagtcaacggggcctggaatctgggctggcatctgagg gcggtggaggcctcaggcgcactggccgccaagagtcagccacaagaagcacctaggtgg catgcagccgaccatgggctttgcagcagaacccctgcctgcccagtcacactcatctcc tacctggcactgctctgcaaggcactacgcccattgcaaggggcttctcaaacaagctct aaactgatcttcctaaaaacgctgggtggcaggaccagcaagggatgggtgcccttcaga tgccacttcaccaagacagagcagcagcagcagcagcagcaaatgccaataggtttgctc cgctcccgaatacgaaagcccccaatctcctgctaa >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_3|305_aa MSSGADGGGGAAVAARSDKGSPGEDGFVPSALGTREHWDAVYERELQTFREYGDTERWGV ILSSRLEYSGMIIITRYILDLLGPGFVTQFAYDVCCGLDFFGLSYLVFTELFTSVEDFLN LSTQLSGFHICIDKGTFDAISLNPDNAIEKRKQYVKSLSRVLKVKGFFLITSCNWTKEEL LNEFSEAPSSPPPFSDLGNNACSTGRGAAAATNRADRFARVPRVRTPPEAELLPSDRPGS LTRTVLRTMATLRVFHTVQSPDSVVLGKWPPEWLAFRVWDPGVGICAVAFAAALALQERV QMEPG >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_3|918_bp atgagctcgggcgctgacggcggcggtggcgctgcggtggcggcgcggtcggacaagggc agtcccggggaggacggtttcgtcccgtcggcgctggggacccgcgagcattgggatgct gtctatgagagagaactgcaaactttccgagaatatggagatacagagaggtggggtgtc attctgtcatccaggctggagtacagtggcatgatcatcataactcgttacatcctggac ctcctgggcccaggctttgtcacccagtttgcttatgatgtgtgctgtggcttggatttc tttggcttatcctatttggtgttcactgaactttttacatctgtagaagactttttgaat ctctccacacagctgtctggatttcatatttgtattgacaaagggacttttgatgccata agccttaatcctgacaatgcaattgagaagaggaagcaatatgtgaaatctctctccagg gtgttgaaagtaaaaggcttttttctaataacgtcatgtaattggaccaaggaagagttg ctaaatgaattcagtgaagcccccagctctccacccccattttcggacttggggaacaat gcttgtagtacgggccggggcgccgcagcagcgacgaaccgcgcggacaggtttgcccgg gtccctcgcgttcggacgccccctgaagccgagctccttccatctgaccgcccgggctca ttaacacgtactgtacttaggacgatggccacactccgagtcttccatacggtacaatct ccggactcggtggtgttaggaaagtggcccccggaatggcttgcattccgggtctgggat cctggggtgggcatttgtgccgtcgccttcgccgccgcgctcgccttgcaggagcgggtg cagatggagccaggctga >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_4|451_aa MAASISGYTFSAVCFHSANSNADHVGAGPPAAPAGGFQPLSQAGARGQAGTSCGSPPGHQ GRPEELVTRAAPAPVLMVELNDRNRNIYIMGFSRSSFEDYGLGFYDYASKVNEESLDRIL KDRRKKVIGWYRFRRNTQQQMSYREQVLHKQLTRILGVPDLVFLLFSFISTANNSTHALE YVLFRPNRRYNQRISLAIPNLGNTSQQEYKVSSVPNTSQSYAKVIKEHGTDFFDKDGVMK DIRAIYQVYNALQEKVQAVCADVEKSERVVESCQAEVNKLRRQITQRKNEKEQERRLQQA VLSRQMPSESLDPAFSPRMPSSGFAAEGRSTLGDAEASDPPPPYSDFHPNNQESTLSHSR MERSVFMPRPQAVGSSNYASTSAGLKYPGSGADLPPPQRAAGDSGEDSDDSDYENLIDPT EPSNSEYSHSKDSRPMAHPDEDPRNTQTSQI >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_4|1356_bp atggcggcgtccatttcgggctacaccttcagtgctgtgtgtttccacagcgccaacagc aacgcggaccacgtaggtgccgggccccctgccgcgcccgctgggggctttcagcctctg tctcaggccggcgctcgcggccaagccgggacctcatgcggctcgccccctgggcaccag ggccggccggaggagctggtgacccgggcggctcccgcccccgtgttgatggtagaactt aatgataggaataggaatatatacatcatgggtttttccagaagcagctttgaagactat ggcctaggtttttatgactacgcaagcaaagtgaatgaggagagtttggacaggattctt aaagatcggagaaagaaagtcattgggtggtacagattccggcgcaatacgcagcagcag atgtcctacagagagcaggttcttcacaagcagctcacccgcatcctcggcgtgcccgac ctcgtctttcttctcttcagcttcatctccactgccaacaattccactcacgctttagaa tatgtgctcttcagaccaaatagaaggtataatcagaggatatcactcgctattcccaat ctaggaaatactagccagcaagagtacaaagtgtcttcagtgccaaatacttctcagagt tatgccaaagtgattaaagaacatggtactgacttttttgacaaggatggagtgatgaaa gacatcagggcgatttatcaggtttataatgcacttcaggagaaagttcaggcagtgtgt gcagatgttgaaaagagtgagcgagttgttgaatcttgtcaggcagaagtgaacaaatta agaagacaaatcactcagaggaaaaatgaaaaggaacaagaaagaagattgcagcaggca gtgttaagcagacagatgccgtctgaaagcttggacccagcgttcagtcctcggatgccg tcctctgggtttgcagctgaaggcagaagtacacttggagatgcagaggcctcggatcct cctcccccttactctgattttcacccaaacaatcaagaaagtactttgagccactctcgc atggaaaggagtgtctttatgcctcgacctcaagctgtgggctcttccaattatgcttcc accagtgccggactgaagtatcctggaagtggggctgaccttcctcctccccaaagagca gctggagacagtggtgaggattcagacgacagtgattatgaaaatttgattgaccctaca gagccttctaatagtgaatactcacattcaaaggattctcgacccatggcacatcccgac gaggaccccaggaacactcagacctcccagatttaa >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_5|411_aa MDMFVCIDDLGDSVVLPGRQQHGRMAAILILILQLTFLKMTSGDEVHGTSSILKALQTRR TEHSWMNSDCGLKKNHRGEIHWEGRDREGCIKQKPTSFLQLIHTFHLLSPRPACSEVPRY PPIGTFYCVLKGVQGPGSVFDIVANSHKYPEHLIPSKMGLSPACLPSSYDPGKDCSGRCP LCGWEASEARLQAHQRVCGRGHVAAIFCLLVSVCRHPMEDSMDMHMSPLRPQNYLFSCEL KANKDDHFKVDNDENEHQLSLRTCGSGPVHISGQHLVAVEEDAESEDEEEESAKLLSISG KQSVPGGGSKVPQKQVKLAADEDDDDHDEDDGDDEDEDDEEDDDEDEDGGDDEEDDDEDD EETEEKVPVKKSIRDTSAKNAQKSNQNGKDSKPSTPRSKRQESFKKTGKNS >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_5|1236_bp atggacatgttcgtgtgcatcgatgacctgggcgactctgtggtcctgcctggccgtcag cagcatggacgcatggccgccatcctcatcctcattctccagttgacatttctaaaaatg acctccggtgatgaagtccatgggaccagcagcattttgaaagctttacagacgagaaga actgaacactcctggatgaacagcgactgtggcttaaagaagaaccacaggggagaaatc cactgggagggacgtgacagggaaggctgcatcaagcagaagccaacctcatttctgcag ctcatccacaccttccacttgctcagtcctcggcctgcctgctcagaggtacccaggtac cccccgatagggacgttttactgcgttctgaaaggtgtgcagggacctggttcagtcttt gacatcgtggccaacagtcacaagtaccccgaacatctcattccttccaaaatggggttg tcgcctgcgtgccttccttcttcctatgaccctggaaaagattgctccggaaggtgccca ctatgtggatgggaagcatcagaggcaagactccaggcacatcagcgtgtctgtggacgt ggacatgtggctgccatcttctgcctcttggtcagtgtatgccgccacccgatggaagat tcgatggacatgcacatgagtcctctgaggccccagaactatcttttcagttgtgaactt aaggccaacaaagatgatcactttaaggtggataatgatgaaaatgagcaccagttatct ttaagaacatgtggctcagggccagtgcatattagtggacagcacttagtagctgtggag gaagatgcagagtcagaagatgaagaggaggagagtgcgaaactcttaagtatatctgga aagcaatctgtccctggaggtggtagcaaggttccacagaaacaagtaaaacttgctgct gatgaagatgatgatgatcatgatgaagatgatggtgatgatgaagatgaagatgatgaa gaagatgatgatgaagatgaagatggtggtgatgatgaagaagatgatgatgaagatgac gaggaaactgaagaaaaagtgccagtgaagaaatctatacgagatacttcagccaaaaat gcacaaaagtcaaatcagaatggaaaagactcaaaaccgtcaacaccaagatcaaaaaga caagaatccttcaaaaaaacaggaaaaaactcctaa >gi568815588f:124701830_124934968|GENSCAN_predicted_peptide_6|183_aa MEAPTAAARAPALPAPATRSPLGRRAGGQPVRPGNLGAAWERGARQAAVGRPPGPGYLAA AAPPPAPAPPPRPTGAWRRRRRRRRRRQRRLPAQPAATSRARPLSASLRAAARPAAAASA LSPQSGHGLRGSARGSRLPEAGGEEQRGGAETGEPRGAGALRCGRRPRALRTLMAAAEGP GAK >gi568815588f:124701830_124934968|GENSCAN_predicted_CDS_6|552_bp atggaagcgccgacagcggcggcccgggcccctgccctccccgcgcctgcgacgcggtcg cccctggggcgccgggcgggcggacagcccgtgaggcccggcaacctgggcgccgcctgg gagcgaggggcccggcaggccgcggtggggcgccccccggggcccggttacctggcggcg gcggctcctcctccagctcctgctcctcctccccgtccgacgggcgcttggcggcggcgg cggcggcggcggcggcggcggcagcggcgactccccgcgcagcccgcggctacgagtcga gcccggcccctctccgcctcccttcgcgccgccgcccgccccgctgctgccgcctccgcc ctctcccctcagtcaggacatggtctccgagggtccgcgcgcggctcccggctgccggag gccgggggagaggagcagagaggaggcgcggagactggggagccccggggggcgggggca ctgaggtgtgggaggcggccgcgggcgctgcggaccctaatggcggcggccgagggaccc ggagccaaataa