GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:50:32 Sequence gi568815578r:49407923_49668116 : 260194 bp : 47.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16697 16767 71 2 2 56 113 31 0.094 1.23 1.02 Intr + 19078 19309 232 2 1 94 64 71 0.321 2.13 1.03 Intr + 19951 20151 201 1 0 64 78 135 0.718 8.50 1.04 Intr + 36304 36462 159 1 0 108 74 53 0.461 5.00 1.05 Intr + 37497 37688 192 0 0 52 45 102 0.266 0.91 1.06 Intr + 41181 41340 160 0 1 51 110 55 0.404 3.99 1.07 Intr + 42536 42728 193 1 1 72 15 81 0.024 -1.73 1.08 Intr + 47326 47400 75 2 0 85 94 42 0.063 3.99 1.09 Intr + 53112 53275 164 1 2 62 36 86 0.066 0.49 1.10 Term + 55297 55347 51 0 0 148 49 6 0.092 0.13 1.11 PlyA + 55673 55678 6 1.05 2.06 PlyA - 58655 58650 6 1.05 2.05 Term - 60708 60550 159 0 0 85 42 107 0.546 3.74 2.04 Intr - 74584 73992 593 2 2 -2 113 1089 0.001 94.72 2.03 Intr - 76408 76020 389 0 2 85 84 131 0.004 6.83 2.02 Intr - 93937 93772 166 2 1 94 100 49 0.522 5.72 2.01 Init - 95987 95903 85 2 1 89 77 54 0.909 5.42 2.00 Prom - 97585 97546 40 -6.16 3.11 PlyA - 98452 98447 6 1.05 3.10 Term - 100142 99998 145 1 1 106 47 344 0.999 29.48 3.09 Intr - 103257 103106 152 0 2 71 79 113 0.999 7.66 3.08 Intr - 105339 105158 182 1 2 89 84 163 0.999 15.59 3.07 Intr - 106473 106305 169 0 1 105 77 107 0.899 10.82 3.06 Intr - 116317 116136 182 2 2 123 94 141 0.953 17.79 3.05 Intr - 131799 131648 152 2 2 117 94 222 0.986 25.51 3.04 Intr - 136526 136383 144 1 0 93 94 101 0.722 10.60 3.03 Intr - 140097 139919 179 0 2 92 74 196 0.996 17.32 3.02 Intr - 142267 142144 124 0 1 53 93 119 0.855 9.49 3.01 Init - 160194 160121 74 0 2 98 101 238 0.991 24.64 3.00 Prom - 167582 167543 40 -4.96 4.15 PlyA - 170967 170962 6 1.05 4.14 Term - 180958 180854 105 1 0 90 45 59 0.117 0.21 4.13 Intr - 195486 195347 140 1 2 43 58 74 0.073 0.08 4.12 Intr - 199692 199554 139 0 1 107 70 97 0.966 9.84 4.11 Intr - 203399 203290 110 0 2 45 73 63 0.015 0.50 4.10 Intr - 206784 206725 60 1 0 73 90 54 0.030 2.81 4.09 Intr - 213071 212954 118 2 1 59 42 81 0.039 0.64 4.08 Intr - 228537 228405 133 2 1 115 66 172 0.226 18.35 4.07 Intr - 229520 229419 102 1 0 70 81 89 0.975 5.69 4.06 Intr - 231878 231756 123 1 0 87 82 83 0.980 7.30 4.05 Intr - 232743 232556 188 0 2 59 65 27 0.921 -4.01 4.04 Intr - 234662 234546 117 2 0 95 89 107 0.999 12.16 4.03 Intr - 235728 235604 125 1 2 95 97 50 0.928 6.90 4.02 Intr - 239156 239043 114 0 0 109 61 25 0.763 2.32 4.01 Intr - 248780 248646 135 0 0 107 76 188 0.980 20.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 203387 203290 98 2 2 74 73 149 0.891 9.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:49407923_49668116|GENSCAN_predicted_peptide_1|499_aa XCCLSLHPAAGAEQPIWDDRDIFNGNGSRYGHVVHDHSGWLRDGHMMQSRPMHVNHRALR NYWYPLELLGWLDTSLGLLRAIFAMTWGAACEDANTKEKRSRLLPTSLFFVRNTHSRGAL FSAVAQQSGLDHQAGFRYGKEPADVMESCQLGQELRGDDVIIKFQKTWRHEDDPLWPQLQ QNMQGLEDTLQSSPVPGAPKVSWFCPWSPSRRGDEGEQEPARAVEFLTGGGGGRLLMLTR EPVLPPVAGCPRPGPCSGNLPQCLSNTGVKSLKWFPKVLSRIAGPGSGQSERCIPLATTI GRVWACDPNQANQSEPWDLSWDYRKEKLQHSNGAPPSAQTRAPACMLLASNYQPARRMKH GSLMLLRGANRCWQKSIPVDKSVDTCTLIRAHLAPKYRNESVSRSLWLSGKRVSVDAGEQ RESNCNSPVGTNSTLVVTAVTNKLLHFPTFCCQPISRAGKPQALGTFFCPLPDSVCLSLD DGAVGASEARNAPPTFPAT >gi568815578r:49407923_49668116|GENSCAN_predicted_CDS_1|1500_bp nnatgctgcctgtccctacatccggctgctggggctgagcagccaatttgggacgaccgt gatatatttaacggaaatggctccaggtatgggcatgtggttcatgaccacagtggttgg ctcagagatggccatatgatgcaatccaggccaatgcatgtcaaccacagagctcttagg aactactggtacccactggagctgctgggctggttagatacaagcctggggctgctgaga gccatttttgctatgacttggggagctgcctgcgaggatgccaacacaaaggaaaaaagg tccaggctgctgcctacatctttgttcttcgtgaggaacacacactcccggggggccctt ttctcagctgtggcccagcagtcgggtctggaccaccaggctgggttccgttatgggaaa gaaccagcagatgtaatggaatcatgccaattaggccaggagctgcgtggggatgatgtt atcattaagttccagaagacgtggagacatgaggatgaccctctctggccccagctgcag cagaacatgcaaggattggaggacactctccaatcttcccctgtcccaggggctcctaaa gtgtcctggttttgcccttggagcccaagtagaagaggggacgagggggagcaggagcca gccagggctgtggagttcctgacaggtggaggtggtggacgtctgctgatgttgaccagg gagcccgtgctcccaccggtggctggctgccccagacctggcccctgttctgggaatctg ccccagtgtctgagcaacacaggggtgaagagtttgaaatggttccccaaggtgctttcc aggattgcagggcctgggtctggccaatcagaacggtgcattcccctggcaacaacgatt ggcagagtctgggcatgtgaccccaatcaagccaatcagagtgaaccctgggacttgtca tgggactaccggaaagagaagctccagcattctaatggagctccgccttcagcacagacc agggcccctgcatgcatgctgttagcatctaactaccagccagcacggcgcatgaagcac ggttcattaatgctcttgaggggagcaaacaggtgctggcagaaatctattcccgtggac aaatctgtggacacgtgcaccctcatacgtgcccacttagcccccaagtataggaatgag tcagtgtcacgatccctctggctttcagggaagcgtgtgagtgtggatgcaggtgagcag cgagaaagtaactgtaatagtccagttggcaccaacagtacacttgtggtgaccgcagtg accaacaaactgttgcacttccccacgttctgctgccagcccatctcccgggctggaaag ccgcaagcccttggcactttcttctgccccttgcctgacagtgtttgcctttctttggat gatggagcagtgggggcctcagaggcaagaaatgcacccccaacatttccagccacgtga >gi568815578r:49407923_49668116|GENSCAN_predicted_peptide_2|463_aa MLKSIKPSLFVELPLSVILMEGIGRAWNGPSDLQLQDFGLSYPCTIKVNRKTQCFILPGK DIKKYIYSYCQLMKGYFNTPKAERWRRPALVPGSGRSEVRDRIAASAEPKLSPDLLISPR QPPPPPPREQEEEEVEEVPRGRGPAPAPPAPAGAPRLGERAPAPRRLQPGRVACPAGVGP AQPGPGTAAAHGRGRPGGRDDARSLAPGGGRPGGGAGAAGSAMPAGMTKHGSRSTSSLPP EPMEIVRSKACSRRVRLNVGGLAHEVLWRTLDRLPRTRLGKLRDCNTHDSLLEVCDDYSL DDNEYFFDRHPGAFTSILNFYRTGRLHMMEEMCALSFSQELDYWGIDEIYLESCCQARYH QKKEQMNEELKREAETLREREGEEFDNTCCAEKRKKLWDLLEKPNSSVAAKAALTAPSSV HRKPDVHTFLAQNIVKIHLWVVLWMPGLAYRETGKLKRAGCAA >gi568815578r:49407923_49668116|GENSCAN_predicted_CDS_2|1392_bp atgctgaaatctattaaaccatctttgtttgtggaactgcctctgagtgtgattctgatg gagggcattggccgtgcctggaatggtcctagtgatctgcagctccaagactttggcctt tcatacccttgtaccattaaagtcaacagaaagacccagtgttttattttgccaggtaag gacattaagaaatacatttactcctactgtcagcttatgaaaggatattttaatacccct aaagcagaaaggtggcgccgtcctgccttggtgcccggctccgggcgctccgaagtgcgg gacagaatagcagcgagcgcggagccgaagctcagccccgacttgttgatctcgcccagg cagccgccgccgccgccgccgcgggagcaggaggaggaggaggtggaggaggtgccgcgc ggccgcggtcctgcccccgcgcccccggcgcccgccggcgccccccgcctcggagagcgc gccccggccccgcggcgcctgcagcccgggcgagttgcctgcccggccggggtaggcccg gctcagcccggccccgggaccgccgctgcccacgggcgggggcgcccgggcggccgggac gatgcgcggtccctggccccaggaggcggccgccctggggggggggcgggggcggccgga tcagcgatgccggcgggcatgacgaagcatggctcccgctccaccagctcgctgccgccc gagcccatggagatcgtgcgcagcaaggcgtgctctcggcgggtccgcctcaacgtcggg gggctggcgcacgaggtactctggcgtaccctggaccgcctgccccgcacgcggctgggc aagctccgcgactgcaacacgcacgactcgctgctcgaggtgtgcgatgactacagcctc gacgacaacgagtacttctttgaccgccacccgggcgccttcacctccatcctcaacttc taccgcactgggcgactgcacatgatggaggagatgtgcgcgctcagcttcagccaagag ctcgactactggggcatcgacgagatctacctggagtcctgctgccaggcccgctaccac cagaagaaagagcagatgaacgaggagctcaagcgtgaggccgagaccctacgggagcgg gaaggcgaggagttcgataacacgtgctgcgcagagaagaggaaaaaactctgggaccta ctggagaagcccaattcctctgtggctgccaaggcagcactgactgctccctcctctgtg caccggaaacctgacgttcataccttcctcgcacaaaacatcgtgaaaattcacttgtgg gttgttctgtggatgccaggcctggcctacagagaaacaggaaagcttaagcgtgcaggc tgtgcagcctga >gi568815578r:49407923_49668116|GENSCAN_predicted_peptide_3|500_aa MAWAALLGLLAALLLLLLLSRRRTRRPGEPPLDLGSIPWLGYALDFGKDAASFLTRMKEK HGDIFTILVGGRYVTVLLDPHSYDAVVWEPRTRLDFHAYAIFLMERIFDVQLPHYSPSDE KARMKLTLLHRELQALTEAMYTNLHAVLLGDATEAGSGWHEMGLLDFSYSFLLRAGYLTL YGIEALPRTHESQAQDRVHSADVFHTFRQLDRLLPKLARGSLSVGDKDHMCSVKSRLWKL LSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLL KNPEALAAVRGELESILWQAEQPVSQTTTLPQKVLDSTPVLDSVLSESLRLTAAPFITRE VVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVFKYNRFLNPDGSEKKDF YKDGKRLKNYNMPWGAGHNHCLGRSYAVNSIKQFVFLVLVHLDLELINADVEIPEFDLSR YGFGLMQPEHDVPVRYRIRP >gi568815578r:49407923_49668116|GENSCAN_predicted_CDS_3|1503_bp atggcttgggccgcgctcctcggcctcctggccgcactgttgctgctgctgctactgagc cgccgccgcacgcggcgacctggtgagcctcccctggacctgggcagcatcccctggttg gggtatgccttggactttggaaaagatgctgccagcttcctcacgaggatgaaggagaag cacggtgacatctttactatactggttgggggcaggtatgtcaccgttctcctggaccca cactcctacgacgcggtggtgtgggagcctcgcaccaggctcgacttccatgcctatgcc atcttcctcatggagaggatttttgatgtgcagcttccacattacagccccagtgatgaa aaggccaggatgaaactgactcttctccacagagagctccaggcactcacagaagccatg tataccaacctccatgcagtgctgttgggcgatgctacagaagcaggcagtggctggcac gagatgggtctcctcgacttctcctacagcttcctgctcagagccggctacctgactctt tacggaattgaggcgctgccacgcacccatgaaagccaggcccaggaccgcgtccactca gctgatgtcttccacacctttcgccagctcgaccggctgctccccaaactggcccgtggc tccctgtcagtgggggacaaggaccacatgtgcagtgtcaaaagtcgcctgtggaagctg ctatccccagccaggctggccaggcgggcccaccggagcaaatggctggagagttacctg ctgcacctggaggagatgggtgtgtcagaggagatgcaggcacgggccctggtgctgcag ctgtgggccacacaggggaatatgggtcccgctgccttctggctcctgctcttccttctc aagaatcctgaagccctggctgctgtccgcggagagctcgagagtatcctttggcaagcg gagcagcctgtctcgcagacgaccactctcccacagaaggttctagacagcacacctgtg cttgatagcgtgctgagtgagagcctcaggcttacagctgcccccttcatcacccgcgag gttgtggtggacctggccatgcccatggcagacgggcgagaattcaacctgcgacgtggt gaccgcctcctcctcttccccttcctgagcccccagagagacccagaaatctacacagac ccagaggtatttaaatacaaccgattcctgaaccctgacggatcagagaagaaagacttt tacaaggatgggaaacggctgaagaattacaacatgccctggggggcggggcacaatcac tgcctggggaggagttatgcggtcaacagcatcaaacaatttgtgttccttgtgctggtg cacttggacttggagctgatcaacgcagatgtggagatccctgagtttgacctcagcagg tacggcttcggtctgatgcagccggaacacgacgtgcccgtccgctaccgcatccgccca tga >gi568815578r:49407923_49668116|GENSCAN_predicted_peptide_4|569_aa XNTYLFMMQAQGILIRDNVRTIGAQVYEQVLRSAYAKRNSSVNDSDYPLDLNHSETFLQT TTFLPEDFTYFANHTCPERLPSMKGPIDINMSEIGMDYIHELFSKDPTIKLGGHWKPSDC MPRWKVAILIPFRNRHEHLPVLFRHLLPMLQRQRLQFAFYVVEQVGTQPFNRAMLFNVGF QEAMKDLDWDCLIFHDVDHIPESDRNYYGCGQMPRHFATKLDKYMYLLPYTEFFGGVSGL TVEQFRKINGFPNAFWGWGGEDDDLWNRVQNAGYSVSRPEGDTGKYKSIPHHHRGEVQFL GRYALLRKSKERQGLDGLNNLNYFANITYDALYKNITVNLTPELAQGTQSKTLIDQKSGT SRTHHSVASPFSTDDDSGKVPEPLAGPEVGAGDEAVEYPSYINSQGSPRDDPVPARPGCG SGAPWRLHGALQQSPKSLAMGKVKHHDSEKASDLPKITGPTESQPECKPGPSDLEYALFT MAPGTKTPVLEAGSPSSRCRLSGLISVKASLFGLQMATLLLPLHTVIPLCMCAPEEVTHL TSLEFALLHLIPMVLNQGSFASQDTFSNV >gi568815578r:49407923_49668116|GENSCAN_predicted_CDS_4|1710_bp ntgaacacctacctcttcatgatgcaagcccaaggcattctgatccgggacaacgtgaga acaatcggtgctcaggtttatgagcaggtgcttcggagtgcttatgccaagaggaacagc agtgtaaatgactcagattatcctcttgacttgaaccacagtgaaaccttcctgcaaact acaacatttcttcctgaagacttcacctactttgcaaaccatacctgccctgaaagactc ccttccatgaagggcccaatagacataaacatgagtgaaattggaatggattacattcat gaactcttctccaaagacccaaccatcaagctcggaggtcactggaagccttctgattgc atgcctcggtggaaggtggcgatccttatccccttccggaaccgccacgagcacctccca gtcctgttcagacacctgcttcccatgctccagcgccagcgcttgcagtttgcattttat gtggttgaacaagttggtacccaaccctttaatcgagccatgcttttcaacgttggcttt caagaggcaatgaaagacttggattgggactgtttgatttttcatgatgtagatcacata ccggaaagtgatcgcaactattatggatgtggacagatgccgaggcattttgcaaccaaa ttggataagtatatgtatctgcttccttataccgagttctttggcggagtgagtggctta acagtggaacaatttcggaaaatcaatggctttcctaatgctttctggggttggggtgga gaagatgacgacctctggaacagagtacagaatgcaggctattctgtgagccggccagag ggtgacacaggaaagtacaagtccattcctcatcaccatcgaggagaagtccagtttctt ggaaggtatgctctgctgaggaagtcaaaagaacggcaagggctggatggcctcaacaac ctgaactactttgcaaacatcacatacgacgccttgtataaaaacataactgtcaacctg acacccgagctggctcagggaacccaatcaaagacgctgatcgatcagaagagcggtaca agtagaacacaccactcagtggcaagtcctttctccacagatgatgactcagggaaggtc ccagagcctctggcaggccccgaggtaggtgctggggatgaagctgtggagtatcctagt tatatcaactctcagggtagcccgcgggatgacccggtccccgcgcgtcctgggtgcggc tcgggcgctccctggcggctgcacggggcattgcagcagtccccgaagagcctggcgatg ggaaaggtaaagcaccatgactcagagaaggcaagtgacttgcctaaaatcacagggccc acagagtcacagccagaatgcaagcctgggccctcagatctggagtatgccctcttcacc atggctccaggcaccaaaactccagtgctggaggctggaagtccaagctcgagatgccgg ctgtcagggttgatttccgtcaaggcctctctctttggcctgcagatggcgaccctcttg ctgcctctgcacacggtcatccctctgtgcatgtgcgctcctgaggaagtaactcatctt acctctttggaatttgctcttctacatcttatccccatggttctcaaccagggcagcttt gcctctcaggacacatttagcaatgtctga