GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:30:54 Sequence gi568815581f:57987091_58189276 : 202186 bp : 43.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1020 1694 675 2 0 86 46 240 0.226 13.85 1.02 PlyA + 1931 1936 6 1.05 2.00 Prom + 14652 14691 40 -0.96 2.01 Sngl + 18285 18488 204 2 0 79 40 320 0.978 21.49 2.02 PlyA + 18578 18583 6 -3.74 3.04 PlyA - 18616 18611 6 -5.80 3.03 Term - 18883 18657 227 2 2 26 42 132 0.234 -0.66 3.02 Intr - 19437 19253 185 2 2 86 105 188 0.710 19.73 3.01 Init - 20047 19854 194 1 2 92 77 408 0.975 38.54 3.00 Prom - 51082 51043 40 -0.56 4.08 PlyA - 56349 56344 6 1.05 4.07 Term - 60898 60788 111 1 0 108 50 59 0.749 2.66 4.06 Intr - 66295 66119 177 1 0 22 81 95 0.368 2.32 4.05 Intr - 70674 70590 85 2 1 57 70 28 0.075 -2.28 4.04 Intr - 72468 72370 99 2 0 119 99 14 0.409 4.83 4.03 Intr - 73714 73533 182 1 2 113 81 39 0.420 4.37 4.02 Intr - 80797 80678 120 1 0 128 90 6 0.729 5.49 4.01 Init - 81387 81337 51 0 0 65 52 30 0.283 -2.58 4.00 Prom - 82886 82847 40 -5.26 5.00 Prom + 89434 89473 40 -2.36 5.01 Init + 91907 92028 122 1 2 54 93 40 0.204 0.76 5.02 Intr + 93964 94050 87 1 0 103 52 43 0.202 1.29 5.03 Intr + 96566 96600 35 2 2 110 64 67 0.252 4.37 5.04 Intr + 96795 97226 432 1 0 70 28 247 0.504 10.52 5.05 Intr + 97582 97664 83 2 2 31 99 48 0.855 -0.44 5.06 Intr + 99992 100132 141 1 0 81 75 278 0.991 26.35 5.07 Term + 102052 102189 138 0 0 121 43 78 0.885 4.56 5.08 PlyA + 102492 102497 6 1.05 6.03 PlyA - 103778 103773 6 1.05 6.02 Term - 130685 130293 393 2 0 -20 38 801 0.919 59.33 6.01 Init - 136029 135970 60 0 0 60 90 67 0.777 3.35 6.00 Prom - 139199 139160 40 -5.76 7.00 Prom + 141855 141894 40 -1.36 7.01 Sngl + 144418 144864 447 0 0 82 39 256 0.979 16.33 7.02 PlyA + 144886 144891 6 1.05 8.00 Prom + 145647 145686 40 -2.46 8.01 Init + 165102 165150 49 2 1 86 58 59 0.304 1.81 8.02 Intr + 168110 168978 869 0 2 51 -10 683 0.283 45.86 8.03 Intr + 169954 170294 341 0 2 -42 68 489 0.287 28.17 8.04 Term + 170321 170714 394 2 1 -41 49 275 0.860 5.21 8.05 PlyA + 170986 170991 6 1.05 9.00 Prom + 177410 177449 40 -4.06 9.01 Sngl + 182566 183489 924 0 0 83 54 876 0.764 80.09 9.02 PlyA + 184039 184044 6 1.05 10.03 PlyA - 184124 184119 6 1.05 10.02 Term - 198541 198435 107 2 2 69 53 123 0.926 5.47 10.01 Intr - 199982 199939 44 1 2 110 94 26 0.885 3.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_1|224_aa MAAAADPPPPHSPRSGSLLSRRRRQQSGGGGGGNGSGGSSTWQRRARPLPVTSGTHHRGG ERGRGGSPGPAAAAPSRPDPLWASPLPARRLGAQPVPAGGGVRAPAGGLGWGPGRGWGAA SPPAGVCGAAVRGSGSWSRGSSGEKMEDARVWRWPAGHSWAVSRDSDTVSSTVASLLDPQ RRAQATADTRLFSRSCTAWSPPPNWGKPFRLRASLPFGISSPAG >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_1|675_bp atggctgcggcggccgacccccctcctccccactccccccgctcggggagcctcctcagc cggaggaggcgacaacaaagcggcggcggcggcggcggcaacggcagcggcggctcctca acatggcagcgccgagcgcggccacttccggtaacctccgggacgcaccaccgcggcggc gagcgcgggcggggggggagccccggcccggccgccgccgcccccagtcgcccggacccc ctctgggcgtccccgctgccggctcgccgcctcggagctcagccagtgcccgctggcggt ggggtcagggcccccgcgggggggctggggtggggcccggggcggggatggggcgcggcg tccccccccgcgggggtgtgtggtgctgcggtccgtggctcaggcagctggagcaggggg agctcaggagagaagatggaggacgccagggtctggcgctggcccgctgggcacagctgg gccgtctcccgtgactcagacacggtgtccagcaccgtcgcgtccctgttagaccctcag cggagggcacaggccacggcggacactcggctcttcagtcgctcatgcacagcatggtcc ccacctccaaactgggggaagccctttcgacttcgagcctctcttccttttggcatttcc tccccggccggctaa >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_2|67_aa MGSTKSVTNHLMYESEICYDGENSVVILCFSLGSNCDSCCCFCYGFCYDYGFEIEIFHNL DFWAHQL >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_2|204_bp atgggttctacaaaaagtgtcaccaatcatcttatgtacgagagcgagatctgctatgac ggggagaatagcgtggtgatcctctgcttctccttggggagtaactgcgactcctgctgt tgcttctgctacggcttctgctacgactacggcttcgagatcgagatcttccataacttg gacttctgggcccatcaactttaa >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_3|201_aa MSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGGPPFAFVE FEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGRGGGGGGGGGAPRGRYGPPSRRSE NRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRKEDMTYAVRKLDNTKF RSHEVGYTRILFFDQNWIQWS >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_3|606_bp atgtcgggaggtggtgtgattcgtggccccgcagggaacaacgattgccgcatctacgtg ggtaacttacctccagacatccgaaccaaggacattgaggacgtgttctacaaatacggc gctatccgcgacatcgacctcaagaatcgccgcgggggaccgcccttcgccttcgttgag ttcgaggacccgcgagacgcggaagacgcggtgtatggtcgcgacggctatgattacgat gggtaccgtctgcgggtggagtttcctcgaagcggccgtggaacaggccgaggcggcggc gggggtggaggtggcggagctccccgaggtcgctatggccccccatccaggcggtctgaa aacagagtggttgtctctggactgcctccaagtggaagttggcaggatttaaaggatcac atgcgtgaagcaggtgatgtatgttatgctgatgtttaccgagatggcactggtgtcgtg gagtttgtacggaaagaagatatgacctatgcagttcgaaaactggataacactaagttt agatctcatgaggtaggttatacacgtattcttttctttgaccagaattggatacagtgg tcttaa >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_4|274_aa MLFTVPGTLLPLFACLKEGQTALALLPGKHPGVYPTFSARAILRILAGDEHLKSGVECTV ALFSCQLGSSSLLLLSTPPREKGEESALIAWRTVQQIHVRASLGMSPKLPRLLVRRKRSL SPRCETDPKTPCNFLLKISTKKHSHSHSSSWSSRTSGKEHLPTWIWQPKPSKPPEHHVQY ADGNDPVEMENLMLQMREWIIAGVMLFNKQEKKGSSVREEGLASDRHMDDSFIMRGGKQN EDQMFSGVAPSSSFIRSTNMHHAIMLYCSGQSFC >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_4|825_bp atgctgttcactgtgcctggaacactcttgcccctcttcgcctgcttaaaggaaggacaa acagcactggctctcctcccaggaaagcaccctggcgtttaccccacattttcagcaaga gccatattgaggattcttgctggagatgagcatcttaagagtggggtggagtgcacagtc gctctgttctcctgccaacttggcagctcatcgctgctgttactatcaacacctccaagg gaaaagggagaggagtcagctctgatcgcctggcgcactgtgcagcaaattcatgtcaga gccagccttggaatgtctcccaagctcccaaggctcctggtgaggaggaaaaggagcctt tctcccagatgtgaaactgatcctaagaccccctgtaatttcctgctgaaaatctccacc aagaaacacagccacagccacagttcttcctggagttcaagaacctcaggcaaagaacac ctgcccacatggatctggcagcccaagccctctaagcccccagagcatcatgttcaatat gctgatgggaatgatccagttgagatggaaaatttgatgctgcagatgagagagtggata attgctggagtgatgctttttaacaagcaagagaagaagggatccagtgtgcgagaggag gggctggcctcagacagacacatggacgattcattcatcatgagaggaggaaagcaaaat gaggaccagatgttctctggtgttgctcccagctcttcattcattcgctccacaaatatg caccatgctatcatgctctactgctctgggcagtcattctgctag >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_5|345_aa MSYKVTQEISKELRWISAVLVTWDLRGETGTRVCIAMGVSRMQYSIMAKIVGSGNNLELN PAADKLDNLGAQCGLASVKVSARAAPPFAPATLVALLEARAAELPLPPPPGVSLRCVEAP DVRGRGRGGGEGSGVTRLLLPPPPPSGASFLQRGRPVTPALFQAGWGRAADRPGPADLAG GAGGRSGGRRMLPRSPGAALRDRSALEDGIVEGYGPLGEPEEGSRSSFLESKVAFNYCNK VAPLDFGNEAVEQCHTMSDRKAVIKNADMSEDMQQDAVDCATQAMEKYNIEKDIAAYIKK EFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_5|1038_bp atgtcttacaaggtcacccaagaaatcagcaaagaattgaggtggatttcagcagttcta gtcacatgggacttgagaggggagactggcaccagggtctgcattgcaatgggagtgtca aggatgcagtatagtataatggctaagattgtaggcagtggaaacaatctggagttgaat cctgccgctgacaaactggataatcttggggcccagtgcggactcgcctccgtgaaggtg agcgcccgggctgcgccaccctttgccccggccaccctcgtggcgctgctggaggcccgg gctgcggagctgccgctgcccccgcccccgggagtctctctgcgctgcgtggaggcccct gatgtcaggggccggggaagagggggtggagagggctcgggcgtgacgcggctcctcctg ccgccaccgccaccctctggggcgtccttcctgcagcgaggacgcccagtcactccggcg ctgttccaggccgggtgggggagggcggccgaccggccggggcctgcggacctggccggc ggcgcgggcgggaggtcggggggaaggaggatgcttcctcgttccccaggtgctgccctt cgggaccgcagcgctctggaggatgggatagtggagggatacggcccacttggagagccc gaagagggaagtagaagctctttcctggagagtaaggtggcttttaattactgtaacaaa gttgctcctttggactttgggaatgaggcagtggagcagtgtcacaccatgtctgaccgg aaggcagtgatcaagaacgcagacatgtctgaggacatgcaacaggatgccgttgactgc gccacgcaggccatggagaagtacaatatagagaaggacattgctgcctatatcaagaag gaatttgacaagaaatataaccctacctggcattgtatcgtgggccgaaattttggcagc tacgtcacacacgagacaaagcacttcatctatttttacttgggtcaagttgcaatcctc ctcttcaagtcaggctag >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_6|150_aa MKKSRMRWLTPVIPALWEAERPALNHYIHLLELNKCSYHHPHHHHHCNHHHHHHHHCHIT ITTIIITITVTVTITVITIIIAVTITIITITITSITIIGSITIITIITVTITIITITDTI TITIITIIITITITITIIIIAITCFRPIDL >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_6|453_bp atgaagaagagccggatgcggtggctcacgcctgtaatcccagcactttgggaggccgag agaccggctctgaaccattacatccatctgctggagctcaacaaatgtagttaccatcat cctcaccatcatcatcactgtaaccatcaccatcatcatcaccatcactgtcacatcacc atcaccaccatcatcatcaccatcactgtcactgtcaccatcaccgtcatcactatcatc atcgccgtcactatcaccatcatcaccatcactatcaccagcatcaccatcatcggcagc atcaccatcatcaccatcatcactgtcaccatcaccatcatcaccatcactgacaccatc actatcaccattatcaccatcattatcaccatcaccatcactatcaccatcatcatcatt gccattacttgctttaggcccattgatctctga >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_7|148_aa MKDLFKEKYKPLLNEIKEDTNKWKNIPCSWAGRINIMKMAILPKVIYTFSAIPIKLPTTF FIELEKTTLKFIWNQKRARIAKPILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDI DQWNRTEPSEIMPPIYNHLTFDKPDKKK >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_7|447_bp atgaaggacctcttcaaggagaagtacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggcaggaagaatcaatatcatgaaaatggcc atactgcccaaggtaatttatacattcagtgccatccccatcaagctaccaacgactttc ttcatagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatt gccaagccaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gaccaatggaacagaacagagccctcagaaataatgccacctatctacaaccatctgacc tttgacaaacctgacaaaaagaagtaa >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_8|550_aa MGFHHVGQAALELLTSGFSQTQELQKFLFLLFLLVYVTTIVGNLLIMVTVTFDCRLHTPM YFLLRNLALIDLCYSTVTSPKMLVDFLHETKTISYQGCMAQIFFFHLLGGGTVFFLSVMA YDRYIAISQPLRYVTIMNTQLCVGLVVAAWVGGFVHSIVQLALILPLPFCGPNILDNFYC DVPQVLRLACTDTSLLEFLMISNSGLLVIIWFLLLLISYTVILVMLRSHSGKARRKAAST CTTHIIVVSMIFIPCIYIYTWPFTPFLMDKAVSISYTVMTPMLNPMIYTLRNQDMKAAMR RLGKCLKGPAVAVGPGPGPGDAEAAAEERRVKVSSLPYSVDALVSDKKPPKEASPVPAKS ASSGATLRLLLLPGHGAREAHSPGPLIKPFETASVKWENSLRRSGVDAGTLLIIRRRQDT KHKTNPKPRTAFTTSQLLALEGKLLQKQYLSIAEGADFSSSPNLTETQVKILFQNRRAKT KRLQESELEKLKMAAKPMLPSSFSLPFPISSPLQAASIYAASYPFHRPVLPIPPVGLYAT PVGYGMYHLS >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_8|1653_bp atggggtttcaccacgttggccaggctgctctcgaactcctgacctcagggttttcacag acccaagagctccagaaattcctgttccttctgttcctgttagtctatgttaccaccatt gtgggaaacctccttatcatggtcacagtgacttttgactgccggctccacacacccatg tattttctgctccgaaatctagctctcatagacctctgctattccacagtcacctctcca aagatgctggtggacttcctccatgagaccaagacgatctcctaccagggctgcatggcc cagatcttcttcttccaccttttgggaggtgggactgtcttttttctctcagtcatggcc tatgaccgctacatagccatctcccagcccctccggtatgtcaccatcatgaacactcaa ttgtgtgtgggcctggtagtagccgcctgggtggggggctttgtccactccattgtccaa ctggctctgatacttccactgcccttctgtggccccaatatcctagataacttctactgt gatgttccccaagtactgagacttgcctgcactgatacctccctcctggagttcctcatg atctccaacagtgggctgctagttatcatctggttcctcctccttctgatctcttatact gtcatcctggtgatgctgaggtcccactcgggaaaggcaaggaggaaggcagcttccacc tgcaccacccacatcatcgtggtgtccatgatcttcattccctgtatctatatctatacc tggcccttcaccccattcctcatggacaaggctgtgtccatcagctacacagtcatgacc cccatgctcaaccccatgatctacaccctgagaaaccaggacatgaaagcagccatgagg agattaggcaagtgcctaaaggggccagcggtggcggtcgggccaggtccggggcctggg gacgccgaggcggccgcggaggagcgccgcgtcaaggtctccagcctgccctacagtgtg gatgcgctcgtgtcggacaagaagccgcccaaggaggcatccccagtgccggccaaaagc gcctcttccggggccaccctgcggctactgctgctgccggggcacggcgctcgggaagcg cacagccccgggccgctgatcaagcccttcgagaccgcctcggtcaagtgggaaaactcc ctaagacggagcggcgtggatgcaggaaccctgctgataattcgccgccgccaagacaca aaacacaagaccaatccgaagccgcgcacagcctttaccacgtcccagctcctcgctctg gagggcaagttgctccagaaacagtacctctccattgcagagggtgcggacttctccagc tctccgaacctcacggagactcaggtcaaaatcttgttccagaaccgaagggccaagacg aaaagactgcaggagtcagaactggaaaagctgaaaatggctgcaaaacctatgctaccc tccagcttcagtctccctttccccatcagctcgcccctgcaggcagcgtccatatacgca gcatcctacccgttccatagacctgtgcttcccatcccgcccgtgggactctatgccacg ccagtgggatatggcatgtaccacctgtcctaa >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_9|307_aa METGNLTWVSDFVFLGLSQTRELQRFLFLMFLFVYITTVMGNILIIITVTSDSQLHTPMY FLLRNLAVLDLCFSSVTAPKMLVDLLSEKKTISYQGCMGQIFFFHFLGGAMVFFLSVMAF DRLIAISRPLRYVTVMNTQLWVGLVVATWVGGFVHSIVQLALMLPLPFCGPNILDNFYCD VPQVLRLACTDTSLLEFLKISNSGLLDVVWFFLLLMSYLFILVMLRSHPGEARRKAASTC TTHIIVVSMIFVPSIYLYARPFTPFPMDKLVSIGHTVMTPMLNPMIYTLRNQDMQAAVRR LGRHRLV >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_9|924_bp atggaaacagggaacctcacgtgggtatcagactttgtcttcctggggctctcgcagact cgggagctccagcgtttcctgtttctaatgttcctgtttgtctacatcaccactgttatg ggaaacatccttatcatcatcacagtgacctctgattcccagctccacacacccatgtac tttctgctccgaaacctggctgtcctagacctctgtttctcttcagtcactgctcccaaa atgctagtggacctcctctctgagaagaaaaccatctcttaccagggctgcatgggtcag atcttcttcttccactttttgggaggtgccatggtcttcttcctctcagtgatggccttt gaccgcctcattgccatctcccggcccctccgctatgtcaccgtcatgaacactcagctc tgggtggggctggtggtagccacctgggtgggaggctttgtccactctattgtccagctg gctctgatgctcccactgcccttctgtggccccaacattttggataacttctactgtgat gttccccaagtactgagacttgcctgcactgacacctcactgctggagttcctcaagatc tccaacagtgggctgctggatgtcgtctggttcttcctcctcctgatgtcctacttattc atcctggtgatgctgaggtcacatccaggggaggcaagaaggaaggcagcttccacctgc accacccacatcatcgtggtttccatgatcttcgttccaagcatttacctctatgcccgg cccttcactccattccctatggacaagcttgtgtccatcggccacacagtcatgaccccc atgctcaaccccatgatctataccctgaggaaccaggacatgcaggcagcagtgagaaga ttagggagacaccggctggtttga >gi568815581f:57987091_58189276|GENSCAN_predicted_peptide_10|50_aa XWQFSHSQGFDLDKPAQNFKLLLSQEPSMVELLATGLIRSLYLLSSNGPN >gi568815581f:57987091_58189276|GENSCAN_predicted_CDS_10|153_bp nnatggcagttcagccactcccagggttttgaccttgataagccagctcaaaacttcaag cttctgctttcacaagaaccgtccatggtggagctgctggccacaggcctgattagatct ctttatctgctcagctccaacggccccaactga