GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:26:07 Sequence gi568815588f:72123915_72333113 : 209199 bp : 46.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 9267 9143 125 2 2 73 115 47 0.080 6.13 1.08 Intr - 29074 28955 120 0 0 86 76 102 0.657 8.41 1.07 Intr - 37760 37624 137 2 2 72 78 99 0.643 6.67 1.06 Intr - 73075 72897 179 2 2 106 115 171 0.999 21.24 1.05 Intr - 79610 79513 98 1 2 90 77 8 0.421 -0.45 1.04 Intr - 86917 86818 100 2 1 55 78 123 0.555 7.17 1.03 Intr - 89417 89273 145 2 1 34 84 152 0.043 9.26 1.02 Intr - 100110 99934 177 0 0 32 44 117 0.006 1.82 1.01 Init - 102717 102676 42 0 0 62 66 38 0.008 -2.58 1.00 Prom - 103205 103166 40 -4.76 2.00 Prom + 103256 103295 40 -5.16 2.01 Init + 104199 104247 49 2 1 67 70 -3 0.180 -3.09 2.02 Intr + 106452 106526 75 1 0 101 103 33 0.972 5.59 2.03 Term + 109087 109202 116 2 2 96 43 175 0.966 12.53 2.04 PlyA + 109810 109815 6 1.05 3.00 Prom + 113479 113518 40 -2.96 3.01 Init + 119826 120005 180 2 0 84 32 75 0.037 0.68 3.02 Intr + 136893 137027 135 2 0 157 47 13 0.060 4.86 3.03 Intr + 150243 150507 265 2 1 112 105 298 0.994 30.99 3.04 Term + 150781 151274 494 2 2 112 43 799 0.999 72.57 3.05 PlyA + 152101 152106 6 1.05 4.00 Prom + 154922 154961 40 -4.76 4.01 Init + 156769 156846 78 0 0 96 73 107 0.563 11.07 4.02 Intr + 161820 161874 55 2 1 48 103 31 0.353 -0.85 4.03 Intr + 167977 168045 69 2 0 101 94 60 0.707 7.05 4.04 Intr + 170473 170510 38 2 2 118 53 3 0.421 -2.22 4.05 Term + 171986 172276 291 1 0 115 47 156 0.891 9.54 4.06 PlyA + 173084 173089 6 -0.45 5.02 PlyA - 173179 173174 6 1.05 5.01 Sngl - 176957 176442 516 2 0 88 43 235 0.934 13.11 5.00 Prom - 179305 179266 40 -6.76 6.00 Prom + 183324 183363 40 -4.16 6.01 Init + 185361 185452 92 2 2 65 105 22 0.514 1.57 6.02 Intr + 185930 186144 215 2 2 81 63 43 0.215 -0.54 6.03 Intr + 195625 195677 53 2 2 98 47 42 0.126 -0.27 6.04 Intr + 197857 197879 23 0 2 138 98 -2 0.538 2.14 6.05 Intr + 198280 198380 101 1 2 110 83 26 0.557 4.05 6.06 Intr + 200011 200122 112 2 1 85 69 4 0.161 -2.36 6.07 Term + 202029 202539 511 0 1 6 33 338 0.481 13.95 6.08 PlyA + 202842 202847 6 1.05 7.00 Prom + 203647 203686 40 -7.86 7.01 Init + 204165 204312 148 2 1 94 45 114 0.891 6.41 7.02 Term + 204522 204670 149 1 2 91 45 131 0.992 7.16 7.03 PlyA + 208927 208932 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 89384 89273 112 2 1 56 84 115 0.902 8.27 S.002 Sngl + 91728 91973 246 2 0 53 48 197 0.829 7.28 S.003 Term + 136893 136960 68 1 2 157 49 29 0.811 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_1|375_aa MRSQLTAASTSQAQVKRAFRGGARSETLKPDPVTELPLTPPAEEDDDEAAISSLAEFLLH YRERKTAIGKFMEGIIGISFGESVMEVLRPQLIRIDGRNYRKNPVQEQTYQHEEDEEDFY QGSMECADEPCDAYEVEQTPQGFRSTLRAPSLLYKHIVGKRGDTRKKIEMETKTSISIPK PGQDGEIVITGQHRNGVISARTRIDVLLDTFRRKQPFTHFLAFFLNEVEVQEGFLRFQEE VLAKCSMDHGVDSSIFQNPKKLHLTIGMLVLLSEEEIQQTCEMLQQCKEEFINDISGGKP LEVEMAGIEYMNDDPGMVDVLYAKVHMKDGSNRLQELVDRVLERFQASGLIVKEWNSVKL HATVMNTLFRKDPNX >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_1|1125_bp atgcgatcacagctcactgcagcctccacctcccaggctcaggtgaaaagggctttccgt ggtggggcaaggtctgagacactgaaaccagatccagtgacagaacttccactgacccca ccagctgaggaggatgatgatgaagcagccatttcctctctagcagagttcttacttcac tacagagaaagaaaaacagcaattggcaagtttatggaaggcataattggaatatcattt ggagaaagtgtcatggaagttctgcgtccacagcttataagaattgatggccggaattac aggaagaatccagtccaagaacagacctatcaacatgaagaagatgaagaggacttctat caaggctccatggagtgtgctgatgagccctgtgatgcctacgaggtggagcagacccca caaggattccggtctactttgagggcccccagcttgctctataagcatatagttggaaag agaggggacactaggaagaaaatagaaatggagaccaaaacttctattagcattcctaaa cctggacaagacggggaaattgtaatcactggccagcatcgaaatggtgtaatttcagcc cgaacacggattgatgttcttttggacacttttcgaagaaagcagcccttcactcacttc cttgcctttttcctcaatgaagttgaggttcaggaaggattcctgagattccaggaggaa gtactggcgaagtgctccatggatcatggggttgacagcagcattttccagaatcctaaa aagcttcatctaactattgggatgttggtgcttttgagtgaggaagagatccagcagaca tgtgagatgctacagcagtgtaaagaggaattcattaatgatatttctgggggtaaaccc ctagaagtggagatggcagggatagaatacatgaatgatgatcctggcatggtggatgtt ctttacgccaaagtccatatgaaagatggctccaacaggctacaagaattagttgatcga gtgctggaacgttttcaggcatctggactaatagtgaaagagtggaatagtgtgaaactg catgctacagttatgaatacactattcaggaaagaccccaatgnn >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_2|79_aa MDAVFNLKRFNLMLHPYGSERFLCESVFSYQVASTLKQVKHDQQVARMEKLAGLVEELEA DEWRFKPIEQLLGFTPSSG >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_2|240_bp atggatgctgtttttaatctcaaaaggttcaacttgatgttacatccttatggctctgag agattcctctgcgaatctgtttttagctatcaagtggcatccacgcttaaacaggtgaaa catgatcagcaagttgctcggatggaaaaactagctggtttggtagaagagctggaggct gacgagtggcggtttaagcccatcgagcagctgctgggattcaccccctcttcaggttga >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_3|357_aa MEAKRTWEWECRGDDKEPFLEKASLLLNMMMRDSAPHTKGKMPKKRSTHREQHVQSQELQ GSHQKGANPAPCPHPAAGAPNAELSISVAWLPSSRAVGGASFLPRRLLRSGTLSSSANAL ASVLTMPSLWDRFSSSSTSSSPSSLPRTPTPDRPPRSAWGSATREEGFDRSTSLESSDCE SLDSSNSGFGPEEDTAYLDGVSLPDFELLSDPEDEHLCANLMQLLQESLAQARLGSRRPA RLLMPSQLVSQVGKELLRLAYSEPCGLRGALLDVCVEQGKSCHSVGQLALDPSLVPTFQL TLVLRLDSRLWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKKKLYSSEQLLIEEC >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_3|1074_bp atggaagccaagaggacttgggagtgggaatgccgaggagatgacaaggagcctttccta gagaaggcgagcctgctcttgaacatgatgatgagagacagcgcgccacacacaaaggga aagatgccaaagaagcgttccacgcacagggaacagcatgtgcaaagtcaggaactccag ggctctcaccagaaaggcgccaaccctgcgccctgcccccacccagccgccggggcccca aatgctgagctcagcatctctgtggcctggctgccctcctcccgggcagtgggaggagcc tccttcctcccgaggcggcttctacgctccggcactctgagttcatcagcaaacgccctg gcgtctgtcctcaccatgcctagcctttgggaccgcttctcgtcgtcgtccacctcctct tcgccctcgtccttgccccgaactcccaccccagatcggccgccgcgctcagcctggggg tcggcgacccgggaggaggggtttgaccgctccacgagcctggagagctcggactgcgag tccctggacagcagcaacagtggcttcgggccggaggaagacacggcttacctggatggg gtgtcgttgcccgacttcgagctgctcagtgaccctgaggatgaacacttgtgtgccaac ctgatgcagctgctgcaggagagcctggcccaggcgcggctgggctctcgacgccctgcg cgcctgctgatgcctagccagttggtaagccaggtgggcaaagaactactgcgcctggcc tacagcgagccgtgcggcctgcggggggcgctgctggacgtctgcgtggagcagggcaag agctgccacagcgtgggccagctggcactcgaccccagcctggtgcccaccttccagctg accctcgtgctgcgcctggactcacgactctggcccaagatccaggggctgtttagctcc gccaactctcccttcctccctggcttcagccagtccctgacgctgagcactggcttccga gtcatcaagaagaagctgtacagctcggaacagctgctcattgaggagtgttga >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_4|176_aa MGSFMETSGAADAACGYVELAALNVESHSKYLNRQVLLHPLLQGDLVNQQVGEGDLLLPE AVGAGARGHQGTDSRLSSYQPIGAGEAAGVGPVNRFANQHRVIPALPLVSVWSHDPNCPV TVKPGTCVTIPKDRASFYSTGTWDDVPLQLQQVLMEPKHFPRQQRQSTNKYKDSGK >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_4|531_bp atgggcagcttcatggaaacttctggggcagcagatgcagcctgtggctatgtggagctg gcggctctgaatgtagagtcccactccaaatacctcaacaggcaagtcctgctgcaccca ctcctacaaggtgacttggttaatcagcaggttggggagggggacctgctgctcccagag gccgtaggtgctggagcacgaggacaccagggaactgactcaaggctctcctcttaccag cccattggagcaggtgaggctgcaggtgtggggcctgtgaacagatttgccaatcaacac agggttattcctgccttgcccctggtatctgtgtggtcacatgaccccaactgtccagtt acagtgaaacccgggacatgtgtcacaattcccaaggacagagcctctttctactcaact ggaacttgggatgatgtgcccctgcagctgcagcaggtgctcatggagcccaaacacttc ccaagacagcaaaggcaaagtacaaacaagtacaaagactctgggaaatga >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_5|171_aa MAGCRSRALPRGEAAKARQEIERSGPALLGGPAVLEDRVHPPQPLARVLSPPLPEAAPNA GPAEPTPTGNSRWPTSTARRPGCHRRLSLHTSLQAEGAGSGLGQPRKGLPQCSGRLKGSS AAKVGAQAEEVPRASEGCEDCQHAVTSQNDSQCRASFSQRYTCGNNPCSYM >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_5|516_bp atggcgggatgcaggtcccgagccctgccccgcggggaggcagctaaggcccggcaagaa atcgagcgcagtgggccggcactgctgggtgggccggcagtgctggaggaccgagtacac cctccgcagccgctggcccgggtgctaagccccccattgcccgaggccgctcctaatgcg gggcccgccgagcccacgcccaccgggaactcgcgctggcccacaagcaccgcgcgcaga cccggttgccaccggcgcctctccctccacacctccctgcaagctgagggagccggctct ggccttggccagcccagaaaggggctcccacagtgcagcggcaggctgaagggctcaagt gccgccaaagtgggagcccaggcagaggaggtgccgagagcgagcgagggctgtgaggac tgccagcatgctgtcacctctcagaatgactctcagtgtcgggcctcattctctcaacgt tacacctgtgggaataatccatgcagttatatgtag >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_6|368_aa MRFGDRVPGNLIWGDSQVQMPLQACTVSARRPEHFAASRNTHGDGTATHTLPSTHSHVLF AAVKALLTHTFSPAPASTAVPCCLEPGPSSRTTHMFSVGFSQYQTGQRAAIKGVWDEPGG VGPGGMSQVWLKHTVGPQSTLLTREMSPGMREMTHEAGLGEAQGWGGGMVCVPGSSSDTA ELGDLGKSLLFSGARFPQWEVWRERREREPGLRAALAGQLEFRVGVGLAAPHSEQPAGPA APGNEGLSTRASGCGGCTGSPSSASPPALLSISHRVLAAFLRGRARDLQPAMPEPPTPSM GSCAARASPMSAAPCSTAPSPIDHPRAEECGPTARDWQAAPPAAPVHDPLSEASWAPESG GDVENLYV >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_6|1107_bp atgaggtttggggaccgagtcccaggaaatctcatatggggtgattcccaggtccagatg cccttgcaggcttgcactgtgtctgccaggagaccagaacattttgctgccagcagaaac acacatggtgatggcactgctacccacaccctgccgtcaacacactcccacgtgctgttt gccgctgtcaaagccctgctcacacacacattctcacccgctcctgccagcacagctgtt ccctgctgcctggaaccaggcccttcatcccgcacgacacacatgttctctgtgggtttt tctcaataccagacggggcaaagggcagccatcaaaggtgtctgggacgagcccggcggg gtggggccaggtggaatgtcacaggtgtggctcaagcacacagtgggtcctcagtccacc ctgctgaccagggagatgagtcctggcatgagggagatgactcatgaagccgggctgggc gaggcccagggttggggcggggggatggtctgtgtcccaggcagcagctccgacaccgct gagctgggcgaccttggcaagtcacttctcttttctggggctcgatttcctcagtgggag gtgtggagggagaggcgcgagcgggaacccgggctgcgcgcggcgcttgcgggccagcta gagttccgggtgggcgtgggcttggcggccccgcactcggagcagcccgccggccctgcc gccccggggaatgaggggcttagcacccgggccagcggctgcggagggtgtactgggtcc cccagcagtgccagcccaccggcgctgctctcgatttctcaccgggtcttagctgccttc ctgaggggcagggctcgggacctgcagcccgccatgcctgagcctcccaccccctccatg ggctcctgtgcggcccgagcctccccgatgagcgccgccccctgctccacagcgcccagt cccatcgaccacccaagggctgaggagtgcgggcccacagcgcgggactggcaggcagct ccacctgcagccccggtgcacgatccactgagtgaagccagctgggctcctgagtctggt ggggacgtggagaacctttatgtctag >gi568815588f:72123915_72333113|GENSCAN_predicted_peptide_7|98_aa MDTLAAHVAPAVVVAGLFDLWLEAKLGPTMPPSARRPWAGSQQMAEPARGRGDLGPFQQD QQDSASPRRQEKEKAPAQCPSHERLLPPATVQLDAHRG >gi568815588f:72123915_72333113|GENSCAN_predicted_CDS_7|297_bp atggacacacttgctgcccatgtggcccctgcagtggtggtagctggcttgtttgacttg tggctggaagcaaaattagggcccacgatgccaccttctgcacgcaggccatgggctggc agccagcagatggcagagcctgccagagggagaggagacctggggccttttcagcaggat cagcaggactctgcttccccccggaggcaggaaaaggaaaaagcccccgcccagtgcccg tcccatgagcgactgctgcctccagccaccgtccagctggatgcacaccgtggctga