GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:06:15 Sequence gi568815587r:76087068_76306407 : 219340 bp : 50.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 62 57 6 1.05 1.01 Sngl - 6392 6132 261 2 0 86 44 153 0.396 5.86 1.00 Prom - 14643 14604 40 0.24 2.00 Prom + 20195 20234 40 -4.26 2.01 Init + 41445 41468 24 2 0 66 119 11 0.432 1.60 2.02 Intr + 43901 43961 61 1 1 90 69 25 0.244 -0.89 2.03 Intr + 51272 51372 101 0 2 127 81 97 0.931 12.73 2.04 Term + 53684 54346 663 1 0 50 45 510 0.863 36.77 2.05 PlyA + 55563 55568 6 1.05 3.00 Prom + 62281 62320 40 -5.36 3.01 Init + 65309 65367 59 1 2 101 114 13 0.600 6.09 3.02 Intr + 75620 75636 17 2 2 110 105 -3 0.027 -1.21 3.03 Term + 85581 86152 572 1 2 93 39 182 0.608 8.40 3.04 PlyA + 86232 86237 6 -0.45 4.13 PlyA - 86324 86319 6 1.05 4.12 Term - 91826 91693 134 0 2 41 53 86 0.122 -1.35 4.11 Intr - 92124 91919 206 2 2 63 47 124 0.171 4.64 4.10 Intr - 100172 100002 171 1 0 72 61 255 0.414 20.26 4.09 Intr - 104789 104497 293 2 2 131 94 460 0.998 46.83 4.08 Intr - 107777 107500 278 0 2 100 91 349 0.971 33.64 4.07 Intr - 109651 109416 236 0 2 64 75 361 0.762 29.73 4.06 Intr - 119431 119258 174 0 0 99 68 183 0.287 16.45 4.05 Intr - 121300 121125 176 1 2 84 -2 92 0.362 -1.46 4.04 Intr - 122211 122074 138 0 0 77 71 107 0.650 8.56 4.03 Intr - 122507 122361 147 2 0 83 37 77 0.676 2.53 4.02 Intr - 127998 127921 78 0 0 95 19 68 0.132 0.35 4.01 Init - 131735 131688 48 2 0 41 111 50 0.559 1.89 4.00 Prom - 138576 138537 40 -1.56 5.03 PlyA - 139030 139025 6 -0.45 5.02 Term - 141443 141272 172 1 1 90 43 115 0.694 4.50 5.01 Init - 143153 143032 122 2 2 91 94 42 0.942 4.77 5.00 Prom - 144176 144137 40 -8.16 6.00 Prom + 145472 145511 40 -4.36 6.01 Init + 159331 159388 58 0 1 67 60 121 0.723 6.76 6.02 Intr + 159522 159650 129 1 0 83 27 89 0.365 2.97 6.03 Intr + 166834 166915 82 2 1 123 68 51 0.252 5.30 6.04 Intr + 181801 181856 56 1 2 53 111 49 0.250 2.32 6.05 Intr + 184177 184351 175 2 1 75 107 124 0.386 12.00 6.06 Intr + 184407 184558 152 0 2 21 67 141 0.666 5.11 6.07 Intr + 196210 196231 22 2 1 101 100 16 0.125 0.70 6.08 Intr + 200192 200243 52 2 1 103 62 67 0.428 4.51 6.09 Intr + 201322 201739 418 0 1 82 72 86 0.278 -0.00 6.10 Term + 202361 202407 47 0 2 137 48 62 0.727 4.47 6.11 PlyA + 202576 202581 6 1.05 7.04 PlyA - 203854 203849 6 1.05 7.03 Term - 210717 210554 164 1 2 100 42 105 0.517 5.20 7.02 Intr - 218468 218341 128 1 2 94 87 -25 0.239 -1.68 7.01 Init - 219237 219014 224 0 2 86 94 60 0.361 2.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_1|86_aa MAILPKVIYRFNAIPIKLPMTFFTELDKTTLKFIWNQKRARIVKTILSEKNKAGGITLPD SKLHYKATVTKTAWYWYQNSMVLVPK >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_1|261_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggacaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgtcaagacaatcctgagcgaaaagaacaaagctggaggcatcacgctacctgac tccaaactacactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagc atggtactggtaccaaaatag >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_2|282_aa MVFAHQKLHKQFIIPKYQVLSRYSEGSPGHSRALCGQLIFHGAQNIIDSRENGRSLHIVV GKRQSSIFGGADVGFSGGIPSPDKGHRKRASSENERLQYKTPPPSYNSALAQPVTTVPSM GETERKITSLSSSLDTSLDFSKENKKKGEDLVGSLNGGHANVHPSQEQGEALSGHRATVN GTLLPSEQAGSASVQLPGEFHPVSEAELCCTVEQAEEIIGLEATGFASGDQLEAFNCIPV DSAVAVECDEQVLGEFEEFSRRIYALNENVSSFRRPRRSSDK >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_2|849_bp atggtctttgcccatcaaaagctgcataaacagtttattattcccaagtatcaagtgttg tccagatactccgaggggtccccaggacactcgagggctctgtgtgggcagctcatcttc cacggggctcagaacattattgacagtcgggaaaatggtcgttcccttcacattgtagtt ggcaagagacaaagctccatatttgggggtgcagatgtaggcttctctggggggatccct tcaccagacaaaggacatcgaaaacgggccagctctgagaatgagagacttcagtacaaa acccctcctcccagttacaactcagcattagcccagcctgtgaccaccgtcccctccatg ggagagaccgagagaaagataacatctctatcctcctccttggatacctccttggacttc tccaaagaaaacaagaaaaaaggagaggatctagttggcagcttaaacggaggccacgcg aatgtgcaccctagccaagaacaaggagaagccctctccgggcaccgggccacagtcaat ggcactctcctacccagcgagcaggccgggtccgccagtgtccagcttccaggcgagttc cacccagtctcagaagctgagctctgctgtactgtggagcaagcagaagaaatcatcggg ctggaagccacaggtttcgcctcaggtgatcagctagaagcatttaactgcatcccagtg gacagtgctgtggcagtagagtgtgacgaacaagttctgggagaatttgaagagttctcc cgaaggatctatgcactgaatgaaaacgtatccagcttccgccggccgcgcaggagttcc gataagtga >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_3|215_aa MEWRFGETTGDQERSQASLGPSTRPGKARLPWCLSPAFGALGWAASIQSQEPQAHRSAGP GNRSPTEVMWNPSVPPSLCVRVIYGQMLREEAEENKPCQLPTCHRSGERERAELLHDLQQ APSFSGPLLPLKSGSESQPQITDLPEGSSEKLAVKPLAQSLAQNRLHPCSKPAVAPTLPQ SDPRPSLTSSPAPMALGFTRAAICIARSSWTTEID >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_3|648_bp atggagtggcgctttggggaaaccacgggcgaccaggaaagatctcaggcaagcctcggc ccctccaccaggcctgggaaagcacgactcccctggtgtctgagcccggcatttggagcc ctgggctgggcagccagcatccagagccaagaaccacaggctcaccggagtgcagggcct gggaaccgctcacccacggaggtgatgtggaatccgtccgtgcccccctcgttgtgtgtc agagtgatttatgggcagatgctgagagaagaggctgaggaaaataagccttgccagctg ccaacctgtcacagatcgggagagagggaaagggccgagctgcttcatgacctccagcag gctccttccttctctgggcctcttctgcctctgaaatcggggtctgaaagccagccccaa atcacagacttgccagaaggatcaagtgagaagttggccgtcaagcccctagcgcagagc ctggcacaaaacaggttgcacccctgctccaaacctgcagtggctcccacgctccctcag tctgaccctcgtccctcactgacatcatctccggcccccatggctctggggtttacaaga gccgccatctgcattgcgaggtcttcctggaccactgaaatagactag >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_4|692_aa MAVNLSLALLTLHPEQKGLSKSTLVVEAVRFSLIVTNEKWEMVTLKPSLGVPGWRGQLRE LRGGRLRDPPVPTLRAARTIGPGARCPWGVGPQAAASRARCRQGSSDQLRGGDRSLSKGA TPGAAHYAPGPKQGAGQAPLCWALAAGLCGGKKLYDPGFSSPSCGFCRQAPENSGASHLC VLTTHAQERATQVASRRRAGPAAAVQAEDFGAAPPGCDPGRARRATMRARPQVCEALLFA LALQTGVCYGIKWLALSKTPSALALNQTQHCKQLEGLVSAQVQLCRSNLELMHTVVHAAR EVMKACRRAFADMRWNCSSIELAPNYLLDLERGTRESAFVYALSAAAISHAIARACTSGD LPGCSCGPVPGEPPGPGNRWGGCADNLSYGLLMGAKFSDAPMKVKKTGSQANKLMRLHNS EVGRQALRASLEMKCKCHGVSGSCSIRTCWKGLQELQDVAADLKTRYLSATKVVHRPMGT RKHLVPKDLDIRPVKDSELVYLQSSPDFCMKNEKVGSHGTQDRQCNKTSNGSDSCDLMCC GRGYNPYTDRVVERCHCKYHWCCYVTCRRCERTVERYVCNTLPVPRVVLGLQMLVDQGLR GHHLFLLSPKNPTQAWSTDTAEAQGSKGPYQRGWWTPRPEAGAQVQALDKGTGTRSLLLM GAGSQWFRLQRPAEMPGPVLKAFLPSHDIHVS >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_4|2079_bp atggctgtcaacctctctctggctctcctcactcttcacccagagcagaaaggcctttct aaaagcaccctggtggtggaagcagtacgctttagtctcattgtaaccaatgagaaatgg gaaatggtaaccttgaagccgtcgctcggcgtccctggttggcggggtcagctccgtgag ctgcgtggcgggaggcttcgggacccgcccgtgccgaccttgcgcgccgcgcggacaatc ggccccggtgcacggtgcccgtggggcgtcgggccccaggcggccgcctcccgggcccgc tgcaggcagggtagctcggaccaacttcgaggaggtgaccgctcgctctccaaaggtgct acgcctggcgcggcccactacgcgcccggccccaagcaaggggccgggcaggcgcccctg tgctgggcgctagctgcggggctctgcggaggaaaaaaactgtacgatccaggcttctcc tcgccttcctgtgggttttgccgacaggctccagaaaacagtggggcttcccatctctgc gtgctgactacccacgcgcaggaacgcgccacgcaggtggcgtcgcggcggcgtgcagga ccagcggcggccgtgcaggcggaggacttcggcgcggctcctcctgggtgtgaccccggg cgcgcccgccgcgcgacgatgagggcgcggccgcaggtctgcgaggcgctgctcttcgcc ctggcgctccagaccggcgtgtgctatggcatcaagtggctggcgctgtccaagacacca tcggccctggcactgaaccagacgcaacactgcaagcagctggagggtctggtgtctgca caggtgcagctgtgccgcagcaacctggagctcatgcacacggtggtgcacgccgcccgc gaggtcatgaaggcctgtcgccgggcctttgccgacatgcgctggaactgctcctccatt gagctcgcccccaactatttgcttgacctggagagagggacccgggagtcggccttcgtg tatgcgctgtcggccgccgccatcagccacgccatcgcccgggcctgcacctccggcgac ctgcccggctgctcctgcggccccgtcccaggtgagccacccgggcccgggaaccgctgg ggaggatgtgcggacaacctcagctacgggctcctcatgggggccaagttttccgatgct cctatgaaggtgaaaaaaacaggatcccaagccaataaactgatgcgtctacacaacagt gaagtggggagacaggctctgcgcgcctctctggaaatgaagtgtaagtgccatggggtg tctggctcctgctccatccgcacctgctggaaggggctgcaggagctgcaggatgtggct gctgacctcaagacccgatacctgtcggccaccaaggtagtgcaccgacccatgggcacc cgcaagcacctggtgcccaaggacctggatatccggcctgtgaaggactcggaactcgtc tatctgcagagctcacctgacttctgcatgaagaatgagaaggtgggctcccacgggaca caagacaggcagtgcaacaagacatccaacggaagcgacagctgcgaccttatgtgctgc gggcgtggctacaacccctacacagaccgcgtggtcgagcggtgccactgtaagtaccac tggtgctgctacgtcacctgccgcaggtgtgagcgtaccgtggagcgctatgtctgcaac accctacccgtacctcgtgtggtcctgggactgcagatgctggtggaccagggcctgcga ggacaccatctgttcctgctgtcccccaaaaacccgacccaggcctggtccacagacaca gcagaagcccaaggcagcaagggcccataccagaggggctggtggacccccaggcccgag gctggggctcaggtccaggctctagacaagggaactgggacaagaagcctgctcctgatg ggtgctggctcccagtggttccgacttcagaggcccgcagagatgcctggccctgtgctg aaggccttcctgccctctcacgacatccatgtttcctga >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_5|97_aa MTPSSGLEIRGRSESDTTAQNRKLGAPSVPMSDRDGGVGPRPCEWWTSKTFGQQSKVLHW AISLPPACPVPGSILALEIWAQACRWRHHLPLVLTEL >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_5|294_bp atgaccccttcctcaggcctggagatcagaggaagatcagaaagtgacaccacagctcag aacaggaagctaggtgctccctcagtgcccatgagtgacagagatggtggtgtgggtccg agaccctgcgagtggtggacgtcgaagacctttggacaacagagcaaggtgctccactgg gctatcagtctgccccctgcctgcccagtgcctggctccatcctggctctggagatctgg gcccaggcttgtcgatggaggcaccatctgcccctggtgcttacagagctctag >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_6|396_aa MAGPQALLLLCQALSGSPPAQCPPPHRDLLPESGDISRPPGAWWLLSVGADIFMPYMGKP VGGPTLQQYDLNLTSYSCKDPISMYGQIHRDLRTKDEANLTPILRELSGVQSQVTHSAFL PANIEEASSTHQALSEAPIKTDTGLPHEEQAEKEDKHERTSADLQSRESDSRLQAAQKQN SRPKGAREKNQASCVLAVVGERMEGEGLDHYDDDDDEGLLPPKGRAISMVAICEAMTVSH QVRTAPVKWAMVSESSMHHPPRPSWSELCPRLPDHTVSPSTGSVQVLLPIQPCTKERGRS RVFFDSTDLQRHLLCARTSAGHRGTKSACPSTAYGSASLDTVIPHPGLHSDSATLKAIWP QQMPGPRPQVQREQGYGRSRGMNLSTPHENLSTVTM >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_6|1191_bp atggctggcccccaagctctgctgctgctctgccaggcgctgtcgggcagtcccccagct cagtgtcctcctccacatagggatctgctcccagagtctggggatatctcccgcccccct ggtgcctggtggctgctgtcagtaggggcagacatcttcatgccctacatgggaaagcca gtgggggggcccaccctacagcagtatgacctcaacttaactagttatagctgcaaagac cccatttccatgtatggccaaattcacagggacttaaggaccaaggatgaagccaacctg actcctattctcagggagctgtcaggagttcaatcacaggtcactcattcagcctttctt ccagcaaacattgaggaagcctcttctacacaccaggcactgagcgaggcaccgatcaag acagacacaggcttgccccatgaggagcaagcagaaaaggaggacaagcacgaaaggaca agtgctgacctgcagtcacgggaatcggactcccggctccaagctgcccagaaacagaac tccaggcccaaaggcgcccgtgagaagaatcaggcttcttgtgtgttggctgtagtggga gaaagaatggaaggagagggtctagaccactatgatgatgatgacgatgaagggctgctg cccccaaaaggaagagccatctccatggtggccatctgtgaggctatgaccgtctcccac caggttagaactgctccggtgaaatgggccatggtctcggagtcgtctatgcaccacccc ccacgacccagctggtcagagctgtgcccccgcctccccgaccacacagtcagcccatcc acaggctccgtccaagtcctcctgcctatccagccctgcacaaaggagcggggcaggagc agggtgttctttgattccacagacctccagcgacatctcctctgtgccaggacctcagct ggccacaggggaaccaagtcagcatgccctagcactgcctatggctctgccagcctggac actgtcatcccacatccgggtttgcattctgactcggccactctcaaggccatttggccc cagcaaatgcctggacctaggccccaggtgcagagagaacagggctacgggaggagcaga ggaatgaacttgtctacacctcatgaaaacctctccacggtgacgatgtga >gi568815587r:76087068_76306407|GENSCAN_predicted_peptide_7|171_aa MKVRKTEEGVRGGPRLSTAAGLGPNLQPSINLTVLPTSAAAQTLHFGILAAICRIGSFCQ AHMALGPERPPRGWRAPEAGLTSSTTGMADPRWDLAPAQTLKPPQASSSSDWPFSFFGHE PRLREVKRLEEQGSPGDEKEEEKEMTQDRLFWGPPPSRLQGEAFVPKKLYP >gi568815587r:76087068_76306407|GENSCAN_predicted_CDS_7|516_bp atgaaggtgaggaagacagaggagggtgtcaggggtggtcccaggctgagcacggcagcg gggctgggccccaacctgcaaccatccatcaatctgactgtgctgcccacttccgctgct gcccagacactccactttgggatcctggctgccatctgtaggattggcagcttctgccag gctcacatggctctgggcccagagaggccccccaggggttggagggccccagaggcaggt cttacaagctccactacggggatggctgacccgaggtgggacttagcgccagctcagacc cttaagcccccacaggcctcctcttcatctgattggcctttttctttctttggacacgaa ccaagactcagagaggtgaagagactggaagaacaaggtagccctggggacgagaaagag gaggagaaagaaatgacccaggacaggctgttctggggaccacccccatcccgactacag ggggaggcttttgtacccaagaaattgtatccgtga