GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:37:10 Sequence gi568815589r:107385754_107589052 : 203299 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3315 3452 138 0 0 86 77 64 0.283 4.68 1.02 Intr + 8215 8297 83 1 2 57 48 55 0.010 -2.32 1.03 Intr + 18534 18696 163 1 1 27 93 122 0.057 5.63 1.04 Term + 34352 34436 85 2 1 121 44 51 0.151 1.13 1.05 PlyA + 34500 34505 6 1.05 2.13 PlyA - 34554 34549 6 1.05 2.12 Term - 39874 39596 279 1 0 45 38 174 0.130 3.55 2.11 Intr - 47628 47542 87 0 0 17 115 56 0.000 1.37 2.10 Intr - 72583 72401 183 1 0 48 75 224 0.030 17.08 2.09 Intr - 80388 80256 133 2 1 63 41 63 0.004 -0.25 2.08 Intr - 80646 80521 126 2 0 55 88 45 0.005 0.99 2.07 Intr - 100173 100038 136 1 1 119 52 110 0.056 10.13 2.06 Intr - 101439 101275 165 1 0 117 91 127 0.991 15.83 2.05 Intr - 102514 101542 973 1 1 107 86 1202 0.983 112.45 2.04 Intr - 103012 102932 81 1 0 24 62 104 0.686 1.23 2.03 Intr - 103297 103144 154 0 1 117 80 137 0.605 15.87 2.02 Intr - 105825 105794 32 0 2 121 93 28 0.951 3.53 2.01 Init - 106045 105998 48 1 0 82 69 61 0.658 4.55 2.00 Prom - 106264 106225 40 -4.06 3.03 PlyA - 106573 106568 6 1.05 3.02 Term - 110115 110033 83 1 2 78 44 99 0.861 2.36 3.01 Init - 132012 131925 88 0 1 41 72 84 0.124 2.90 3.00 Prom - 133695 133656 40 -2.46 4.00 Prom + 135319 135358 40 -2.26 4.01 Init + 142854 142958 105 2 0 62 81 90 0.962 5.92 4.02 Term + 149222 149341 120 1 0 -6 47 189 0.694 3.87 4.03 PlyA + 150963 150968 6 1.05 5.03 PlyA - 151200 151195 6 1.05 5.02 Term - 157685 157437 249 2 0 -61 34 449 0.906 20.30 5.01 Init - 179057 178902 156 2 0 65 94 65 0.381 4.79 5.00 Prom - 189874 189835 40 -0.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6727 6911 185 2 2 97 48 104 0.842 5.01 S.002 Init - 72553 72401 153 1 0 103 75 190 0.937 19.18 S.003 Term - 100173 99998 176 1 2 119 35 138 0.943 9.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:107385754_107589052|GENSCAN_predicted_peptide_1|156_aa XTVPAVSEKLRKRAADFTQSLLDHLPLPSPWQLRGNTCVSSTVKWRCEREGGKGQPDATE ALVNDCASTLSLNGVLPEGLKLSLCLDPAVNCQEIQRTEEHEELHRKYTISKIQAVDPPQ VKQPGSVNWTSSPALTFKEGLYGHAHQGISRNQNGS >gi568815589r:107385754_107589052|GENSCAN_predicted_CDS_1|471_bp nncacagtcccagctgtttctgaaaagctgcgcaagagagcagctgacttcacccagtct ctgctggaccaccttcctcttccatccccctggcagctaagaggaaacacatgtgtctcc tcaactgtaaaatggagatgtgaaagggagggtggcaagggacagcctgatgctacagag gccctagtaaacgattgtgcatctacactaagtctcaacggtgtcctgccagagggatta aagctgagcctgtgtctagatccagctgtcaattgccaggaaatacaaaggacagaggaa catgaagaactacaccgaaagtacacaatcagcaaaatccaggctgtggaccctccacaa gtcaaacagcctgggtccgtcaactggacttcttctccagccctcaccttcaaagaagga ttatatggtcatgcgcaccaaggcatctccaggaatcagaacggttcatga >gi568815589r:107385754_107589052|GENSCAN_predicted_peptide_2|798_aa MHSKNASVVGEEWKNQVDTSLSASVRRQPPGESDMAVSDALLPSFSTFASGPAGREKTLR QAGAPNNVSIAPGRRERPLCMPVVRALRSRLGRKPGLHQSPGALSRWREELSHMKRLPPV LPGRPYDLAAATVATDLESGGAGAACGGSNLAPLPRRETEEFNDLLDLDFILSNSLTHPP ESVAATVSSSASASSSSSPSSSGPASAPSTCSFTYPIRAGNDPGVAPGGTGGGLLYGRES APPPTAPFNLADINDVSPSGGFVAELLRPELDPVYIPPQQPQPPGGGLMGKFVLKASLSA PGSEYGSPSVISVSKGSPDGSHPVVVAPYNGGPPRTCPKIKQEAVSSCTHLGAGPPLSNG HRPAAHDFPLGRQLPSRTTPTLGLEEVLSSRDCHPALPLPPGFHPHPGPNYPSFLPDQMQ PQVPPLHYQELMPPGSCMPEEPKPKRGRRSWPRKRTATHTCDYAGCGKTYTKSSHLKAHL RTHTGEKPYHCDWDGCGWKFARSDELTRHYRKHTGHRPFQCQKCDRAFSSWGPLPGDSDA RLRRLLRGPERPLAVAGPRTRLIAFGFKVRIGFRSQARFPPFRQVVAKPVPRSVWGLVSE PAPQNSSFLKLRAQVEHPHPAAIVTTMPKRKGEGDAKGDKSKVKDEPQRKSTKLSAKPAP PKPEPKLKKAPAKKGEKAISHSLALLGYKPGQHVTILNYVRNCNTMAWDPGEPKSSLEGE FTGFRVRLVEGTSWWYAWGLHGEEDKGDFQELQRQPTLRASSACAPRGARGNCGPTSPVS LVDFASSDEAGAGLFRGM >gi568815589r:107385754_107589052|GENSCAN_predicted_CDS_2|2397_bp atgcattccaagaatgcttctgtggtcggcgaggagtggaaaaatcaggttgacaccagc ctaagcgccagcgttcgccggcagccacctggcgagtctgacatggctgtcagcgacgcg ctgctcccatctttctccacgttcgcgtctggcccggcgggaagggagaagacactgcgt caagcaggtgccccgaataacgtgagtatcgctccgggccgccgggaacgcccgctgtgt atgcccgtggtgcgagcactgcggagccgcctggggcgcaagccgggtcttcaccagtcc ccgggcgcgctgtcgcgctggcgggaggagctctcccacatgaagcgacttcccccagtg cttcccggccgcccctatgacctggcggcggcgaccgtggccacagacctggagagcggc ggagccggtgcggcttgcggcggtagcaacctggcgcccctacctcggagagagaccgag gagttcaacgatctcctggacctggactttattctctccaattcgctgacccatcctccg gagtcagtggccgccaccgtgtcctcgtcagcgtcagcctcctcttcgtcgtcgccgtcg agcagcggccctgccagcgcgccctccacctgcagcttcacctatccgatccgggccggg aacgacccgggcgtggcgccgggcggcacgggcggaggcctcctctatggcagggagtcc gctccccctccgacggctcccttcaacctggcggacatcaacgacgtgagcccctcgggc ggcttcgtggccgagctcctgcggccagaattggacccggtgtacattccgccgcagcag ccgcagccgccaggtggcgggctgatgggcaagttcgtgctgaaggcgtcgctgagcgcc cctggcagcgagtacggcagcccgtcggtcatcagcgtcagcaaaggcagccctgacggc agccacccggtggtggtggcgccctacaacggcgggccgccgcgcacgtgccccaagatc aagcaggaggcggtctcttcgtgcacccacttgggcgctggaccccctctcagcaatggc caccggccggctgcacacgacttccccctggggcggcagctccccagcaggactaccccg accctgggtcttgaggaagtgctgagcagcagggactgtcaccctgccctgccgcttcct cccggcttccatccccacccggggcccaattacccatccttcctgcccgatcagatgcag ccgcaagtcccgccgctccattaccaagagctcatgccacccggttcctgcatgccagag gagcccaagccaaagaggggaagacgatcgtggccccggaaaaggaccgccacccacact tgtgattacgcgggctgcggcaaaacctacacaaagagttcccatctcaaggcacacctg cgaacccacacaggtgagaaaccttaccactgtgactgggacggctgtggatggaaattc gcccgctcagatgaactgaccaggcactaccgtaaacacacggggcaccgcccgttccag tgccaaaaatgcgaccgagcattttccagctggggtcccctccctggggacagcgatgcc cgcttgcggcggcttctgcgagggccagagcgcccgctggctgtggccgggcccagaaca cgcctgatcgcttttgggttcaaagtcagaatcggtttccgttctcaggccaggtttccc cccttccgccaggtcgtagccaaaccagtccccagatccgtgtggggcctagtgtcggaa cccgcaccccagaatagctcgttcttgaagctccgtgctcaagtggagcacccacatcct gctgccattgtcaccaccatgcccaagagaaagggtgaaggggatgctaaaggagataaa tccaaagttaaggacgaaccacagagaaaatccacgaagttgtctgctaaacctgctcct ccaaagccagagcccaagcttaaaaaggcccctgcaaagaagggagagaaggctatatcg catagcctagcgctcctaggctacaaacctggacagcatgttaccatactgaattatgtg cgcaactgtaacacaatggcctgggaccctggagaacctaagtcatcgctcgagggagag tttactgggttcagggtgaggctggtggaaggaacatcgtggtggtacgcctggggtctc cacggggaagaggacaagggggacttccaggagcttcagcggcagccaacgctgcgagcc tcatctgcgtgcgcccctcgtggcgctcgcgggaactgcgggccgacgtcccctgtttct ctcgttgatttcgccagctctgatgaggccggagcgggcctcttcagaggaatgtag >gi568815589r:107385754_107589052|GENSCAN_predicted_peptide_3|56_aa MPVFMPDAEDENSLGQYVIDGKGIASGGHELFKERDEVVKKEKGWKMKYQTNDPLN >gi568815589r:107385754_107589052|GENSCAN_predicted_CDS_3|171_bp atgcctgttttcatgccagacgctgaggatgaaaactccttggggcagtacgtgatagat gggaagggcattgccagtggaggtcacgaacttttcaaagaaagagatgaagtggtgaag aaagagaagggatggaaaatgaagtaccaaaccaatgaccctctcaactag >gi568815589r:107385754_107589052|GENSCAN_predicted_peptide_4|74_aa MWFQYESNSFHDPLARAPQPSAGGFWKVFWHIQRPKRKPYLKKKRKIEKKYVDEASKKEI KDILIQYDMYPSGS >gi568815589r:107385754_107589052|GENSCAN_predicted_CDS_4|225_bp atgtggttccagtatgagtccaacagcttccatgaccctttagccagggctcctcaacct agtgctggtggcttctggaaggtcttttggcacatccagaggccaaaaagaaaaccctat ctcaaaaagaaaagaaaaatagaaaagaaatatgtcgatgaggcttccaagaaggagatc aaagacatcctcatccagtatgacatgtatccttctggcagctga >gi568815589r:107385754_107589052|GENSCAN_predicted_peptide_5|134_aa MVPLRERAFLKTVHPRSPLAVTTGKARRIIQCDPCLSLLPGDLQGLSLRQAQGGDFTCHN DTGGKFIYGEKFDDENLIRKHKGPGILSMANAGPNTKGFQFFMCTTKTEWSDGKHVVFDK VKDSMDIMDAMEHS >gi568815589r:107385754_107589052|GENSCAN_predicted_CDS_5|405_bp atggtccccctgagagagcgagccttcttaaaaactgtgcaccctaggagtcctttggca gtgaccaccggcaaagcacgtaggatcatccagtgtgacccctgtctgagcctgctgcct ggggatctccagggcctcagcctcaggcaggcccagggtggtgacttcacatgccataat gacactggtggcaaattcatctatggggagaaatttgatgatgagaacctcatccggaag cacaaaggtcctggtatcttgtccatggcaaatgctggacccaacacaaaaggcttccag tttttcatgtgcaccaccaagactgagtggtcggatggcaagcacgtggtctttgacaag gtgaaggacagcatggatatcatggatgccatggagcactcttag