GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:37:18 Sequence gi568815595r:186050093_186420459 : 370367 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 85 80 6 1.05 1.07 Term - 1891 1826 66 1 0 57 42 102 0.965 0.44 1.06 Intr - 2039 1938 102 2 0 114 105 210 0.999 25.67 1.05 Intr - 7152 6983 170 1 2 76 69 167 0.999 13.27 1.04 Intr - 7399 7331 69 2 0 85 115 90 0.999 10.55 1.03 Intr - 14426 14325 102 0 0 103 116 37 0.946 8.15 1.02 Intr - 15934 15721 214 1 1 108 76 115 0.940 10.59 1.01 Init - 17033 17010 24 2 0 86 77 0 0.720 -2.32 1.00 Prom - 18117 18078 40 -3.46 2.04 PlyA - 19727 19722 6 1.05 2.03 Term - 29912 29756 157 1 1 72 44 117 0.623 3.11 2.02 Intr - 31083 30954 130 1 1 95 97 142 0.973 15.55 2.01 Init - 38065 37963 103 1 1 77 23 131 0.745 5.90 2.00 Prom - 39723 39684 40 -6.46 3.00 Prom + 46275 46314 40 -2.36 3.01 Init + 52702 52757 56 0 2 59 106 61 0.872 6.08 3.02 Intr + 58200 58322 123 0 0 97 85 34 0.699 3.70 3.03 Term + 58877 59084 208 2 1 110 41 166 0.755 10.91 3.04 PlyA + 60139 60144 6 1.05 4.31 PlyA - 61243 61238 6 -0.45 4.30 Term - 62286 61987 300 0 0 37 54 132 0.840 -0.08 4.29 Intr - 64076 63938 139 1 1 77 100 107 0.941 11.27 4.28 Intr - 85284 85229 56 0 2 94 72 55 0.128 2.28 4.27 Intr - 111571 111511 61 0 1 75 113 47 0.683 4.64 4.26 Intr - 114926 114806 121 0 1 91 85 90 0.209 8.55 4.25 Intr - 138287 138110 178 2 1 85 121 216 0.998 24.09 4.24 Intr - 139901 139762 140 0 2 26 33 92 0.052 -2.32 4.23 Intr - 154918 154774 145 1 1 63 91 81 0.410 5.76 4.22 Intr - 161793 161703 91 2 1 58 107 154 0.051 14.30 4.21 Intr - 186032 185892 141 1 0 65 67 90 0.161 4.07 4.20 Intr - 192476 192412 65 2 2 91 115 28 0.293 3.32 4.19 Intr - 201827 201667 161 0 2 73 89 226 0.745 20.91 4.18 Intr - 203090 203001 90 0 0 110 105 39 0.991 7.77 4.17 Intr - 207847 207762 86 0 2 91 99 38 0.956 4.66 4.16 Intr - 210421 210347 75 0 0 128 51 112 0.996 10.13 4.15 Intr - 211686 211607 80 0 2 53 102 43 0.940 0.55 4.14 Intr - 215214 215155 60 0 0 90 82 82 0.970 6.73 4.13 Intr - 217685 217593 93 2 0 120 114 48 0.999 10.76 4.12 Intr - 218825 218709 117 2 0 109 76 119 0.562 13.46 4.11 Intr - 222251 222163 89 0 2 103 73 40 0.489 3.69 4.10 Intr - 225572 225455 118 2 1 101 70 133 0.898 12.84 4.09 Intr - 229881 229759 123 0 0 78 84 161 0.999 15.48 4.08 Intr - 230652 230578 75 0 0 114 76 52 0.983 6.31 4.07 Intr - 234617 234568 50 0 2 77 115 77 0.884 7.70 4.06 Intr - 234875 234837 39 0 0 96 40 60 0.532 0.20 4.05 Intr - 238788 238618 171 1 0 89 84 107 0.952 10.31 4.04 Intr - 247391 247329 63 0 0 126 119 76 0.987 13.19 4.03 Intr - 248137 247972 166 1 1 106 86 160 0.989 17.13 4.02 Intr - 256885 256809 77 2 2 77 94 64 0.630 5.13 4.01 Init - 270367 270301 67 1 1 59 82 28 0.009 0.53 4.00 Prom - 281754 281715 40 -2.86 5.00 Prom + 286035 286074 40 -2.36 5.01 Init + 301216 301222 7 0 1 63 116 0 0.223 1.47 5.02 Term + 305185 305711 527 2 2 -1 39 440 0.426 24.63 5.03 PlyA + 306035 306040 6 1.05 6.04 PlyA - 306139 306134 6 1.05 6.03 Term - 315469 315453 17 2 2 119 54 3 0.225 -1.60 6.02 Intr - 315729 315677 53 2 2 88 101 54 0.206 5.25 6.01 Init - 322821 322745 77 0 2 54 89 70 0.134 4.26 6.00 Prom - 329062 329023 40 -3.26 7.00 Prom + 329707 329746 40 -2.46 7.01 Init + 332661 332754 94 2 1 48 100 50 0.034 2.74 7.02 Intr + 349251 349325 75 1 0 110 105 55 0.216 8.89 7.03 Term + 367035 367111 77 1 2 77 36 106 0.165 2.30 7.04 PlyA + 369283 369288 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 97656 97823 168 2 0 95 37 109 0.882 6.03 S.002 Init - 161792 161703 90 2 0 74 107 148 0.945 15.79 S.003 Term - 335219 334832 388 1 1 102 47 174 0.913 9.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_1|248_aa MGFWHVGQPGVPGDNRPSYHRQMSEPIVPAAPPPPQGFKQEYHDPLYEHGVPGMPGPPAH GFQSPMGIKQEPRDYCVDSETIDRIVSTFAFLSEVPNCQSSYMRGGYFSSSHEGFSYEKD PRLYFDDTCVVPERLEGKVKQEPTMYREGPPYQRRGSLQLWQFLVTLLDDPANAHFIAWT GRGMEFKLIEPEEVARRWGIQKNRPAMNYDKLSRSLRYYYEKGIMQKEKVQGFCSETGDR RYEEILAP >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_1|747_bp atggggttttggcacgttggccagccaggagttcctggagataatcgccccagttaccat cggcaaatgtcagaacctattgtccctgcagctcccccgccccctcagggattcaaacaa gaataccatgacccactctatgaacatggggtcccgggcatgccagggcccccagcacac gggttccagtcaccaatgggaatcaagcaggagcctcgggattactgcgtcgattcagaa actattgacaggattgtgtccacctttgctttcctttcagaagtgcctaactgccagtca tcctacatgagagggggttatttctccagcagccatgaaggtttttcatatgaaaaagat ccccgattatactttgacgacacttgtgttgtgcctgagagactggaaggcaaagtcaaa caggagcctaccatgtatcgagaggggcccccttaccagaggcgaggttcccttcagctg tggcagttcctggtcacccttcttgatgacccagccaatgcccacttcattgcctggaca ggtcgaggcatggagttcaagctgatagaaccggaagaggttgctcggcgctggggcatc cagaagaaccggccagccatgaactatgacaagctgagccgctctctccgctattactat gaaaagggcatcatgcagaaggaaaaagtccagggcttctgtagtgagactggtgatcgt cgctacgaggaaattttagccccctaa >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_2|129_aa MSLGQMKVNETEEGTRKTPKQNVKDGNCWKAVKALVLHAPPPTKIKRELHSPSSELSSCS HEQALGANYGEKCLYNYWQLCPPQGMPLQLAQFKVWAPPPPPIRFQSLDHSSKHLRSPDH HISPCRCQR >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_2|390_bp atgagtctgggccagatgaaggtaaatgaaactgaagaagggacaaggaaaaccccaaaa cagaacgtgaaggatggcaactgctggaaggcggtcaaagccttggtgcttcatgcccca cctccaaccaagatcaaacgggagctgcacagcccctcctctgagctgtcgtcttgtagc catgagcaggctcttggtgctaactatggagaaaagtgcctctacaactattggcaactc tgcccacctcagggcatgcccctgcagctggcccagttcaaggtgtgggccccgcccccg ccccccattcgcttccagagcctggaccacagcagcaaacatttgcggtcccccgaccac cacatcagcccctgcagatgccaaagatga >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_3|128_aa MTGVKLVLNSRLWKALMEKPLFRCAPIFSSRRTAGRPQPAGWEQGDWMSTFHHLKLARPA GPGGAGALTHARSGPGRLGERLGRTAPKTVNRSAHQRLDSAPQLPTRLERWPGPPRLAAP PAAIGPRP >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_3|387_bp atgactggagtgaagctggtcctgaactcgaggctgtggaaagcgctcatggaaaagccg ctgttccggtgcgccccgattttctcgagtcgccgcacagccggacgccctcagcctgct gggtgggagcagggcgactggatgtccactttccaccacctgaaacttgcaaggcctgcc ggccccggcggcgcaggcgcgctcacgcacgcgcgcagcgggcctgggcgcctgggcgaa aggctgggccgaaccgctccgaagacggtgaaccgctccgcgcaccagcggctggactct gcgccgcagctgccaaccaggctggagcgctggcccggccccccgcggctcgctgctccg cccgccgccattggcccgcggccctag >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_4|1078_aa MGEERWVSLTPEEFDQLQKYSEYSSKKIKDALTEFNEGGSLKQYDPHEPISYDVFKLFMR AYLEVDLPQPLSTHLFLAFSQKPRHETSDHPTEGASNSEANSADTNIQNADNATKADEAC APDTESNMAEKQAPAEDQVAATPLEPPVPRSSSSESPVVYLKDVVCYLSLLETGRPQDKL EWNYTHDLDFISHIVMFRLYDSDENGLLDQAEMDCIVNQMLHIAQYLEWDPTELRPILKE MLQGMDYDRDGFVSLQEWVHGGMTTIPLLVLLGMDDSGSKGDGRHAWTMKHFKKPTYCNF CHIMLMGVRKQGLCCTYCKYTVHERCVSRNIPGCVKTYSKAKRSGEVMQHAWVEGNSSVK CDRCHKSIKCYQSVTARHCVWCRMTFHRKCELSTLCDGGELRDHILLPTSICPITRDRPG EKSDGCVSAKGELVMQYKIIPTPGTHPLLVLVNPKSGGRQGERILRKFHYLLNPKQVFNL DNGGPTPGLNFFRDTPDFRVLACGGDGTVGWILDCIDKANFAKHPPVAVLPLGTGNDLAR CLRWGGGYEGGSLTKILKDIEQSPLVMLDRWHLEVIPREEVENGDQVPYSIMNNYFSIGV DASIAHRFHVMREKHPEKFNSRTEALEEIPFTGSMEVQEAEVGDLDTRDSKSSAKSRLAG SREGVIDPRMKNKLWYFEFGTSETFAATCKKLHDHIELECWDYMHEPLHLAWFVTFNCPM EFHENPATIRDEEQKWELSDTAGGNISDGSAEYMCTMVIFIIIIIYLGGGLWRRKEGHSV WVQKGEQDQLVETRCDGVGVDLSNIFLEGIAILNIPSMYGGTNLWGENKKNRAVIRESRK GVTDPKELKFCVQDLSDQLLEVVGLEGAMEMGQIYTGLKSAGRRLAQCASVTIRTNKLLP MQVDGEPWMQPCCTNGKKFYAFMLLCTINLKVRGFHEDKSLPAQDNNSEVWLELLKQVPK EAQQLSLAWQASSQSPENKEGSLNVIDKELLNTTFNSYKGKDNKRRGAGEVNHKEKKEEE WREDISKYLKMRSPPWAQELAQVIAGCTLTFNAAAKMAVPRAVPLVQHKPPNSQNQRL >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_4|3237_bp atgggtgaagaacggtgggtctccctcactccagaagaatttgaccaactccagaaatat tcagaatattcctccaagaagataaaagatgccttgactgaatttaatgagggtgggagc ctcaaacaatatgacccacatgagccgattagctatgatgtcttcaagctgttcatgagg gcgtacctggaggtggaccttccccagccactgagcactcacctcttcctggccttcagc cagaagcccagacacgagacctctgaccacccgacggagggagccagcaacagtgaggcc aacagcgcagatactaatatacagaatgcagataatgccaccaaagcagacgaggcctgt gcccctgatactgaatcaaatatggctgagaagcaagcaccagctgaagaccaagtggct gcgacccccctggaaccccccgtccctcggtcttcaagctcggaatccccagtggtatac ctgaaggatgttgtgtgctacctgtccctgctggagacggggaggcctcaggataagctg gagtggaattacactcatgacctggacttcatcagccacatagtcatgtttcgcctctat gattcagatgagaacggtctcctggaccaagcggagatggattgcattgtcaaccaaatg ctgcatattgcccagtacctggagtgggatcccacagagctgaggcctatattgaaggag atgctgcaagggatggactacgaccgggacggctttgtgtctctacaggaatgggtccat ggagggatgaccaccatcccattgctggtcctcctggggatggatgactctggctccaag ggggatgggcggcacgcctggaccatgaagcacttcaagaaaccaacctactgcaacttc tgccatatcatgctcatgggcgtccgcaagcaaggcctgtgctgcacttactgtaaatac actgtccacgaacgctgtgtgtccagaaacattcctggttgtgtcaaaacgtactcaaaa gccaaaaggagtggtgaggtgatgcagcacgcatgggtggaagggaactcctccgtcaag tgtgaccggtgccacaaaagtatcaagtgctaccagagtgtcaccgcgcggcactgcgtg tggtgccggatgacgtttcaccgcaaatgtgaattatcaacgttgtgtgacggtggggaa ctcagagaccacatcttactgcccacctccatatgccccatcacccgggacaggccaggt gagaagtctgatggctgcgtgtccgccaagggcgaacttgtcatgcagtataagatcatc cccaccccgggtacccaccccctgctggtcttggtgaaccccaagagtggagggagacaa ggagaaagaattcttcggaaattccactatctgctcaaccccaaacaagttttcaacctg gacaatggggggcctactccagggttgaactttttccgtgatactccagacttccgtgtt ttggcctgtggtggagatgggacagttggctggattttggattgcattgataaggccaac tttgcaaagcatccaccagtggctgtcctgcctcttggaacaggaaatgaccttgcccgt tgtctccgctggggaggaggttatgaagggggcagcttgacaaaaatcctgaaagacatt gagcagagccccttggtgatgctggaccgctggcatctggaagtcatccccagagaggaa gtggaaaacggggaccaggtcccatacagcatcatgaacaactatttctccattggtgtg gacgcttccattgcacacagattccatgtgatgagagagaaacatcctgaaaaattcaac agcaggacagaagctctagaggaaatccccttcactggctccatggaagtccaagaggca gaggttggggatctggacactagagattcgaagtccagtgccaaatccaggctagcagga tcaagggagggggtcatagaccccaggatgaagaacaagctgtggtactttgaatttggc acctcggagacttttgcagcgacctgcaagaaactccacgaccacattgagttggagtgc tgggattacatgcatgagccactgcacctggcctggttcgtcacttttaactgccctatg gagttccatgaaaacccagccactatccgtgacgaggagcagaaatgggaactctcagac actgctggtgggaacatcagtgatggcagtgctgaatacatgtgcacaatggtgatcttc atcatcatcatcatctatctagggggtggcctatggaggaggaaggagggtcattctgta tgggttcagaagggagaacaggaccaactggtggaaacaagatgtgatggggttggggtg gacctgagcaacatcttcctggaaggcattgccattctcaacattcccagcatgtacgga ggcaccaatctctggggagaaaacaagaagaaccgggctgtgatccgggaaagcaggaag ggtgtcactgaccccaaagaactgaaattctgcgttcaagacctcagtgaccagctcctt gaagtggtggggctagaaggagccatggagatggggcagatctacaccggcctgaagagt gcaggcaggaggctggcccagtgcgcctctgtcaccatcaggacaaacaagctgctgcca atgcaagtggatggagaaccctggatgcagccatgttgcacgaatgggaagaagttttat gcctttatgcttctctgtaccatcaacctcaaagttcgtggctttcatgaagacaaatcc ttgccagcccaggacaataattcagaagtctggctggagttgctgaagcaggttcctaag gaagcacagcagctctcactggcatggcaagcaagcagccaaagccctgagaacaaggag ggaagcctcaacgtaatagacaaagaacttctaaatactacctttaactcttataaaggc aaagataacaaaagaagaggagccggggaggtaaatcataaagagaagaaggaagaagag tggagagaagatatcagtaaatatttaaaaatgaggagcccaccctgggcccaggagctg gctcaggtcatcgcaggctgtactctgacttttaatgctgcagccaagatggctgtgcca agagctgtaccccttgtccagcacaaacctccaaattcccagaatcagagactctga >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_5|177_aa MEARSSSPPLLSPPTTTTITTITTTNNTTPTTMIITTTTTTITTITTTTTTMNTTPTTMI ITTIIITPTTTTITTITTTTITSTATTTITTITTTTTLPPLPPPPSPHHYHHHNNHPYYC HHYHQHHHQHHCPTTTIITPLSPSPAPAPAPASLSYHQHHHHHYHHHHHHYHHLHEK >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_5|534_bp atggaagcacgatcatcatcaccaccattattgtcaccccccaccaccactaccatcacc actatcaccaccaccaacaacactactcctaccaccatgatcatcaccaccaccaccact accatcaccactatcaccaccaccaccaccaccatgaacactactcccaccactatgatc atcaccaccattatcataacccccacaaccactaccatcaccactatcaccaccaccacc atcacctccactgccactaccaccatcaccactatcaccaccaccaccaccttaccacca ttacccccaccaccatcaccccaccactaccaccaccacaataatcacccctactactgc caccactatcaccagcaccaccaccagcatcactgtcctaccactaccatcattacccca ttatcaccatcaccagcaccagcaccagcaccagcatcactgtcctaccaccagcatcac caccaccattaccaccatcatcaccaccactaccatcaccttcatgagaaataa >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_6|48_aa MRRSSKVEGAWVLTPNRTESTSCYPSGDQASAAGLDDVFPEQHGLFSG >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_6|147_bp atgaggagaagcagcaaggtggaaggagcctgggtcctgacgcccaacaggactgagtct acttcctgctaccccagtggagatcaggcatcagctgctggcttggatgatgtgttccca gagcaacatggtcttttctctggctga >gi568815595r:186050093_186420459|GENSCAN_predicted_peptide_7|81_aa MTVIVAVVEVVVVRKALANICRTLENLHVPGSSHGNPIKLPKRSGEQEASKTKMEQVPIA KEKNVHVYVCNYACVFMSQYV >gi568815595r:186050093_186420459|GENSCAN_predicted_CDS_7|246_bp atgacagtgattgttgctgtggtggaggtagttgtagtaagaaaagcattagctaacatt tgtagaacactggaaaatcttcacgtgccaggaagttcccatggcaaccccattaaattg cccaaacgcagtggtgagcaggaggcaagtaaaaccaagatggagcaagtgcctatcgcc aaggaaaaaaatgtccatgtctatgtctgcaactatgcctgtgtcttcatgtctcaatac gtgtaa