GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:57:32 Sequence gi568815588r:14421193_14630459 : 209267 bp : 42.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4419 4487 69 1 0 41 107 74 0.213 2.84 1.02 Term + 8267 8388 122 0 2 103 48 55 0.649 0.66 1.03 PlyA + 9692 9697 6 1.05 2.00 Prom + 24029 24068 40 -4.75 2.01 Init + 26934 27315 382 2 1 57 47 336 0.387 23.48 2.02 Intr + 30458 30567 110 0 2 30 93 18 0.522 -4.42 2.03 Intr + 32068 32166 99 0 0 78 55 73 0.655 2.39 2.04 Intr + 33488 33691 204 1 0 85 66 133 0.858 9.27 2.05 Term + 38872 39036 165 0 0 109 42 81 0.577 2.53 2.06 PlyA + 39482 39487 6 1.05 3.25 PlyA - 44710 44705 6 1.05 3.24 Term - 46110 46043 68 1 2 127 39 67 0.312 2.82 3.23 Intr - 52904 52676 229 2 1 83 76 109 0.420 5.72 3.22 Intr - 55419 55282 138 0 0 65 3 131 0.668 2.04 3.21 Intr - 57181 57100 82 0 1 72 37 98 0.437 1.82 3.20 Intr - 60393 60238 156 2 0 3 77 131 0.170 1.70 3.19 Intr - 60675 60609 67 1 1 74 115 15 0.120 -0.16 3.18 Intr - 64914 64794 121 0 1 61 68 98 0.572 4.25 3.17 Intr - 67823 67686 138 2 0 125 44 92 0.878 8.24 3.16 Intr - 75284 75146 139 1 1 28 42 110 0.036 -0.05 3.15 Intr - 99469 99399 71 1 2 89 76 80 0.056 3.96 3.14 Intr - 100114 100055 60 1 0 49 62 87 0.362 0.21 3.13 Intr - 100827 100677 151 2 1 89 110 181 0.789 19.64 3.12 Intr - 109323 109140 184 1 1 97 93 161 0.988 15.42 3.11 Intr - 111689 111578 112 2 1 89 28 111 0.974 4.23 3.10 Intr - 112924 112662 263 2 2 75 36 120 0.043 1.78 3.09 Intr - 124266 124141 126 1 0 68 53 140 0.153 8.23 3.08 Intr - 126407 126281 127 2 1 21 61 178 0.607 7.73 3.07 Intr - 127280 127079 202 1 1 53 73 59 0.186 -0.73 3.06 Intr - 130800 130640 161 0 2 25 82 122 0.108 3.16 3.05 Intr - 132195 132130 66 0 0 92 83 34 0.119 1.48 3.04 Intr - 135445 135363 83 2 2 86 48 53 0.114 -0.46 3.03 Intr - 138271 138126 146 0 2 63 94 76 0.000 4.71 3.02 Intr - 145364 145100 265 0 1 71 66 154 0.000 7.15 3.01 Init - 163256 163181 76 2 1 84 15 126 0.074 6.20 3.00 Prom - 166150 166111 40 -7.05 4.03 PlyA - 166174 166169 6 1.05 4.02 Term - 177112 176979 134 2 2 23 42 159 0.545 2.07 4.01 Init - 179197 179080 118 1 1 47 92 113 0.591 8.03 4.00 Prom - 185463 185424 40 -4.05 5.03 PlyA - 186084 186079 6 1.05 5.02 Term - 188822 188731 92 0 2 94 47 109 0.263 4.30 5.01 Init - 201963 201858 106 0 1 64 86 48 0.027 2.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 112420 112337 84 1 0 55 103 44 0.917 3.55 S.002 Term - 124266 124100 167 1 2 68 43 151 0.828 5.70 S.003 Init + 148428 148497 70 2 1 81 78 32 0.832 2.76 S.004 Term + 149847 150055 209 1 2 69 49 189 0.887 9.62 S.005 Term - 202819 202701 119 2 2 83 49 118 0.830 5.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:14421193_14630459|GENSCAN_predicted_peptide_1|63_aa XKPAQVRVILWDTILHADTDIQPVTKNITHNKNIHSFQSGHFAFTTDVLLLVNNCAVAKA AWS >gi568815588r:14421193_14630459|GENSCAN_predicted_CDS_1|192_bp ncgaagcctgctcaagtaagggtgattctctgggacaccatcctccacgcagacacggac attcagcctgtgactaagaatataactcacaacaagaacatccattcttttcagtctgga cactttgccttcaccacggatgtgctgctcctggttaataattgtgctgtagctaaggct gcttggtcctga >gi568815588r:14421193_14630459|GENSCAN_predicted_peptide_2|319_aa MHTEGAMLADSITHYGIRARGPLLPVCPVTAERRDIRTRCPTGLSHTVVLKQVSSHLSPP WSAAAGDTFREVQVHHSSRAWLGKPLPTAVLNRSPVPVLGPWCHVAGAPAGPSLTPTLPA LPREDTAGVSAFASSPPLSVLPPPCKKCLLPSAVILTPPQPCGTITDYLFCFGGAAGREV RTSFFRELGSSDCSGKWKAKSKNRHISILIARSNANAATDMVRRICPLQSLPQLHFQHLA QAAISLTWAATEAVQEPPLPCRPELVVTWYSFLNKYENREGHSQKFSPPLKADLAKIDAS RYCHREWHGTGFDFKAENK >gi568815588r:14421193_14630459|GENSCAN_predicted_CDS_2|960_bp atgcacactgaaggagcaatgttggctgactccatcacccattatggcatccgagcccga ggccctcttctgccagtctgtcctgttactgctgaaagaagggacatcagaactcgatgc cctaccggactctctcacactgtggtattgaagcaggtgtcctctcatttgtccccaccg tggtctgctgctgcaggcgacaccttcagggaggtccaagtccaccatagcagcagggcc tggctgggcaagcccctgcccacagctgtcctcaacaggtccccagtccctgtgcttggc ccatggtgccatgttgcaggtgctcctgcagggccttctctgacacccaccctccctgct cttccccgtgaggacactgcaggggtttctgcttttgcctcctctcctcctctttctgtc ttgccaccaccatgtaagaagtgccttctgccttcggccgtgattctgacgcctccccag ccatgtggaactataacagattacctcttctgtttcggaggtgcagctggaagagaagta aggacgtccttctttcgtgagctgggatccagtgactgttcagggaaatggaaagcgaag agtaagaaccggcacatcagtatcctcatagcaagaagtaatgctaatgccgccaccgac atggtgagacggatttgcccccttcaaagccttccgcagctccacttccaacacctggcc caagccgccatctctctcacctgggctgctacagaggctgttcaagaacctcccctcccc tgcaggccagagctagttgttacctggtattctttccttaataaatatgagaacagagaa ggacacagccagaaattttctccaccacttaaagcagatctggcaaagatagatgcatca agatactgccatcgagaatggcatggcactgggtttgattttaaggcagaaaataaataa >gi568815588r:14421193_14630459|GENSCAN_predicted_peptide_3|1076_aa MERKTLSAEGHGVRKHNSHRQLDLGGSHQAAVQVGARAVVSSEARLRKSVSRLPQVMGRI HFPAVVGRGSPAVPCRTGPYFLSAGKDGRLQHDGFTGVTSHHLCYLLLLEAGDREPGDLN LPGMVSIRMEALWQVCQDHLSPCPWYSDLGAGKAGLMKAGCAGCLVKPGVACGLDGPGRS QVRGSSTEHTENAVETNFVDRETVGTALQSILKAGLREAKCVAKVDRKHSPTAVGLKYEK LVSQATKKVLKPFWGRTGENLIKPERGALRFQLLAARCCYLRPTHPHRPVYKGTSTSPGA GRRCSRKSIWSLRPPCASPWTPRVSVFPVVDPREMLTGAFQSVDTQKYSKYAKIGKALKS ETLLVFRIRDAQPVVTFKDVNQVFIKRYINGDVSDRAQDTPSPQINSFVTEALNHYEAPV VKKDKPVKKKEDVCPCVDSIYIYFTTLSHGKTSSWAEILNALHALLCSHKCLKSQPQPPP PAPARSARVSLIGDRLGCPSPHWSYLLPKCNNEVKVRGSCLFGFGIVEIIYGEGQKYGAF YGNSPPEDIDHKDSYLITRSIMAEPDYIEDDNPELIRPQKLINPVKTSRNHQDLHRELLM NQKRGLAPQNKPELQKVMEKRKRDQVIKQKEEEAQKKKSDLEIELLKRQQKLEQLELEKQ KLQEEQENAPEFVKLTPEKERECRAQPLPLLALNLEIGSLKSNSEVTPIGGKSLRVMEAD LAVKPLILAGALSLTGCSLLTFSMLPSGTECTASVNWPQPVYGQQPLIRTKLLKSVSSPI KAVVMAGGMGELKAYVHTKTNAQMFLATFFIIDKICKQPGCSSIDEWINRADMILLIPAI ALCHSVIFQCDECIWAMPRVSHLASLAGDQHAGTTGSGIPPGKRHKAPPTPPPPEKKLVR KEGRRDRAPPSHEAPFIIHTQLASKRDERLGGRHNSEVFLDVRVKDQEHLFITSGLKLRV EKGSKSDSGILVNLGIGNNGEDLRISQLTRLHGPSLHLQGQVLTFRGSWFWKRDHNKKHA PRPLKRHLPQGGGQRNGEVRAEALPQDPANVCCQGPKEESTCSTYSPYVPGSLLLP >gi568815588r:14421193_14630459|GENSCAN_predicted_CDS_3|3231_bp atggagcgaaagactctgagtgccgagggccatggggtgcgaaagcataattctcatcgg cagctggacctcggaggctctcaccaggctgctgtccaggtgggcgccagggctgtggtc tcatcagaggctcgactgaggaagagtgtttccaggctccctcaggtcatgggcagaatt catttccctgcagttgtgggacgtgggtcgcctgcagttccatgccgcacaggtccttac ttcctttcagctggcaaggatggaagattgcagcatgatggcttcacaggagtgacatct catcacctttgctatcttctgttgttagaagcaggtgacagagaaccgggggatttaaat ctccctggtatggtttccattagaatggaagctctttggcaggtctgccaggaccattta tcaccttgtccctggtactcagacctcggtgctggcaaagctgggctcatgaaggctggc tgcgcaggatgccttgtgaagccaggtgtggcttgtggcctagatggaccagggagatct caagtgaggggcagtagcacggagcacacagagaatgcagttgaaactaattttgtggat agagaaactgttggaactgctttacaaagtattttaaaggcaggtctccgagaggctaaa tgtgtggcaaaagtagacaggaagcacagtccaaccgctgtagggttgaaatatgaaaag ctagtgtcacaggccaccaagaaggttctgaagcctttttgggggagaactggggagaat ttgataaagccagaaagaggagccctgcgcttccagctacttgcagccaggtgctgttac ctacgccccacgcaccctcacagacctgtgtacaagggcacaagtacctcccctggagct ggcaggaggtgcagtcggaagagcatctggtccctgaggcctccctgtgcctccccctgg acccccagggtgtccgtgttcccagttgttgaccctcgggaaatgctcaccggagcattt cagagtgtagatactcagaaatattccaaatatgctaaaattggaaaagctttgaaatct gaaacacttctggtgtttcggataagggatgctcaacctgtagtgaccttcaaagatgtc aaccaggtattcatcaagagatatattaatggggatgttagtgatagagcccaggacacc cccagcccccagatcaatagctttgtaacagaagcgctaaaccactatgaagctccagtt gtgaagaaggacaaaccagtgaaaaagaaggaagatgtttgcccctgtgttgacagcata tacatatacttcaccacactgtcccacggaaaaacatcctcctgggctgaaatcctaaat gcactacacgcacttctgtgcagccacaagtgtctgaaaagtcaaccgcagcccccccct cctgcaccggctcgcagtgcccgtgtctctctaataggagacaggcttgggtgcccctca ccccactggtcctatctcctccccaaatgtaataacgaggtcaaggtcagaggcagttgt ctgtttgggtttggcatcgtggagattatctatggtgaagggcagaaatacggagccttt tatggaaacagcccacctgaggatattgaccataaggactcatatctcattacaagaagc atcatggccgagccagactacatagaagatgacaatcctgaactcattaggcctcagaaa ctgatcaatcctgtaaaaacctcccggaaccatcaagatcttcacagagaacttcttatg aatcaaaaaaggggtcttgctcctcagaacaaaccagaattgcagaaggtgatggaaaaa agaaaacgagaccaagtaataaagcagaaggaagaagaagcacagaagaagaaatctgac ttggaaatagagctattaaaacggcagcagaagttggagcagcttgaacttgagaagcag aaattgcaagaagagcaagaaaatgcccccgagtttgtgaagttaacaccagagaaggaa cgtgaatgtcgagcacagccactacccttgttggcacttaatttagaaataggatcttta aaaagcaacagtgaagtgactcctattgggggcaaatctctccgggtaatggaagctgac ctagcagtcaaacccttgattctggctggggctctgtccctcactggctgttccctattg accttctcaatgcttccctctggcactgagtgcacagcctctgtaaactggccacaacca gtctacggtcagcagcctctgatcaggacaaagttactgaaatcagtctcttctccaatc aaagctgtagtgatggctggtgggatgggggaattgaaagcttatgtccacacaaaaacc aatgcacagatgtttctggcaactttcttcataattgacaaaatttgcaagcaaccagga tgttcttccatagatgaatggataaaccgtgctgacatgattctattaataccagcaatc gctctttgccattctgttattttccagtgtgatgaatgcatttgggcaatgccaagggtt agccatcttgctagccttgcgggtgatcaacatgctggtactactggttctggaattcct ccagggaagaggcataaggcacccccaacacctccacccccagaaaaaaagcttgtgagg aaggaaggaaggagggacagggctcctcccagccatgaagcacctttcattattcacacg caactagccagtaaaagagacgagagactcggtgggagacacaactcagaggtctttctc gatgttcgagtcaaggaccaggaacacctgttcatcactagtggcttgaaacttcgggtg gagaaaggctcaaaatctgacagtggcattcttgtcaatttagggataggcaacaatggt gaggacctgaggatttctcagctcacacggctccacgggccatctctccacctccaggga caagttctcactttcaggggttcctggttttggaagagggatcacaataagaaacatgca ccaagacccttaaagcgacatctaccccaggggggtgggcagaggaacggggaagttaga gcagaggcactcccgcaggaccctgcaaatgtttgttgccagggacccaaggaggagtcc acgtgctccacttacagcccctatgttccaggctcgctccttctcccctag >gi568815588r:14421193_14630459|GENSCAN_predicted_peptide_4|83_aa MRWTVQFCLEFGCGVRTLCRLGLKVDRETVWFCPSVLAEAGSTKLLEENIGEEVQDTGLG SNYAAATLKAQSTNPKLNQIASN >gi568815588r:14421193_14630459|GENSCAN_predicted_CDS_4|252_bp atgaggtggactgttcagttctgtctggaatttggctgtggggttcggacactttgtcgc ttgggattgaaggtggatcgggaaaccgtatggttctgtcccagtgtccttgctgaagct ggttccacaaaactcctagaagaaaacataggggaagaggttcaggacactggtcttggc agtaattacgcggctgccacgctaaaagcacagtcaacaaacccaaaactaaaccagatt gcatcgaactaa >gi568815588r:14421193_14630459|GENSCAN_predicted_peptide_5|65_aa MNTKRKRVVIIKIKSIALTRMNMSSLKQNAVKLGVVFRSSGVNEDLVTTERTMWEFGYYS IRIPS >gi568815588r:14421193_14630459|GENSCAN_predicted_CDS_5|198_bp atgaacaccaaaagaaagagggttgtgataatcaaaatcaagtcaattgctttgacaaga atgaacatgagcagcttaaaacaaaatgctgtcaaattaggtgtggtattcaggtcatca ggggtcaatgaagacttggtaaccactgaacggaccatgtgggaattcggttattactca ataagaattccttcttag