GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:48:05 Sequence gi568815589f:99728043_99963864 : 235822 bp : 39.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1105 1100 6 1.05 1.05 Term - 11211 10945 267 0 0 44 35 175 0.208 2.31 1.04 Intr - 16690 16644 47 2 2 113 47 52 0.027 0.81 1.03 Intr - 26324 26172 153 0 0 90 56 56 0.036 1.72 1.02 Intr - 32888 32653 236 1 2 22 11 234 0.116 5.51 1.01 Init - 53486 53245 242 2 2 59 42 146 0.015 4.49 1.00 Prom - 76084 76045 40 -0.35 2.00 Prom + 82310 82349 40 -4.45 2.01 Init + 93265 93603 339 0 0 19 12 247 0.037 7.51 2.02 Intr + 93893 94117 225 1 0 89 59 131 0.034 7.56 2.03 Intr + 94547 94754 208 1 1 71 21 180 0.052 7.33 2.04 Intr + 95043 95219 177 1 0 67 69 200 0.630 14.97 2.05 Intr + 97617 97790 174 1 0 90 96 78 0.833 7.79 2.06 Intr + 98688 98759 72 1 0 103 86 12 0.575 0.96 2.07 Intr + 99999 100255 257 1 2 102 -17 295 0.590 16.64 2.08 Intr + 100454 100951 498 1 0 116 105 215 0.380 18.26 2.09 Intr + 104647 104776 130 0 1 65 86 152 0.998 12.05 2.10 Intr + 105240 105412 173 1 2 33 89 186 0.674 11.94 2.11 Intr + 116607 116806 200 2 2 87 91 104 0.984 7.83 2.12 Intr + 119395 119573 179 1 2 122 115 159 0.997 20.64 2.13 Term + 135578 135825 248 0 2 123 37 221 0.646 15.47 2.14 PlyA + 136495 136500 6 1.05 3.11 PlyA - 137165 137160 6 1.05 3.10 Term - 148067 147963 105 2 0 66 38 70 0.245 -2.77 3.09 Intr - 153499 153105 395 2 2 -5 42 239 0.015 3.45 3.08 Intr - 160287 160118 170 2 2 9 80 137 0.222 3.67 3.07 Intr - 160612 160582 31 2 1 76 70 38 0.273 -2.93 3.06 Intr - 161174 160850 325 2 1 64 61 172 0.333 6.52 3.05 Intr - 166963 166838 126 1 0 115 84 49 0.418 7.16 3.04 Intr - 168185 167969 217 1 1 101 23 54 0.039 -2.22 3.03 Intr - 174021 173814 208 1 1 71 27 136 0.063 3.01 3.02 Intr - 179118 178711 408 1 0 58 48 319 0.256 18.21 3.01 Init - 180284 180236 49 2 1 52 19 62 0.257 -3.14 3.00 Prom - 181227 181188 40 -4.75 4.00 Prom + 186120 186159 40 -6.15 4.01 Init + 187198 187320 123 0 0 27 105 122 0.795 8.02 4.02 Intr + 195664 195834 171 0 0 55 100 77 0.523 4.82 4.03 Intr + 200736 200801 66 2 0 76 110 63 0.228 5.48 4.04 Intr + 223018 223243 226 0 1 66 40 339 0.409 23.54 4.05 Intr + 231875 231990 116 0 2 51 80 107 0.150 5.45 4.06 Intr + 232063 232113 51 0 0 103 101 19 0.112 2.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:99728043_99963864|GENSCAN_predicted_peptide_1|314_aa MGYSGGPSLITRGLKSRRGRRRRQSKRCDNKAGLERCYVADFEDGEKECGLPLEDGQGKS VTLSKASRRECSPTDTLILVQLHLHEEAEEEITGLEESKIQEKEWKEQGSKDLGGFMIKS KGEGHDLARRRDIHFILEPKIERQNTIYSVKEFRGEEKGTSFSFICLNHFGALVILSVAN LHYVVCFTGFTLLPLLLGNAPSSTPVSWFECTEEPVEGFEMGVRGGHMDNQQHRLLALQA LVSEFNPEVTHNHVLDKGSPKTYNIGERQFNGQAIFPYLFSNLYSRNLLLYQEATYLDWI TSVPDKNALTELEI >gi568815589f:99728043_99963864|GENSCAN_predicted_CDS_1|945_bp atgggttattctggggggcctagtttaatcacaagaggccttaaaagcagaagagggagg agaagaagacagtcaaagagatgtgataataaagcagggttggagagatgttatgttgct gattttgaagatggagaaaaagaatgtgggttgcctctagaagatggacaaggcaagtca gtgactctctccaaagcttccagaagggaatgcagccctaccgacacgttgattttagta caacttcatctacatgaggaggcagaagaagagatcactggtctggaggaatcaaagatt caagaaaaagaatggaaggagcaaggaagtaaggatttgggagggttcatgatcaagagt aaaggtgaaggacatgaccttgccagaaggagagacattcatttcatcctggagccaaag attgaaaggcaaaatacaatatacagtgtgaaggaattcagaggggaagagaaaggaact tcattctccttcatctgcctgaatcattttggggctctggttatactatctgtagcgaat ctccattatgttgtctgcttcacgggtttcactcttcttccactccttctgggaaatgct ccttctagcactcctgtcagttggtttgaatgcactgaggagccagtggagggttttgag atgggtgtgaggggtgggcatatggataatcaacagcatagactattggcacttcaggca cttgtcagcgaattcaatcctgaagtgactcataaccatgtgcttgacaaaggctcacct aagacatataatatcggagagagacaatttaatggacaagccatctttccatatctcttc tctaatctctattcaagaaatttgttattgtatcaggaagcaacatatctggactggatt acctctgttcctgataaaaatgctctcactgagttagaaatttag >gi568815589f:99728043_99963864|GENSCAN_predicted_peptide_2|959_aa MQEESAAAPQGVFGLVPDAFCTGCQVTGAGQKLAGGAFPFGRAVLSLTPLALAPCLGAAS EARQQHSSNSTPPEPDSWAVPPDPWAGRCTLPQTHALASGSAAKREAPRPGPALLLLLRS PYTDALTPAPSLAHTDTSAHTGSAHTLRSPARSHPSCPEPLPVQRGAAAGRPSRAHFATL TVPAVAVEWARGDLWSSWDSSPHSKRRKASNKEEDRDLCYSGSSDVIQGISALVVMVMML SAAAAAAAAAAAGSCSSGGKSGVIGDSRTLTRVGTVSLSSGRWLGVGKACFFTDRTRAFG EVAPSVLLLVEMKVLGEPTAEEGSPASPGPEPGPLAVPGSTAGASPRRTSAPPTLSASAG ETPSPTIQRARYPPGPHHLFSSQDFIPYMHDSIRFGNVDMPCVQAQYSPSPPGSSYAAQT YSSEYTTEIMNPDYTKLTMDLGSTEITATATTSLPSISTFVEGYSSNYELKPSCVYQMQR PLIKAGALWDEALPSAPGCIAPGPLLDPPMKAVPTVAGARFPLFHFKPSPPHPPAPSPAG GHHLGYDPTAAAALSLPLGAAAAAGSQAAALESHPYGLPLAKRAAPLAFPPLGLTPSPTA SSLLGESPSLPSPPSRSSSSGEGTCAVCGDNAACQHYGVRTCEGCKGFFKRTVQKNAKYV CLANKNCPVDKRRRNRCQYCRFQKCLSVGMVKEVVRTDSLKGRRGRLPSKPKSPLQQEPS QPSPPSPPICMMNALVRALTDSTPRDLDYSRYCPTDQAAAGTDAEHVQQFYNLLTASIDV SRSWAEKIPGFTDLPKEDQTLLIESAFLELFVLRLSIRSNTAEDKFVFCNGLVLHRLQCL RGFGEWLDSIKDFSLNLQSLNLDIQALACLSALSMITERHGLKEPKRVEELCNKITSSLK DHQSKGQALEPTESKVLGALVELRKICTLGLQRIFYLKLEDLVSPPSIIDKLFLDTLPF >gi568815589f:99728043_99963864|GENSCAN_predicted_CDS_2|2880_bp atgcaagaggaaagtgcagctgcacctcagggcgtcttcgggctggtgccagacgccttc tgcaccggctgccaggtcactggagctggtcagaagctggctggcggagccttccctttc ggaagagctgtcctctcccttacccccctcgccctggctccgtgcctcggggcagcctcg gaggcgcgccagcagcactcctccaactctactccacccgagcctgacagctgggcggtc ccgcctgacccgtgggcaggccgctgcaccctcccgcagacgcacgccctggcgagcggt tccgctgcaaaaagagaagcccccaggccggggccggccctcctgctcctcctccgctcc ccatacacagacgcgctcacacccgctccctcactcgcacacacagacacaagcgcgcac acaggctccgcacacacacttcgctctcccgcgcgctcacacccctcttgccctgagccc ttgccggtgcagcgcggcgccgcagctggacgcccctcccgggctcactttgcaacgctg acggtgccggcagtggccgtggagtgggcccgcggggatctctggagctcgtgggattcc tccccccactcgaagaggcgaaaagcctctaacaaagaggaggaccgggatttgtgctat agcggctccagcgatgtaattcagggtatttcggctctagttgtcatggtaatgatgctc tcggcggcggcggcggcggcggcagcggcagcggcagggagttgcagctccggaggtaaa tcgggtgtaattggcgactcccgcacactgacacgtgtggggacggtgtccctctcctct ggacgttggctcggtgtgggaaaggcatgctttttcacggacagaactcgcgcttttgga gaagttgctccgagtgttttactcttagtagaaatgaaagttctcggtgagcccactgcg gaagagggcagcccggcaagcccgggccctgagcctggacccttagcggtgccgggcagc actgccggcgcttcgcctcgccggacgtccgctcctcctacactctcagcctccgctgga gagacccccagccccaccattcagcgcgcaagataccctccaggccctcatcaccttttt tcaagtcaagatttcatcccatacatgcatgactcaatcagatttggaaatgtggatatg ccctgcgtccaagcccaatatagcccttcccctccaggttccagttatgcggcgcagaca tacagctcggaatacaccacggagatcatgaaccccgactacaccaagctgaccatggac cttggcagcactgagatcacggctacagccaccacgtccctgcccagcatcagtaccttc gtggagggctactcgagcaactacgaactcaagccttcctgcgtgtaccaaatgcagcgg cccttgatcaaagcgggggcgttatgggacgaggcactgccctcggcgcccggctgcatc gcacccggcccgctgctggacccgccgatgaaggcggtccccacggtggccggcgcgcgc ttcccgctcttccacttcaagccctcgccgccgcatccccccgcgcccagcccggccggc ggccaccacctcggctacgacccgacggccgctgccgcgctcagcctgccgctgggagcc gcagccgccgcgggcagccaggccgccgcgcttgagagccacccgtacgggctgccgctg gccaagagggcggccccgctggccttcccgcctctcggcctcacgccctcccctaccgcg tccagcctgctgggcgagagtcccagcctgccgtcgccgcccagcaggagctcgtcgtct ggcgagggcacgtgtgccgtgtgcggggacaacgccgcctgccagcactacggcgtgcga acctgcgagggctgcaagggctttttcaagagaacagtgcagaaaaatgcaaaatatgtt tgcctggcaaataaaaactgcccagtagacaagagacgtcgaaaccgatgtcagtactgt cgatttcagaagtgtctcagtgttggaatggtaaaagaagttgtccgtacagatagtctg aaagggaggagaggtcgtctgccttccaaaccaaagagcccattacaacaggaaccttct cagccctctccaccttctcctccaatctgcatgatgaatgcccttgtccgagctttaaca gactcaacacccagagatcttgattattccagatactgtcccactgaccaggctgctgca ggcacagatgctgagcatgtgcaacaattctacaacctcctgacagcctccattgatgta tccagaagctgggcagaaaagattccgggatttactgatctccccaaagaagatcagaca ttacttattgaatcagcctttttggagctgtttgtcctcagactttccatcaggtcaaac actgctgaagataagtttgtgttctgcaatggacttgtcctgcatcgacttcagtgcctt cgtggatttggggagtggctcgactctattaaagacttttccttaaatttgcagagcctg aaccttgatatccaagccttagcctgcctgtcagcactgagcatgatcacagaaagacat gggttaaaagaaccaaagagagtcgaagagctatgcaacaagatcacaagcagtttaaaa gaccaccagagtaagggacaggctctggagcccaccgagtccaaggtcctgggtgccctg gtagaactgaggaagatctgcaccctgggcctccagcgcatcttctacctgaagctggaa gacttggtgtctccaccttccatcattgacaagctcttcctggacaccctacctttctaa >gi568815589f:99728043_99963864|GENSCAN_predicted_peptide_3|677_aa MDVKFSKGKSEEKQDMLLRASLPSSVASFLAVSPTAHLGHTHQTPPPANGCAGLSRDPDA SDPGDPGPTARQRQDARRKLFHPATPGGRPARIVCGLPIPASNKASKNETKRGSATSEKQ EEPPWRSHFGSNALLAAQRLLRAPHGRQAHQACLEALEENMVSWAGPRVLVRVQPRNLVP CVSAAPAVAERGQCGAWAVVSEGASPKPWQLPRGFEPASAQKTIGSPQPRAGPKMLSGSQ GLELGTLGSSLVLYFTVAELAPRPQDKVLSTLTSPFLKKKEFLPMATTTPGLWQAYRFSL LTTRLLLEDGGGVVSAIQDCLSYPLQCPSLISCLNQPSSGLELSLRTKVKDEESPHQNLS AGVFSYLRMSSPEIPLTGSSCQHILDPPDGLQLDLILKSVTVAEFQKGFKFLFVTTTPGS LSPLIKVSIEQESTYFIFRLGPFQERAEKAAGSSHKADYENRIGFFVSALCVCTIQMQSD VIVAVIGDEDIIIDFEEIGQHLTELAGEGLTEENLGNTIQDIGTGKDFMSETPKAMATKA KIDKWDLIKLKSFCTPKETIIRVNRPPTEWEKIFANYLSDKGLISRIYKELKQIYKKKTN NPIKKWVKDMNRHFSKEDIYAGNRHMKKCSSSLAIREMQIKTTGNGSPNVVPFPTNSWTP LQATKSESLVTNIKKLF >gi568815589f:99728043_99963864|GENSCAN_predicted_CDS_3|2034_bp atggatgtgaaatttagcaagggaaagtctgaggagaaacaggatatgctcctccgagcc tcactaccatcgagcgtcgcgtcattcctcgcagtgtcacccacagcacacctgggccac acacaccaaacgcccccaccagcgaacggatgcgcgggtctgtcccgggacccagacgcc agcgaccctggcgacccggggcctacagcgcgccagagacaggacgcgcgccggaaactt ttccaccccgcgaccccgggaggccgtcctgctcgcatcgtctgcggcctccctatacca gcttccaataaagcttccaaaaacgaaaccaagcgaggctccgccacttctgaaaaacag gaggagccaccatggcgctcgcacttcggatccaatgccctacttgccgcccagcgcctc ttgcgcgcgccacacgggcgccaggcccaccaagcctgcctggaggccttggaggaaaat atggtttcatgggctggaccaagggtcctcgtgcgtgtacagcctaggaacttggtgccc tgtgtctcagctgctccagctgtggctgaaaggggccaatgtggagcttgggctgtggtt tcagagggtgcaagccccaagccttggcagcttccacgtggttttgagcctgcaagtgca cagaagactataggctcccctcagcccagggcaggtccaaaaatgctatctgggagccag ggcctggagctgggaaccttaggcagctccttagtgctctattttactgtggctgagcta gcacctaggccacaagataaagtcctttccactcttacctctcctttcctcaagaagaag gagtttctccccatggccaccaccaccccaggcctatggcaagcatacagattctctctc ctcaccacgcggctgctgctggaggatggaggaggggtggtgtcagcaattcaagactgt ctttcctaccctcttcagtgcccttccttgataagctgtttaaaccagccttcttctggc ttggagctgtcactcagaaccaaggtgaaggatgaagagtctcctcatcaaaatctctct gctggggtcttttcctacctgaggatgtcttcaccagaaattcccctaactggcagctct tgtcaacacattctggatccccctgatggactccagctagacttgattcttaaatcagtc actgtagcagagtttcagaaaggatttaagttccttttcgtcaccactactcctggttct ctttcaccattgatcaaagtgtccatagagcaggagagtacatactttattttcagatta gggccgtttcaggagagagcggagaaagcagcagggtcctcacacaaggctgattatgag aatagaattggattttttgtgtctgcactttgtgtctgcacaatacaaatgcaaagtgat gttattgtagctgttattggtgatgaagatattattattgattttgaagaaattggccag catttaacagagctggcaggggaaggccttacagaagaaaacctaggcaataccattcag gacataggcacgggcaaggacttcatgtctgaaacaccaaaagcaatggcaacaaaagcc aaaatagacaaatgggatctaattaaactaaagagcttctgcacaccaaaagaaactatc atcagagtgaacaggccacctacagaatgggagaaaatttttgcaaactacttatctgac aaagggctaatatccagaatctacaaagaactcaaacaaatttacaagaaaaaaacgaac aaccccatcaaaaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttat gcaggcaacaggcacatgaaaaaatgctcatcgtcactggccatcagagaaatgcaaatc aaaaccacaggtaatggttctccaaatgtggtccccttcccaacaaattcatggactcca cttcaggcaaccaaatcagaatctctggtcacgaatatcaagaaactgttttaa >gi568815589f:99728043_99963864|GENSCAN_predicted_peptide_4|251_aa MSEDEEKVKLRRLEPAIQKFIKIVIPTDLERLRKHQINIEKSQVQALGTSKQPASSWSSH DPFLGLINLLGWLTELRETIYQFFIKDITRDTAEEMHKYQRCRIWDKLHEEHINAGRTVQ QLRSNIREIEKLCLKVRKDDLVLLKRMIDPVKEEASAATAEFLQLHLESVEELKKQFNDE ETLLQPPLTRSMTVGGAFHTTEAEASSQSLTQIYALPEIPQDQNAAESWETLEADLIELS QLVTDFSLLVN >gi568815589f:99728043_99963864|GENSCAN_predicted_CDS_4|753_bp atgtctgaagatgaagaaaaagtgaaattacgccgtcttgaaccagctatccagaaattc attaagatagtaatcccaacagacctggaaaggttaagaaagcaccagataaatattgag aagtcacaagtccaggcccttggaacttctaaacaaccagcttcaagttggagttcccac gatccctttttgggcttgattaatttgctgggatggctcacagaactcagggagacaatt taccagttctttataaaggatattacaagggatacagctgaagagatgcataagtatcaa aggtgcagaatctgggacaagttgcatgaagagcatatcaatgcaggacgtacagttcag caactccgatccaatatccgagaaattgagaaactttgtttgaaagtccgaaaggatgac ctagtacttctgaagagaatgatagatcctgttaaagaagaagcatcagcagcaacagca gaatttctccaactccatttggaatctgtagaagaacttaagaagcaatttaatgatgaa gaaactttgctacagcctcctttgaccagatccatgactgttggtggagcatttcatact actgaagctgaagctagttctcagagtttgactcagatatatgccttacctgaaattcct caagatcaaaatgctgcagaatcgtgggaaaccttagaagcggacttaattgaacttagc caactggtcactgacttctctctcctagtgaat