GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:38:44 Sequence gi568815586r:117118045_117431069 : 313025 bp : 46.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 225 220 6 1.05 1.14 Term - 28233 28043 191 1 2 68 39 225 0.977 13.21 1.13 Intr - 31579 31361 219 2 0 48 92 143 0.867 8.97 1.12 Intr - 37366 37262 105 2 0 55 70 101 0.785 5.19 1.11 Intr - 37904 37747 158 1 2 118 121 248 0.999 30.75 1.10 Intr - 40019 39829 191 2 2 66 105 283 0.775 26.18 1.09 Intr - 47573 47441 133 1 1 62 86 105 0.842 8.35 1.08 Intr - 49033 48854 180 0 0 70 91 275 0.998 24.98 1.07 Intr - 54563 54427 137 2 2 63 84 54 0.897 1.87 1.06 Intr - 56297 56161 137 0 2 57 99 114 0.990 9.69 1.05 Intr - 56753 56607 147 0 0 84 64 172 0.999 14.61 1.04 Intr - 59597 59476 122 1 2 88 98 110 0.975 12.14 1.03 Intr - 68527 68433 95 1 2 91 121 49 0.988 7.26 1.02 Intr - 71318 71183 136 1 1 102 85 95 0.998 11.17 1.01 Init - 72412 72174 239 1 2 101 83 542 0.794 50.48 1.00 Prom - 95359 95320 40 -4.26 2.31 PlyA - 97705 97700 6 1.05 2.30 Term - 100120 99998 123 1 0 110 43 206 0.782 16.68 2.29 Intr - 102225 102031 195 0 0 124 96 379 0.999 41.91 2.28 Intr - 104819 104671 149 0 2 79 86 166 0.957 15.45 2.27 Intr - 107093 106972 122 1 2 90 98 109 0.996 12.24 2.26 Intr - 108726 108639 88 1 1 88 96 37 0.997 3.53 2.25 Intr - 109597 109387 211 1 1 73 64 348 0.805 29.39 2.24 Intr - 114087 113918 170 1 2 132 98 268 0.999 31.87 2.23 Intr - 116714 116521 194 1 2 71 60 372 0.996 31.84 2.22 Intr - 124661 124583 79 0 1 115 91 44 0.962 6.01 2.21 Intr - 125391 125253 139 0 1 63 77 180 0.990 14.44 2.20 Intr - 129478 129304 175 0 1 88 84 88 0.982 8.34 2.19 Intr - 135710 135594 117 1 0 66 83 125 0.991 9.38 2.18 Intr - 137993 137892 102 1 0 61 80 54 0.680 1.19 2.17 Intr - 140411 140353 59 2 2 88 100 33 0.989 2.18 2.16 Intr - 141086 140982 105 2 0 97 78 191 0.999 19.41 2.15 Intr - 142565 142421 145 1 1 114 100 217 0.999 25.78 2.14 Intr - 145930 145845 86 1 2 99 89 101 0.998 9.92 2.13 Intr - 147466 147272 195 1 0 147 30 263 0.999 26.01 2.12 Intr - 150100 149999 102 1 0 138 92 145 0.999 20.27 2.11 Intr - 154515 154341 175 2 1 107 94 281 0.999 30.54 2.10 Intr - 160054 159915 140 1 2 55 71 194 0.825 13.66 2.09 Intr - 162822 162681 142 2 1 135 64 282 0.995 30.86 2.08 Intr - 167288 167197 92 2 2 93 99 165 0.994 16.89 2.07 Intr - 168222 168060 163 2 1 89 96 195 0.942 20.38 2.06 Intr - 170175 170024 152 0 2 50 12 105 0.313 -1.94 2.05 Intr - 172382 172254 129 2 0 113 111 75 0.981 13.19 2.04 Intr - 173320 173300 21 1 0 87 127 -13 0.162 0.24 2.03 Intr - 193548 193422 127 2 1 108 111 182 0.985 23.18 2.02 Intr - 206244 206185 60 2 0 102 105 42 0.683 5.15 2.01 Init - 213025 212301 725 1 2 92 115 722 0.925 69.60 2.00 Prom - 217858 217819 40 -2.96 3.05 PlyA - 218767 218762 6 -0.45 3.04 Term - 221896 221775 122 2 2 57 45 81 0.374 -0.66 3.03 Intr - 223233 223054 180 1 0 76 43 145 0.521 8.64 3.02 Intr - 241035 240947 89 2 2 85 119 19 0.508 4.31 3.01 Init - 242824 242673 152 1 2 89 95 188 0.622 17.12 3.00 Prom - 254816 254777 40 -5.46 4.05 PlyA - 254841 254836 6 1.05 4.04 Term - 260407 260188 220 0 1 59 39 91 0.126 -2.19 4.03 Intr - 265403 265280 124 0 1 53 81 76 0.048 3.04 4.02 Intr - 271916 271786 131 1 2 25 34 141 0.191 2.74 4.01 Init - 276303 276164 140 0 2 64 110 99 0.588 9.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:117118045_117431069|GENSCAN_predicted_peptide_1|729_aa MAAAAVDSAMEVVPALAEEAAPEVAGLSCLVNLPGEVLEYILCCGSLTAADIGRVSSTCR RLRELCQSSGKVWKEQFRVRWPSLMKHYSPTDYVNWLEEYKVRQKAGLEARKIVASFSKR FFSEHVPCNGFSDIENLEGPEIFFEDELVCILNMEGRKALTWKYYAKKILYYLRQQKILN NLKAFLQQPDDYESYLEGAVYIDQYCNPLSDISLKDIQAQIDSIVELVCKTLRGINSRHP SLAFKAGESSMIMEIELQSQVLDAMNYVLYDQLKFKGNRMDYYNALNLYMHQVLIRRTGI PISMSLLYLTIARQLGVPLEPVNFPSHFLLRWCQGAEGATLDIFDYIYIDAFGKGKQLTV KECEYLIGQHVTAALYGVVNVKKVLQRMVGNLLSLGKREGIDQSYQLLRDSLDLYLAMYP DQVQLLLLQARLYFHLGIWPEKVLDILQHIQTLDPGQHGAVGYLVQHTLEHIERKKEEVG VEVKLRSDEKHRDVCYSIGLIMKHKRYGYNCVIYGWDPTCMMGHEWIRNMNVHSLPHGHH QPFYNVLVEDGSCRYAAQGCGCRLTPEMPSCGHLWELQPTAPALGAPESPQVYRYLAAPL ASTHCKLAAALSYNQTCLQALPDVPLVAKEPPDENCSRRTGNKAVILNYLLEFCVLQIPM SCLPMTENLEYNVEPQEISHPDVGRYFSEFTGTHYIPNAELEIRYPEDLEFVYETVQNIY SAKKENIDE >gi568815586r:117118045_117431069|GENSCAN_predicted_CDS_1|2190_bp atggcggcggcagcagtcgacagcgcgatggaggtggtgccggcgctggcggaggaggcc gcgccggaggtagcgggcctcagctgcctcgtcaacctgccgggtgaggtgctggagtac atcctgtgctgcggctcgctgacggccgccgacatcggccgtgtctccagcacctgccgg cggctgcgcgagctgtgccagagcagcgggaaggtgtggaaggagcagttccgggtgagg tggccttcccttatgaaacactacagccccaccgactacgtcaattggttggaagagtat aaagttcggcaaaaagctgggttagaagcgcggaagattgtagcctcgttctcaaagagg ttcttttcagagcacgttccttgtaatggcttcagtgacattgagaaccttgaaggacca gagattttttttgaggatgaactggtgtgtatcctaaatatggaaggaagaaaagctttg acctggaaatactacgcaaaaaaaattctttactacctgcggcaacagaagatcttaaat aatcttaaggcctttcttcagcagccagatgactatgagtcgtatcttgaaggtgctgta tatattgaccagtactgcaatcctctctccgacatcagcctcaaagacatccaggcccaa attgacagcatcgtggagcttgtttgcaaaacccttcggggcataaacagtcgccacccc agcttggccttcaaggcaggtgaatcatccatgataatggaaatagaactccagagccag gtgctggatgccatgaactatgtcctttacgaccaactgaagttcaaggggaatcgaatg gattactataatgccctcaacttatatatgcatcaggttttgattcgcagaacaggaatc ccaatcagcatgtctctgctctatttgacaattgctcggcagttgggagtcccactggag cctgtcaacttcccaagtcacttcttattaaggtggtgccaaggcgcagaaggggcgacc ctggacatctttgactacatctacatagatgcttttgggaaaggcaagcagctgacagtg aaagaatgcgagtacttgatcggccagcacgtgactgcagcactgtatggggtggtcaat gtcaagaaggtgttacagagaatggtgggaaacctgttaagcctggggaagcgggaaggc atcgaccagtcataccagctcctgagagactcgctggatctctatctggcaatgtacccg gaccaggtgcagcttctcctcctccaagccaggctttacttccacctgggaatctggcca gagaaggtgcttgacatcctccagcacatccaaaccctagacccggggcagcacggggcg gtgggctacctggtgcagcacactctagagcacattgagcgcaaaaaggaggaggtgggc gtagaggtgaagctgcgctccgatgagaagcacagagatgtctgctactccatcgggctc attatgaagcataagaggtatggctataactgtgtgatctacggctgggaccccacctgc atgatgggacacgagtggatccggaacatgaacgtccacagcctgccgcacggccaccac cagcctttctataacgtgctggtggaggacggctcctgtcgatacgcagcccaaggctgt ggctgccggctgacgccggagatgccctcgtgtggccacctgtgggaactgcagcccact gcgccggccctgggagcacctgagagcccgcaagtgtacaggtacttagcagcacccctg gcctccacccactgcaagctggcagcagctctcagttacaaccaaacatgtctgcaggcc ttaccagatgtccctttagtagcaaaagagcccccagatgagaactgctccagacgaact ggcaacaaagcagttattctaaactacctgcttgaattctgtgtacttcagatacctatg tcctgtttaccaatgacagaaaacttggaatataacgtggagcctcaagaaatctcacac cctgacgtgggacgctatttctcagagtttactggcactcactacatcccaaacgcagag ctggagatccggtatccagaagatctggagtttgtctatgaaacggtgcagaatatttac agtgcaaagaaagagaacatagatgagtaa >gi568815586r:117118045_117431069|GENSCAN_predicted_peptide_2|1493_aa MEDHMFGVQQIQPNVISVRLFKRKVGGLGFLVKERVSKPPVIISDLIRGGAAEQSGLIQA GDIILAVNGRPLVDLSYDSALEVLRGIASETHVVLILRGPEGFTTHLETTFTGDGTPKTI RVTQPLGPPTKAVDLSHQPPAGKEQPLAVDGASGPGNGPQHAYDDGQEAGSLPHANGLAP RPPGQDPAKKATRVSLQGRGENNELLKEIEPVLSLLTSGSRGVKGGAPAKAEMKDMGIQV DRALGHLQIVVNLFRECLNSAPDLDGKSHKPLPLGVENDRVFNDLWGKGNVPVVLNNPYS EKEQVFKTEGTPPTSGKQSPTKNGSPSKCPRFLKVKNWETEVVLTDTLHLKSTLETGCTE YICMGSIMHPSQHARRPEDVRTKGQLFPLAKEFIDQYYSSIKRQVFGSKAHMERLEEVNK EIDTTSTYQLKDTELIYGAKHAWRNASRCVGRIQWSKLQVFDARDCTTAHGMFNYICNHV KYATNKGNLRSAITIFPQRTDGKHDFRVWNSQLIRYAGYKQPDGSTLGDPANVQFTEICI QQGWKPPRGRFDVLPLLLQANGNDPELFQIPPELVLEVPIRHPKFEWFKDLGLKWYGLPA VSNMLLEIGGLEFSACPFSGWYMGTEIGVRDYCDNSRYNILEEVAKKMNLDMRKTSSLWK DQALVEINIAVLYSFQSDKVTIVDHHSATESFIKHMENEYRCRGGCPADWVWIVPPMSGS ITPVFHQEMLNYRLTPSFEYQPDPWNTHVWKGTNGTPTKRRAIGFKKLAEAVKFSAKLMG QAMAKRVKATILYATETGKSQAYAKTLCEIFKHAFDAKVMSMEEYDIVHLEHETLVLVVT STFGNGDPPENGEKFGCALMEMRHPNSVQEERKYPEPLRFFPRKGPPLPNGDTEVHGLAA ARDSQHRSYKVRFNSVSSYSDSQKSSGDGPDLRDNFESAGPLANVRFSVFGLGSRAYPHF CAFGHAVDTLLEELGGERILKMREGDELCGQEEAFRTWAKKVFKAACDVFCVGDDVNIEK ANNSLISNDRSWKRNKFRLTFVAEAPELTQGLSNVHKKRVSAARLLSRQNLQSPKSSRST IFVRLHTNGSQELQYQPGDHLGVFPGNHEDLVNALIERLEDAPPVNQMVKVELLEERNTA LGVISNWTDELRLPPCTIFQAFKYYLDITTPPTPLQLQQFASLATSEKEKQRLLVLSKGL QEYEEWKWGKNPTIVEVLEEFPSIQMPATLLLTQLSLLQPRYYSISSSPDMYPDEVHLTV AIVSYRTRDGEGPIHHGVCSSWLNRIQADELVPCFVRGAPSFHLPRNPQVPCILVGPGTG IAPFRSFWQQRQFDIQHKGMNPCPMVLVFGCRQSKIDHIYREETLQAKNKGVFRELYTAY SREPDKPKKYVQDILQEQLAESVYRALKEQGGHIYVCGDVTMAADVLKAIQRIMTQQGKL SAEDAGVFISRMRDDNRYHEDIFGVTLRTYEVTNRLRSESIAFIEESKKDTDE >gi568815586r:117118045_117431069|GENSCAN_predicted_CDS_2|4482_bp atggaggatcacatgttcggtgttcagcaaatccagcccaatgtcatttctgttcgtctc ttcaagcgcaaagttgggggcctgggatttctggtgaaggagcgggtcagtaagccgccc gtgatcatctctgacctgattcgtgggggcgccgcagagcagagtggcctcatccaggcc ggagacatcattcttgcggtcaacggccggcccttggtggacctgagctatgacagcgcc ctggaggtactcagaggcattgcctctgagacccacgtggtcctcattctgaggggccct gaaggtttcaccacgcacctggagaccacctttacaggtgatgggacccccaagaccatc cgggtgacacagcccctgggtccccccaccaaagccgtggatctgtcccaccagccaccg gccggcaaagaacagcccctggcagtggatggggcctcgggtcccgggaatgggcctcag catgcctacgatgatgggcaggaggctggctcactcccccatgccaacggcctggccccc aggcccccaggccaggaccccgcgaagaaagcaaccagagtcagcctccaaggcagaggg gagaacaatgaactgctcaaggagatagagcctgtgctgagccttctcaccagtgggagc agaggggtcaagggaggggcacctgccaaggcagagatgaaagatatgggaatccaggtg gacagagcacttggtcacttacaaatagttgtcaacctcttccgagagtgtctgaattct gccccagatttggacggcaagtcacacaaacctctgcccctcggcgtggagaacgaccga gtcttcaatgacctatgggggaagggcaatgtgcctgtcgtcctcaacaacccatattca gagaaggagcaggtgttcaagacagaggggacgccccccacctcaggaaaacagtccccc acaaagaatggcagcccctccaagtgtccacgcttcctcaaggtcaagaactgggagact gaggtggttctcactgacaccctccaccttaagagcacattggaaacgggatgcactgag tacatctgcatgggctccatcatgcatccttctcagcatgcaaggaggcctgaagacgtc cgcacaaaaggacagctcttccctctcgccaaagagtttattgatcaatactattcatca attaaaaggcaagtatttggctccaaagcccacatggaaaggctggaagaggtgaacaaa gagatcgacaccactagcacttaccagctcaaggacacagagctcatctatggggccaag cacgcctggcggaatgcctcgcgctgtgtgggcaggatccagtggtccaagctgcaggta ttcgatgcccgtgactgcaccacggcccacgggatgttcaactacatctgtaaccatgtc aagtatgccaccaacaaagggaacctcaggtctgccatcaccatattcccccagaggaca gacggcaagcacgacttccgagtctggaactcccagctcatccgctacgctggctacaag cagcctgacggctccaccctgggggacccagccaatgtgcagttcacagagatatgcata cagcagggctggaaaccgcctagaggccgcttcgatgtcctgccgctcctgcttcaggcc aacggcaatgaccctgagctcttccagattcctccagagctggtgttggaagttcccatc aggcaccccaagtttgagtggttcaaggacctggggctgaagtggtacggcctccccgcc gtgtccaacatgctcctagagattggcggcctggagttcagcgcctgtcccttcagtggc tggtacatgggcacagagattggtgtccgcgactactgtgacaactcccgctacaatatc ctggaggaagtggccaagaagatgaacttagacatgaggaagacgtcctccctgtggaag gaccaggcgctggtggagatcaatatcgcggttctctatagcttccagagtgacaaagtg accattgttgaccatcactccgccaccgagtccttcattaagcacatggagaatgagtac cgctgccgggggggctgccctgccgactgggtgtggatcgtgccccccatgtccggaagc atcacccctgtgttccaccaggagatgctcaactaccggctcaccccctccttcgaatac cagcctgatccctggaacacgcatgtctggaaaggcaccaacgggacccccacaaagcgg cgagccattggcttcaagaagctagcagaagctgtcaagttctcggccaagctgatgggg caggctatggccaagagggtgaaagcgaccatcctctatgccacagagacaggcaaatcg caagcttatgccaagaccttgtgtgagatcttcaaacacgcctttgatgccaaggtgatg tccatggaagaatatgacattgtgcacctggaacatgaaactctggtccttgtggtcacc agcacctttggcaatggagatccccctgagaatggggagaaattcggctgtgctttgatg gaaatgaggcaccccaactctgtgcaggaagaaaggaagtacccggaacccttgcgtttc tttccccgtaaagggcctcccctccccaatggtgacacagaagtccacggtctggctgca gcccgtgacagccagcacaggagctacaaggtccgattcaacagcgtctcctcctactct gactcccaaaaatcatcaggcgatgggcccgacctcagagacaactttgagagtgctgga cccctggccaatgtgaggttctcagtttttggcctcggctcacgagcataccctcacttt tgcgccttcggacacgctgtggacaccctcctggaagaactgggaggggagaggatcctg aagatgagggaaggggatgagctctgtgggcaggaagaggctttcaggacctgggccaag aaggtcttcaaggcagcctgtgatgtcttctgtgtgggagatgatgtcaacattgaaaag gccaacaattccctcatcagcaatgatcgcagctggaagagaaacaagttccgcctcacc tttgtggccgaagctccagaactcacacaaggtctatccaatgtccacaaaaagcgagtc tcagctgcccggctccttagccgtcaaaacctccagagccctaaatccagtcggtcaact atcttcgtgcgtctccacaccaacgggagccaggagctgcagtaccagcctggggaccac ctgggtgtcttccctggcaaccacgaggacctcgtgaatgccctgatcgagcggctggag gacgcgccgcctgtcaaccagatggtgaaagtggaactgctggaggagcggaacacggct ttaggtgtcatcagtaactggacagacgagctccgcctcccgccctgcaccatcttccag gccttcaagtactacctggacatcaccacgccaccaacgcctctgcagctgcagcagttt gcctccctagctaccagcgagaaggagaagcagcgtctgctggtcctcagcaagggtttg caggagtacgaggaatggaaatggggcaagaaccccaccatcgtggaggtgctggaggag ttcccatctatccagatgccggccaccctgctcctgacccagctgtccctgctgcagccc cgctactattccatcagctcctccccagacatgtaccctgatgaagtgcacctcactgtg gccatcgtttcctaccgcactcgagatggagaaggaccaattcaccacggcgtatgctcc tcctggctcaaccggatacaggctgacgaactggtcccctgtttcgtgagaggagcaccc agcttccacctgccccggaacccccaagtcccctgcatcctcgttggaccaggcaccggc attgcccctttccgaagcttctggcaacagcggcaatttgatatccaacacaaaggaatg aacccctgccccatggtcctggtcttcgggtgccggcaatccaagatagatcatatctac agggaagagaccctgcaggccaagaacaagggggtcttcagagagctgtacacggcttac tcccgggagccagacaaaccaaagaagtacgtgcaggacatcctgcaggagcagctggcg gagtctgtgtaccgagccctgaaggagcaagggggccacatatacgtctgtggggacgtc accatggctgctgatgtcctcaaagccatccagcgcatcatgacccagcaggggaagctc tcggcagaggacgccggcgtattcatcagccggatgagggatgacaaccgataccatgag gatatttttggagtcaccctgcgaacgtacgaagtgaccaaccgccttagatctgagtcc attgccttcattgaagagagcaaaaaagacaccgatgagtaa >gi568815586r:117118045_117431069|GENSCAN_predicted_peptide_3|180_aa MAEPLAQLATLPRAARAGGCQSLGTVVPAQAQLYAFLGARLCGRIATASRRLPVQPSEFH FAAAEIEVGQTKLPSHSHLAAFRKGANDSHRLPWETGPVHVSVPFTWSFNTKCLKQHIHQ TLRMLNRKMTAYVMIISNSGAFQPPFLEVTTGTDILNNFPELFSASPSLCGNMDECMMKL >gi568815586r:117118045_117431069|GENSCAN_predicted_CDS_3|543_bp atggcggagcccttggcacagctcgctaccctcccgcgggcagcccgggctgggggatgc cagtccctggggaccgttgtacctgcccaagctcaactttacgccttcctcggtgcccga ctctgcggccgaatcgcaactgcgagccgcaggttgccagtgcagccatctgaattccat tttgcagctgcggaaattgaggtggggcaaaccaaactacctagccatagtcacctagct gcattccgaaaaggagccaatgacagtcatcgactcccatgggagacaggaccagtccac gtgtccgtaccttttacctggtctttcaacacaaagtgccttaagcagcacatccaccag acactacggatgctcaatagaaaaatgactgcttacgtgatgataattagtaatagtggt gccttccagccccctttcctagaagtaaccactggtactgatatcttgaataattttcca gagctgttcagtgcatctccaagcttatgtggaaacatggatgaatgcatgatgaagctg tag >gi568815586r:117118045_117431069|GENSCAN_predicted_peptide_4|204_aa MGLILSEEWRAVNDDSFSKWVFDKNEKELKSGQLGSNEELKNTEKSSLGIELKENNGEFK KPTPKLPELEMQARLAGTEPPGETLGVMVKNQFPRNMMILLSGHLLIPEQRGYLEHGSFN AKARKVLGTSARCLPDIANIHVLNLTLDSHKERKNEFSDIIFSSENINSISIVVQAKTEK LFFTSLLIHNNQVLLVPSVQLICP >gi568815586r:117118045_117431069|GENSCAN_predicted_CDS_4|615_bp atgggtctaattctaagtgaagagtggcgtgcagttaatgatgactcattttctaaatgg gtgtttgacaagaacgagaaagaactaaaatcaggacaattaggaagtaatgaagagctt aaaaacacagaaaaaagcagcctgggcatagagctaaaagaaaacaatggggagtttaaa aagccaaccccgaagcttccagaactggagatgcaagctcgtctggctggcacagagccc cctggagagactcttggtgtgatggttaaaaatcaatttcccagaaacatgatgatcctg ctcagtggacatctactgatcccagaacagagagggtacctggaacatggaagtttcaat gctaaagccaggaaagtcctgggcacatcagcaagatgtctcccagacattgcaaacatc catgttctaaacttgactcttgattcccataaagaaagaaaaaacgaattctctgatatc atcttcagctcagaaaacatcaactccatctccatagttgttcaagctaaaactgagaag ttattcttcacctccttgctaattcacaataaccaagtcctgctggttccatctgttcaa ctaatctgtccatag