GENSCAN 1.0 Date run: 3-Nov-116 Time: 03:53:24 Sequence gi568815586r:117046090_117290456 : 244367 bp : 47.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 Intr - 577 470 108 1 0 72 89 241 0.996 23.08 1.14 Intr - 749 652 98 0 2 86 11 188 0.931 10.63 1.13 Intr - 3069 2930 140 2 2 110 38 366 0.945 33.91 1.12 Intr - 10797 10717 81 2 0 25 80 203 0.005 11.95 1.11 Intr - 12992 12871 122 2 2 93 105 -56 0.005 -4.11 1.10 Intr - 14773 14614 160 0 1 60 80 136 0.863 9.99 1.09 Intr - 15316 15105 212 1 2 100 45 69 0.529 1.51 1.08 Intr - 19201 19093 109 0 1 115 33 28 0.188 0.29 1.07 Intr - 29251 29182 70 2 1 74 103 70 0.749 5.34 1.06 Intr - 34950 34700 251 2 2 60 44 168 0.092 6.68 1.05 Intr - 38861 38815 47 2 2 83 75 16 0.231 -2.99 1.04 Intr - 49394 49294 101 0 2 83 86 30 0.914 2.13 1.03 Intr - 53010 52837 174 1 0 60 51 118 0.967 5.21 1.02 Intr - 53381 53136 246 0 0 4 68 225 0.779 9.53 1.01 Init - 53881 53803 79 1 1 74 57 115 0.540 6.22 1.00 Prom - 77753 77714 40 -4.26 2.15 PlyA - 77909 77904 6 1.05 2.14 Term - 100188 99998 191 1 2 68 39 225 0.977 13.21 2.13 Intr - 103534 103316 219 2 0 48 92 143 0.867 8.97 2.12 Intr - 109321 109217 105 2 0 55 70 101 0.785 5.19 2.11 Intr - 109859 109702 158 1 2 118 121 248 0.999 30.75 2.10 Intr - 111974 111784 191 2 2 66 105 283 0.775 26.18 2.09 Intr - 119528 119396 133 1 1 62 86 105 0.842 8.35 2.08 Intr - 120988 120809 180 0 0 70 91 275 0.998 24.98 2.07 Intr - 126518 126382 137 2 2 63 84 54 0.897 1.87 2.06 Intr - 128252 128116 137 0 2 57 99 114 0.990 9.69 2.05 Intr - 128708 128562 147 0 0 84 64 172 0.999 14.61 2.04 Intr - 131552 131431 122 1 2 88 98 110 0.975 12.14 2.03 Intr - 140482 140388 95 1 2 91 121 49 0.988 7.26 2.02 Intr - 143273 143138 136 1 1 102 85 95 0.998 11.17 2.01 Init - 144367 144129 239 1 2 101 83 542 0.794 50.48 2.00 Prom - 167314 167275 40 -4.26 3.27 PlyA - 169660 169655 6 1.05 3.26 Term - 172075 171953 123 1 0 110 43 206 0.782 16.68 3.25 Intr - 174180 173986 195 0 0 124 96 379 0.999 41.91 3.24 Intr - 176774 176626 149 0 2 79 86 166 0.957 15.45 3.23 Intr - 179048 178927 122 1 2 90 98 109 0.996 12.24 3.22 Intr - 180681 180594 88 1 1 88 96 37 0.997 3.53 3.21 Intr - 181552 181342 211 1 1 73 64 348 0.805 29.39 3.20 Intr - 186042 185873 170 1 2 132 98 268 0.999 31.87 3.19 Intr - 188669 188476 194 1 2 71 60 372 0.996 31.84 3.18 Intr - 196616 196538 79 0 1 115 91 44 0.962 6.01 3.17 Intr - 197346 197208 139 0 1 63 77 180 0.990 14.44 3.16 Intr - 201433 201259 175 0 1 88 84 88 0.982 8.34 3.15 Intr - 207665 207549 117 1 0 66 83 125 0.991 9.38 3.14 Intr - 209948 209847 102 1 0 61 80 54 0.680 1.19 3.13 Intr - 212366 212308 59 2 2 88 100 33 0.989 2.18 3.12 Intr - 213041 212937 105 2 0 97 78 191 0.999 19.41 3.11 Intr - 214520 214376 145 1 1 114 100 217 0.999 25.78 3.10 Intr - 217885 217800 86 1 2 99 89 101 0.998 9.92 3.09 Intr - 219421 219227 195 1 0 147 30 263 0.999 26.01 3.08 Intr - 222055 221954 102 1 0 138 92 145 0.999 20.27 3.07 Intr - 226470 226296 175 2 1 107 94 281 0.999 30.54 3.06 Intr - 232009 231870 140 1 2 55 71 194 0.825 13.66 3.05 Intr - 234777 234636 142 2 1 135 64 282 0.995 30.86 3.04 Intr - 239243 239152 92 2 2 93 99 165 0.994 16.89 3.03 Intr - 240177 240015 163 2 1 89 96 195 0.943 20.38 3.02 Intr - 242130 241979 152 0 2 50 12 105 0.308 -1.94 3.01 Intr - 244324 244209 116 2 2 60 111 83 0.963 7.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:117046090_117290456|GENSCAN_predicted_peptide_1|666_aa MRPARSCSLRCLSATGLCRAHTGRHVGPRRAALRRYMRPRSGPTRNPRLRAFAGVPTRGR TRGQSRRCAAEASAGPERDARPGAPAAGTMGAAHSASEEVRELEGKTGFSSSFKWVPAGH LAGCGEELRGDAQCRSRSASDWPGNSLAGGLRTGEGAGPRLPKRRAGLCTKAVPLPGALF SFDVYNQVFTYGSLSLSPPRQAPSKPEVLRGPCKHRLKSKTRNNEVQLPGTKAHSAHHPH GNCDLRALEKGSEVRSKETGKATNPGALEEKAGPSVGFGALMQADMHQVLMRGALLGAQV SSDQIEQLHRRFKQLSGDQPTIRPCPHPAAMVSGLGYPNSLLPGSPVSVFTVHPPSRLTG LDTWALGKAPGEGQCLRMGQTKREALRAAIFIIRFIIRSANHLLTTLPSPAVQGTEDGVL WTSQGPHPARFPLELHVVPFSGIWALENGFYSDGQGESGSRVSGKSMAVRYLASKLTLPC PHWWADTLCLPYLGLTPFSLGSPACVLGPSQDTARGGPWSRRKSKENFNNVPDLELNPIR SKIVRAFFDNRNLRKGPSGLADEINFEDFLTIMSYFRPIDTTMDEEQVELSRKEKLRFLF HMYDSDSDGRITLEEYRNVKYAPLARPGLLVVEELLSGNPHIEKESARSIADGAMMEAAS VCMGQM >gi568815586r:117046090_117290456|GENSCAN_predicted_CDS_1|1998_bp atgcgcccagcccgctcctgcagtttgaggtgcctctcagccaccgggctgtgcagggcc cacacgggccgccacgtgggaccgcggcgcgccgccctccgccgttatatgaggccccgc tccggccccacgcggaacccgcggctccgagccttcgccggcgtcccgacccgaggccgg acccgaggccagtcccgccgctgcgcagccgaagccagtgcggggcctgagagggacgcg cgccccggggcccccgccgcgggcaccatgggcgctgcccactccgcgtctgaggaggtg cgggagctcgagggcaagaccggcttctcctcctctttcaaatgggtccccgctggccac ctcgcgggctgcggcgaggagctccgcggcgatgctcagtgccgcagccgctcagcttcg gactggcccgggaactccctggccgggggactgcggaccggggagggagcggggcccagg cttcccaagcgcagggcaggcctttgcacaaaagctgttcctctgcctggagcactgttt tcatttgatgtgtacaatcaggtgtttacgtacgggagcctttctctgtctccacccagg caagcacctagcaaaccagaggtgctcagagggccatgcaagcacaggttgaaatctaaa accagaaataatgaggtccagctgccaggaaccaaagcccattctgcccatcatccgcat ggcaactgtgacctcagggccctggagaagggctcagaggtgaggagcaaggaaactgga aaagctaccaacccaggagcccttgaggaaaaggcaggtccttccgtgggctttggggcc ctgatgcaggcagacatgcatcaggtgcttatgcgaggggccctgttgggggcccaggtc tcatcggatcagatcgagcagctccatcggagatttaagcagctgagtggagatcagcct accattcgcccctgccctcatccagctgccatggtctctggcctgggctaccccaacagc ctcctccctggttcccctgtgtctgtcttcactgtccatccaccatccaggctcacgggt ctggacacctgggccctggggaaggcacctggggaagggcagtgcctcaggatgggccag acaaagagggaagccctgcgtgcagccatcttcattattcgcttcattattcgctcagca aaccatttattgaccaccttgccaagcccagcagtgcaaggcactgaggacggggtgctg tggacatcacagggtccccatcctgcaagattccctctggagctccacgtggtccctttc tcagggatctgggccttggaaaatggattctattcggatggccagggagagtcgggcagc agggtatctggtaagagcatggctgtacgctacctggcttccaagctgactcttccatgc ccccactggtgggctgataccctctgccttccctacctgggcctgacacctttttcccta ggatcaccagcctgtgtgctgggtccttcccaagacactgcccgtggaggcccatggagc agaaggaaaagcaaggagaacttcaacaatgtcccggacctggagctcaaccccatccga tccaaaattgttcgtgccttcttcgacaacaggaacctgcgcaagggacccagtggcctg gctgatgagatcaatttcgaggacttcctgaccatcatgtcctacttccggcccatcgac accaccatggacgaggaacaggtggagctgtcccggaaggagaagctgagatttctgttc cacatgtacgactcggacagcgacggccgcatcactctggaagaatatcgaaatgtaaag tatgctcctctggcccgcccggggctcctggtggtcgaggagctgctgtcgggaaaccct cacatcgagaaggagtccgctcgctccatcgccgacggggccatgatggaggcggccagc gtgtgcatggggcagatg >gi568815586r:117046090_117290456|GENSCAN_predicted_peptide_2|729_aa MAAAAVDSAMEVVPALAEEAAPEVAGLSCLVNLPGEVLEYILCCGSLTAADIGRVSSTCR RLRELCQSSGKVWKEQFRVRWPSLMKHYSPTDYVNWLEEYKVRQKAGLEARKIVASFSKR FFSEHVPCNGFSDIENLEGPEIFFEDELVCILNMEGRKALTWKYYAKKILYYLRQQKILN NLKAFLQQPDDYESYLEGAVYIDQYCNPLSDISLKDIQAQIDSIVELVCKTLRGINSRHP SLAFKAGESSMIMEIELQSQVLDAMNYVLYDQLKFKGNRMDYYNALNLYMHQVLIRRTGI PISMSLLYLTIARQLGVPLEPVNFPSHFLLRWCQGAEGATLDIFDYIYIDAFGKGKQLTV KECEYLIGQHVTAALYGVVNVKKVLQRMVGNLLSLGKREGIDQSYQLLRDSLDLYLAMYP DQVQLLLLQARLYFHLGIWPEKVLDILQHIQTLDPGQHGAVGYLVQHTLEHIERKKEEVG VEVKLRSDEKHRDVCYSIGLIMKHKRYGYNCVIYGWDPTCMMGHEWIRNMNVHSLPHGHH QPFYNVLVEDGSCRYAAQGCGCRLTPEMPSCGHLWELQPTAPALGAPESPQVYRYLAAPL ASTHCKLAAALSYNQTCLQALPDVPLVAKEPPDENCSRRTGNKAVILNYLLEFCVLQIPM SCLPMTENLEYNVEPQEISHPDVGRYFSEFTGTHYIPNAELEIRYPEDLEFVYETVQNIY SAKKENIDE >gi568815586r:117046090_117290456|GENSCAN_predicted_CDS_2|2190_bp atggcggcggcagcagtcgacagcgcgatggaggtggtgccggcgctggcggaggaggcc gcgccggaggtagcgggcctcagctgcctcgtcaacctgccgggtgaggtgctggagtac atcctgtgctgcggctcgctgacggccgccgacatcggccgtgtctccagcacctgccgg cggctgcgcgagctgtgccagagcagcgggaaggtgtggaaggagcagttccgggtgagg tggccttcccttatgaaacactacagccccaccgactacgtcaattggttggaagagtat aaagttcggcaaaaagctgggttagaagcgcggaagattgtagcctcgttctcaaagagg ttcttttcagagcacgttccttgtaatggcttcagtgacattgagaaccttgaaggacca gagattttttttgaggatgaactggtgtgtatcctaaatatggaaggaagaaaagctttg acctggaaatactacgcaaaaaaaattctttactacctgcggcaacagaagatcttaaat aatcttaaggcctttcttcagcagccagatgactatgagtcgtatcttgaaggtgctgta tatattgaccagtactgcaatcctctctccgacatcagcctcaaagacatccaggcccaa attgacagcatcgtggagcttgtttgcaaaacccttcggggcataaacagtcgccacccc agcttggccttcaaggcaggtgaatcatccatgataatggaaatagaactccagagccag gtgctggatgccatgaactatgtcctttacgaccaactgaagttcaaggggaatcgaatg gattactataatgccctcaacttatatatgcatcaggttttgattcgcagaacaggaatc ccaatcagcatgtctctgctctatttgacaattgctcggcagttgggagtcccactggag cctgtcaacttcccaagtcacttcttattaaggtggtgccaaggcgcagaaggggcgacc ctggacatctttgactacatctacatagatgcttttgggaaaggcaagcagctgacagtg aaagaatgcgagtacttgatcggccagcacgtgactgcagcactgtatggggtggtcaat gtcaagaaggtgttacagagaatggtgggaaacctgttaagcctggggaagcgggaaggc atcgaccagtcataccagctcctgagagactcgctggatctctatctggcaatgtacccg gaccaggtgcagcttctcctcctccaagccaggctttacttccacctgggaatctggcca gagaaggtgcttgacatcctccagcacatccaaaccctagacccggggcagcacggggcg gtgggctacctggtgcagcacactctagagcacattgagcgcaaaaaggaggaggtgggc gtagaggtgaagctgcgctccgatgagaagcacagagatgtctgctactccatcgggctc attatgaagcataagaggtatggctataactgtgtgatctacggctgggaccccacctgc atgatgggacacgagtggatccggaacatgaacgtccacagcctgccgcacggccaccac cagcctttctataacgtgctggtggaggacggctcctgtcgatacgcagcccaaggctgt ggctgccggctgacgccggagatgccctcgtgtggccacctgtgggaactgcagcccact gcgccggccctgggagcacctgagagcccgcaagtgtacaggtacttagcagcacccctg gcctccacccactgcaagctggcagcagctctcagttacaaccaaacatgtctgcaggcc ttaccagatgtccctttagtagcaaaagagcccccagatgagaactgctccagacgaact ggcaacaaagcagttattctaaactacctgcttgaattctgtgtacttcagatacctatg tcctgtttaccaatgacagaaaacttggaatataacgtggagcctcaagaaatctcacac cctgacgtgggacgctatttctcagagtttactggcactcactacatcccaaacgcagag ctggagatccggtatccagaagatctggagtttgtctatgaaacggtgcagaatatttac agtgcaaagaaagagaacatagatgagtaa >gi568815586r:117046090_117290456|GENSCAN_predicted_peptide_3|1178_aa XKQSPTKNGSPSKCPRFLKVKNWETEVVLTDTLHLKSTLETGCTEYICMGSIMHPSQHAR RPEDVRTKGQLFPLAKEFIDQYYSSIKRQVFGSKAHMERLEEVNKEIDTTSTYQLKDTEL IYGAKHAWRNASRCVGRIQWSKLQVFDARDCTTAHGMFNYICNHVKYATNKGNLRSAITI FPQRTDGKHDFRVWNSQLIRYAGYKQPDGSTLGDPANVQFTEICIQQGWKPPRGRFDVLP LLLQANGNDPELFQIPPELVLEVPIRHPKFEWFKDLGLKWYGLPAVSNMLLEIGGLEFSA CPFSGWYMGTEIGVRDYCDNSRYNILEEVAKKMNLDMRKTSSLWKDQALVEINIAVLYSF QSDKVTIVDHHSATESFIKHMENEYRCRGGCPADWVWIVPPMSGSITPVFHQEMLNYRLT PSFEYQPDPWNTHVWKGTNGTPTKRRAIGFKKLAEAVKFSAKLMGQAMAKRVKATILYAT ETGKSQAYAKTLCEIFKHAFDAKVMSMEEYDIVHLEHETLVLVVTSTFGNGDPPENGEKF GCALMEMRHPNSVQEERKYPEPLRFFPRKGPPLPNGDTEVHGLAAARDSQHRSYKVRFNS VSSYSDSQKSSGDGPDLRDNFESAGPLANVRFSVFGLGSRAYPHFCAFGHAVDTLLEELG GERILKMREGDELCGQEEAFRTWAKKVFKAACDVFCVGDDVNIEKANNSLISNDRSWKRN KFRLTFVAEAPELTQGLSNVHKKRVSAARLLSRQNLQSPKSSRSTIFVRLHTNGSQELQY QPGDHLGVFPGNHEDLVNALIERLEDAPPVNQMVKVELLEERNTALGVISNWTDELRLPP CTIFQAFKYYLDITTPPTPLQLQQFASLATSEKEKQRLLVLSKGLQEYEEWKWGKNPTIV EVLEEFPSIQMPATLLLTQLSLLQPRYYSISSSPDMYPDEVHLTVAIVSYRTRDGEGPIH HGVCSSWLNRIQADELVPCFVRGAPSFHLPRNPQVPCILVGPGTGIAPFRSFWQQRQFDI QHKGMNPCPMVLVFGCRQSKIDHIYREETLQAKNKGVFRELYTAYSREPDKPKKYVQDIL QEQLAESVYRALKEQGGHIYVCGDVTMAADVLKAIQRIMTQQGKLSAEDAGVFISRMRDD NRYHEDIFGVTLRTYEVTNRLRSESIAFIEESKKDTDE >gi568815586r:117046090_117290456|GENSCAN_predicted_CDS_3|3537_bp ngaaaacagtcccccacaaagaatggcagcccctccaagtgtccacgcttcctcaaggtc aagaactgggagactgaggtggttctcactgacaccctccaccttaagagcacattggaa acgggatgcactgagtacatctgcatgggctccatcatgcatccttctcagcatgcaagg aggcctgaagacgtccgcacaaaaggacagctcttccctctcgccaaagagtttattgat caatactattcatcaattaaaaggcaagtatttggctccaaagcccacatggaaaggctg gaagaggtgaacaaagagatcgacaccactagcacttaccagctcaaggacacagagctc atctatggggccaagcacgcctggcggaatgcctcgcgctgtgtgggcaggatccagtgg tccaagctgcaggtattcgatgcccgtgactgcaccacggcccacgggatgttcaactac atctgtaaccatgtcaagtatgccaccaacaaagggaacctcaggtctgccatcaccata ttcccccagaggacagacggcaagcacgacttccgagtctggaactcccagctcatccgc tacgctggctacaagcagcctgacggctccaccctgggggacccagccaatgtgcagttc acagagatatgcatacagcagggctggaaaccgcctagaggccgcttcgatgtcctgccg ctcctgcttcaggccaacggcaatgaccctgagctcttccagattcctccagagctggtg ttggaagttcccatcaggcaccccaagtttgagtggttcaaggacctggggctgaagtgg tacggcctccccgccgtgtccaacatgctcctagagattggcggcctggagttcagcgcc tgtcccttcagtggctggtacatgggcacagagattggtgtccgcgactactgtgacaac tcccgctacaatatcctggaggaagtggccaagaagatgaacttagacatgaggaagacg tcctccctgtggaaggaccaggcgctggtggagatcaatatcgcggttctctatagcttc cagagtgacaaagtgaccattgttgaccatcactccgccaccgagtccttcattaagcac atggagaatgagtaccgctgccgggggggctgccctgccgactgggtgtggatcgtgccc cccatgtccggaagcatcacccctgtgttccaccaggagatgctcaactaccggctcacc ccctccttcgaataccagcctgatccctggaacacgcatgtctggaaaggcaccaacggg acccccacaaagcggcgagccattggcttcaagaagctagcagaagctgtcaagttctcg gccaagctgatggggcaggctatggccaagagggtgaaagcgaccatcctctatgccaca gagacaggcaaatcgcaagcttatgccaagaccttgtgtgagatcttcaaacacgccttt gatgccaaggtgatgtccatggaagaatatgacattgtgcacctggaacatgaaactctg gtccttgtggtcaccagcacctttggcaatggagatccccctgagaatggggagaaattc ggctgtgctttgatggaaatgaggcaccccaactctgtgcaggaagaaaggaagtacccg gaacccttgcgtttctttccccgtaaagggcctcccctccccaatggtgacacagaagtc cacggtctggctgcagcccgtgacagccagcacaggagctacaaggtccgattcaacagc gtctcctcctactctgactcccaaaaatcatcaggcgatgggcccgacctcagagacaac tttgagagtgctggacccctggccaatgtgaggttctcagtttttggcctcggctcacga gcataccctcacttttgcgccttcggacacgctgtggacaccctcctggaagaactggga ggggagaggatcctgaagatgagggaaggggatgagctctgtgggcaggaagaggctttc aggacctgggccaagaaggtcttcaaggcagcctgtgatgtcttctgtgtgggagatgat gtcaacattgaaaaggccaacaattccctcatcagcaatgatcgcagctggaagagaaac aagttccgcctcacctttgtggccgaagctccagaactcacacaaggtctatccaatgtc cacaaaaagcgagtctcagctgcccggctccttagccgtcaaaacctccagagccctaaa tccagtcggtcaactatcttcgtgcgtctccacaccaacgggagccaggagctgcagtac cagcctggggaccacctgggtgtcttccctggcaaccacgaggacctcgtgaatgccctg atcgagcggctggaggacgcgccgcctgtcaaccagatggtgaaagtggaactgctggag gagcggaacacggctttaggtgtcatcagtaactggacagacgagctccgcctcccgccc tgcaccatcttccaggccttcaagtactacctggacatcaccacgccaccaacgcctctg cagctgcagcagtttgcctccctagctaccagcgagaaggagaagcagcgtctgctggtc ctcagcaagggtttgcaggagtacgaggaatggaaatggggcaagaaccccaccatcgtg gaggtgctggaggagttcccatctatccagatgccggccaccctgctcctgacccagctg tccctgctgcagccccgctactattccatcagctcctccccagacatgtaccctgatgaa gtgcacctcactgtggccatcgtttcctaccgcactcgagatggagaaggaccaattcac cacggcgtatgctcctcctggctcaaccggatacaggctgacgaactggtcccctgtttc gtgagaggagcacccagcttccacctgccccggaacccccaagtcccctgcatcctcgtt ggaccaggcaccggcattgcccctttccgaagcttctggcaacagcggcaatttgatatc caacacaaaggaatgaacccctgccccatggtcctggtcttcgggtgccggcaatccaag atagatcatatctacagggaagagaccctgcaggccaagaacaagggggtcttcagagag ctgtacacggcttactcccgggagccagacaaaccaaagaagtacgtgcaggacatcctg caggagcagctggcggagtctgtgtaccgagccctgaaggagcaagggggccacatatac gtctgtggggacgtcaccatggctgctgatgtcctcaaagccatccagcgcatcatgacc cagcaggggaagctctcggcagaggacgccggcgtattcatcagccggatgagggatgac aaccgataccatgaggatatttttggagtcaccctgcgaacgtacgaagtgaccaaccgc cttagatctgagtccattgccttcattgaagagagcaaaaaagacaccgatgagtaa