GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:32:17 Sequence gi568815593r:126444968_126695114 : 250147 bp : 42.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6011 6223 213 2 0 33 55 173 0.172 5.41 1.02 Intr + 20459 20578 120 2 0 104 115 108 0.848 13.79 1.03 Intr + 24710 24821 112 2 1 58 91 116 0.919 8.36 1.04 Intr + 32725 32820 96 0 0 75 96 88 0.745 7.59 1.05 Intr + 35489 35560 72 1 0 98 81 15 0.581 0.48 1.06 Intr + 35660 35740 81 1 0 52 119 121 0.999 10.52 1.07 Intr + 38496 38607 112 2 1 111 20 101 0.940 4.63 1.08 Intr + 41794 42010 217 2 1 0 43 177 0.599 1.04 1.09 Intr + 43832 43925 94 2 1 132 55 56 0.825 5.75 1.10 Intr + 44580 44703 124 2 1 73 84 61 0.615 3.44 1.11 Term + 54770 55008 239 0 2 -4 39 203 0.345 1.55 1.12 PlyA + 55867 55872 6 1.05 2.00 Prom + 56272 56311 40 -4.05 2.01 Init + 56630 56684 55 1 1 75 79 60 0.707 5.30 2.02 Term + 57410 57537 128 0 2 31 48 104 0.290 -1.84 2.03 PlyA + 59259 59264 6 1.05 3.21 PlyA - 60113 60108 6 1.05 3.20 Term - 63976 63765 212 2 2 54 38 190 0.402 7.07 3.19 Intr - 81229 81076 154 1 1 71 57 101 0.214 4.02 3.18 Intr - 92858 92738 121 1 1 36 55 42 0.003 -4.72 3.17 Intr - 101432 101357 76 0 1 74 91 65 0.410 3.05 3.16 Intr - 105035 104962 74 1 2 119 84 67 0.402 7.53 3.15 Intr - 105326 105229 98 2 2 83 22 49 0.448 -4.31 3.14 Intr - 107170 107054 117 1 0 60 93 96 0.989 7.04 3.13 Intr - 109426 109320 107 2 2 115 116 95 0.999 14.01 3.12 Intr - 111048 110964 85 0 1 57 84 132 0.999 8.17 3.11 Intr - 114367 114273 95 2 2 29 105 93 0.969 3.86 3.10 Intr - 116157 116116 42 1 0 83 87 39 0.593 0.59 3.09 Intr - 123389 123292 98 1 2 98 111 93 0.941 11.33 3.08 Intr - 125892 125815 78 2 0 136 69 81 0.996 8.75 3.07 Intr - 130497 130453 45 2 0 60 106 49 0.587 0.51 3.06 Intr - 132244 132112 133 2 1 123 99 166 0.999 19.98 3.05 Intr - 138007 137884 124 1 1 89 80 132 0.956 11.74 3.04 Intr - 139045 138965 81 1 0 46 31 123 0.649 1.32 3.03 Intr - 147762 147697 66 0 0 73 95 53 0.860 2.68 3.02 Intr - 148437 148384 54 0 0 84 110 36 0.925 3.66 3.01 Init - 150231 150040 192 0 0 37 85 186 0.870 10.31 3.00 Prom - 150804 150765 40 -8.35 4.00 Prom + 151748 151787 40 -9.75 4.01 Init + 154168 154234 67 0 1 37 44 81 0.229 -0.11 4.02 Intr + 155281 155347 67 2 1 91 89 90 0.944 6.44 4.03 Intr + 155506 155730 225 1 0 70 29 154 0.825 3.88 4.04 Intr + 155982 156108 127 0 1 33 59 277 0.379 19.06 4.05 Intr + 158603 159216 614 1 2 111 99 626 0.996 56.25 4.06 Intr + 163397 163517 121 2 1 52 95 113 0.999 7.98 4.07 Intr + 172283 172366 84 1 0 88 90 98 0.998 9.10 4.08 Term + 179608 179877 270 0 0 87 42 287 0.999 18.50 4.09 PlyA + 179926 179931 6 1.05 5.02 PlyA - 182002 181997 6 1.05 5.01 Sngl - 183352 183050 303 1 0 86 40 396 0.997 30.18 5.00 Prom - 217074 217035 40 -3.75 6.02 PlyA - 217129 217124 6 -0.45 6.01 Sngl - 218683 218420 264 1 0 88 42 241 0.959 12.45 6.00 Prom - 221180 221141 40 -6.85 7.05 PlyA - 221528 221523 6 1.05 7.04 Term - 224391 224322 70 2 1 75 38 70 0.210 -2.87 7.03 Intr - 228632 228532 101 2 2 86 110 84 0.637 8.39 7.02 Intr - 229117 228910 208 0 1 30 56 162 0.497 5.36 7.01 Init - 230011 229851 161 1 2 30 102 91 0.384 4.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_1|493_aa XVPNRALPSGTVGRGPLSSRPQNGRASGSLHPTSGKPTGIQLQPVRAASEAAPCKATEAE LSEALGAHSLHHSEAENGVEEKKKACRSPTAQSPTPSVEADSPDQKKIISLWSKSSFDGA SLASDKNDCKTESKNDPKTERKKSSSSSQISIPAFSVTLIKKTKTALLVPNALIIATVTD RYIFVSLLSRDSTYKLLKSVCGHLENTSVGNSPNPSSAENSFRADRPSSLPLDFNDEFSD LDGVVQQRRQDMEGYSSSGSQTPESENSRVILENTLPRVYHIGTELFYEHQILAVNLTKV SLSSVSSVCALIISTFYMRYRINTLEEQLGLLTSIVDTHNTEQAAPSGLRSQVQFNVEVL CQELTANIVKLEKNSALGFVDKATDVPLPQSIYSPNNARAKLSLACGILRRQQEENSTSR IVNQVEKIVNLVTEKGNRTQRKHGSGKSRFGGTREKWLKLQQSRSSAEGLPAAEAQTSEE GAQPGGVKVAEEA >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_1|1482_bp nnagtgcccaacagggcactgcctagtggaactgtaggaagggggccactgtcctccaga ccccagaatggtagagcctctggcagcttgcaccctacatctggaaaacccacaggcatt caactccaacccgtgagagcagccagcgaggctgcaccctgcaaagccacagaggcagaa ctgtccgaggccttgggagcccactccttgcaccactcagaggctgagaatggtgtggag gagaaaaagaaagcctgcaggtcgccaacagcccaatcccctaccccatctgtggaggcg gactccccagaccagaagaaaatcattagcctatggtcaaaatccagttttgatggtgcc tctttagcaagtgataagaacgactgtaaaacagaaagcaaaaatgaccctaagactgaa agaaaaaagtcttcatcttccagccagatctctattccagctttctcggtaaccctaata aagaaaaccaaaactgctcttctagtgccaaacgccctgatcatagcaacagtcacagac aggtacatatttgtctccttactctccagagattcaacttacaaactactaaaatctgtg tgtggacacttagaaaatacaagtgttggtaacagtcccaatccatcttctgctgaaaac agtttccgagcagaccgcccttcatctctgcctctggatttcaatgatgaattctcagat ctggatggagtggttcaacaaagaaggcaagacatggaaggatatagcagttctggttct caaactcctgaatctgagaactctcgagttattttggaaaataccttgcctcgggtgtac cacattggcactgagttgttttatgaacatcagatcctggccgttaacctaacgaaagtt tctctttcatccgtttccagtgtctgtgcactaatcatctcgaccttctacatgagatac agaattaatactctggaggagcagctggggttactaacctccattgtggacacccataat actgaacaggcagcaccatctggcctgaggtcacaagtacaattcaatgtggaggttctc tgtcaagagcttacagctaacatagtgaaattagaaaagaattcagctttaggctttgtt gataaggccacagatgtacctttacctcaaagtatctacagtccaaacaacgctagggca aaactgtccttagcctgtggcattcttagaaggcagcaggaggaaaactcaacatccaga attgtgaaccaagtagaaaagattgttaacttggtaacagaaaagggaaacagaacacaa agaaagcatggaagtggcaaatctaggtttgggggaacaagagagaagtggttgaaacta caacagtctagaagctcagcggaggggcttcctgcagctgaagctcagacctctgaagaa ggggcacagccaggtggtgttaaagtcgcagaggaggcataa >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_2|60_aa MTSGGVPVTPKAPEGMRYNRIPCVWGASPHGLTASSGGHATALSQQLLGNGECGKNSGCK >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_2|183_bp atgaccagtgggggagtacctgtgacccccaaagccccagagggcatgcgttacaataga attccatgcgtctggggtgcatcccctcatggcctgacagcgtcttcaggaggtcatgcc actgccctaagccagcagcttctgggtaatggagaatgtggtaaaaacagtggatgcaag tag >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_3|683_aa MWRLPRALCVHAAKTSKLSGPWSRPAAFMSTLLINQPQYAWLKELGLREENEGVYNGSWG GRGEVITTYCPANNEPIARVRQASVADYEETVKKAREAWKIWADIPAPKRGEIVRQIGDA LREKIQVLGSLVSLEMGKILVEGVGEVQEYVDICDYAVGLSRMIGGPILPSERSGHALIE QWNPVGLVGIITAFNFPVAVYGWNNAIAMICGNVCLWKGAPTTSLISVAVTKIIAKVLED NKLPGAICSLTCGGADIGTAMAKDERVNLLSFTGSTQVGKQVGLMVQERFGRSLLELGGN NAIIAFEDADLSLVVPSALFAAVGTAGQRCTTARRLFIHESIHDEVVNRLKKAYAQIRVG NPWDPNVLYGPLHTKQAVSMFLGAVEEAKKEGGTVVYGGKVMDRPGNYVEPTIVTGLGHD ASIAHTETFAPILYVFKFKNEEEVFAWNNEVKQGLSSSIFTKDLGRIFRWLGPKGSDCGI VNVNIPTSGAEIGGAFGGEKHTGGGRESGSDAWKQYMRRSTWSLENSITRQHYGDGAKPL ETTHMIQSASTRLHLQQWGSQYLSSPDSEPKGVEEKPFLFYSRKEVGIFDILPRKRKKRR KPVGLAWSETGDVGAEEPEAIERGANRSSSSACELDLGHLGASLLREPTVWWFQGPRLLV WDHKYFLISRNALSLRSALDTHY >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_3|2052_bp atgtggcgccttcctcgcgcgctgtgtgtgcacgctgcaaagaccagcaagctctctgga ccttggagcaggcctgccgccttcatgtccactctcctcatcaatcagccccagtatgcg tggctgaaagagctggggctccgcgaggaaaacgagggcgtgtataatggaagctgggga ggccggggagaggttattacgacctattgccctgctaacaacgagccaatagcaagagtc cgacaggccagtgtggcagactatgaagaaactgtaaagaaagcaagagaagcatggaaa atctgggcagatattcctgctccaaaacgaggagaaatagtaagacagattggcgatgcc ttgcgggagaagatccaagtactaggaagcttggtgtctttggagatggggaaaatctta gtggaaggtgtgggtgaagttcaggagtatgtggatatctgtgactatgctgttggttta tcaaggatgattggaggacctatcttgccttctgaaagatctggccatgcactgattgag cagtggaatcccgtaggcctggttggaatcatcacggcattcaatttccctgtggcagtg tatggttggaacaacgccatcgccatgatctgtggaaatgtctgcctctggaaaggagct ccaaccacttccctcattagtgtggctgtcacaaagataatagccaaggttctggaggac aacaagctgcctggtgcaatttgttccttgacttgtggtggagcagatattggcacagca atggccaaagatgaacgagtgaacctgctgtccttcactgggagcactcaggtgggaaaa caggtgggcctgatggtgcaggagaggtttgggagaagtctgttggaacttggaggaaac aatgccattattgcctttgaagatgcagacctcagcttagttgttccatcagctctcttc gctgctgtgggaacagctggccagaggtgtaccactgcgaggcgactgtttatacatgaa agcatccatgatgaggttgtaaacagacttaaaaaggcctatgcacagatccgagttggg aacccatgggaccctaatgttctctatgggccactccacaccaagcaggcagtgagcatg tttcttggagcagtggaagaagcaaagaaagaaggtggcacagtggtctatgggggcaag gttatggatcgccctggaaattatgtagaaccgacaattgtgacaggtcttggccacgat gcgtccattgcacacacagagacttttgctccgattctctatgtctttaaattcaagaat gaagaagaggtctttgcatggaataatgaagtaaaacagggactttcaagtagcatcttt accaaagatctgggcagaatctttcgctggcttggacctaaaggatcagactgtggcatt gtaaatgtcaacattccaacaagtggggctgagattggaggtgcctttggaggagaaaag cacactggtggtggcagggagtctggcagtgatgcctggaaacagtacatgagaaggtct acttggtctcttgagaactctatcacaagacagcactatggggatggcgctaaaccatta gaaaccacccatatgatccaatcagcttccaccaggctccacctccaacaatggggatca caatatctatcatctccagactcagaacctaagggggtagaggaaaagcctttcctcttc tacagcaggaaggaggttggcatatttgatattcttcccaggaagaggaaaaagaggagg aagccagtggggctggcttggagtgaaacaggggatgtgggtgcagaagaaccagaagca atagaaagaggagcaaatcgatcatcatctagtgcctgtgagctggatttgggtcacctg ggtgcctccctgcttcgagaaccaacagtgtggtggtttcaaggacctcggcttctggtt tgggaccacaaatatttcctaatctctagaaatgctctcagtttacgatctgctttagac acgcactactaa >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_4|524_aa MGNCPQAARQLLCGNPEAVRGSVYSPGKTAGECTLSAESKNGLIQTPTDTNIRHAQAPYI MWGCTVGPSHPQVRHPWMGSGYRGRTIQLERLPWKQYVEWIVEEQEQKRGGLSRAYRGRI APREDGVGGRRYGRWAAFRLGFRHDGRTQRQAAAIASECERRKVLGGDSAMRAFQNTATA CAPVSHYRAVESVDSSEESFSDSDDDSCLWKRKRQKCFNPPPKPEPFQFGQSSQKPPVAG GKKINNIWGAVLQEQNQDAVATELGILGMEGTIDRSRQSETYNYLLAKKLRKESQEHTKD LDKELDEYMHGGKKMGSKEEENGQGHLKRKRPVKDRLGNRPEMNYKGRYEITAEDSQEKV ADEISFRLQEPKKDLIARVVRIIGNKKAIELLMETAEVEQNGGLFIMNGSRRRTPGGVFL NLLKNTPSISEEQIKDIFYIENQKEYENKKAARKRRTQVLGKKMKQAIKSLNFQEDDDTS RETFASDTNEALASLDESQEGHAEAKLEAEEAIEVDHSHDLDIF >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_4|1575_bp atgggaaattgtcctcaagcagctcgacaacttctgtgtgggaacccagaagcggttaga ggttcagtgtactctcctggcaaaactgctggcgagtgtaccctttctgcagaaagtaaa aatggccttattcagacccccacagataccaacatccgccatgctcaagccccttacatt atgtgggggtgcacggtgggtccttcccatccgcaggttcggcatccgtggatggggagt ggatacagagggcggactatacagcttgaaaggcttccctggaagcaatatgtagagtgg attgtggaggagcaagaacagaagcggggaggcttgtccagagcatacaggggaaggatc gcaccgcgggaagatggcgttggaggtcggcgatatggaagatgggcagctttccgactc ggattccgacatgacggtcgcacccagcgacaggccgctgcaattgccagtgagtgtgaa aggaggaaagtgctaggtggcgacagtgctatgagggccttccagaacacggcaactgca tgtgcaccagtatcacattatcgagctgttgaaagtgtggattcaagtgaagaaagtttt tctgattcagatgatgatagctgtctttggaaacgcaaacgacagaaatgttttaaccct cctcccaaaccagagccttttcagtttggccagagcagtcagaaaccacctgttgctgga ggaaagaagattaacaacatatggggtgctgtgctgcaggaacagaatcaagatgcagtg gccactgaacttggtatcttgggaatggagggcactattgacagaagcagacaatccgag acctacaattatttgcttgccaagaaacttaggaaggaatctcaagagcatacaaaagat ctagacaaggaactagatgaatatatgcatggtggcaaaaaaatgggatcaaaggaagag gaaaatgggcaaggtcatctcaaaaggaaacgacctgtcaaagacaggctagggaacaga ccagaaatgaactataaaggtcgatacgagatcacagcggaagattctcaagagaaagtg gctgatgaaatttcattcaggttacaggaaccaaagaaagacctgatagcccgagtagtg aggattattggtaacaaaaaggcaattgaacttctgatggaaaccgctgaagttgaacaa aatggtggtctctttataatgaatggtagtcgaagaagaacaccaggtggagtttttctg aatctcttgaaaaacactcctagtatcagcgaggaacaaattaaggacattttctacatt gaaaaccaaaaggaatatgaaaataaaaaagctgctaggaagaggagaacacaagtgttg gggaaaaagatgaaacaagctattaaaagtctaaattttcaagaagatgatgatacatca cgagaaacttttgcaagtgacacgaatgaggccttggcctctcttgatgagtcacaggaa ggacatgcagaagccaagttggaggcagaggaagccattgaagttgatcattctcatgat ttggacatcttttaa >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_5|100_aa MASLSELAYIYSALILHNDKINALIKAAGVNVEPFWPGVFAKALANVSIGNPICNVRAGG LAAAGDPTPSTAAASVEKKVEAKKEESEESDEDMGFGLFD >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_5|303_bp atggcctccctctccgagctcgcctacatctactcggccctcattctgcataatgataaa atcaatgccctcattaaagcagctggtgtaaatgttgaacctttctggcctggcgtgttt gcaaaggccctagccaatgtcagcatcgggaaccccatctgcaatgtaagggctggtgga cttgcagcagctggagatcctaccccctccactgctgctgcttcagttgagaagaaagtg gaagcaaagaaagaagaatccgaggaatctgatgaggacatgggctttggtctttttgac taa >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_6|87_aa MAAWSPAAAAPLLRGICGLPLHHGMFATQTEGELRVTQILKEKFPRATAIKVTDISGVVG RCMKLKLNQKNLRRRELSSSTRWFIRH >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_6|264_bp atggcggcatggagcccggccgcagcagcgcctctgctccgcgggatctgcgggcttcca cttcaccatgggatgtttgccacccagactgagggggagctcagagtgacccaaattctc aaagaaaagtttccacgagctacagctatcaaagtcactgacatttctggagttgtggga cgatgtatgaaattaaaattgaatcagaagaatttaaggagaagagaactgtccagcagc accagatggtttatcaggcactaa >gi568815593r:126444968_126695114|GENSCAN_predicted_peptide_7|179_aa MWMCVTFGAENLGQEDSFGRPAPCPRPHSMRRSTYDLGSSDQPAQGTSHQFQIGPREDRR PFYSQYAFYYPTPPDIRKKLQKLDSGPQTPQQDLINLTFKVYNNREEAAKRQCISELQLP ASTISLAQQLKTDAAQSLWKLRDYHGPRASSNSYSGGIRACPRDATGYSPFELLYGRSF >gi568815593r:126444968_126695114|GENSCAN_predicted_CDS_7|540_bp atgtggatgtgcgtgacatttggtgctgaaaacctgggacaggaggactcctttgggaga ccagccccctgtcctcgccctcactccatgaggagatccacctacgaccttgggtcctca gaccagccagcccaaggaacatctcaccaatttcaaatcggacccagagaggacagaagg ccgttttattctcaatatgcattttattacccgacccctcccgacattagaaaaaaactc caaaaattggattctggtcctcaaaccccacaacaggacttaattaacctcaccttcaag gtgtacaataatagagaagaagcagccaagcgacaatgcatctctgagttacagctacct gcctccactatctccttggctcaacagctgaagactgacgctgcccaatcgctttggaag ctccgggactatcacggaccccgagcttcgagtaactcttacagtggaggaattcgggcc tgtcctcgggatgctacagggtacagcccatttgagctcctgtatggacgctccttttga