GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:16:28 Sequence gi568815596r:130952906_131182646 : 229741 bp : 46.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1173 1212 40 -2.06 1.01 Init + 10407 10666 260 2 2 60 43 238 0.634 11.11 1.02 Intr + 11067 11344 278 0 2 86 107 308 0.970 29.66 1.03 Term + 13294 13364 71 2 2 98 41 33 0.451 -2.30 1.04 PlyA + 13400 13405 6 -1.75 2.02 PlyA - 13989 13984 6 -4.83 2.01 Sngl - 14640 14293 348 0 0 91 38 274 0.534 18.74 2.00 Prom - 16283 16244 40 -5.26 3.00 Prom + 25827 25866 40 -1.36 3.01 Init + 38806 39010 205 0 1 97 44 152 0.098 10.98 3.02 Intr + 75040 75179 140 2 2 105 75 179 0.119 18.48 3.03 Term + 75563 75730 168 1 0 50 44 111 0.892 0.78 3.04 PlyA + 76457 76462 6 1.05 4.00 Prom + 76503 76542 40 -9.55 4.01 Init + 82187 82327 141 1 0 121 -19 313 0.046 22.03 4.02 Intr + 85640 85720 81 1 0 85 49 66 0.610 2.23 4.03 Intr + 85948 86127 180 0 0 96 55 240 0.994 21.56 4.04 Intr + 87111 87287 177 2 0 58 78 406 0.954 36.72 4.05 Intr + 87356 87535 180 1 0 57 58 494 0.998 43.36 4.06 Intr + 88325 88557 233 1 2 56 91 525 0.999 46.17 4.07 Intr + 88910 89039 130 2 1 98 78 293 0.997 30.00 4.08 Intr + 90547 90678 132 0 0 60 76 115 0.990 8.34 4.09 Intr + 91006 91118 113 0 2 38 34 84 0.528 -2.82 4.10 Intr + 91261 91637 377 1 2 54 52 453 0.526 32.96 4.11 Intr + 92464 92541 78 2 0 72 102 127 0.992 12.02 4.12 Term + 93133 93284 152 2 2 116 43 226 0.471 18.97 4.13 PlyA + 94329 94334 6 1.05 5.06 PlyA - 94993 94988 6 1.05 5.05 Term - 100110 99998 113 1 2 128 43 70 0.884 5.22 5.04 Intr - 102544 102367 178 1 1 130 111 218 0.999 27.79 5.03 Intr - 102790 102648 143 2 2 103 85 123 0.711 13.57 5.02 Intr - 119033 118950 84 0 0 92 91 54 0.575 5.89 5.01 Init - 129741 129672 70 0 1 72 111 57 0.540 7.61 5.00 Prom - 148218 148179 40 -0.86 6.00 Prom + 162255 162294 40 -5.46 6.01 Init + 166065 166122 58 2 1 59 94 57 0.795 2.87 6.02 Intr + 168029 168073 45 0 0 117 70 50 0.735 4.48 6.03 Intr + 170337 170426 90 1 0 46 96 46 0.553 1.17 6.04 Intr + 172848 173000 153 1 0 100 86 216 0.999 22.64 6.05 Intr + 173779 173881 103 2 1 59 89 42 0.998 0.63 6.06 Intr + 177816 177855 40 0 1 126 98 -11 0.997 1.93 6.07 Intr + 179997 180110 114 2 0 141 94 84 0.594 14.94 6.08 Intr + 186744 186953 210 2 0 94 62 29 0.435 0.01 6.09 Intr + 187262 187370 109 1 1 67 93 201 0.691 18.36 6.10 Term + 193732 193868 137 2 2 104 50 193 0.970 15.28 6.11 PlyA + 198104 198109 6 1.05 7.00 Prom + 216622 216661 40 -2.86 7.01 Sngl + 216683 217198 516 1 0 39 28 189 0.445 4.04 7.02 PlyA + 217248 217253 6 1.05 8.05 PlyA - 218196 218191 6 1.05 8.04 Term - 225655 224737 919 0 1 3 42 575 0.990 36.09 8.03 Intr - 226125 225668 458 0 2 74 19 468 0.685 30.52 8.02 Intr - 228450 228261 190 2 1 -46 18 252 0.438 4.39 8.01 Intr - 228526 228467 60 0 0 71 40 103 0.458 1.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 8352 8246 107 1 2 114 55 142 0.990 12.07 S.002 Init - 8998 8926 73 1 1 71 68 43 0.862 1.87 S.003 Init + 75042 75179 138 2 0 89 75 189 0.818 17.84 S.004 Sngl + 82187 82357 171 1 0 121 53 397 0.943 32.33 S.005 Sngl - 161946 161491 456 0 0 60 36 166 0.809 4.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_1|202_aa MRVVSRLSALPPPARTPARRVSALGRAGPPACVLALDGVGGAANRLSWNPCGEKVGAHQP SSPPRLRRAWASPPKDGPVLTHACGHSPAAAAPPPPAQPSPSLWLELGPACVRAQQCPSE SSGQTSGDLGTSSSSSAGLSPGSDSDSSGVVCGGRGGNGGMRGAVSRSWSLESLRSATAG PASAVTNARCVKRHSRVCLRAS >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_1|609_bp atgagggtcgtgtcccgtctgagcgcgctccccccgccggcgcgcacgcctgcccgccgg gtctctgccctgggccgagcggggccacctgcttgcgtcctcgccctggatggcgtggga ggcgccgccaaccgtctgagctggaatccttgcggtgaaaaagtgggcgcccaccagccc tcctcaccaccgcggctgcgacgagcctgggcgtcgcctcctaaggatggccctgtccta acacacgcgtgcggacacagccccgcggccgccgcaccgccgcccccggcccagccttcc ccgagcctgtggctggagctcgggcccgcctgcgtgcgggcgcagcaatgccccagcgag tcaagcgggcagacgagtggcgatctcggcactagcagcagcagcagcgccgggctgtcc ccgggctccgactcggacagcagcggcgtggtgtgtggcggccgcggaggcaacgggggc atgcgcggcgccgtgtcccgctcctggagcctggagagcctgcgctcggccaccgccgga cctgcatcagcagtgaccaatgctcgatgtgtaaaaagacactcccgtgtatgccttaga gccagttga >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_2|115_aa MASPPRRDPMTLTMEKEVNMKTVSPRSPKREFGLSDDTSRTEIVKASAEYQQCGSTADGL EAPGARGEMRTAGQKEGPWKFHPVSYISCSDYASKGSPLHSQMNDISVEMPTQSI >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_2|348_bp atggcttctccaccaagaagagatcccatgacactgaccatggaaaaggaggtgaacatg aaaactgtcagccccagaagcccgaagagagaattcggcttgtcagatgacacatccaga acagaaattgtgaaagcttcagctgagtaccagcagtgtggcagcacagctgatgggctg gaggctccaggggcccgaggggagatgagaacagcggggcagaaagaagggccatggaaa ttccaccctgtcagctatatttcgtgttcggattatgcctcaaagggaagtcctctccac agccagatgaacgacatttcagtggaaatgccaacacagagcatttaa >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_3|170_aa MPEPSPASMDSCAARASQMSATPCSTAPSPIDHPRAEECEHMARDWQAAPPAAPVRDPLG EASWAPESAMPDGALDTAVCADEVGSEEDLYDDLHSSSHHYSHPGGGGEQLAINEAGNKL REKPFSLLTTGVQKYCCCYDNCVQVAVFLLFLLKQKKSMRILIQAKLVPI >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_3|513_bp atgcctgagccttcccccgcctccatggattcctgtgcagcccgagcctcccagatgagc gccaccccgtgctccacagcgcccagtcccatcgaccacccaagggctgaggagtgcgag cacatggcgcgggactggcaggcagctccacctgcagccccggtgcgggatccactgggt gaagccagctgggctcccgagtctgccatgcctgatggagctctggacacagctgtctgc gctgacgaagtggggagcgaggaggacctgtatgatgacctgcacagctccagccaccac tacagccaccctggagggggtggggagcagctggctatcaatgaggctgggaacaagttg agggaaaaaccattttctctcctgacaacaggagttcagaagtattgctgttgctatgac aactgcgtgcaagttgctgtgtttttgctattcctactgaagcagaagaaatccatgaga atcctcatccaggcaaaactggttcccatctga >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_4|657_aa MARAPQPRRGPAAPGNALRALLRCNLPPGAQRVVVSAVLALLVLINVPCSPQEFWIGNAN REQLAVPVTGSDSVLISDGSVVCAEALWDHVTMDDQELGFKAGDVIEVMDATNREWWWGR VADGEGWFPASFVRLRVNQDEPADDDAPLAGNSGAEDGGAEAQSSKDQMRTNVINEILST ERDYIKHLRDICEGYVRQCRKRADMFSEEQLRTIFGNIEDIYRCQKAFVKALEQRFNRER PHLSELGACFLEHQADFQIYSEYCNNHPNACVELSRLTKLSKYVYFFEACRLLQKMIDIS LDGFLLTPVQKICKYPLQLAELLKYTHPQHRDFKDVEAALHAMKNVAQLINERKRRLENI DKIAQWQSSIEDWEGEDLLVRSSELIYSGELTRVTQPQAKSQQRMFFLFDHQLIYCKKPP PSFLYAAGFCSSFKTWPSGLMSMKPSLSTGDLSYVWISLSAGEVEASPAAPPMAGERRWE QPGNGQGSGSAARAEASIWQDLLRRDVLYYKGRLDMDGLEVVDLEDGKDRDLHVSIKNAF RLHRGATGDSHLLCTRKPEQKQRWLKAFAREREQVQLDQETGFSITELQRKQAMLNASKQ QVTGKPKAVGRPCYLTRQKHPALPSNRPQQQVLVLAEPRRKPSTFWHSISRLAPFRK >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_4|1974_bp atggcgagggccccgcagccccggcgcggccccgcggcgcccgggaacgccctgcgcgcc ctgctgcgctgcaacctgccccccggcgcccagcgcgtggtggtctccgccgtgctggcg ctcctggttctcatcaacgtcccttgcagcccccaggaattctggattgggaatgctaac agggagcagttggcagtcccagtgacaggatcagattctgtgctcatcagcgatggcagt gtggtctgcgctgaagcactctgggaccatgtcaccatggacgaccaggagctgggcttc aaagctggggacgtcatcgaagtgatggatgccaccaacagagagtggtggtggggccgg gtcgccgatggcgagggctggtttccagccagcttcgttcggctgagggtgaatcaggac gagcccgcggatgacgacgcccctctggccgggaacagcggagcggaggacggcggggcg gaggcgcagagcagcaaggaccagatgcggaccaacgtcatcaacgagatcctcagcact gagcgggactacatcaagcacctgcgcgacatctgcgagggctacgtccggcagtgccgc aagcgcgcagacatgttcagcgaggagcagctgcgtaccatcttcgggaacatcgaggac atctaccgctgccagaaggccttcgtgaaggccctggagcagaggttcaaccgcgagcgc ccacacctgagcgagctgggtgcctgcttcctggagcatcaagccgacttccagatctac tcggagtactgcaataaccaccccaacgcctgcgtggagctctcccggctcaccaagctc agcaagtacgtgtacttcttcgaggcctgccggctgctgcagaagatgattgacatctcc ctggatggcttcctgctgactccggtgcagaagatctgcaagtaccctctgcagctggcc gagctgctcaaatacacgcacccccagcacagggacttcaaggatgttgaagccgccttg catgccatgaagaacgtggcccagctcatcaacgagcggaagcggagacttgagaacatc gacaagattgctcagtggcagagctccatagaggactgggagggagaagatctcttggtc aggagctcagaactcatctactcgggggagctgactcgagttacacagcctcaagccaaa agccagcagcgaatgttctttctctttgaccaccagctcatctactgtaagaagccacct cccagcttcctctacgctgccggcttctgctcctctttcaaaacatggcccagtggcctc atgtccatgaagccctccttgtccacgggtgatctttcctacgtgtggatctcactgtct gctggggaggtggaggcatcaccagcagcccctcctatggctggggagaggaggtgggag cagccaggaaatggtcaggggagcggttcagcagccagggctgaggccagcatctggcag gacctgctccgccgcgacgtgttgtactacaagggccggctggacatggacggcctggag gtggtggacctggaggacgggaaggacagagacctccatgtgagcatcaagaacgccttc cggctgcaccgtggcgccacaggggacagccacctgctgtgcaccaggaagcccgagcag aagcagcgctggctcaaggcctttgccagggagagggagcaggtgcagctggaccaggag acaggcttctccatcactgaactgcagaggaagcaggccatgctgaatgccagcaagcag caggtcacagggaagcccaaagctgttggccggccctgctacctgacgcgccagaagcac ccagccctgcccagcaaccggccccagcagcaggtcctggtgctggcggagcccaggcgc aagccatctaccttctggcacagcatcagccggctggcacccttccgcaagtga >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_5|195_aa MNPVYSPGSSGVPYANAKGIGYPAGFPMGYAAAAPAYSPNMYPGANPTFQTGYTPGTPYK VSCSPTSGAVPPYSSSPNPYQTAVYPVRSAYPQQSPYAQQGTYYTQPLYAAPPHVIHHTT VVQPNGMPATVYPAPIPPPRGNGVTMGMVAGTTMAMSAGTLLTAHSPTPVAPHPVTVPTY RAPGTPTYSYVPPQW >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_5|588_bp atgaatcctgtttatagtcctggatcttctggggttccctatgcaaatgccaaaggaatt ggttatccagctggttttcccatgggctatgcagcagcagctcctgcctattctcctaac atgtatcctggagcgaatcctaccttccaaacaggttacactcctggcacaccttacaaa gtgtcctgttcccccaccagcggggctgtgccaccgtactcctcctccccgaacccctac cagactgccgtgtaccctgtgcgaagtgcctacccccagcagagcccgtatgcacagcaa ggcacgtactacacacagccgctgtatgcagcacctcctcacgtcatccaccacaccacg gtggtgcagcccaacggcatgcctgcaacggtgtaccctgctcccatcccccctcctaga ggcaacggggtcaccatgggcatggtggctgggaccaccatggccatgtcagcaggtacc ctgctgactgctcactccccaactcctgtcgccccccacccggtcactgtgcccacgtac cgggccccaggaacgcccacttacagctatgtgccccctcagtggtga >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_6|352_aa MAWVRWLMPVIPALWEAEAGEEMAFVKSGWLLRQIEVSMPVKRGEIHYIHGVVEQQRYTC DVVTGTILKRWKKNWFDLWSDGHLIYYDDQTRQNIEDKVHMPMDCINIRTGQECRDTQPP DGKSKDCMLQIVCRDGKTISLCAESTDDCLAWKFTLQDSRTNTAYVGSAVMTDETSVVSS PPPYTAYAAPAPEVGRTLSLQVCAARTPSVTCPGATPQGTLIFHIVPGIAEFGPERVPLL GWLCLQNSGFGESHYHLVQPYHFTDGQRKAQQAYGYGPYGGAYPPGTQVVYAANGQAYAV PYQYPYAGLYGQQPANQVIIRERYRDNDSDLALGMLAGAATGMALGSLFWVF >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_6|1059_bp atggcctgggtgcggtggctcatgcctgtaatcccagcactttgggaggccgaggcaggt gaagagatggcgtttgtgaagagtggctggttgctgcgacagatagaagtatcaatgcca gtaaaacgaggtgagatacattatattcacggggttgtagagcagcagagatacacatgt gatgtggtcacaggtactattttgaagcgctggaagaagaactggtttgatctgtggtcg gatggtcacctgatctattatgatgaccagactcggcagaatatcgaggataaggtccac atgccaatggactgcatcaacatccgcacggggcaggaatgtcgggatactcagcccccg gatggaaagtcaaaagactgcatgctccagattgtttgtcgagatgggaaaacaattagt ctttgtgcagaaagcacagatgattgcttggcctggaaatttacactccaagattctagg acaaacacagcgtatgtgggctctgcagtcatgaccgatgagacatccgtggtttcctca cctccaccatacacggcctatgctgcaccggcccctgaggtagggagaaccctgagcctc caggtgtgtgcagccagaacaccatcagtcacctgcccaggtgccacacctcagggcacc ctcatcttccatattgtgcctggaattgctgaatttgggccagagagggttccgcttcta ggctggctttgcctgcagaatagtggttttggtgaatctcattatcacctggtccaacct tatcattttacagatgggcaaaggaaggcccagcaggcttatggctatgggccatacggt ggtgcgtacccgccaggaactcaagttgtctacgctgcgaatgggcaggcgtatgccgtg ccctaccagtacccatatgcaggactttatggacagcagcctgctaaccaagtcatcatt cgagagcgctatcgagacaacgacagcgacctggcactgggcatgctggcaggagcagcc acgggcatggccttagggtctctattttgggtcttctag >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_7|171_aa MEYLKPTVKTYCTDKISFKILLLIDNAPAHPSALVEMYEEVTVVFVLTNTTSILQPMDQR VILNFKSYYLRNTFHETIATIDSDSFDGSGQSKLKTFWKAFAILDVIKNICDSWEVVKTS TVTGGWRKLIPTLTEDFEGLKTSVKEVPASVVGTARGLELEVEPEDETELL >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_7|516_bp atggaatatctaaaacccactgttaagacctactgcacagataagatttctttcaaaata ttgctgcttattgacaatgcacctgctcacccaagtgctctggtggagatgtacgaggaa gttactgttgtttttgtgcttactaacacaacttccattctgcagcccatggatcaaaga gtaattttgaatttcaagtcttattatttaagaaatacatttcatgagactatagctacc atagatagtgattcttttgatggatctgggcaaagtaagttgaaaaccttctggaaagca ttcgccattctagatgtcattaagaacatctgtgattcatgggaggtggtaaaaacatca acagtaactggaggttggaggaagttgattccaacgctcacagaagactttgagggatta aagacttcagtgaaggaagtacctgcaagtgtggtgggaacagcaagaggactagaatta gaagtggagcctgaagatgagactgaattgctgtaa >gi568815596r:130952906_131182646|GENSCAN_predicted_peptide_8|542_aa XRITILVELSCEDSQGLNYLRVCPGNQPALVQGILERVVDGPVPHQTVRLEDLDESGEPQ AGEPVLPGILRPPYLAEGLGLGLPVPGAPGPWSSALGGTTCMRQRDSGDWLAMPSRAENY EVLYTIGTGSCGRCQKIQRKSDGKILVWKELHYGSMTEAEKQMLVSEVNLLCELKNPNIV HYYDRIIDRTNTTLYIVMEYCEEGDLASVITKGTKERQYLDEEFVLRVTTQLTLALKRSD GDHTVVRRDLKPASVFLDGKQNVKLGDLGLARILNHDTSFAKTFVGTPYYMSPEQTNHMS YNEKPDIWSLGRLLYELGALMPPFTAFSQKELAGKIREGKFRQILYRDSDELNEIIMRML KDYHRPSVEEILENPLIADLVAEERRRNLERRGRQLGEPEKLQDSSPVLSELKLKEIQLE ERERALKAREERLEQKEQEFCVRERLAEDRLARAENLLKNYSLLKKFLSLASSPELLSLP SSVIKKKGHFSGESKENVMRSENSESQLTSKSKCKDLKVLASCFMLPSCGLKPCQILRKI IN >gi568815596r:130952906_131182646|GENSCAN_predicted_CDS_8|1629_bp nnccgcatcaccatcctggtggaactctcctgtgaggacagccaaggcctgaactacctg cgggtttgcccaggcaaccagccagccctggtccaaggcatcctggagcgagttgtggat ggccccgtgccccaccagacagtgcgcctggaggacctggacgagagcggtgagccccag gctggggagccagtgctgcctggcatcctccggccaccctacctggctgaggggctgggc ctgggcctgccggtccctggagctcccggtccctggagctctgcacttgggggcacaact tgcatgaggcagcgcgactctggcgactggctggccatgccgtcccgggctgagaactat gaagtgttgtacaccattggcacaggctcctgtggccgctgccagaagatccagaggaag agtgacggcaagatactagtttggaaagaacttcattatggctccatgacagaagctgag aaacagatgcttgtttctgaagtgaatttgctttgtgaactgaaaaatccaaacatcgtt cattactatgatcgtattattgaccggaccaacacaacactgtacattgtaatggaatat tgtgaagaaggagacctggctagtgtaattacaaagggaaccaaggaaaggcaatactta gatgaagagtttgttcttcgagtgacgactcagttgactctggccctgaaacgaagtgat ggtgatcatactgtagtgcgtcgggatctgaaaccagccagtgttttcctggatggcaag caaaacgtcaagcttggagatttggggctagccagaatattaaaccacgacacgagtttt gcaaaaacatttgttggcacaccttattacatgtctcctgaacaaacgaatcacatgtcc tacaatgagaaaccagatatctggtcattgggccgtttgctgtatgagttaggtgcatta atgcctccatttacagcttttagccagaaagaacttgctgggaaaatcagagaaggcaaa ttcaggcaaattctataccgtgactctgacgaattgaatgaaattattatgaggatgtta aaggattaccatcgaccttctgttgaagaaattctcgagaaccctttaatagcagatttg gttgcagaagaacgaagaagaaatcttgagagaagagggcgacaattaggagagccagaa aaattgcaggattccagccctgtattgagtgagctgaaactaaaggaaattcagttagag gagcgagagcgagctctcaaagcaagagaagaaagattggagcagaaagaacaggagttt tgtgtccgtgagagactagcagaggacagactggctagagcagaaaatctgttgaagaat tacagcttgctaaagaagttcctgtctctggcaagtagtccagaacttcttagtcttcca tcctcagtaattaagaagaaaggtcatttcagtggggaaagtaaagagaatgtcatgagg agtgagaattctgagagtcagctcacatctaagtccaagtgcaaggacctgaaggtcctt gcttcatgcttcatgctgcccagctgcgggctcaagccctgtcagatattgagaaaaatt atcaactaa