GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:19:27 Sequence gi568815592r:2790166_3000611 : 210446 bp : 44.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1321 1255 67 0 1 94 85 53 0.029 4.51 1.04 Intr - 7807 7672 136 2 1 99 72 250 0.955 24.13 1.03 Intr - 9888 9770 119 2 2 -2 43 92 0.016 -4.19 1.02 Intr - 12167 12066 102 1 0 89 105 38 0.134 4.89 1.01 Init - 14844 14705 140 0 2 89 3 105 0.134 1.71 1.00 Prom - 18459 18420 40 -3.56 2.00 Prom + 19398 19437 40 -6.36 2.01 Init + 20982 21048 67 2 1 67 98 89 0.466 9.03 2.02 Term + 29184 29278 95 1 2 108 53 59 0.330 2.39 2.03 PlyA + 29786 29791 6 1.05 3.09 PlyA - 30311 30306 6 1.05 3.08 Term - 33159 33083 77 1 2 49 37 81 0.127 -2.90 3.07 Intr - 43847 43481 367 2 1 74 69 149 0.787 6.32 3.06 Intr - 45858 45691 168 0 0 36 86 347 0.991 29.44 3.05 Intr - 46085 45943 143 0 2 74 100 100 0.999 9.87 3.04 Intr - 47834 47717 118 2 1 75 100 67 0.880 6.64 3.03 Intr - 50429 50254 176 0 2 79 82 195 0.524 17.66 3.02 Intr - 51818 51647 172 2 1 62 90 25 0.547 -0.38 3.01 Init - 58310 58221 90 2 0 78 49 78 0.315 3.39 3.00 Prom - 64677 64638 40 -6.66 4.26 PlyA - 66269 66264 6 -0.45 4.25 Term - 70444 69957 488 2 2 53 55 259 0.147 13.96 4.24 Intr - 72690 72646 45 1 0 65 98 51 0.122 2.18 4.23 Intr - 76146 76110 37 0 1 119 49 38 0.020 0.84 4.22 Intr - 86260 86128 133 0 1 78 72 104 0.004 8.45 4.21 Intr - 100405 100086 320 1 2 81 23 218 0.027 9.36 4.20 Intr - 101823 101668 156 0 0 67 107 305 0.997 30.51 4.19 Intr - 103388 103246 143 0 2 45 75 100 0.985 4.47 4.18 Intr - 105343 105226 118 1 1 78 77 40 0.984 1.94 4.17 Intr - 106025 105888 138 2 0 62 89 110 0.796 9.16 4.16 Intr - 110456 110279 178 1 1 117 115 47 0.988 10.22 4.15 Intr - 113490 113385 106 1 1 93 47 62 0.373 1.87 4.14 Intr - 116060 116030 31 2 1 114 80 2 0.021 -0.30 4.13 Intr - 129279 129135 145 2 1 85 96 3 0.106 1.08 4.12 Intr - 131030 130920 111 1 0 45 80 66 0.006 0.99 4.11 Intr - 150322 150137 186 0 0 74 14 181 0.025 8.10 4.10 Intr - 158534 158137 398 2 2 105 47 531 0.256 44.08 4.09 Intr - 158904 158749 156 0 0 107 44 74 0.968 5.11 4.08 Intr - 160670 160610 61 1 1 27 30 90 0.390 -4.16 4.07 Intr - 161680 161575 106 2 1 94 34 42 0.417 -1.33 4.06 Intr - 163021 162875 147 2 0 69 63 125 0.830 8.31 4.05 Intr - 164544 164427 118 0 1 57 62 69 0.974 1.24 4.04 Intr - 165505 165359 147 1 0 82 89 75 0.993 7.43 4.03 Intr - 169177 169003 175 0 1 79 92 205 0.941 19.94 4.02 Intr - 176425 176235 191 1 2 42 106 66 0.006 2.18 4.01 Init - 183743 183657 87 2 0 57 98 52 0.079 3.75 4.00 Prom - 190139 190100 40 -6.36 5.00 Prom + 193423 193462 40 -3.06 5.01 Init + 198506 198544 39 1 0 87 96 34 0.575 4.34 5.02 Intr + 198883 198985 103 0 1 70 24 74 0.651 -1.05 5.03 Intr + 199363 199545 183 2 0 94 33 58 0.315 0.66 5.04 Intr + 209804 209920 117 0 0 63 113 115 0.870 11.94 5.05 Intr + 210066 210202 137 1 2 41 86 89 0.579 4.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 91024 91344 321 0 0 40 45 173 0.845 4.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:2790166_3000611|GENSCAN_predicted_peptide_1|188_aa MVEDLCIIGIDTWAAVAQNVTPPRRHALTATAAQIHSCLPPKLPKPQRCPHAIKKIRIGL VAAICAGETELRNSVSMRLGWARTQTVTQLHLLMYVLTTQEKGPPEVNAPERVFDTPDFA ACPGQHGGRGGVGAGEEEEKEEEEEEEEEEEEEEEEEEEEEEICLGLTTNTETQSWPEPG DLELVEGL >gi568815592r:2790166_3000611|GENSCAN_predicted_CDS_1|564_bp atggttgaggacctctgcattataggcatcgacacttgggctgctgtggcacagaacgtc acaccacctagacggcatgccctcacagcaacagcggcgcagatccactcctgcctacca cctaagctccccaagccccagaggtgtccccatgcaataaagaaaataagaattggatta gtagcagccatctgtgcaggggagactgagctccgcaactctgtgtccatgaggcttggc tgggctcgaacgcagaccgtcacccagctccatttacttatgtatgttctaacaacacag gaaaaaggacctccagaagtgaatgccccagaacgagtgtttgacactcctgactttgca gcctgcccagggcagcatggaggaaggggaggagtaggagcaggggaggaggaggagaag gaggaggaggaggaagaggaggaggaggaggaggaagaggaggaggaggaggaggaggag gaggagatctgccttgggttaacgacaaatactgaaactcagtcttggcctgaacctggt gacctggagctggtggaaggcctg >gi568815592r:2790166_3000611|GENSCAN_predicted_peptide_2|53_aa MSEDASPPAKDPGAKSTEGILEGLLASKDSTILIPGPQPRAQFPVFRNINTWI >gi568815592r:2790166_3000611|GENSCAN_predicted_CDS_2|162_bp atgagcgaggatgcctctccaccagcaaaagatccaggtgcaaagagcactgaaggcatc ttggagggactccttgccagcaaggactccactatacttattcctggtccccagccacga gcacagtttccagtattcagaaacattaatacgtggatctga >gi568815592r:2790166_3000611|GENSCAN_predicted_peptide_3|436_aa MAPTPLLDVAAYASTSAKTFLAILVNIDFVATSGARTPTQSPGRAAAPPAAAGPGDASAC YKSSGPRCLLPDLAPSSEPGACLGGLSVFTMEQLSSANTRFALDLFLALSENNPAGNIFI SPFSISSAMAMVFLGTRGNTAAQLSKEFLVSTQKTYGADLASVDFQHASEDARKTINQWV KGQTEGKIPELLASGMVDNMTKLVLVNAIYFKGNWKDKFMKEATTNAPFRLNKKDRKTVK MMYQKKKFAYGYIEDLKCRVLELPYQGEELSMVILLPDDIEDESTGLKKIEEQLTLEKLH EWTKPENLDFIEVNVSLPRFKLEESYTLNSDLARLGVQDLFNSSKADLSGMSGARDIFIS KIVHKSFVEVNEEGTEAAAATAGIATFCMLMPEENFTADHPFLFFIRHNSSVLEATKSKI KVQEDLMSGKGLLTGS >gi568815592r:2790166_3000611|GENSCAN_predicted_CDS_3|1311_bp atggcccctacccccttactggatgtggctgcttatgcttctaccagtgctaaaaccttc ctcgccatattggtgaatattgactttgtggcgacctcgggagctcggactcctacgcag tcaccgggaagggccgccgccccgcccgcggctgctggcccgggtgacgcttccgcctgc tataagagcagcggccctcggtgcctccttcctgacctcgcacccagctcggagcccgga gcgtgcctcggcggcctgtcggttttcaccatggagcagctgagctcagcaaacacccgc ttcgccttggacctgttcctggcgttgagtgagaacaatccggctggaaacatcttcatc tctcccttcagcatttcatctgctatggccatggtttttctggggaccagaggtaacacg gcagcacagctgtccaaggagttcttggtttcgactcagaaaacatatggtgctgacctg gccagtgtggattttcagcatgcctctgaagatgcaaggaagaccataaaccagtgggtc aaaggacagacagaaggaaaaattccggaactgttggcttcgggcatggttgataacatg accaaacttgtgctagtaaatgccatctatttcaagggaaactggaaggataaattcatg aaagaagccacgacgaatgcaccattcagattgaataagaaagacagaaaaactgtgaaa atgatgtatcagaagaaaaaatttgcatatggctacatcgaggaccttaagtgccgtgtg ctggaactgccttaccaaggcgaggagctcagcatggtcatcctgctgccggatgacatt gaggacgagtccacgggcctgaagaagattgaggaacagttgactttggaaaagttgcat gagtggactaaacctgagaatctcgatttcattgaagttaatgtcagcttgcccaggttc aaactggaagagagttacactctcaactccgacctcgcccgcctaggtgtgcaggatctc tttaacagtagcaaggctgatctgtctggcatgtcaggagccagagatatttttatatca aaaattgtccacaagtcatttgtggaagtgaatgaagagggaacagaggcggcagctgcc acagcaggcatcgcaactttctgcatgttgatgcccgaagaaaatttcactgccgaccat ccattccttttctttattcggcataattcctcagtgctggaggccacaaagtccaagatc aaggtacaggaagatttgatgtctggcaagggcctgcttactggttcctag >gi568815592r:2790166_3000611|GENSCAN_predicted_peptide_4|1306_aa MCDGRTAPTTTSYRVHAVNSVAVEKLCLRHATKITTCTVHRRFTLLQDSNAATDLTGGGA QAVMFAHLLLTSCCVAQFLTDHRRICWGLLHIKSAIMDVLAEANGTFALNLLKTLGKDNS KNVFFSPMSMSCALAMVYMGAKGNTAAQMAQILSFNKSGGGGDIHQGFQSLLTEVNKTGT QYLLRMANRLFGEKSCDFLSSFRDSCQKFYQAEMEELDFISAVEKSRKHINTWVAEKTEG KIAELLSPGSVDPLTRLVLVNAVYFRGNWDEQFDKENTEERLFKVSKAKRESCFLYGVGQ NYRSCGFGEMWNPVMLSEFCSEEGRMCYTRDWGPYGVFDCAYCRNEEKPVQMMFKQSTFK KTYIGEIFTQILVLPYVGKELNMIIMLPDETTDLRTVEKELTYEKFVEWTRLDMMDEEEV EVSLPRFKLEESYDMESVLRNLGMTDAFELGKADFSGMSQTDLSLSKVVHKSFVEVNEEG TEAAAATAAIMMMRCARFVPRFCADHPFLFFIQHSKTNGILFCGRFSSPNKVTIFAVSAT AHKGNADAKCKRTNLITAQKTPTGYHYKLRQPAFILLSGPTHILLIGRTKCKQQDLDSTP GVSVAVVMVMLVAGTSQVLESWKCGHHGTSWHRDADGIQGLSLEVPLMPTPYCSWKGVLR QTLREGSWVLHKKELRDFKEVCQMPTDALFLRAEALDAAGSQPTSLGCQVLNPVLKRPWR RGPCIMETLSNASGTFAIRLLKILCQDNPSHNVFCSPVSISSALAMVLLGAKGNTATQMA QALSLNTEEDIHRAFQSLLTEVNKAGTQYLLRTANRLFGEKTCQFLSTFKESCLQFYHAE LKELSFIRAAEESRKHINTWVSKKTEGKIEELLPGSSIDAETRLVLVNAIYFKGKWNEPF DETYTREMPFKINQEEQRPVQMMYQEATFKLAHVGEVRAQLLELPYARKELSLLVLLPDD GVELSTVEKSLTFEKLTAWTKPDCMKSTEVEVLLPKFKLQEDYDMESVLRHLGIVDAFQQ GKADLSAMSAERDLCLSKFVHKSFVEVNEEGTEAAAASSCFVVAECCMESGPRLEKNGPD PLDSSDERIQREAFESVLAETFFTKEPLRCWCPWRAWGITKSRLISISSGDTFHIGYDWS IILEDSGAQLASPSGPHTGGAGGAACQSRTVRSHSSALGWSMGLGAVEQGVVLVGEARAV QEPMEWVGGSGMAGCRSRALPRGKAAKARREIERSAGGLALLRDPVHPPQPLARVLSPPL PGASRAGCSECAARQAHVHPELQLASKRCTQPRFPLVPLPPHLPAS >gi568815592r:2790166_3000611|GENSCAN_predicted_CDS_4|3921_bp atgtgtgatggcaggacagcccccacaaccacaagttatcgggttcatgcggtcaacagt gtcgcagttgagaaactctgccttaggcatgcaaccaagatcaccacgtgcacagttcac cgtaggttcactctcctacaagactccaacgctgccactgacctaacaggaggtggagct caggcggtgatgttcgctcacctgctgctcacctcctgctgcgtggcccagttcctaaca gaccacagacggatctgctggggactcctgcatataaagtctgccatcatggatgttctc gcagaagcaaatggcacctttgccttaaaccttttgaaaacgctgggtaaagacaactcg aagaatgtgtttttctcacccatgagcatgtcctgtgccctggccatggtctacatgggg gcaaagggaaacaccgctgcacagatggcccagatactttctttcaataaaagtggcggt ggtggagacatccaccagggcttccagtctcttctcaccgaagtgaacaagactggcacg cagtacttgcttaggatggccaacaggctctttggggaaaagtcttgtgatttcctctca tcttttagagattcctgccaaaaattctaccaagcagagatggaggagcttgactttatc agcgccgtagagaagtccagaaaacacataaacacctgggtagctgaaaagacagaaggt aaaattgcggagttgctctctccgggctcagtggatccattgacaaggctggttctggtg aatgctgtctatttcagaggaaactgggatgaacagtttgacaaggagaacaccgaggag agactgtttaaagtcagcaaggcgaaaagggaaagctgctttctatatggtgtaggccag aactataggtcttgtgggtttggggagatgtggaatcccgtcatgctttctgagttctgc agtgaagaaggaagaatgtgctacaccagagactggggaccatacggcgtctttgactgt gcctattgcaggaatgaggagaaacctgtgcaaatgatgtttaagcaatctacttttaag aagacctatataggagaaatatttacccaaatcttggtgcttccatatgttggcaaggaa ctgaatatgatcatcatgcttccggacgagaccactgacttgagaacggtggagaaagaa ctcacttacgagaagttcgtagaatggacgaggctggacatgatggatgaagaggaggtg gaagtgtccctcccgcggtttaaactagaggaaagctacgacatggagagtgtcctgcgc aacctgggcatgactgatgccttcgagctgggcaaggcagacttctctggaatgtcccag acagacctgtctctgtccaaggtcgtgcacaagtcttttgtggaggtcaatgaggaaggc acggaggctgcagccgccacagctgccatcatgatgatgcggtgtgccagattcgtcccc cgcttctgcgccgaccaccccttccttttcttcatccagcacagcaagaccaacgggatt ctcttctgcggccgcttttcctctccaaataaagtgacaattttcgcagtgagcgctaca gctcataaaggcaacgcagacgccaaatgcaaacgaacaaaccttatcacagcacagaaa accccaacgggttaccactacaagctccggcagcctgcttttattcttttatctggcccc acccacatcctgctgattggtagaaccaagtgcaaacagcaggacctagactcaacccct ggtgtttcagtagcagtagtgatggtgatgctggtggcaggaacctcccaagtgctggag tcttggaagtgtgggcaccatggaacttcctggcacagggatgcagatggcatccaaggt ctgtctctagaagtgccactcatgccaacgccctactgttcttggaaaggggtcctgaga caaactctaagagaaggttcttgggtcttgcacaagaaagaactcagggactttaaggaa gtgtgtcaaatgcccacagacgccttgtttcttcgcgcggaggccctggacgccgcaggc tcccaacctacttctctgggctgtcaggttctgaacccggtcctgaagaggccctggcgc cgggggccctgcatcatggaaactctttctaatgcaagtggtacttttgccatacgcctt ttaaagatactgtgtcaagataacccttcgcacaacgtgttctgttctcctgtgagcatc tcctctgccctggccatggttctcctaggggcaaagggaaacaccgcaacccagatggcc caggcactgtctttaaacacagaggaagacattcatcgggctttccagtcgcttctcact gaagtgaacaaggctggcacacagtacctgctgagaacggccaacaggctctttggagag aaaacttgtcagttcctctcaacgtttaaggaatcctgtcttcaattctaccatgctgag ctgaaggagctttcctttatcagagctgcagaagagtccaggaaacacatcaacacctgg gtctcaaaaaagaccgaaggtaaaattgaagagttgttgccgggtagctcaattgatgca gaaaccaggctggttcttgtcaatgccatctacttcaaaggaaagtggaatgaaccgttt gacgaaacatacacaagggaaatgccctttaaaataaaccaggaggagcaaaggccagtg cagatgatgtatcaggaggccacgtttaagctcgcccacgtgggcgaggtgcgcgcgcag ctgctggagctgccctacgccaggaaggagctgagcctgctggtgctgctgcctgacgac ggcgtggagctcagcacggtggaaaaaagtctcacttttgagaaactcacagcctggacc aagccagactgtatgaagagtactgaggttgaagttctccttccaaaatttaaactacaa gaggattatgacatggaatctgtgcttcggcatttgggaattgttgatgccttccaacag ggcaaggctgacttgtcggcaatgtcagcggagagagacctgtgtctgtccaagttcgtg cacaagagttttgtggaggtgaatgaagaaggcaccgaggcagcggcagcgtcgagctgc tttgtagttgcagagtgctgcatggaatctggccccagactggaaaagaatggcccagat cctctggactcctcagatgagcggattcagagagaagcttttgagagcgtgctggcggag acatttttcacaaaagagcccttgcggtgctggtgtccgtggcgtgcctgggggatcacc aagagccgcctcatcagcataagctcaggggacacctttcacattgggtatgactggagc atcatccttgaagactcaggagcccagttggcttcacccagtggaccccacactgggggt gcaggtggagctgcctgccagtcccgcaccgtgcgctcgcattcctcagcccttgggtgg tcgatgggactgggcgctgtggagcagggggtggtgcttgtcggggaggctcgggccgta caggagcccatggagtgggtgggaggctcaggcatggcgggctgcaggtcccgagccctg ccccgcgggaaggcagctaaggcccggcgagaaatcgagcgcagcgccggtgggctggca ctgctgcgggacccagtacaccctccgcagccgctggcccgggtgctaagtcccccattg cccggggccagcagggccggctgctccgagtgcgcggcccgccaagcccacgtccacccg gaactccagctggccagcaagcgctgcacgcagccccggttcccgctcgtgcctctccct ccacacctccctgcaagctga >gi568815592r:2790166_3000611|GENSCAN_predicted_peptide_5|193_aa MAPRSRMNRESDRLRPERLLPGESPPLCNQPAAAEGAWVPSAEPAAHDALAYIKICIYGN VTGSPGYPKKGCFLLRGEANARNRKGVCQAVQAVFNGYGIGRSEINLACVSECAVQCGRN LAQLLERSLGRRGSQSCGSYWGVRWSEGSCLALTERAHTQGPGVESSGGICPQPYGAGAG IIVILQVGISGPK >gi568815592r:2790166_3000611|GENSCAN_predicted_CDS_5|579_bp atggcgccgcgctcccggatgaacagagaaagcgacaggctgcgcccggagcgcctgctg cctggcgagagcccgcccctctgcaaccagcccgctgcagcggagggcgcctgggtgccc tcggccgagccagcagcccacgacgcccttgcatacattaagatatgtatatatggaaat gttaccgggagtcccggttatcccaaaaagggttgtttcctgttgcgtggtgaggccaat gcacgaaaccgaaagggagtgtgtcaagcagtgcaggctgtattcaatggctatggaatt ggaagatctgaaatcaacttagcttgcgtcagcgagtgcgcggtccagtgcggccggaac ctggcgcaactcctagagcggtccttggggagacgcgggtcccagtcctgcggctcctac tggggagtgcgctggtcggaaggatcgtgcttggctctcacggaacgcgcccacacgcag ggtcccggtgtcgaatcgtctggtggaatctgccctcaaccctatggagcgggtgctggt atcatagtcatattacaagtaggaatcagcgggcccaag