GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:24:15 Sequence gi568815578r:44843112_45060333 : 217222 bp : 41.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1005 1072 68 2 2 85 92 36 0.243 4.30 1.02 Intr + 13708 13923 216 1 0 -19 53 490 0.148 31.50 1.03 Intr + 40753 40883 131 1 2 50 86 76 0.448 3.02 1.04 Intr + 42567 42775 209 1 2 -15 113 168 0.123 6.87 1.05 Intr + 43045 43249 205 0 1 17 52 134 0.049 0.55 1.06 Intr + 44537 44631 95 0 2 42 99 61 0.165 1.36 1.07 Intr + 58420 58722 303 0 0 112 99 309 0.895 30.26 1.08 Intr + 60882 61005 124 2 1 62 106 101 0.993 8.54 1.09 Intr + 61857 62020 164 1 2 77 93 34 0.985 1.57 1.10 Intr + 62890 62985 96 0 0 36 93 115 0.973 6.09 1.11 Term + 63271 63327 57 0 0 132 35 72 0.983 3.01 1.12 PlyA + 63394 63399 6 -0.45 2.00 Prom + 64584 64623 40 -2.95 2.01 Init + 67033 67225 193 0 1 100 101 286 0.939 30.28 2.02 Intr + 69038 69058 21 0 0 104 80 29 0.550 0.30 2.03 Intr + 69549 69796 248 1 2 101 40 246 0.356 17.36 2.04 Intr + 73645 73760 116 0 2 89 65 136 0.499 9.73 2.05 Intr + 75795 75934 140 0 2 64 60 174 0.963 11.39 2.06 Intr + 76072 76166 95 2 2 88 97 127 0.984 12.36 2.07 Intr + 78483 78620 138 2 0 111 94 208 0.135 23.44 2.08 Intr + 81050 81145 96 1 0 49 100 127 0.031 9.29 2.09 Intr + 87349 87615 267 0 0 116 66 281 0.487 25.51 2.10 Intr + 89231 89321 91 1 1 96 76 -2 0.375 -1.95 2.11 Intr + 89961 90074 114 1 0 66 95 41 0.262 2.10 2.12 Intr + 92280 92386 107 1 2 102 90 40 0.303 4.61 2.13 Intr + 93526 93619 94 0 1 36 97 106 0.155 4.92 2.14 Intr + 94950 95080 131 1 2 16 121 124 0.876 7.99 2.15 Intr + 95388 95481 94 2 1 54 89 49 0.962 0.22 2.16 Intr + 95563 95637 75 2 0 97 94 162 0.995 16.27 2.17 Intr + 96219 96561 343 1 1 57 71 81 0.409 -3.14 2.18 Term + 96638 97019 382 2 1 43 41 245 0.577 8.63 2.19 PlyA + 97653 97658 6 1.05 3.09 PlyA - 98023 98018 6 1.05 3.08 Term - 100102 99998 105 1 0 99 42 88 0.974 2.73 3.07 Intr - 100468 100342 127 0 1 98 93 100 0.997 11.26 3.06 Intr - 105766 105619 148 2 1 68 70 197 0.942 14.27 3.05 Intr - 108891 108722 170 2 2 100 83 147 0.999 14.07 3.04 Intr - 112109 111957 153 1 0 98 99 247 0.953 25.07 3.03 Intr - 113374 113275 100 2 1 91 88 127 0.999 11.25 3.02 Intr - 117458 117096 363 0 0 62 102 313 0.414 24.33 3.01 Init - 121096 120928 169 1 1 82 36 103 0.236 4.26 3.00 Prom - 121685 121646 40 -7.15 4.00 Prom + 122754 122793 40 -8.75 4.01 Init + 124078 124160 83 0 2 48 92 56 0.617 2.49 4.02 Intr + 128967 129047 81 0 0 91 96 134 0.991 12.33 4.03 Intr + 135332 135460 129 2 0 74 103 153 0.993 14.29 4.04 Intr + 138718 138832 115 1 1 98 90 86 0.991 9.33 4.05 Intr + 144021 144185 165 2 0 74 98 74 0.983 6.24 4.06 Intr + 151979 152146 168 1 0 103 90 200 0.970 20.92 4.07 Intr + 154058 154195 138 1 0 65 80 89 0.974 5.54 4.08 Intr + 157281 157409 129 2 0 80 70 160 0.994 13.37 4.09 Intr + 158056 158242 187 0 1 87 78 276 0.862 24.84 4.10 Intr + 164040 164120 81 1 0 79 94 18 0.244 0.19 4.11 Intr + 181862 182019 158 0 2 118 78 155 0.468 16.31 4.12 Term + 205767 205793 27 2 0 111 41 24 0.003 -2.90 4.13 PlyA + 206197 206202 6 1.05 5.02 PlyA - 207217 207212 6 1.05 5.01 Sngl - 214112 213759 354 2 0 76 45 196 0.919 9.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 13708 13927 220 1 1 -19 42 488 0.825 29.03 S.002 Term + 78483 78659 177 2 0 111 54 244 0.864 20.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:44843112_45060333|GENSCAN_predicted_peptide_1|555_aa MKTLRDDVACPKFTQVVESRGTRRRRKRKRRRRRRRRGRRRKEEEEEEEEEEEEEEEEEE EEEEEEEEEGGGGGEEEGEGEGEWEEEEEEEAALRNVDTVLPDKAERQISVHTETFQFKK VLATKSNCLEILEMAYWEGPERKWRSERKWSYRHRRRRFRSRGSRRRRRRCSHCRHRCRR LSSGLRKEEVISLGASLGRVFVPCSPPTGQRSRGSRTLPHYLLRAPRDPSLRLGRQERPL QPCHLCSVLSGFQKLKFNQTRMSFLAPVPVFGSSWMSRVSRDLGRYKKCILPIRLWIEKQ EEHWTWSQGMTMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERNLLSVAYK NVVGARRSSWRVISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLELLDKYLIPNA TQPESKVFYLKMKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKKEMQPTHPIRLG LALNFSVFYYEILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTLIMQLLRDNLTLWT SENQGDEGDAGEGEN >gi568815578r:44843112_45060333|GENSCAN_predicted_CDS_1|1668_bp atgaagactctcagggatgatgtggcttgcccaaagttcacacaggtagtggaaagcaga ggaaccagaagaagaaggaagaggaagaggaggaggaggaggagaagaagaggaagaaga agaaaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaggaagaagaagaaggaggaggaggaggagaagaagaaggggagggg gagggggaatgggaagaggaagaggaagaggaagcagctctgagaaatgtggacacagtg ttaccagacaaggcagagaggcagatttctgttcacacagaaacttttcagtttaaaaaa gtgttggcaactaagtcaaactgcttagaaatactggagatggcatattgggagggaccg gagcggaagtggcgatcggagcggaagtggagctaccgccaccgccgccgccgattccgg agccggggtagtcgccgccgccgccgccgctgcagccactgcaggcaccgctgccgccgc ctgagtagtgggcttaggaaggaagaggtcatctcgctcggagcttcgctcggaagggtc tttgttccctgcagccctcccacgggacagcggagccggggatcccggaccctccctcac tatctgcttcgggccccacgcgacccctccctgcgtcttggtcggcaagaacgtcccctt cagccctgtcacctctgctccgtgctttctggtttccagaaattaaaatttaatcaaacg cggatgagttttctagccccagtccccgtgtttggctcctcctggatgagcagagtctcc agagatttgggccgctacaaaaagtgcattttgcccattcggctgtggatagagaagcag gaagagcactggacttggagtcagggaatgacaatggataaaagtgagctggtacagaaa gccaaactcgctgagcaggctgagcgatatgatgatatggctgcagccatgaaggcagtc acagaacaggggcatgaactctccaacgaagagagaaatctgctctctgttgcctacaag aatgtggtaggcgcccgccgctcttcctggcgtgtcatctccagcattgagcagaaaaca gagaggaatgagaagaagcagcagatgggcaaagagtaccgtgagaagatagaggcagaa ctgcaggacatctgcaatgatgttctggagctgttggacaaatatcttattcccaatgct acacaaccagaaagtaaggtgttctacttgaaaatgaaaggagattattttaggtatctt tctgaagtggcatctggagacaacaaacaaaccactgtgtcgaactcccagcaggcttac caggaagcatttgaaattagtaagaaagaaatgcagcctacacacccaattcgtcttggt ctggcactaaatttctcagtcttttactatgagattctaaactctcctgaaaaggcctgt agcctggcaaaaacggcatttgatgaagcaattgctgaattggatacgctgaatgaagag tcttataaagacagcactctgatcatgcagttacttagggacaatctcactctgtggaca tcggaaaaccagggagacgaaggagacgctggggagggagagaactaa >gi568815578r:44843112_45060333|GENSCAN_predicted_peptide_2|914_aa MNASGSGYPLASLYVGDLHPDVTEAMLYEKFSPAGPILSIRVCRDVATRRSLGYAYINFQ QPADGEEQAQRAERALDTMNFEMLKGQPIRIMWSQRDPGLRKSGVGNIFIKNLEDSIDNK ALYDTFSTFGNILSCKVEDEGCTSLGRSVSTTCLVACDEHGSRGFGFVHFETHEAAQQAI NTMNGMLLNDRKVFVGHFKSRREREAELGARALEFTNIYVKNLPVDVDEQGLQDLFSQFG KMLSVKVMRDNSGHSRCFGFVNFEKHEEAQKAVVHMNGKEVSGRLLYAGRAQKRVERQNE LKRRFEQMKQDRLRRYQGVNLYVKNLDDSIDDDKLRKEFSPYGVITSAKVMTEGGHSKGF GFVCFSSPEEATKAVTEMNGRIVGTKPLYVALAQRKEERKAILTNQYMQRLSTMRTLSNP LLGSFQQPSSYFLPAMPQPPAQAAYYGCGPVTPTQPAPRWTSQPPRPSCASMVRPPVVPR RPPAHISSVRQASTQVPRTVPHTQRVANIGTQTTGPSGVGCCTPGRPLLPCKCSSAAHST YRVQEPAVHIPGQEPLTASMLAAAPLHEQKQMIGERLYPLIHDVHTQLAGKITGMLLEID NSELLLMLESPESLHAKVTDRAMGSCKQWWMGGFHRNPVDGGGSCIQDDRRGSGRAAGTP GYGAAEGVHALKPGGPFVSEGGGDRANGRHDWGGAQALRNRNPKGVLKYTEGIYTRAGEV LGTDPSTVQWMMTWISHNGPELQGPPSDVHAAPGLGTLQGKHPAASRLAKGLCNIINPCR NVLGYLPRDVQSRSLPSRVPERPSESGYYDPSRLGSAKNSPATNLTESSRLLCHSQQLTL PVLRSTDLQPVFTMSCPLLWLLLICSLCSSIVFSPNPDCWHFFTLSPGPSRFLMPCSPEL SNLQAHLPFVPSRG >gi568815578r:44843112_45060333|GENSCAN_predicted_CDS_2|2745_bp atgaacgccagcggttctggctacccgcttgcctcgctttacgtgggcgatctgcacccc gacgtgaccgaggccatgctctatgagaagttctctcccgccggccccatcctgtccatc cgcgtgtgccgcgatgtagccacccggcgctcgctgggctacgcctacatcaacttccag cagcccgcggacggtgaggaacaggctcagagagcggagcgggcactggacacaatgaac tttgagatgctcaaaggccagcctattcgcatcatgtggtcccagcgagacccaggactt cgcaagtcaggtgtgggcaacatcttcatcaagaacctggaggactccattgacaacaag gctttatatgataccttctccacctttgggaacatcctctcttgcaaggtagaggatgaa gggtgtacgtctttgggtagatcggtgtcaactacctgcctggtggcgtgtgacgagcat ggctcccggggtttcggctttgtccattttgagacccatgaggccgcacagcaggccatc aacaccatgaatgggatgctgctgaatgaccgcaaagtctttgtgggtcacttcaagtct cgacgggagcgggaggcggagctgggggcgcgggccctggagttcaccaacatctacgtg aagaacctcccggtggatgtggacgagcaaggcctgcaggacctcttctcccagtttggg aaaatgctgagtgtgaaggtgatgagggacaacagcggccactcgcggtgctttggcttt gtcaactttgagaagcatgaggaagcccagaaggccgtggtccatatgaacgggaaggag gtgagcgggcggctgctgtacgcgggccgggcccaaaagcgcgtggagcggcagaatgaa ctgaagcgcaggtttgagcagatgaagcaggaccggctgaggcgttaccagggtgtgaac ttgtatgtgaagaatctggacgactccattgatgacgacaaactgaggaaagagttctct ccctatggagtaattaccagtgcgaaggtgatgacagagggtggccacagcaaggggttt ggctttgtgtgtttttcctccccagaagaggcgacaaaggccgtgacagagatgaacggg cgcatcgtgggcaccaagccactctacgtggcactggcccagcgcaaagaggagcggaag gccatcttgaccaaccagtacatgcagcgcctctccaccatgcggaccctgagcaacccc ctcctgggctcctttcagcagccctccagctacttcctgcctgccatgccccagcctcca gcccaggctgcatactatggctgtggcccagtgacacccacccagcctgcccccaggtgg acatcccagccacctagaccttcctgtgcctcaatggtccggccaccagttgtgcctcgg cgccccccggcccacatcagcagtgtcaggcaggcctccacccaggtgccacgcacggtg cctcatacccagagagtagccaacattggtactcagaccacaggacccagtggggtagga tgctgtacaccaggccggccgctcctgccgtgcaaatgttcctcagcagcacatagcacc tatcgggtccaggagccggctgtgcacatcccaggacaggagcccctgaccgcgtccatg ctggctgcggcgcccctgcatgagcaaaagcagatgattggggagcgtctctaccccctt atccatgatgtccacacccagctggctggcaagatcacgggcatgctgctggagattgac aactcagagctgttgctcatgctggagtctccagaatccctccatgccaaggtgacagac agggccatgggatcttgcaagcagtggtggatgggagggttccacagaaaccctgtggat ggaggaggatcctgtatccaggatgatagacgaggcagtggccgtgctgcaggcacacca ggctatggagcagccgaaggcgtacatgcactgaaaccaggaggcccttttgtgtctgag ggaggtggggacagagcaaacggtagacatgactggggtggggctcaggctctaagaaac agaaaccctaaaggagtgctgaagtacacagagggcatatacacaagagcaggagaagtg ttaggaactgacccatcaacagtacagtggatgatgacatggatatcacacaatggacca gagctacaggggcccccatctgatgtgcatgcagcccctggcttgggcaccctgcagggc aaacacccagctgcttctcggctggctaaggggttgtgtaacataattaacccttgtagg aacgttttagggtaccttcccagggacgtacagtccaggtctttgccctctcgtgtaccg gagcggcccagtgaaagtggctactatgacccaagcagactgggttctgctaaaaattca ccagcaacaaatctgaccgagtcctcaaggctgctctgccactctcagcaactcactctt cctgttctccgctccacagacctccaacctgtcttcaccatgtcctgcccacttctttgg ctgcttttaatctgttccctgtgcagctccatcgtcttctccccaaatcctgactgctgg catttctttactctcagtcctgggccctctcggtttctcatgccatgttctcctgagcta tctaatttacaagcgcatcttccatttgtacccagtcgaggatga >gi568815578r:44843112_45060333|GENSCAN_predicted_peptide_3|444_aa MAYLCNAICMTHMVTLDPVDQALPGATTATSVPGPQCDASCMMLFAVVPFNGIALAATQG AQSMLPPRVQGSLSSKPSTRRGRDAPGLQVSQAPPPRRGFAVGRRYSPPALAPGRCAAPH GGGRKELPTRRPGHGMAPKFPDSVEELRAAGNESFRNGQYAEASALYGRALRVLQAQGSS DPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSIKPLLRRASAYEALEKYPMAY VDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLPSIPLVPVSAQKRWNSLPSEN HKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVKKGNHKKAIEKYSESLLCSNL ESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFYRRAQAHKALKDYKSSFADIS NLLQIEPRNGPAQKLRQEVKQNLH >gi568815578r:44843112_45060333|GENSCAN_predicted_CDS_3|1335_bp atggcatatttgtgtaatgccatttgcatgacccacatggtcaccctggaccccgtggat caggccctgcctggtgccaccaccgccacttctgtgccaggcccccagtgtgatgcaagt tgcatgatgctctttgctgtagtgccttttaatggcatagctttggcagcaactcaggga gcccagagcatgctgccgccccgcgtgcaagggagcctaagttccaaaccgagcacgcgc agagggcgggacgctccgggcctccaggtctcgcaggccccgccccctcgccgcgggttc gctgttgggcggagatattcgccgccggcgcttgcgcccggaaggtgtgccgcaccacac gggggaggaaggaaggagctcccaactcgccggcctggccacgggatggcccccaaattc ccagactctgtggaggagctccgcgccgccggcaatgagagtttccgcaacggccagtac gccgaggcctccgcgctctacggccgcgcgctgcgggtgctgcaggcgcaaggttcttca gacccagaagaagaaagtgttctctactccaaccgagcagcatgtcacttgaaggatgga aactgcagagactgcatcaaagattgcacttcagcactggccttggttcccttcagcatt aagcccctgctgcggcgagcatctgcttatgaggctctggagaagtaccctatggcctat gttgactataagactgtgctgcagattgatgataatgtgacgtcagccgtagaaggcatc aacagaatgaccagagctctcatggactcgcttgggcctgagtggcgcctgaagctgccc tcaatccccttggtgcctgtttcagctcagaagaggtggaattccttgccttcggagaac cacaaagagatggctaaaagcaaatccaaagaaaccacagctacaaagaacagagtgcct tctgctggggatgtggagaaagccagagttctgaaggaagaaggcaatgagcttgtaaag aagggaaaccataagaaagctattgagaagtacagtgaaagcctcttgtgtagtaacctg gaatctgccacgtacagcaacagagcactctgctatttggtcctgaagcagtacacagaa gcagtgaaggactgcacagaagccctcaagctggatggaaagaacgtgaaggcattctac agacgggctcaagcccacaaagcactcaaggactataaatccagctttgcagacatcagc aacctcctacagattgagcctaggaatggtcctgcacagaagttgcggcaggaagtgaag cagaacctacactaa >gi568815578r:44843112_45060333|GENSCAN_predicted_peptide_4|486_aa MASFITDVQCLPNGLHILLSSSEPDIGRQLKKLDEDSLTKQPEEVFDVLEKLGEGSYGSV YKAIHKETGQIVAIKQVPVESDLQEIIKEISIMQQCDSPHVVKYYGSYFKNTDLWIVMEY CGAGSVSDIIRLRNKTLTEDEIATILQSTLKGLEYLHFMRKIHRDIKAGNILLNTEGHAK LADFGVAGQLTDTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGITAIEMAEGKPPY ADIHPMRAIFMIPTNPPPTFRKPELWSDNFTDFVKQCLVKSPEQRATATQLLQHPFVRSA KGVSILRDLINEAMDVKLKRQESQQREVDQDDEENSEEDEMDSGTMVRAVGDEMGTVRVA STMTDGANTMIEHDDTLPSQLGTMVINAEDEEEEGTMKKMMFLHFLYKGTGSERLRLLLV HGHMARRDETMQPAKPSFLEYFEQKEKENQINSFGKSVPGPLKNSSDWKIPQDGDYEFGL NIKDTH >gi568815578r:44843112_45060333|GENSCAN_predicted_CDS_4|1461_bp atggcatcttttattactgatgttcagtgtcttcctaatggtcttcacatcctcctcagc tcttcagaacctgacattgggaggcagctgaaaaagttggatgaagatagtttaaccaaa caaccagaagaagtatttgatgtcttagagaaacttggagaagggtcctatggcagcgta tacaaagctattcataaagagaccggccagattgttgctattaagcaagttcctgtggaa tcagacctccaggagataatcaaagaaatctctataatgcagcaatgtgacagccctcat gtagtcaaatattatggcagttattttaagaacacagacttatggatcgttatggagtac tgtggggctggttctgtatctgatatcattcgattacgaaataaaacgttaacagaagat gaaatagctacaatattacaatcaactcttaagggacttgaataccttcattttatgaga aaaatacaccgagatatcaaggcaggaaatattttgctaaatacagaaggacatgcaaaa cttgcagattttggggtagcaggtcaacttacagataccatggccaagcggaatacagtg ataggaacaccattttggatggctccagaagtgattcaggaaattggatacaactgtgta gcagacatctggtccctgggaataactgccatagaaatggctgaaggaaagcccccttat gctgatatccatccaatgagggcaatcttcatgattcctacaaatcctcctcccacattc cgaaaaccagagctatggtcagataactttacagattttgtgaaacagtgtcttgtaaag agccctgagcagagggccacagccactcagctcctgcagcacccatttgtcaggagtgcc aaaggagtgtcaatactgcgagacttaattaatgaagccatggatgtgaaactgaaacgc caggaatcccagcagcgggaagtggaccaggacgatgaagaaaactcagaagaggatgaa atggattctggcacgatggttcgagcagtgggtgatgagatgggcactgtccgagtagcc agcaccatgactgatggagccaatactatgattgagcacgatgacacgttgccatcacaa ctgggcaccatggtgatcaatgcagaggatgaggaagaggaaggaactatgaaaaagatg atgtttctccactttctctataaaggaacaggttcagagagattgagattgttgcttgtt catggtcacatggctagaagggatgagaccatgcagcctgcgaaaccatcctttcttgaa tattttgaacaaaaagaaaaggaaaaccagatcaacagctttggcaagagtgtacctggt ccactgaaaaattcttcagattggaaaataccacaggatggagactacgagtttggttta aatatcaaggatactcactaa >gi568815578r:44843112_45060333|GENSCAN_predicted_peptide_5|117_aa MGDGNVGVRTIDTVAQRQPRTWSFCDLADSAYPNVLGAQLTWPLKPPHGSCVLKASWPSQ GPHPATHRMASNPLDGLLSASTPEGKRRLAQRLWISSSLVKPANSSVIPWAEPHLLQ >gi568815578r:44843112_45060333|GENSCAN_predicted_CDS_5|354_bp atgggagatggcaatgtgggtgtgaggaccatcgacactgtggcccagcgacagcctaga acgtggtctttctgtgaccttgcagactcagcctatcctaacgttttgggagctcagctg acatggccactaaagcctccacatggctcttgtgttctaaaggcctcctggcccagccag ggcccacaccctgctacccaccggatggccagtaacccactggatggcctgctctctgcc agcactccagaaggtaaacgacggcttgcccagagactttggatcagctccagtcttgtc aaaccagcaaactcctctgtgattccatgggctgaaccacatcttctccaatga