GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:30:58 Sequence gi568815577f:29221281_29442830 : 221550 bp : 39.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 524 519 6 -0.45 1.02 Term - 1575 1162 414 0 0 38 33 340 0.877 17.78 1.01 Init - 1885 1694 192 1 0 78 55 175 0.459 12.21 1.00 Prom - 23441 23402 40 -3.05 2.06 PlyA - 23730 23725 6 1.05 2.05 Term - 32094 31666 429 0 0 46 41 241 0.775 9.52 2.04 Intr - 35400 35292 109 2 1 74 109 57 0.292 5.77 2.03 Intr - 42773 42666 108 1 0 119 67 8 0.012 0.28 2.02 Intr - 54701 54561 141 1 0 42 23 144 0.013 1.85 2.01 Init - 63215 63130 86 2 2 58 63 83 0.102 3.14 2.00 Prom - 71703 71664 40 -3.25 3.03 PlyA - 71790 71785 6 1.05 3.02 Term - 76971 76685 287 1 2 26 47 316 0.532 15.88 3.01 Init - 79688 79637 52 2 1 64 68 59 0.456 2.87 3.00 Prom - 94285 94246 40 -3.75 4.00 Prom + 99318 99357 40 -3.45 4.01 Init + 100001 100234 234 1 0 47 78 282 0.842 21.29 4.02 Intr + 104779 106113 1335 0 0 75 83 1099 0.999 95.41 4.03 Intr + 108207 108413 207 2 0 76 119 223 0.995 22.55 4.04 Term + 121119 121553 435 2 0 59 37 228 0.962 9.10 4.05 PlyA + 124544 124549 6 1.05 5.08 PlyA - 126687 126682 6 1.05 5.07 Term - 134502 134370 133 2 1 101 37 113 0.502 4.08 5.06 Intr - 136805 136402 404 2 2 100 69 189 0.759 10.50 5.05 Intr - 139074 138798 277 2 1 78 71 65 0.129 0.40 5.04 Intr - 143156 143060 97 0 1 93 39 104 0.183 4.15 5.03 Intr - 154160 154100 61 2 1 73 28 58 0.031 -4.31 5.02 Intr - 156252 156077 176 1 2 29 76 139 0.742 5.54 5.01 Init - 157246 157039 208 1 1 62 20 196 0.515 9.23 5.00 Prom - 171828 171789 40 -3.05 6.02 PlyA - 171945 171940 6 1.05 6.01 Sngl - 177744 177541 204 0 0 68 49 191 0.568 8.14 6.00 Prom - 187261 187222 40 -4.65 7.02 PlyA - 188139 188134 6 1.05 7.01 Sngl - 191178 190885 294 0 0 50 48 189 0.800 6.55 7.00 Prom - 194330 194291 40 -6.65 8.00 Prom + 194691 194730 40 -4.15 8.01 Init + 200214 200891 678 2 0 42 86 314 0.702 21.78 8.02 Intr + 201952 202075 124 0 1 20 90 96 0.578 2.24 8.03 Intr + 204307 204388 82 2 1 85 89 22 0.310 -0.22 8.04 Intr + 209201 209482 282 2 0 56 89 105 0.311 3.01 8.05 Intr + 213549 213711 163 0 1 64 97 29 0.396 0.46 8.06 Term + 214574 214699 126 1 0 58 38 103 0.290 -0.40 8.07 PlyA + 215467 215472 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_1|201_aa MESGRVNVTVKVGGDSLVLSVAQDALEGALQCLHHHLLDVIIFGRFLQMAGQVHDRHIGG EDMPVAVMAWTVVISHDAKVGMDDLDQGCEEVGGTGGIADDLEAVFELLMVHPHHEHGVI NRQGRDDDHFDSLLSVCPSLLHGSGSEDASGLPNIMSTGITPFAFGGIFLREHGDRISTD DKFPILSLANAMESYWNMQSM >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_1|606_bp atggaaagtggaagagtaaatgtcactgttaaagttggaggagacagcctggtgctcagt gtagcccaagatgcccttgaaggggccctccaatgcctgcatcaccaccttcttgatgta atcatatttggcaggtttctccagatggcaggtcaggtccatgaccgacatattggtggt gaggacatgccagtggcagtgatggcatggactgtggtcataagtcatgatgccaaagtt ggcatggatgaccttgaccaggggtgcgaagaagttggtggtacaggaggcatcgctgat gatcttgaggctgtttttgaacttctcatggttcacccccatcacgaacatggggtcatc aacagacagggcagagatgatgaccattttgactccctactctcagtatgccccagcctt ctccatgggagtgggagtgaagatgccagtgggctccccaacataatgagcactggcatc accccatttgcttttggtgggatcttcctccgggaacacggtgataggatttccactgat gacaagtttcccattctcagccttgccaatgccatggaatcatactggaatatgcagagc atgtag >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_2|290_aa MPGFNSDTLTAVADLHVITHNNNANYAIRMPNTKSEPECKLWTLGDNDVSVRFIDCNKCT TLVGDVNNGGGYACVGPPCSLRHNIEIKPIDNPTMASKCSSEHLVFNTQVFRAMMTVNHM SSCFENSPQTMGADGLFGSSTPLFHLPQHRDLVPCVTATPAIAKRVQHTAWPMFSEGASP KPWQFPHGVEPAVAQKSRIEVWEPPPRFQKMHGNAWMPRQKFATQVGPTWRISARAVQKG NVESEPPHRVPTGAPPSGAVRRGLLSSKPQNGRFNDSLPCAPGKAIDAQC >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_2|873_bp atgccaggattcaactcagacacactgactgctgttgcagacctccatgtgataactcat aataataacgccaactatgctattagaatgccgaacaccaagagtgaacctgaatgtaaa ctatggactctgggtgataatgatgtgtcagttaggtttattgactgcaacaaatgtacc actctggtaggggatgtcaataatggtggaggctatgcatgtgtggggcctccctgttcc ttgagacacaatattgaaattaagccaattgataaccccacaatggcctcaaagtgttca agtgaacacttagtgttcaacactcaagtgttcagagcgatgatgactgtgaatcatatg agctcttgctttgaaaatagtccacagacaatgggagcagatgggctttttggcagcagt acccctcttttccatctgccacagcatagggacttggtgccctgtgtcacagccactcca gccattgctaaaagggtccaacatacagcttggcccatgttttcagagggtgcaagcccc aaaccttggcagtttccacatggtgttgagcctgcagttgcacagaagtcaagaattgag gtttgggaacctcctcctagatttcagaagatgcatggaaacgcctggatgcccaggcaa aagtttgctacacaggtggggcccacatggagaatctctgccagggcagtacagaaggga aatgtggagtcagagcccccacacagagtgcctactggtgcaccacctagtggagctgtg agaagagggttactgtcctccaaaccccaaaatgggagattcaatgacagcctgccctgt gcacctggaaaagccatagacgctcaatgctag >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_3|112_aa MTPGVKAVVSYNQDNETVKQGRPRQVFPFRCRCQEASSQVQRPVVSAGPSRALVPLSSSG AYSGVYPTVRFEIMFPSHLTPLENQSTGCQRSGMHAQRRGLGTPTPRGFLRR >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_3|339_bp atgacaccaggtgtaaaggcagtggtgtcatataatcaagacaatgagacagtgaaacaa gggcgcccacgacaggtgttcccttttcgctgtcgctgccaggaagcttccagccaggtt cagcgcccagtggtgtcagccggcccatctcgagccttggttccgctgtcatcttctggt gcttattctggtgtttatccaactgtccgttttgaaattatgttcccaagccatttaaca cctctggagaaccaaagcactggctgtcagcggagcgggatgcacgcgcagcggagggga ctgggcacgccaaccccacgagggttcctgcggagatga >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_4|736_aa MSLSENSVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFH SRIVGQADGELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEE SCFQFLKFKFLDSTADQQECPRKKCFSSHCQKTDLKLSLLDQRDLETDEVEEFLENKNVQ TPQCKLRRYQGNAKASPPLQDSASQTYESMCLEKDAALALPSLCPKYRKFQKAFGTDRVR TGESSVKDIHASVQPNERSENECLGGVPECRDLQVMLKCDESKLAMEPEETKKDPASQCP TEKSEVTPFPHNSSIDPHGLYSLSLLHTYDQYGDLNFAGMQNTTVLTEKPLSGTDVQEKT FGESQDLPLKSDLGTREDSSVASSDRSSVEREVAEHLAKGFWSDICSTDTPCQMQLSPAV AKDGSEQISQKRSECPWLGIRISESPEPGQRTFTTLSSVNCPFISTLSTEGCSSNLEIGN DDYVSEPQQEPCPYACVISLGDDSETDTEGDSESCSAREQECEVKLPFNAQRIISLSRND FQSLLKMHKLTPEQLDCIHDIRRRSKNRIAAQRCRKRKLDCIQNLESEIEKLQSEKESLL KERDHILSTLGETKQNLTGLCQKVCKEAALSQEQIQILAKYSAADCPLSFLISEKDKSTP DGELALPSIFSLSDRPPAVLPPCARGNSEPGYARGQESQQMSTATSEQAGPAEQCRQSGG ISDFCQQMTDKCTTDE >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_4|2211_bp atgtctctgagtgagaactcggtttttgcctatgaatcttctgtgcatagcaccaatgtt ttactcagccttaatgaccagcggaagaaagatgtgctgtgcgatgtcaccatctttgtg gagggacagcggttccgcgctcaccggtccgtgctggcggcatgcagcagttacttccac tcaagaatcgtaggccaggctgatggagagctgaacattactcttccagaagaggtgaca gttaaaggatttgaacctttaattcagtttgcctacactgctaaactgattttaagtaaa gagaatgtggatgaagtgtgcaaatgtgtggagtttttaagtgtacataatattgaggaa tcctgctttcagtttctgaaatttaagtttttggactccactgcagaccagcaagaatgc ccaagaaaaaaatgcttttcatcacactgtcagaaaacagaccttaaactttcacttttg gaccagagggatctagaaactgatgaagtggaggaatttctggaaaataaaaatgttcag actcctcagtgtaaactccgcaggtatcaaggaaatgcaaaagcctcacctcctctacaa gacagtgccagtcagacatatgagtccatgtgcttagagaaggatgctgctctggccttg ccttctttatgccccaaatacagaaaattccaaaaagcatttggaactgacagagtccgt actggggaatctagtgtcaaagacattcatgcttctgttcagccaaatgaaaggtctgaa aatgaatgcctgggaggagtcccggagtgtagagatttgcaggtgatgttaaaatgtgac gaaagtaaattagcaatggaacctgaagaaacgaagaaagatcctgcttctcagtgccca actgaaaaatcagaagtgactcctttcccccacaattcttccatagaccctcatggactt tattctttgtctcttttacacacatatgaccaatatggtgacttgaattttgctggtatg caaaacacaacagtgttaacagaaaagcctttgtcaggtacagacgtccaagaaaaaaca tttggtgaaagtcaggatttacctttgaaatccgacttgggcaccagggaagatagtagt gttgcatctagtgataggagtagtgtggagcgagaagtggcagaacacctagcaaaaggc ttctggagtgacatttgcagcacggacactccttgccaaatgcagttatcacctgctgtg gccaaagatggctcagaacagatctcacagaaacggtctgagtgtccgtggttaggtatc aggattagtgagagcccagaaccaggtcaaaggactttcacaacattaagttctgtcaac tgcccttttataagtactctgagtactgaaggctgttcaagcaatttggaaattggaaac gatgattatgtttcagaaccccagcaagaaccttgcccatatgcttgtgtcattagcttg ggagacgactctgagacggacaccgaaggagacagtgaatcctgttcagccagagaacaa gaatgtgaggtaaaactgccattcaatgcacaacggataatttcactgtctcgaaatgat tttcagtccttgttgaaaatgcacaagcttactccagaacagctggattgtatccatgat attcgaagaagaagtaaaaacagaattgctgcacagcgctgtcgcaagagaaaacttgac tgtatacagaatcttgaatcagaaattgagaagctgcaaagtgaaaaggagagcttgttg aaggaaagagatcacattttgtcaactctgggtgagacaaagcagaacctaactggactt tgccagaaagtttgtaaagaagcagctctgagtcaagaacaaatacagatactcgccaag tactcagctgcagattgcccactttcatttttaatttctgaaaaagataaaagtactcct gatggtgaactggcgttaccatcaattttcagtttatctgaccggcctccagcagtgctg cctccctgtgccagaggaaacagtgagcctggctacgcgcgagggcaggagtcccagcag atgtccacagccacctctgagcaagctgggcctgcggaacagtgtcgtcagagtggtggg atctcagatttctgtcagcagatgactgataaatgtactactgatgagtaa >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_5|451_aa MKEIHQRSDQSIRDAWGASGKEGCAENRMERIKEGIDSDYFNKRPAIPTDNMSITPMARS KRHGDRGLGGDSDAAVLWARSPFKATGLCTIEQASLDAPFPGSVLRHVRGHFKTSVPETF NLSLQSGQKSYVEILTSNAMVFGGGALIGPPSDFAKGTGPSLMSTAASLICLYHTQNAVY QYLPDGQLRFVSPKYLRQVSINLESLSAKAKDVPMTASGSLGYMMTCTQVVRAQLGFIHF SLILYIDETSINMHKRYIGLVQKGGTTRSKGFQGRASPLPVTPAGKYKERAEEVRGGHQT TGTSSQGVWGLGEWEKRITLTPLDPVGVAKAGVIIYMNMGNELPICCPLLEEGINPEVWA LEGTNSSSSLKPSHRMKLFFMHHRESKDGSWSPYSDSWGNPTTSGIPKSCGSHLAITRLR TLIFNLLVKSVSSRIEAIKLQMVLRMEPQMS >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_5|1356_bp atgaaagagatacatcaacgcagtgatcagagcatcagggatgcttggggggcatcagga aaggaaggctgtgcagagaacaggatggagaggataaaggagggcattgattctgactac ttcaacaagaggcctgccatccccactgataatatgagcattacccccatggcccgttca aaacgtcatggcgatagaggtttagggggggacagtgatgctgctgtactgtgggccagg tcacccttcaaggccacaggcctttgtacaatcgaacaggcttctcttgacgcaccattt cctggcagtgtgctcaggcatgttcgtggccatttcaaaacttcagttccagagactttc aacttaagtcttcaaagtggacagaaatcatatgttgaaattctaacctccaatgcaatg gtatttggaggaggagccttgataggtccacctagtgacttcgcaaaaggtactggccct tctctcatgagcacagctgccagccttatatgtttatatcacactcagaatgccgtctat caatatctgcctgatggacagctacgttttgtgagcccaaagtatttgagacaggtctca attaatttagaaagtttatctgccaaggctaaggacgtgcctatgacagcgtcaggaagt cttgggtacatgatgacatgtacccaagtggtcagggcacagcttggttttatacatttt agcttgattttatacattgatgagacctcaatcaatatgcataagaggtacattggtttg gtccagaaaggtggaacaactcgaagtaagggattccagggaagagcatccccacttcca gtcactccagcagggaagtataaggaaagggcagaagaagtgaggggtggacatcagacc acagggactagctctcagggcgtgtggggactaggtgagtgggagaagcggatcacgctg actccactagatccagtaggagtagccaaagctggagttattatctacatgaatatgggg aacgagttacccatttgttgtcccctacttgaggagggaatcaaccctgaagtctgggca ttggaaggaacaaactcaagctccagccttaagccttcccacaggatgaaacttttcttt atgcatcacagagagagcaaggatggctcttggagtccttactcagactcatggggcaac cccacaaccagtggcatacctaagtcctgtggcagccatcttgctattactcgccttcgg acccttatttttaacctccttgtcaaatctgtttcctctagaatcgaggccatcaagcta cagatggtcttacgaatggaaccccaaatgagctaa >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_6|67_aa MQGLILGTPKSRDHEAEKESAATKEERQYSLVSQKASEGLIEKEGNDPVAICGKVSPGGL RTAHWST >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_6|204_bp atgcaaggactaatccttgggactccaaaatcaagagaccatgaagctgagaaggaatca gcagcaactaaggaggaaaggcagtacagcctggtatcccagaaggcaagtgaagggctc atcgagaaagaagggaatgaccctgtggccatctgtggcaaagtcagtcctgggggactg agaactgctcattggagtacctga >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_7|97_aa MQHTRRYHRRLGKGQLGIASLDTLCRAHVEHTQTICTPQMSQRLEAQHTYYHQPRETHKY HERSKNAFPPFMLTPASQFWRSQQQELGREAKEKGIA >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_7|294_bp atgcagcacactagacggtaccaccggaggcttggaaaaggccaactgggtattgcatct cttgacaccctgtgcagagcccacgtggagcacacacagaccatctgcaccccacaaatg agccagcgtttggaggcccagcacacatattatcaccagccaagagaaacccataaatat catgaacggagtaaaaatgcctttccccctttcatgctaacccctgcttcccagttctgg aggagccagcagcaggagttgggaagagaagcaaaggagaaaggaattgcataa >gi568815577f:29221281_29442830|GENSCAN_predicted_peptide_8|484_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSLLLFNIVLEVLARAIRKEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKRVTEESASALQEAT AHCLHFLSLPTGESFRKEEHKHWDARGDFLKTQNKQCNYPIAGSKTWVKRRENDSFGNWI YSVFSNCLVIDFKSEKKESLNNSFCKCVKAECEEKKLFSGTRSWFTFKKKISCLWQNGIA ALSWMVICTSKFITNLAGEMLKTKAFISHLDPWGCQFYLLVLLPVEGVQVLGVLNINWTK RTKKERMKQLKQRFIEIRNILHKFVLDLICKPTVEQKNMYQLPMPGEDSNYLEDRQGHHN ASHL >gi568815577f:29221281_29442830|GENSCAN_predicted_CDS_8|1455_bp atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcactgctcctattcaacatagtgttggaagtt ctggccagggcaatcaggaaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaagagggtgactgaggagtctgcctctgcccttcaagaagccaca gctcattgtctgcacttcctttcattgccaactggggagagcttcagaaaggaggaacat aagcattgggatgcacgaggagactttctaaagacacagaataaacaatgcaattatcca atagcaggttctaaaacatgggtaaaaagaagagagaatgacagttttggaaactggata tactctgtattcagcaattgccttgtaatagacttcaagagtgagaaaaaagaatccctt aataatagtttctgcaaatgtgtgaaagctgagtgtgaggagaaaaagttgttttcagga actaggagctggttcacctttaaaaagaaaatcagctgcctctggcaaaatggaattgca gcattgtcctggatggtaatttgcactagtaaatttatcaccaacctggcaggggagatg ctgaagacaaaagcttttatctctcaccttgatccttggggatgccaattttatttgctg gttctgctaccggtggagggtgtccaggttctcggtgtcttgaacataaattggacaaaa cgtacaaagaaagaaagaatgaagcaactaaagcagagatttattgaaatcagaaacata ctccacaagtttgtacttgacctcatctgtaagcctactgttgagcagaaaaacatgtac cagttaccaatgccaggtgaagattccaactatctggaggatagacaaggtcatcataat gccagccacctgtaa