GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:34:51 Sequence gi568815587f:62028162_62251973 : 223812 bp : 45.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5477 5518 42 2 0 65 115 57 0.791 3.46 1.02 Term + 5747 5978 232 2 1 63 42 152 0.901 4.05 1.03 PlyA + 6721 6726 6 1.05 2.05 PlyA - 8964 8959 6 1.05 2.04 Term - 27767 27592 176 0 2 61 45 193 0.103 10.22 2.03 Intr - 36337 36181 157 1 1 83 6 101 0.070 0.98 2.02 Intr - 37037 36874 164 0 2 93 90 25 0.616 2.89 2.01 Init - 47870 47822 49 2 1 86 58 46 0.349 0.51 2.00 Prom - 69578 69539 40 -3.36 3.00 Prom + 69977 70016 40 -3.36 3.01 Init + 96732 96931 200 2 2 53 85 161 0.342 10.67 3.02 Intr + 96965 97068 104 2 2 89 43 18 0.511 -2.78 3.03 Intr + 99990 100140 151 1 1 93 72 130 0.583 11.12 3.04 Intr + 100609 100722 114 1 0 86 97 132 0.999 13.46 3.05 Intr + 101621 102429 809 2 2 102 65 902 0.633 80.39 3.06 Intr + 109671 109722 52 1 1 129 92 52 0.999 7.67 3.07 Intr + 110552 110609 58 2 1 101 94 50 0.999 5.79 3.08 Intr + 110727 110844 118 2 1 102 100 164 0.999 19.04 3.09 Intr + 111526 111658 133 2 1 78 90 16 0.956 0.50 3.10 Intr + 112047 112142 96 0 0 85 49 90 0.794 3.92 3.11 Intr + 112543 112660 118 1 1 84 105 148 0.999 16.57 3.12 Intr + 112752 112883 132 2 0 140 100 147 0.976 21.94 3.13 Intr + 113339 113350 12 1 0 133 91 3 0.511 0.08 3.14 Intr + 116821 116930 110 0 2 99 87 239 0.995 23.98 3.15 Intr + 117008 117128 121 2 1 108 81 306 0.947 32.40 3.16 Intr + 117468 117602 135 2 0 44 80 223 0.704 17.86 3.17 Intr + 118497 118741 245 2 2 108 78 518 0.994 49.10 3.18 Intr + 120315 120393 79 0 1 52 100 219 0.727 19.05 3.19 Intr + 120578 120685 108 1 0 89 50 231 0.607 19.88 3.20 Intr + 121896 122046 151 2 1 70 99 138 0.760 12.84 3.21 Term + 123601 123815 215 2 2 89 54 263 0.999 20.39 3.22 PlyA + 124146 124151 6 1.05 4.00 Prom + 131085 131124 40 -3.86 4.01 Init + 132012 132175 164 2 2 70 71 70 0.040 2.71 4.02 Intr + 134455 134607 153 1 0 61 94 49 0.046 1.99 4.03 Intr + 141071 141535 465 2 0 91 -69 579 0.450 34.74 4.04 Term + 141544 141703 160 1 1 19 51 234 0.696 10.21 4.05 PlyA + 144521 144526 6 1.05 5.00 Prom + 150387 150426 40 -4.96 5.01 Init + 162124 162178 55 0 1 112 84 116 0.998 13.15 5.02 Intr + 163895 164082 188 0 2 72 19 112 0.483 2.11 5.03 Intr + 164304 164352 49 2 1 107 38 87 0.788 3.85 5.04 Term + 178148 178281 134 0 2 88 42 99 0.192 3.55 5.05 PlyA + 178681 178686 6 1.05 6.00 Prom + 180475 180514 40 -6.26 6.01 Init + 180571 180625 55 0 1 91 109 177 0.999 19.56 6.02 Intr + 182252 182439 188 0 2 115 46 174 0.674 15.31 6.03 Intr + 199435 199471 37 0 1 73 82 28 0.052 -1.46 6.04 Intr + 201576 201685 110 1 2 68 76 59 0.124 2.70 6.05 Term + 204049 204258 210 0 0 54 48 81 0.137 -2.01 6.06 PlyA + 205851 205856 6 1.05 7.00 Prom + 212564 212603 40 -1.96 7.01 Init + 214147 214201 55 0 1 114 96 121 0.993 15.33 7.02 Intr + 215128 215315 188 2 2 88 53 108 0.922 6.71 7.03 Term + 216182 216349 168 1 0 101 54 23 0.552 -1.92 7.04 PlyA + 216628 216633 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_1|91_aa XINLFFLNAVTANSRIGGTVKVTPVSDGSVSSQSTQDDVKEKIKNYCLAATLLEQGAPFH LPLPQPTENGIWSLLSCPNSFLQGDPSRAKA >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_1|276_bp nnaattaatctcttcttcctcaatgctgtcacagcaaatagcaggattggaggaactgtt aaagtcactcctgtttctgatggcagtgtgagttctcaatctacacaggatgacgtgaaa gaaaagatcaagaactactgcttggcagccacacttcttgaacaaggagctccatttcac ctgccattgcctcaacccactgaaaatgggatttggtccctactgagctgtccaaattcc ttcctccaaggcgatccctcaagagcaaaggcctag >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_2|181_aa MGFHHVAQAGLELLTSGGYPSLVVFIRAFPGKGGRNIGREDQGGEIAHKKAEEIGWALWL MPVIPALWEAEELALKKFPNLPRLIVGYKTTIPDKILPHTLEEGQLHREVKKNLDRQALL AFLRIMAVYKWHPKQQHNQAHNKWRLNTGLQGCERRRSAGAEELKLTRQTRTPERVCQQQ I >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_2|546_bp atggggtttcaccatgttgcccaggctggtctcgaactcctgacctcaggtgggtatcca agtctggtggtgttcatcagggccttcccaggcaaaggaggaaggaacattggaagagaa gatcaagggggagaaatagcccacaagaaagcagaagaaataggctgggcgctgtggctc atgcctgtaatcccagcactttgggaggccgaggagctagccctaaagaaattccccaac ctacctcgcctaattgtaggttacaagacaaccattccagacaagatcctgccccatacc ctggaggaaggacagctgcacagagaggtcaagaagaatctagatagacaggccttgctg gcattcctgaggatcatggctgtctacaagtggcaccccaaacagcaacacaatcaggcg cacaacaagtggcgtctgaacacaggacttcaaggatgtgagcgaagaaggtctgctgga gcagaggaactgaaattgacaaggcaaacacggaccccggaacgagtctgccagcagcag atataa >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_3|1086_aa MSDSGMEYDSVNGSSCYALSAQRCGRRPTYSTLKRTGVIEEDDGAFSLVYGVEEVLERRK TGECLPRCLHGLKKRFLLLFQEPDSWYTDSFFFRVTSVLTYDRATMGTTAPGPIHLLELC DQKLMEFLCNMDNKDLVWLEEIQEEAERMFTREFSKEPELMPKTPSQKNRRKKRRISYVQ DENRDPIRRRLSRRKSRSSQLSSRRLRSKDSVEKLATVVGENGSVLRRVTRAAAAAAAAT MALAAPSSPTPESPTMLTKKPEDNHTQCQLVPVVEIGISERQNAEQHVTQLMSTEPLPRT LSPTPASATAPTSQGIPTSDEESTPKKSKARILESITVSSLMATPQDPKGQGVGTGRSAS KLRIAQVSPGPRDSPAFPDSPWRERVLAPILPDNFSTPTGSRTDSQSVRHSPIAPSSPSP QVLAQKYSLVAKQESVVRRASRRLAKKTAEEPAASGRIICHSYLERLLNVEVPQKVGSEQ KEPPEEAEPVAAAEPEVPENNGNNSWPHNDTEIANSTPNPKPAASSPETPSAGQQAGAAP SCCQGHTSCMAQEGWNLEAVQGAWSSCFYKASTHCSSCQSLAEIAVFTEAKTDQADGPRE PPQSARFASGKGRKRSYKQAVSELDEEQHLEDEELQPPRSKTPSSPCPASKVVRPLRTFL HTVQRNQMLMTPTSAPRSVMKSFIKRNTPLRMDPKCSFVEKERQRLENLRRKEEAEQLRR QKVEEDKRRRLEEVKLKREERLRKVLQARERVEQMKEEKKKQIEQKFAQIDEKTEKAKEE RLAEEKAKKKAAAKKMEEVEARRKQEEEARRLRWLQQVRAQEEEERRHQELLQKKKEEEQ ERLRKAAEAKRLAEQREQERREQERREQERREQERREQERREQERQLAEQERRREQERLQ AERELQEREKALRLQKEQLQRELEEKKKKEEQQRLAERQLQEEQEKKAKEAAGASKALNV TVDVQSPACTSYQMTPQGHRAPPKINPDNYGMDLNSDDSTDDEAHPRKPIPTWARGTPLS QAIIHQYYHPPNLLELFGTILPLDLEDIFKKSKPRYHKRTSSAVWNSPPLQGARVPSSLA YSLKKH >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_3|3261_bp atgtctgactcgggcatggagtatgactcagtaaatgggagcagctgctatgcgctttct gcacagaggtgtggtaggcgcccaacttacagcaccctgaaaaggacaggagtcatagag gaggacgatggggccttcagtctggtgtatggtgtggaagaagtgttggagaggaggaag actggagagtgtcttcctcggtgcttacatgggttgaagaagagatttttgctgctcttc caggagcctgattcttggtacactgattccttctttttcagggtcacctctgttttaact tacgacagagccaccatggggacgacggccccagggcccattcacctgctggagctatgt gaccagaagctcatggagtttctctgcaacatggataataaggacttggtgtggcttgag gaaatccaagaggaggccgagcgcatgttcaccagagaattcagcaaagagccagagctg atgcccaaaacaccttctcagaagaaccgacggaagaagagacggatttcttatgttcag gatgaaaacagagatcccatcaggagaaggttatcccgcagaaagtctcggagcagccag ctgagctcccgacgcctccgcagcaaggacagtgtagagaagctggctacagtggtcggg gagaacggctccgtcctgcggcgtgtgacccgtgctgcggctgcagctgccgcggctacc atggcattggctgcaccttcttcacccacccctgagtctcccacgatgctgactaagaag cccgaggataaccacacccagtgccagctggtgcctgtggtggagatcggcatcagtgag cgccagaatgctgagcagcatgtcacccagctcatgtccaccgagcctctgccccgcact ctgtccccgactccagcttcagccacagctccaacctcccagggcatcccgacatcagat gaggaatcaacacctaagaagtcgaaggccaggatactggagtccatcacagtgagctcc ctgatggctacaccccaggaccccaagggtcaaggggtcgggacggggcggtctgcgtct aagctcaggattgcgcaggtctcccctggcccacgggactcgccagcctttccagattct ccatggcgggagcgggtgctggctcccatcctgccggataacttctccacgcccacgggc tctcgcacggactctcaatcggtgcggcacagcccgatcgccccgtcttccccgagtccc caagtcttagcccagaagtactctctggtggccaaacaggaaagtgttgtccgcagggcg agcagaaggcttgccaagaagactgccgaagagccagctgcctctggccgcatcatctgt cacagttacctggagaggctcctgaatgttgaggtgccccagaaagttggttctgagcag aaggaaccccccgaggaggctgagcctgtggcggcagctgagccagaggtccctgagaac aatggaaataactcgtggccccacaatgacacggagattgccaacagcacacccaacccg aagcctgcagccagcagcccggaaacaccctctgcagggcagcaagcgggagctgccccc tcctgctgtcagggccacaccagctgcatggcccaggaggggtggaatttagaagctgtg caaggggcttggagctcctgcttctacaaggcgtccacacactgttcttcgtgtcagagc cttgctgaaattgctgtcttcacagaggccaagacggaccaagcagatggacccagagag ccaccgcagagtgccaggtttgcatccgggaaggggaggaagcgcagctacaagcaggcc gtgagtgagctggacgaggagcagcacctggaggatgaggagctgcagccccccaggagc aagaccccttcctcaccctgcccagccagcaaggtggtacggcccctccggacctttctg cacacagtgcagaggaaccagatgctcatgaccccgacctcagccccacgcagcgtcatg aagtcctttattaagcgcaacactcccctgcgcatggaccccaagtgcagcttcgtcgag aaggagcggcagcgcctggagaatctgcggcggaaggaggaggccgagcagctgcgcagg cagaaggtggaggaggacaagcggcggcggctggaggaggtgaagctgaagcgtgaggaa cgcctccgcaaggtgctgcaggcccgcgagcgggtggagcagatgaaggaggagaagaag aagcagattgagcagaagtttgctcagatcgacgagaagactgagaaggccaaggaggag cggctggcagaggagaaggccaagaaaaaggcggcggccaagaagatggaggaggtggaa gcacgcaggaagcaggaagaggaggcacgtaggctcaggtggctgcagcaggtgcgagca caggaggaggaagagcggcggcaccaagagctgctgcagaagaagaaggaagaggagcag gagcggctgcggaaggcggccgaggctaagcggctggcagagcagcgggagcaggagcgg cgggagcaggagcggcgcgagcaggagcggcgcgagcaggagcggcgggagcaggagcgg cgcgagcaggagcgacagctggcagagcaggagcgtcggcgggagcaggagcggctccag gccgagagggagctgcaggagcgggagaaggccctgcggctgcagaaggagcagctgcag agggaactggaggagaagaagaagaaggaagagcagcagcgtctggctgagcggcagctg caggaggagcaagagaagaaagccaaggaggcagcaggggccagcaaggccctgaatgtg actgtggacgtgcagtctccagcttgtacctcatatcagatgactccgcaagggcacagg gcccctcccaagatcaacccagataactacgggatggatctgaatagcgacgactccacc gatgatgaggcccatccccggaagcccatccccacctgggcccgaggcaccccgctcagc caggctatcattcaccagtactaccacccaccgaaccttctggagctctttggaaccatt ctcccactggacttggaggatatcttcaagaagagcaagccccgctatcacaagcgcacc agctctgctgtctggaactcaccgcccctgcagggcgccagggtccccagcagcctggcc tacagcctgaagaagcactga >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_4|313_aa MDKFLDTYTLPSLNQEEVESLNRPITSSEIEAAINSLPTKKSSGPDGFTAEFYQRDMNEA GNHHPQQTNTGTENQTLHVLTHKWEMNKENTWTQRGEHHTLGPVGRRPTSTDSSSLVAES LAGIRKMATNFSVYEKIWFDKFKYDIAERRLYEQMNGPVASTSRQKNGASVILHDIARAR ENIQKSLARSSGPRVSSGPNGEHSELVVLIASLEVENQRLSSVVQELQQAMSRLEAQLNV LEKSSAGHRATVPQTQHVSPIGALAKKPTTPAEDDKGNDIDLFGSNNEEEDEEAAQLREE WLPFPPVIRKALL >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_4|942_bp atggataaattcctggacacatacaccctcccaagtctaaaccaggaagaagtcgaatcc ctaaatagaccaataacaagttctgaaattgaggcagcaattaatagcctaccaaccaaa aaaagttcaggaccagatggattcacagccgaattctaccagagggacatgaatgaagct ggaaaccatcatcctcagcaaactaacacaggaacagaaaaccaaacactgcacgttctc actcataagtgggagatgaacaaggagaacacatggacacagcgaggggaacatcacaca ctggggcctgtcggcaggcgtcccacgtccaccgattcctcctccctcgttgctgagtcc ttggctggcatcagaaaaatggctacaaacttctcagtatatgagaagatctggtttgac aagttcaaatatgacattgcagaaaggagattgtacgagcaaatgaacgggcctgtggcc agcacctcccgccagaagaatggcgccagcgtgatcctccatgacattgcgagagccaga gagaacatccagaaatccctggccagaagctcaggccccagggtctccagcggccccaac ggagaacacagcgagcttgttgtcctgatcgccagtctggaagtggagaaccagaggctg agcagcgtggtgcaggagctgcagcaggccatgtccaggctggaggcccagctgaacgtg ctggagaagagctctgctggccaccgggccacagtccctcagacccagcacgtgtctccc attggagccctggccaagaagccaaccacaccagcagaggatgacaagggcaatgacatt gacctttttggcagcaacaatgaggaggaggacgaggaggcagcacagctgcgggaggaa tggctgccatttcccccagtaattagaaaagccttattgtga >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_5|141_aa MRLSVCLLLLTLALCCYRANAVVCQALGSEITGFLLAGKPVFKFQLAKFKAPLEAVAAKM EVKKCVDTMAYEKRVLITKTLVTYLDAVSPECAEDMGNVSLDITIHSSKSQLKQHLLLQA ALGGLQATAPGQAIYDSIQLL >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_5|426_bp atgaggctgtcggtgtgtctcctgctgctcacgctggccctttgctgctaccgggcaaat gcagtggtctgccaagctcttggttctgaaatcacaggcttcttattagctggaaaacct gtgttcaagttccaacttgccaaatttaaggcacctctggaagctgttgcagccaagatg gaagtgaagaaatgcgtggatacgatggcctatgagaaaagagtgctaattacaaaaaca ttggtcacctacctggatgccgtgtccccagagtgtgctgaggacatggggaatgtgtcc ttggacatcaccatccactcatccaagtctcagctgaaacagcacctgctcctacaggct gccttggggggcttacaggccactgcccctgggcaggccatctatgacagcattcagctg ttgtga >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_6|199_aa MKLLMVLMLAALLLHCYADSGCKLLEDMVEKTINSDISIPEYKELLQEFIDSDAAAEAMG KFKQCFLNQSHRTLKNFGLMMMNQRAPENILKRETFKLQIIMQQRFQPIPGEDTAPMPSR SYSTSTRQNRCTPAVCKHNIGSFVGCRVESLMKSTSLNEDLVHRDYGPADTMLPVPVSES IILWVLNILTIVTVIPEIG >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_6|600_bp atgaagctgctgatggtcctcatgctggcggccctcctcctgcactgctatgcagattct ggctgcaaactcctggaggacatggttgaaaagaccatcaattccgacatatctatacct gaatacaaagagcttcttcaagagttcatagacagtgatgccgctgcagaggctatgggg aaattcaagcagtgtttcctcaaccagtcacatagaactctgaaaaactttggactgatg atgatgaatcagagggcccctgaaaacatcttgaaaagagagaccttcaagcttcagata atcatgcaacaaaggttccagccaattccaggtgaagacaccgccccaatgccatcaaga agctactccacctccactagacagaacaggtgcacacctgctgtgtgcaagcataacatt gggtcctttgttggctgcagagttgagtcactgatgaagtccacctctctgaatgaggat ttggttcatagggattatggaccagcagatacgatgctgccagtccctgtctcagaatcc attattctttgggtattgaatatcctcaccattgtcactgttatccctgaaattgggtga >gi568815587f:62028162_62251973|GENSCAN_predicted_peptide_7|136_aa MKLSVCLLLVTLALCCYQANAEFCPALVSELLDFFFISEPLFKLSLAKFDAPPEAVAAKL GVKRCTDQMSLQKRSLIAEVLEPGDSKPWDSPGNNLLLLSPTLVTCFHEDPGPHSRITPG LLAWQEKAAPSPAWVL >gi568815587f:62028162_62251973|GENSCAN_predicted_CDS_7|411_bp atgaagctgtcggtgtgtctcctgctggtcacgctggccctctgctgctaccaggccaat gccgagttctgcccagctcttgtttctgagctgttagacttcttcttcattagtgaacct ctgttcaagttaagtcttgccaaatttgatgcccctccggaagctgttgcagccaagtta ggagtgaagagatgcacggatcagatgtcccttcagaaacgaagcctcattgcggaagtc ctggaaccaggagactccaagccctgggacagcccaggcaataacctcttgctgctctct ccaaccctcgtcacttgcttccatgaggatcctgggcctcattcccggatcaccccaggg ctattggcctggcaagaaaaggctgctccatcaccggcatgggtgctgtga