GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:59:04 Sequence gi568815583f:76918147_77137173 : 219027 bp : 45.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13696 13839 144 0 0 92 94 510 0.999 50.12 1.02 Intr + 14215 14320 106 0 1 84 78 122 0.796 10.59 1.03 Intr + 17380 17576 197 2 2 31 113 227 0.612 18.63 1.04 Intr + 25612 25725 114 0 0 13 107 122 0.808 7.24 1.05 Intr + 29275 29371 97 0 1 2 108 39 0.906 -3.12 1.06 Intr + 30264 30406 143 1 2 31 107 140 0.992 10.27 1.07 Term + 30924 31076 153 2 0 72 33 163 0.999 7.12 1.08 PlyA + 33540 33545 6 1.05 2.03 PlyA - 33554 33549 6 1.05 2.02 Term - 61631 61079 553 1 1 42 48 869 0.726 72.09 2.01 Init - 62107 61773 335 1 2 68 21 700 0.738 58.07 2.00 Prom - 68611 68572 40 -6.26 3.06 PlyA - 71820 71815 6 1.05 3.05 Term - 74325 74249 77 1 2 108 47 70 0.902 2.90 3.04 Intr - 75098 74952 147 0 0 129 84 57 0.950 9.61 3.03 Intr - 76283 76169 115 2 1 61 48 64 0.564 -0.28 3.02 Intr - 80469 80362 108 0 0 60 18 110 0.536 1.68 3.01 Init - 81670 81560 111 1 0 70 53 66 0.100 1.52 3.00 Prom - 82316 82277 40 -5.26 4.00 Prom + 82318 82357 40 -5.36 4.01 Init + 84380 84486 107 1 2 79 36 72 0.113 0.80 4.02 Intr + 85972 86063 92 1 2 101 41 67 0.181 2.94 4.03 Intr + 91443 91604 162 1 0 81 68 85 0.582 5.75 4.04 Intr + 95520 95624 105 1 0 64 91 54 0.505 3.49 4.05 Intr + 98668 98717 50 2 2 101 33 -4 0.114 -6.20 4.06 Intr + 100002 100102 101 2 2 106 83 193 0.972 19.51 4.07 Intr + 100311 100385 75 0 0 110 71 106 0.857 9.73 4.08 Intr + 104023 104155 133 1 1 50 56 69 0.457 0.55 4.09 Intr + 105424 105548 125 0 2 63 100 79 0.865 5.98 4.10 Intr + 107138 107172 35 2 2 134 94 18 0.996 4.87 4.11 Intr + 107352 107458 107 1 2 72 100 108 0.890 10.33 4.12 Intr + 109706 109768 63 1 0 90 83 153 0.788 13.91 4.13 Intr + 110408 110506 99 1 0 63 91 269 0.959 25.01 4.14 Intr + 111383 111428 46 1 1 95 91 52 0.985 4.18 4.15 Intr + 112356 112435 80 1 2 75 105 77 0.900 7.37 4.16 Intr + 113034 113158 125 2 2 135 74 186 0.732 21.28 4.17 Intr + 113388 113581 194 0 2 91 55 73 0.546 3.44 4.18 Intr + 113959 114065 107 2 2 10 80 43 0.470 -4.37 4.19 Intr + 114152 114248 97 1 1 92 93 241 0.991 24.58 4.20 Intr + 114716 114806 91 0 1 74 78 108 0.995 7.45 4.21 Intr + 117362 117417 56 2 2 145 63 35 0.896 5.32 4.22 Intr + 117656 117789 134 0 2 39 94 151 0.507 11.16 4.23 Intr + 117860 117959 100 1 1 83 44 9 0.630 -4.32 4.24 Term + 118861 119030 170 2 2 64 55 267 0.685 19.04 4.25 PlyA + 119163 119168 6 1.05 5.00 Prom + 121228 121267 40 -4.26 5.01 Init + 124897 124998 102 0 0 86 100 -31 0.382 -1.78 5.02 Term + 125267 125542 276 1 0 72 42 173 0.597 6.56 5.03 PlyA + 129522 129527 6 1.05 6.07 PlyA - 130329 130324 6 1.05 6.06 Term - 133794 133717 78 0 0 128 42 -14 0.601 -4.24 6.05 Intr - 134322 134239 84 0 0 77 115 75 0.979 9.12 6.04 Intr - 134783 134631 153 2 0 94 105 34 0.984 5.97 6.03 Intr - 136133 136032 102 2 0 71 99 98 0.999 9.57 6.02 Intr - 138109 137918 192 1 0 79 107 277 0.999 28.39 6.01 Init - 152808 152746 63 0 0 109 80 118 0.546 14.25 6.00 Prom - 168624 168585 40 -2.86 7.03 PlyA - 168664 168659 6 1.05 7.02 Term - 197173 196010 1164 1 0 90 35 912 0.958 77.95 7.01 Init - 215587 214859 729 1 0 82 116 579 0.973 54.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_1|317_aa MRLGPRTAALGLLLLCAAAAGAGKAEELHYPLGERRSDYDREALLGVQEDVDEYVKLGHE EQQKRLQAIIKKIDLDSDGFLTESELSSWIQMSFKHYAMQEAKQQFVEYDKNSDDTVTWD EYNIQMYDRVIDFDENTALDDAEEESFRKLHLKDKKRFEKANQDSGPGLSLEEFIAFEHP EEVDYMTEFVIQEALEEHDKNGDGFVSLEEFLGDYRWDPTANEDPEWILVEKDRFVNDYD KDNDGRLDPQELLPWVVPNNQGIAQEEALHLIDEMDLNGDKKLSEEEILENPDLFLTSEA TDYGRQLHDDYFYHDEL >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_1|954_bp atgcggctgggcccgaggaccgcggcgttggggctgctgctgctgtgcgccgccgcggcc ggcgccggcaaggccgaggagctgcactacccgctgggcgagcgccgcagcgactacgac cgcgaggcgctgctgggcgtccaggaagatgtggatgaatatgttaaactcggccacgaa gagcagcaaaaaagactgcaggcgatcataaagaaaatcgacttggactcagatggcttt ctcactgaaagtgaactcagttcatggattcagatgtcttttaagcattatgctatgcaa gaagcaaaacaacagtttgttgaatatgataaaaacagtgatgatactgtgacttgggat gaatataacattcagatgtatgatcgtgtgattgactttgatgagaacactgctctggat gatgcagaagaggagtcctttaggaagcttcacttaaaggacaagaagcgatttgaaaaa gctaaccaggattcaggtcccggtttgagtcttgaagaatttattgcttttgagcatcct gaagaagttgattatatgacggaatttgtcattcaagaagctttagaagaacatgacaaa aatggtgatggatttgttagtttggaagaatttcttggtgattacaggtgggatccaact gcaaatgaagatccagaatggatacttgttgagaaagacagattcgtgaatgattatgac aaagataacgatggcaggcttgatccccaagagctgttaccttgggtagtacctaataat cagggcattgcacaagaggaggcgcttcatctaattgatgaaatggatttgaatggtgac aaaaagctctctgaagaagagattctggaaaacccggacttgtttctcaccagtgaagcc acagattatggcagacagctccatgatgactatttctatcatgatgagctttaa >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_2|295_aa MIIQSLLSPLNLEVDPNIQAMHTQEKEHIKSLNNKFASFIDKVQFLEQQNKMLETKWSLV QQQKKARSNVDHQFESYINNLWQQLETLGQDKLKLEAGLGNMQGLVEDFKNKQLYEEEIQ ELPSQISHTPVVLSMDNSRCLDTDSIITEVKAQYEEIANRSRAEAESMHQSKHEELQTLA GKHGDDLQCTKTEISEMNWNISRLQAEIEGLKDQRASLEAAMVDAEQRGELAVKDANAKL SELEAALRQAKQDMVRQLPEYQELMNVRLVLDMEIPPTGSCWRARRAGWRLGCRT >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_2|888_bp atgatcatccagagcctgctgagcccccttaacctggaggtggaccccaacatccaggcc atgcacacccaggagaaggagcacatcaagagcctcaacaacaagtttgcctccttcatc gacaaggtacagttcctggagcagcagaacaagatgctggagaccaagtggagcctcgtg cagcagcagaagaaggctcggagcaacgtggaccaccagttcgagagctacatcaacaac ctttggcagcagctggagactctgggccaggacaagctgaagctggaggcagggcttggc aacatgcaggggctggttgaggacttcaagaacaagcaactgtatgaagaggagatccag gagctgccgtcccagatttcccacacgcctgtggtgctgtccatggacaacagccgctgc ctggacacggatagcatcatcactgaggtcaaggcacagtatgaggagatcgccaaccgc agccgggctgaggctgagagcatgcaccagagcaagcatgaggagctgcagacgctggct gggaagcacggcgatgacttgcagtgcacaaagactgagatctcggagatgaactggaac atcagccggctccaggctgagattgagggcctcaaagaccagagggcttccctggaggcc gccatggtagatgccgagcagcgcggggagctggctgttaaggacgccaatgccaagttg tccgagctggaggccgccctgcggcaggccaagcaggacatggtgcggcagctgcctgag taccaggagctgatgaacgtcaggctggtcctggacatggaaatcccacctacaggaagc tgctggagggcgaggagagccggctggcgtctgggatgcagaacatga >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_3|185_aa MGDLGEDGTVEILLLSEHSLPDPVIPPAPALNYGFLQVHQHLSIANCISGTVQTATNCYI YRLKDPMTKLIKIGLRDLNRPPNSLSLSFLICHTQSLESRRCREYFWNFQFGPQAGRPDS QFGFGVLTQHRVHHSAFHTDHLSYTVIIPTLQRGKLVIKECGSRAHTSPWLALRMGSRGC RKPTT >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_3|558_bp atgggggacctgggagaggatggcacagttgagatactgctcctctcagagcactccttg cctgacccagttattcctccagccccggcactgaactatggcttccttcaggtccatcag catttatccatcgccaactgtatatcaggtactgtgcagactgccactaattgctacatc tataggctaaaagaccccatgaccaagctcattaaaattgggctgagggaccttaataga ccccccaactcgctgagcctcagcttcctcatctgccacacccagtctttggaaagtaga cggtgtcgagaatatttctggaatttccagtttggtccccaggctgggcgaccagacagc caattcggctttggagtacttacccagcaccgggtacatcattctgcttttcacacggac cacctcagttacactgtcattatccccactttacagaggggcaaactagtgattaaagag tgtgggtcacgtgctcatacttccccatggttggccttgcgaatgggcagccggggatgc cgaaagccaaccacgtga >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_4|817_aa MPFSTCSVHILWYVGLWGPRTSFICAHSTGRGRLLREETEPLGRNVGLGDATQMDRTGLG LDATFPVNMHESPRLPFQEPGERQMPCMRLLQVLSIQSSEPLEVGDALIPISQRRNLGSG GYSHGQNYQAALPGYLPRPPLGSQGRFWCQFLDIAESSPGFRLHPCSPHTPKCRDFTAHT GYEVLLQRLLDGRKMCKDMEELLRQRAQAEERYGKELVQIARKAGGQTEIKENDDIAEDI GGREPGPVPVRCPLTLAHRGHSLSTGSREGYSCQLLVDSGVFAKRGALEEERFAGQVLSP VLDTLNLRYLSDMRPKSSLRASFDSLKQQMENVGSSHIQLALTLREELRSLEEFRERQKE QRKKYEAVMDRVQKSKLSLYKKAMESKKTYEQKCRDADDAEQAFERISANGHQKQVEKSQ NKARQCKDSATEAERVYRQSIAQLEKVRAEWEQEHRTTCEAFQLQEFDRLTILRNALWVH SNQLSMQCVKDDEVGAEGLGVGATEHLPPLPGMGGEVDTAVRGWQKLEDRGCSALGLVLA GAEAQSFLCSPHLQGVLVPLTEPNLLGPCTQKVFRKQEAVSQGRDPSGSKTPSRAQWPVR RPLYEEVRLTLEGCSIDADIDSFIQAKSTGTEPPAPVPYQNYYDREVTPLTSSPGIQPSC GMIKRFSGLLHGSPKTTSLAASAASTETLTPTPERNEGVYTAIAVQEIQGNPASPAQEYR ALYDYTAQCLASSSLLMVSLIFEHPLLLRRARVPECVHTSKGPSNVMRFQSLGQNPDELD LSAGDILEVILEGEDGWWTVERNGQRGFVPGSYLEKL >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_4|2454_bp atgcccttttccacatgctcggtgcatattttgtggtacgtgggcctctggggaccaaga acttcgttcatctgtgcccattccacaggacgtggccggctgctcagagaggaaactgag cctcttggaaggaatgtgggcctgggggatgccacacagatggatcggactgggctgggc ctggatgccaccttcccagttaacatgcatgaatcccctcggctgcccttccaggagccc ggtgagcgtcaaatgccctgcatgaggctcctgcaggtgttgtccattcagtcctcagag cccctggaagtgggtgatgctctcatcccgatctcacagaggaggaatctgggctcagga ggctattctcatggacagaattaccaggcagctcttcctggctaccttccacggccccca cttggttcccagggaagattttggtgccagttcttggacatagcagagagctctcctgga tttcgactccatccctgttcccctcacacccctaagtgcagggacttcacagcccacacg ggctacgaggtgctgctgcagcggcttctggatggcaggaagatgtgcaaagacatggag gagctactgaggcagagggcccaggcggaggagcggtacgggaaggagctggtgcagatc gcacggaaggcaggtggccagacggagatcaaagagaatgacgacattgctgaggatatt ggcgggagagagccaggccctgttcctgtcagatgtcccctgacgctggctcatcgtggg cacagcctcagcacagggagccgagagggctactcctgtcagctgctggtggacagtggt gtctttgccaagaggggagccctggaagaggagaggtttgcagggcaggtgctgagtccg gttttggacacgctgaatttgaggtatctgtcagatatgagacccaaaagctccctgagg gcctcctttgactccttgaagcagcaaatggagaatgtgggcagctcacacatccagctg gccctgaccctgcgtgaggagctgcggagtctcgaggagtttcgtgagaggcagaaggag cagaggaagaagtatgaggccgtcatggaccgggtccagaagagcaagctgtcgctctac aagaaggccatggagtccaagaagacatacgagcagaagtgccgggacgcggacgacgcg gagcaggccttcgagcgcattagcgccaacggccaccagaagcaggtggagaagagtcag aacaaagccaggcagtgcaaggactcggccaccgaggcagagcgggtatacaggcagagc attgcgcagctggagaaggtccgggctgagtgggagcaggagcaccggaccacctgtgag gcctttcagctgcaagagtttgaccggctgaccattctccgcaacgccctgtgggtgcac agcaaccagctctccatgcagtgtgtcaaggatgatgaggtgggggctgagggccttggt gtgggagccacagagcacctgccccctctgccggggatgggaggggaggtggacacagct gtcagaggttggcagaagctggaggaccgtggctgctctgctctaggcctggtgcttgca ggagccgaggcgcagtccttcctctgctccccacatctccagggtgtcctggttcccctt actgagcccaacctgctggggccttgcacacagaaggtgttcaggaaacaggaagctgtc agccagggccgtgacccctcaggatcaaagaccccgagccgcgcacaatggcctgtgagg aggccgctctacgaggaagtgcggctgacgctggaaggctgcagcatagacgccgacatc gacagtttcatccaggccaagagcacgggcacagagccccccgctccggtgccctaccag aactattacgatcgggaggtcaccccgctgaccagcagccctggcatacagccgtcctgc ggcatgataaagaggttctctggactgctgcacggaagtcccaagaccacttcgttggca gcttctgctgcgtccacagagaccctgacccccacccccgagcggaatgagggtgtctac acagccatcgcagtgcaggagatacagggaaacccggcctcaccagcccaggagtaccgg gcgctctacgattatacagcgcagtgccttgcgtcctcatctctcctcatggtttcactc atcttcgagcatcctctcctcctcaggagggcacgtgtgcccgagtgtgtccacactagc aagggcccttccaacgtcatgcgctttcaatctcttggccagaacccagatgagctggac ctgtccgcgggagacatcctggaggtgatcctggaaggggaggatggctggtggactgtg gagaggaacgggcagcgtggcttcgtccctggttcctacctggagaagctttga >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_5|125_aa MAFFFCLLSSCTGPSSIPHRDSSLLLWLGGGAPPMIKTNIPTVGQMGIVCLEMRCPERPR DAGSGSATWNAKPNLITRKSQTNTDKQHHLKRTVSLQTVPIITDKAMEMIQIKEDQGDMT TQLIT >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_5|378_bp atggcctttttcttctgtctcctttcgtcctgcacgggtcccagcagcattccccaccgt gacagctctctgctcctgtggcttggcgggggggcacctccaatgatcaaaacgaacatc cccactgtggggcagatgggcattgtgtgcctcgagatgcgatgccctgagaggccgcgt gacgcaggtagtggttcagccacctggaatgcaaaaccaaatctaatcacgaggaaatcg cagacaaacacagataagcagcaccacctaaaacgaactgtatccttacaaactgttcct atcatcacagacaaagctatggaaatgatccagattaaagaagaccaaggagacatgaca actcaactgattacctga >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_6|223_aa MGQCGITSSKTVLVFLNLIFWGAAGILCYVGAYVFITYDDYDHFFEDVYTLIPAVVIIAV GALLFIIGLIGCCATIRESRCGLATVENEVDRSIQKVYKTYNGTNPDAASRAIDYVQRQL HCCGIHNYSDWENTDWFKETKNQSVPLSCCRETASNCNGSLAHPSDLYAEGCEALVVKKL QEIMMHVIWAALAFAAIQVLYIHIRKSRTGTTALVSCTKHNNF >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_6|672_bp atgggccagtgcggcatcacctcctccaagaccgtgctggtctttctcaacctcatcttc tggggggcagctggcattttatgctatgtgggagcctatgtcttcatcacttatgatgac tatgaccacttctttgaagatgtgtacacgctcatccctgctgtagtgatcatagctgta ggagccctgcttttcatcattgggctaattggctgctgtgccacaatccgggaaagtcgc tgtggacttgccacggtggaaaatgaggttgatcgcagcattcagaaagtgtataagacc tacaatggaaccaaccctgatgctgctagccgggctattgattatgtacagagacagctg cattgttgtggaattcacaactactcagactgggaaaatacagattggttcaaagaaacc aaaaaccagagtgtccctcttagctgctgcagagagactgccagcaattgtaatggcagc ctggcccacccttccgacctctatgctgaggggtgtgaggctctagtagtgaagaagcta caagaaatcatgatgcatgtgatctgggccgcactggcatttgcagctattcaggtttta tatattcatatacgtaaatcaagaactggtactacagcactggtctcttgcactaaacac aacaacttttaa >gi568815583f:76918147_77137173|GENSCAN_predicted_peptide_7|630_aa MIPPKQPRQPKGAVDDAIAFGGKTDQEAPNASQPTPPPLPKKMIIRANTEPISKDLQKSM ESSLCVMANPTYDIDPNWDASSAGSSISYELKGLDIESYDSLERPLRKERPVPSAANSIS SLTTLSIKDRFSNSMESLSSRRGPSCRQGRGIQKPQRQALYRGLENREEVVGKIRSLHTD ALKKLAVKCEDLFMAGQKDQLRFGVDSWSDFRLTSDKPCCEAGDAVYYTASYAKDPLNNY AVKICKSKAKESQQYYHSLAVRQSLAVHFNIQQDCGHFLAEVPNRLLPWEDPDDPEKDED DMEETEEDAKGETDGKNPKPCSEAASSQKENQGVMSKKQRSHVVVITREVPCLTVADFVR DSLAQHGKSPDLYERQVCLLLLQLCSGLEHLKPYHVTHCDLRLENLLLVHYQPGGTAQGF GPAEPSPTSSYPTRLIVSNFSQAKQKSHLVDPEILRDQSRLAPEIITATQYKKCDEFQTG ILIYEMLHLPNPFDENPELKEREYTRADLPRIPFRSPYSRGLQQLASCLLNPNPSERILI SDAKGILQCLLWGPREDLFQTFTACPSLVQRNTLLQNWLDIKRTLLMIKFAEKSLDREGG ISLEDWLCAQYLAFATTDSLSCIVKILQHR >gi568815583f:76918147_77137173|GENSCAN_predicted_CDS_7|1893_bp atgatacctcccaagcagccacgacagcccaagggagctgtggacgatgccatcgccttt ggagggaaaacagaccaagaagcacccaatgcttcccaacctacaccacccccactgcca aagaagatgatcataagagccaatacagagccaatctccaaggacctccaaaaatccatg gaaagtagtctttgtgtcatggctaatcccacctatgatatcgaccccaactgggatgcc agcagtgctggttcttccatcagctatgaactcaaaggactggacattgagtcttatgac tccttggaaaggcctttgcgcaaggagagacctgtcccctcagcagcaaacagcatttcc agcttaaccactctcagtattaaggatagattttccaacagcatggaatccctctccagc cggcgtgggccctcttgcagacagggccgaggcatccagaagccgcagagacaagcactt tatcgaggacttgagaatcgggaggaagtagtgggtaaaatccgaagccttcatacagat gccttgaagaaactggctgttaaatgcgaagaccttttcatggctgggcagaaagaccag ctccgttttggagtggacagctggtcagacttcaggctaaccagtgacaaaccatgttgt gaggcaggtgatgcggtttactatactgcttcatatgcaaaagatccacttaataactat gcagtcaagatctgtaagagcaaagctaaagaatctcagcagtattatcacagcttggct gtccggcagagtctggctgtccattttaacattcagcaggactgtggtcatttccttgct gaagtccctaaccgtctgcttccctgggaggatccagatgaccctgaaaaggatgaggat gacatggaagagactgaagaagacgccaaaggagaaacggatgggaaaaacccaaagccc tgttctgaagcagcatcatcccagaaagagaatcagggagtcatgagcaagaagcagagg agccacgttgtggtcatcaccagggaggttccatgtcttactgtggctgattttgtgcga gactctctggcccagcatgggaaaagccctgatttgtatgagaggcaggtgtgtctgctg ctcttacagctatgctctggtcttgagcacctcaaaccctaccatgtcactcactgcgat ctacgcctagagaacctgctacttgtccactaccagcctggggggactgcccaaggcttt gggcctgcagagcccagccccacctcatcttatcccactaggcttatagtgagcaacttc tctcaggccaagcagaagagccatctggtggaccccgagatcctccgggaccagtctcgc cttgccccagagatcataacagctacccagtataaaaagtgtgatgagttccagacaggc atcctcatctatgagatgctgcacctacccaacccctttgatgagaacccagagctgaag gagagggaatacacacgagcagacctgcctcgcatcccattccgctccccctactcccgg ggtctgcagcagctggccagctgcctcctgaatcccaacccttctgagcggatcctcatt tcagacgccaaaggcatcctccagtgtctgctctggggcccccgcgaagatctcttccag actttcaccgcctgccctagcctagtacagaggaacaccctgctccaaaactggctagac atcaagcgaacactgctcatgatcaagtttgctgagaagtccctggacagggaaggtgga atcagccttgaggactggctttgtgctcagtatttggcttttgccactacagactccctc agttgtattgtgaaaattctgcagcaccgttaa