GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:47:21 Sequence gi568815594f:67938544_68139647 : 201104 bp : 36.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 6094 5976 119 2 2 91 97 100 0.449 10.46 1.03 Intr - 8028 7907 122 2 2 79 74 56 0.076 2.52 1.02 Intr - 30904 29530 1375 2 1 66 53 357 0.448 16.95 1.01 Init - 32489 31055 1435 2 1 44 40 524 0.498 34.52 1.00 Prom - 32582 32543 40 -6.15 2.02 PlyA - 32751 32746 6 1.05 2.01 Sngl - 33725 33165 561 2 0 40 40 447 0.900 30.98 2.00 Prom - 35525 35486 40 -6.85 3.12 PlyA - 36525 36520 6 1.05 3.11 Term - 39486 39203 284 1 2 44 49 182 0.952 4.50 3.10 Intr - 43533 43260 274 0 1 37 64 146 0.107 3.29 3.09 Intr - 46482 46357 126 0 0 65 68 75 0.219 3.16 3.08 Intr - 53302 53240 63 1 0 133 66 68 0.970 7.10 3.07 Intr - 55572 55439 134 1 2 113 116 15 0.961 6.24 3.06 Intr - 60443 60378 66 0 0 95 55 71 0.004 2.46 3.05 Intr - 61240 61202 39 2 0 108 87 41 0.002 3.28 3.04 Intr - 64538 64502 37 2 1 78 95 6 0.001 -2.78 3.03 Intr - 76513 76395 119 2 2 103 99 82 0.910 10.06 3.02 Intr - 86041 85891 151 1 1 86 94 88 0.660 8.01 3.01 Init - 88183 88145 39 1 0 64 95 25 0.845 1.04 3.00 Prom - 88310 88271 40 -7.85 4.00 Prom + 90345 90384 40 -8.85 4.01 Init + 91608 91672 65 2 2 37 87 42 0.524 -0.33 4.02 Intr + 92787 92914 128 0 2 51 115 100 0.767 8.40 4.03 Intr + 93609 93742 134 1 2 87 93 -15 0.337 -1.66 4.04 Term + 94199 94549 351 1 0 68 48 202 0.474 7.70 4.05 PlyA + 96710 96715 6 1.05 5.00 Prom + 98895 98934 40 -6.55 5.01 Sngl + 100001 101107 1107 1 0 49 49 1093 0.997 96.32 5.02 PlyA + 102210 102215 6 1.05 6.09 PlyA - 102687 102682 6 1.05 6.08 Term - 104543 104380 164 0 2 52 53 65 0.436 -3.48 6.07 Intr - 106624 106477 148 1 1 55 71 113 0.269 5.19 6.06 Intr - 120925 120783 143 2 2 55 115 102 0.501 8.75 6.05 Intr - 126401 126142 260 1 2 65 89 94 0.015 3.38 6.04 Intr - 130276 130075 202 2 1 73 100 -4 0.073 -3.28 6.03 Intr - 133943 133780 164 1 2 113 113 97 0.989 13.40 6.02 Intr - 143509 143368 142 2 1 90 105 55 0.001 5.89 6.01 Intr - 160495 160344 152 0 2 85 110 77 0.677 8.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 63390 63540 151 0 1 69 48 192 0.909 9.70 S.002 Sngl + 139090 139608 519 0 0 43 40 289 0.894 15.39 S.003 Init + 144760 144808 49 0 1 77 99 23 0.939 3.46 S.004 Term + 145272 145441 170 1 2 53 37 182 0.927 6.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_1|1017_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLSDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKISESRSWFFERNNKIDRLLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHFYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSVEKE GLLPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKLLNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIHKSINVIQHINRAKDENHMIISIDAEKAFDKIQQPFMLKTLNKLVL EVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYK INVQKSQAFLYTNNRQTESQIMSELPFTTASKRIKYLGIQLTRDVKDLFKENYKQLLKEI KEDTSKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQK RACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIY NYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK TIKTLEENLGIIIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQ PRKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHM KKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRTVGFGTRSRNLKPWMIAVLIVLS LTVVAVTIGLLVHFLVFDQKKEYYHGSFKILDPQINNNFGQSNTYQLKDLRETTENL >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_1|3051_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagagatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccactcaactacatggaaactgaacaacctgctcctgagtgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattagtgaatcc aggagctggttttttgaaaggaacaacaaaattgatagactgctagcaagactaataaag aaaaaaagagagaaaaatcaaatagacgcaataaaaaatgataaaggggatatcaccaca gatcccacagaaatacaaactaccatcagagaatactacaaacacttttacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagctgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcagtagaaaaagag ggactcctccctaactcattttatgaggccagcatcattctgataccaaagctgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaactc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggtttcatccctgggatgcaaggctggttcaatatacacaaatcaataaatgta atccagcatataaacagagccaaagacgaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacctttcatgctaaaaactctcaataaattagtgttg gaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctggaaaaccccatt gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtacaaaagtcacaagcattcttatacaccaacaacagacaaacagagagccaa atcatgagtgaactcccattcacaactgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaacaactgctcaaggaaata aaagaggatacaagcaaatggaagaacattccatgctcatgggtaggaagaatcaatatc gtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccattaagcta ccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtattggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattatcattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctagaaaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcgaaggacatgaacagacacttctcaaaagaagacatttatgcagcaaaaaaacacatg aaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagatac catctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggacagtggga tttggcacccgaagcagaaatctgaagccatggatgattgccgttctcattgtgttgtcc ctgacagtggtggcagtgaccataggtctcctggttcacttcctagtatttgaccaaaaa aaggagtactatcatggctcctttaaaattttagatccacaaatcaataacaatttcgga caaagcaacacatatcaacttaaggacttacgagagacgaccgaaaatttg >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_2|186_aa MKLKTKARELREECRSLRSRCDQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIW DYVKRPNLRLIGVTESDGENGTKLENTLQDIIQENFPNLARQSNVQIQEIQRMPQRYSSR RATPRHIIVRFTKVEMKEKMLRAAREKGRVTFKGKPIRLTADLSAETLQARREWGPIFNI LKEKNF >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_2|561_bp atgaagctgaaaaccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccga tgcgatcaactggaagaaagggtgtcagcaatggaagatgaaatgaatgaaatgaagcga gaagggaagtttagagaaaaaagaataaaaagaaacgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacgtctgattggtgtaactgaaagtgatggggagaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaacctagca aggcagtccaacgttcagattcaggaaatacagagaatgccacaaagatactcctcgaga agagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatg ttaagggcagcaagagagaaaggtcgggttaccttcaaagggaagcccatcagactaaca gcggatctctcggcagaaaccctacaagccagaagagagtgggggccaatattcaacatt cttaaagaaaagaatttttaa >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_3|443_aa MSTSNIAFGAVLKINKLRLQYVMPGHITCECRGILSETVQAFSTIHATSIRRVGEAAVTK LLKGLKFYYYQTSFQIPSIEYNPDFSVEHSKLSTDLKQKVSNEMRTTVTELVFCSDMNAA QAEHILNSCCGLGKEFPSTKRIADGRIARKGPFPNMLREVEVEIISNDICNQVHVYVSSG MICAGFLSGKLDACKGDSGGPLVIARDRNAWYLVGIPPSATRAIIELLSYSQAPLTSSFV VTFINDMSSGLPDETHALLTSTFQAASSGEARQLVQQTCSAKEQPDCFFKQVSDLFPPDS VRPPNMGLQTPPTGAFSWHQVGAPLGWSCQRKEQSSIFALLQPLLVIPSAQKLLLLISNF GKVSGHKINVQKSLYTNNRQSESQIMNVLSFTIATKRIKYLGIQLTREMKDLFKENYKLL LKEIGEDTNKRKNLGKERIHVHG >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_3|1332_bp atgagcacttcaaatatagcatttggtgccgtcttaaagattaataaactcaggctccag tatgtgatgccaggccatataacttgtgagtgccgaggaattctgtctgaaacagtccag gctttttctaccatacatgcaacctctattcggagagttggtgaggctgctgtgaccaaa ctcttaaaaggcctgaagttttactattaccagacctccttccagatccccagtattgaa tataatcctgatttttcagtagaacactcaaaacttagcaccgacctgaaacaaaaagtc agtaacgagatgagaaccactgttactgaattggtgttctgctcagatatgaatgcagca caagcagaacacattctgaacagctgttgtggtttagggaaggagtttccatccaccaaa agaattgctgatggtcggattgcaagaaaaggtccctttcccaatatgctacgggaagtt gaggtagagatcataagtaatgatatatgtaatcaagttcatgtgtatgtatcatcagga atgatatgtgctggattcttgtcaggaaaactagatgcctgtaagggtgattctggaggt cctctggttattgcacgtgatagaaatgcctggtaccttgttggaatacctccatctgca acgagagccatcattgaacttcttagttattctcaagctcctctcaccagctcatttgta gtgactttcataaatgatatgtcctctgggcttcctgatgaaacacatgccctgctgact tcaactttccaggctgccagctctggagaggctaggcagttagtgcagcaaacctgctct gccaaggagcagccagactgtttctttaagcaggtctctgatctatttcctcctgactcg gtgagacctcccaacatgggtctccagacacctccaacaggagcattcagctggcatcag gttggtgcccccctcggatggagctgccagaggaaggagcagtcttcaatctttgctctt ttgcagcctttactagtgataccttcagcccaaaagcttcttctgctgataagcaacttc ggcaaagtctcaggacacaaaatcaatgtgcaaaaatctctgtacaccaacaacaggcaa tcagagagccaaatcatgaatgtactctcattcacaattgctacaaaaagaataaaatac ctaggaatacagctaacaagggaaatgaaggacctcttcaaggagaactacaaactacta ctcaaagaaattggagaggacacaaacaaacggaaaaacttggggaaagaaagaatccat gttcatggatag >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_4|225_aa MGMRSFQKDLGPGEDNFKNLERSGDLPREINPLSSCYLLHEKDPPTTSNPQTNQPKEHLT NFKSVRLIPNLSSFPPTCPHSPNPKRCFQSSFSTDPSDLSLHPKIAPPQVYNNREELQLL ASSLRESLATSPAHKNFKMPKPQWPGVPSGLHPSGSCFTCWKSGHWAKECLWPGIPPKLY PIYEGLHWKSDCPTHLATTPRAPGTLAQDSLTDSFPDLLSLAAED >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_4|678_bp atgggaatgcggagttttcaaaaagatttaggaccaggagaagataactttaaaaatctt gaaagatcgggggacctccctcgggagatcaatcccctgtcctcctgctatttgctccat gagaaagatccacctacgacatcgaatcctcagaccaaccagcccaaggaacatctcacc aattttaaatcagtgcgactcatcccaaacctttcttctttccctcccacctgtccccac agtcccaaccccaagcgttgctttcaatcttccttttctaccgacccatctgacctctcc cttcatcccaagattgctcctcctcaggtgtacaataatagagaagagttgcaattactt gcctcttctctgagagaaagcctagccacatctccagcacacaagaacttcaaaatgcct aaaccacagtggccaggtgttccttcaggacttcatccctcaggatcttgcttcacgtgc tggaaatctggccactgggccaaggaatgcctgtggcccgggatccctcctaagctgtat cccatctatgagggactccactggaaatcggactgtccaactcacctggcaaccactccc agagcccctggaactctggcccaagactctctgactgactccttcccagatcttctcagc ttagcagctgaagactga >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_5|368_aa MCLLPRGFEPQAPEDLAQRSLVELREMLKLQERLLRNEKFICKLPDKGKKIFDSFAKLKA AIAECEEVRRKNELFHPVSLDCKLRQKAIAEVDVGTDKARNSDPILDTSSLVPGCSSVDN IKSSQTSQNQGLGRPTLEGDEETSEVEYTVNKGPASSNRDRVPPSSEASEHHPQHRVSSQ AEDTSSSFDNLFIDRLQRITIADQGEQQSEENASTKNLTGLSSGTQKKPHYMEVLEMRAK NPGPQLRKFKTNVLPFRQNDSSSHCQKSGSPISSKERRRRDKQHLDDITAARLLPLHHMP TQLLSIEESLALQKQQKQNYEEMQAKLAAQKLAERPNIKMRSYNPEGESSGRYREVRDED DDWSSDEF >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_5|1107_bp atgtgcttgctgccccgcggcttcgagccccaagctcccgaggacttggcgcagcggagt ttggtggagctgcgggaaatgttgaagctccaggagagacttttgcgcaacgaaaaattc atttgcaaattgcccgacaaaggtaaaaagatctttgactcttttgccaaactgaaagcc gccattgcagaatgtgaagaagttagaagaaaaaatgaactgtttcaccctgttagttta gactgtaagctaaggcaaaaagcaattgcagaagttgatgtgggtacagataaggcccgg aattctgacccgatacttgatacttcatcactagttcctggatgttcctctgtagataac atcaagtcatctcaaacctcacaaaaccagggacttggacgtcctactcttgaaggtgat gaagagacttcagaggttgagtacacagtgaataagggcccagcttccagcaacagagac agggtaccaccttcatctgaagctagtgagcatcacccgcagcatcgtgtttcaagtcag gcagaagatacttccagcagctttgacaacctttttattgacaggttacagaggatcacc attgcggaccaaggtgaacaacagtcagaagaaaacgcaagtactaagaacttgacaggc ctttctagtgggactcagaagaaacctcattatatggaagtgctagaaatgcgagccaaa aacccagggccccagctgcgtaaatttaaaaccaatgtgttaccttttcgacaaaatgat tcatctagtcattgccagaagagtgggtctcctatttcctcaaaagagcggcggcgcagg gataagcagcatcttgatgacatcacagcagctcggcttctaccacttcaccatatgccc acgcagctgctctccatagaagaatccttggcacttcagaaacagcagaaacagaattat gaggagatgcaagcaaagctcgcagcgcaaaaattagctgaaagaccgaatattaaaatg cggagttataatccagaaggggagtcttcagggagataccgagaagtaagggatgaagat gacgattggtcctctgatgaattctga >gi568815594f:67938544_68139647|GENSCAN_predicted_peptide_6|458_aa XPVEFSEAEFSRAEYQRKQQFWDSVRLALFTLAIVAIIGIAIGIVTHFVVEEICLMYVGL LNILVSLSGKVPFWLYLGSRLATPPTSSQLFFIGGKERSPDEQGVDILIVLIFRYPSTDS AEQIKKKIEKALYQSLKTKQLSLTINKPSFRLTRCGIRMTSSNMPLPASSSTQRIVQGRE TAMEGEWPWQASLQLIGSGHQCGASLISNTWLLTAAHCFWKNKDPTQWIATFGATITPPA VKRNVRKIILHENYHRETNENDIALVQLSTGVEFSNIVQRVCLPDSSIKLPPKTSVFVTG FGSIVDDGPIQNTLRQARVETISTDVCNRKDVYDGLITPGMLCAGFMEGKIDACKCWLVG AVAVVTLYGTLLGVCEWPGPLAMGPGKKSVSAYIFLSITRPGSAGSSSLRTDTNELPGQI SSAQVSLPEEKQIKGHRMFYENLWCVKSLPLKEMNIRA >gi568815594f:67938544_68139647|GENSCAN_predicted_CDS_6|1377_bp nnacctgttgaattttcagaagctgaattctcacgagctgaatatcaaagaaagcagcaa ttttgggactcagtacggctagctcttttcacattagcaattgtagcaatcataggaatt gcaattggtattgttactcattttgttgttgaggagatctgtttgatgtatgttggctta ctcaatattttggtctctctcagtgggaaagttcctttctggctgtatctgggcagccgt cttgccactcccccaacaagttctcagttatttttcattggtggcaaagagaggagtcca gatgaacaaggtgtggatattcttatagtgctcatatttcgatacccatctactgatagt gctgaacaaatcaagaaaaaaattgaaaaggctttatatcaaagtttgaagaccaaacaa ttgtctttgaccataaacaaaccatcatttagactcacacgctgtggaataaggatgaca tcttcaaacatgccattaccagcatcctcttctactcaaagaattgtccaaggaagggaa acagctatggaaggggaatggccatggcaggccagcctccagctcatagggtcaggccat cagtgtggagccagcctcatcagtaacacatggctgctcacagcagctcactgcttttgg aaaaataaagacccaactcaatggattgctacttttggtgcaactataacaccacccgca gtgaaacgaaatgtgaggaaaattattcttcatgagaattaccatagagaaacaaatgaa aatgacattgctttggttcagctctctactggagttgagttttcaaatatagtccagaga gtttgcctcccagactcatctataaagttgccacctaaaacaagtgtgttcgtcacagga tttggatccattgtagatgatggacctatacaaaatacacttcggcaagccagagtggaa accataagcactgatgtgtgtaacagaaaggatgtgtatgatggcctgataactccagga atgttatgtgctggattcatggaaggaaaaatagatgcatgtaagtgctggcttgtaggg gctgtggctgttgtgactctttacggcaccctccttggtgtctgtgagtggcccggtccc ctagccatgggcccaggcaaaaaatctgtgtctgcgtacattttcttatccatcactcgg cctgggtctgcaggcagctcctccttaagaacagacacaaatgaattgccaggccagata tcatcagcacaagtatcattgcctgaagaaaagcaaatcaagggccatagaatgttttat gaaaacctctggtgtgttaaaagcttgcctctgaaagagatgaatatcagggcctga