GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:30:38 Sequence gi568815578f:50091034_50292068 : 201035 bp : 50.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 5702 5791 90 1 0 88 38 118 0.751 4.52 1.02 PlyA + 6241 6246 6 1.05 2.00 Prom + 16523 16562 40 0.74 2.01 Init + 17699 17721 23 1 2 99 59 19 0.342 -0.73 2.02 Intr + 22118 22318 201 2 0 51 53 147 0.280 5.90 2.03 Term + 24853 24919 67 1 1 88 48 71 0.308 0.51 2.04 PlyA + 28240 28245 6 1.05 3.09 PlyA - 32754 32749 6 1.05 3.08 Term - 34146 34025 122 1 2 97 32 198 0.999 13.74 3.07 Intr - 37154 36942 213 0 0 76 80 331 0.996 29.79 3.06 Intr - 38657 38513 145 2 1 90 85 312 0.999 30.96 3.05 Intr - 39914 39823 92 0 2 98 113 70 0.828 10.21 3.04 Intr - 48424 48303 122 0 2 -3 61 159 0.551 4.24 3.03 Intr - 49732 49630 103 2 1 64 65 66 0.843 1.13 3.02 Intr - 52588 52469 120 2 0 130 97 87 0.986 14.27 3.01 Init - 62604 62484 121 0 1 107 110 199 0.971 24.35 3.00 Prom - 87364 87325 40 -4.56 4.00 Prom + 91579 91618 40 -4.96 4.01 Sngl + 100001 101038 1038 1 0 41 43 2102 0.909 196.13 4.02 PlyA + 101610 101615 6 1.05 5.06 PlyA - 101676 101671 6 1.05 5.05 Term - 111960 111657 304 2 1 27 41 183 0.834 2.14 5.04 Intr - 119401 119215 187 2 1 108 80 111 0.946 11.05 5.03 Intr - 122551 122433 119 0 2 126 54 63 0.959 6.81 5.02 Intr - 124801 124672 130 2 1 105 75 -10 0.842 -0.85 5.01 Init - 127154 127073 82 2 1 75 82 71 0.783 6.33 5.00 Prom - 127452 127413 40 -13.43 6.00 Prom + 127600 127639 40 -6.66 6.01 Sngl + 128843 129139 297 1 0 111 49 211 0.979 15.30 6.02 PlyA + 129449 129454 6 1.05 7.00 Prom + 136647 136686 40 -3.36 7.01 Init + 146425 146552 128 0 2 92 47 133 0.060 9.26 7.02 Intr + 152506 152604 99 1 0 93 92 111 0.203 11.23 7.03 Intr + 162363 162574 212 0 2 16 64 134 0.002 2.36 7.04 Intr + 178670 178838 169 0 1 63 42 63 0.067 -1.80 7.05 Intr + 179692 179828 137 1 2 55 46 99 0.062 2.61 7.06 Intr + 182938 183084 147 2 0 85 65 26 0.154 0.21 7.07 Intr + 186810 186947 138 1 0 62 88 32 0.033 1.04 7.08 Intr + 187710 187733 24 1 0 82 115 16 0.061 1.70 7.09 Intr + 188778 189012 235 1 1 61 -33 162 0.377 -1.85 7.10 Term + 192226 192400 175 1 1 107 42 136 0.229 8.13 7.11 PlyA + 193616 193621 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_1|29_aa TPANCAISYSFLAFFEFFQQSEIARDFYS >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_1|90_bp accccagctaactgtgccatctcctactcctttctggccttcttcgagttcttccaacag tcggaaattgcgagggacttttactcctaa >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_2|96_aa MGLQAQEVPAPSSPPPYPSPGPDAQARAHAHTPPLTPAPETCTTRGLEGSFPSLLEKGNI CVLEGPATCKSPHIGKLLASKIDNNNSRAELIAGRG >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_2|291_bp atggggctccaagcccaagaggtgccggccccttcttcaccccccccttacccgtccccc ggccctgatgcgcaggcgcgcgcgcacgcacatacgccgccgctgacgcccgccccggag acctgcacgacccgtggcctggaaggctcttttccttcccttttagagaaaggaaatatc tgcgtgctcgagggcccagcaacctgcaagtccccacacatcgggaaacttcttgcaagt aaaattgacaataataattcccgtgccgagctcatcgcaggaagaggttag >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_3|345_aa MAGAENWPGQQLELDEDEASCCRWGAQHAGARELAALYSPGKRLQEWCSVILCFSLIAHN LVHLLLLARWEDTPLVILGVGRSQKSRKAEARKDVELINTDTRDAALGSKIKLSGSLGGR LGTATEWTSKVASKGTFGFSHVEVISHCDTHIVVVIAGALIADFLSGLVHWGADTWGSVE LPIVGKAFIRPFREHHIDPTAITRHDFIETNGDNCLVTLLPLLNMAYKFRTHSPEALEQL YPWECFVFCLIIFGTFTNQIHKWSHTYFGLPRWVTLLQDWHVILPRKHHRIHHVSPHETY FCITTGWLNYPLEKIGFWRRLEDLIQGLTGEKPRADDMKWAQKIK >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_3|1038_bp atggcgggcgccgagaactggccgggccagcagctggagctggacgaggacgaggcgtct tgttgccgctggggcgcgcagcacgccggggcccgcgagctggctgcgctctactcgcca ggcaagcgcctccaggagtggtgctctgtgatcctgtgcttcagcctcatcgcccacaac ctggtccatctcctgctgctggcccgctgggaggacacacccctcgtcatactcggtgtt ggcaggtcccagaaatcaaggaaggcagaagcacgaaaagatgtcgagttaataaacaca gacacacgtgatgcagcattgggaagcaaaatcaagctaagcgggagtcttgggggtcga ctgggaacagccaccgagtggaccagcaaggtggccagtaaagggacctttggatttagc cacgtggaggtcatcagccactgtgacacgcacatcgttgtcgtgattgcaggggctctc attgctgacttcttgtctggcctggtacactggggtgctgacacatggggctctgtggag ctgcccattgtggggaaggctttcatccgacccttccgggagcaccacattgacccgaca gctatcacacggcacgacttcatcgagaccaacggggacaactgcctggtgacactgctg ccgctgctaaacatggcctacaagttccgcacccacagccctgaagccctggagcagcta tacccctgggagtgcttcgtcttctgcctgatcatcttcggcaccttcaccaaccagatc cacaagtggtcgcacacgtactttgggctgccacgctgggtcaccctcctgcaggactgg catgtcatcctgccacgtaaacaccatcgcatccaccacgtctcaccccacgagacctac ttctgcatcaccacaggctggctcaactaccctctggagaagataggcttctggcgacgc ctggaggacctcatccagggcctgacgggcgagaagcctcgggcagatgacatgaaatgg gcccagaagatcaaataa >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_4|345_aa MQRLVAWDPACLPLPPPPPAFKSMEVANFYYEADCLAAAYGGKAAPAAPPAARPGPRPPA GELGSIGDHERAIDFSPYLEPLGAPQAPAPATATDTFEAAPPAPAPAPASSGQHHDFLSD LFSDDYGGKNCKKPAEYGYVSLGRLGAAKGALHPGCFAPLHPPPPPPPPPAELKAEPGFE PADCKRKEEAGAPGGGAGMAAGFPYALRAYLGYQAVPSGSSGSLSTSSSSSPPGTPSPAD AKAPPTACYAGAAPAPSQVKSKAKKTVDKHSDEYKIRRERNNIAVRKSRDKAKMRNLETQ HKVLELTAENERLQKKVEQLSRELSTLRNLFKQLPEPLLASSGHC >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_4|1038_bp atgcaacgcctggtggcctgggacccagcatgtctccccctgccgccgccgccgcctgcc tttaaatccatggaagtggccaacttctactacgaggcggactgcttggctgctgcgtac ggcggcaaggcggcccccgcggcgccccccgcggccagacccgggccgcgcccccccgcc ggcgagctgggcagcatcggcgaccacgagcgcgccatcgacttcagcccgtacctggag ccgctgggcgcgccgcaggccccggcgcccgccacggccacggacaccttcgaggcggct ccgcccgcgcccgcccccgcgcccgcctcctccgggcagcaccacgacttcctctccgac ctcttctccgacgactacgggggcaagaactgcaagaagccggccgagtacggctacgtg agcctggggcgcctgggggccgccaagggcgcgctgcaccccggctgcttcgcgcccctg cacccaccgcccccgccgccgccgccgcccgccgagctcaaggcggagccgggcttcgag cccgcggactgcaagcggaaggaggaggccggggcgccgggcggcggcgcaggcatggcg gcgggcttcccgtacgcgctgcgcgcttacctcggctaccaggcggtgccgagcggcagc agcgggagcctctccacgtcctcctcgtccagcccgcccggcacgccgagccccgctgac gccaaggcgcccccgaccgcctgctacgcgggggccgcgccggcgccctcgcaggtcaag agcaaggccaagaagaccgtggacaagcacagcgacgagtacaagatccggcgcgagcgc aacaacatcgccgtgcgcaagagccgcgacaaggccaagatgcgcaacctggagacgcag cacaaggtcctggagctcacggccgagaacgagcggctgcagaagaaggtggagcagctg tcgcgcgagctcagcaccctgcggaacttgttcaagcagctgcccgagcccctgctcgcc tcctccggccactgctag >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_5|273_aa MLRGHFHTCLIGPQDTGKPQAFRELGAGTTLVSDSCVFFQNCPRIPTLLFLHVCFNTSGE TIGTVCTCCFITHHRTTCIPHPNNGPAMKVGLGAALFSSVSPGLAQHLVCGKETEVSEVN DEDNCPQRVFLSTQETTQVKVLGDHAGCHSTARTGHIKMQGPRSQIIKNFKARAGKVTLQ MRSLAAISSIKSSKSAPSAMGRGEEHNIASLIFLPELQNLNLTTMKHQTNPQGKTCCNAV QQLPCILKSVKVMKVKERLRNCSTFKATREMWN >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_5|822_bp atgctacgaggacacttccacacctgcctcattgggccccaggacactggtaaacctcag gcattcagggaacttggggcagggaccaccctggtctcagattcttgtgtgttcttccag aattgtccacgaatacccacacttctcttcttgcatgtatgttttaacacaagtggtgaa acaataggcaccgtttgcacctgctgtttcatcactcatcatcggaccacctgtatcccc caccccaataatggacccgccatgaaggtggggcttggggccgccttattctcttctgta tccccgggcttagcacagcacctggtgtgtggtaaggaaactgaggtcagtgaggttaac gatgaggacaactgccctcaaagagtgttcctgagcactcaggagacaactcaggtgaaa gtactaggtgaccatgccggatgccacagtacagccaggactggccacatcaagatgcag ggccctcgttcacaaatcattaagaatttcaaggccagagcggggaaagtaaccctacag atgcgaagtctggcagctatctcctcaatcaagtcatcaaagagtgcaccatcagcaatg ggacgaggagaagaacataacatcgcttctttgatttttctgccagagctgcagaacctg aatctcaccacaatgaagcatcagacaaacccacagggaaagacatgctgtaatgctgta caacagctgccctgtatcctcaaaagtgtcaaggtcatgaaggtcaaggaaagactaagg aactgttctaccttcaaggcaactagagagatgtggaactaa >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_6|98_aa MAAKAPAISSSYSQAGKALTCPVKNEETIPRSPLADFPISLATDAFHVQPLPIAASGEGD LISICASNVVGGVHPHPLSGDLAAYERRPLNLGHFDYS >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_6|297_bp atggctgccaaagctcctgccatctcttcctcatacagccaagcaggaaaggccctcaca tgtcctgttaagaatgaagaaaccattcccagaagccccttggcagacttccccatctca ctggccacagatgcattccatgtccagcctctgccaattgctgcaagtggggaaggggac ctaatcagcatctgtgcctcgaatgtcgtgggtggagtccacccacacccactgagcggg gatctggctgcctacgagcgcagacccctgaacttgggtcattttgactattcctag >gi568815578f:50091034_50292068|GENSCAN_predicted_peptide_7|487_aa MHIFVQPPLCAGQSAYGEFPGSTGSTKKGKLNRRGQHDPKACGHRSAAPDMPEEATGSGA LGPDKSGFGFMLDDFGCLQDELAKRQLRCEQNGLIGSRHNSTTERVSQRTMSHQLSNETI VVWKPEIRTHRMTGGMTDQMWKMFQPANSIGSACGLNPELDKSSLLCLVIARLPGQNLRP GLPTHPTQPSTLSQRAPVGTQVRKDVAPPTEADVRSLHRFRLRNPMAASAGSHGATLWPA FKDHIVATAPPAPALLAAWEPADASSELPFPDQVCIQGSEPETPSQLGETSSREVPQGPV ITTYPGQWVREITLVGRAVARVLTWPPAGPMGTVWPGFMADIPVPSQCLADEEIQCDEID ESALQGAGNSGSIDNESPTISSWLKYKNCHGLWRNDKNSKELPCTGHPTPYTLTSSPQLA WIYRPPLPHCPDSFLLFLKKPWVFPPQGLCRSLEPLSTMDVHGSLTSFRSELGCHSRQPP SLTLPLA >gi568815578f:50091034_50292068|GENSCAN_predicted_CDS_7|1464_bp atgcacatctttgtgcagccgcctttgtgtgcaggacagtcagcttatggggaattcccg ggatcaacaggaagtaccaagaaggggaaactgaaccgcaggggccagcatgaccccaag gcctgtgggcacagatcagctgcccctgacatgccagaggaggccacgggatctggggct ttggggcctgacaagtctgggtttggattcatgctggatgattttgggtgtctacaggat gaattagccaaacgacaactccggtgtgagcagaacgggctcattgggtcaagacacaat tccaccacagaacgggtttcacaaaggaccatgtcccatcaactcagcaatgaaaccatc gtggtctggaaacctgagatcaggacccacagaatgactggcggaatgacagaccaaatg tggaagatgttccaaccagcaaattccatcggctctgcctgtggactgaatccagaactc gacaagtcttccctcctctgtctggtcattgctcgcctgcctgggcagaatctccgccct ggactccccacccaccccacacagccctccacactcagccagagggctcctgttggaacc caagtcaggaaggacgtggcccctccaaccgaggcggatgtacggtctctccatagattc cgcctgcgtaatcccatggccgcttcagctgggtctcatggggccaccctgtggcctgcg ttcaaggaccacatcgtagcaacagcaccccctgctccagccctgctggctgcctgggag ccggcagatgcttcctctgagctgccttttccagatcaagtttgcatccaaggaagcgag cctgagactcccagccagctgggggagacaagttcgagggaagtaccccaagggcctgta attaccacttaccctggtcaatgggtccgagagattacccttgtaggcagggctgtggcc agggtgctcacctggcccccagcaggtcccatgggcactgtctggccgggcttcatggct gacattccagtgccatcccagtgcctggcagatgaggagattcaatgtgatgaaattgat gagagtgccctgcagggagccggaaactcagggagcatcgataatgagtcccccaccatc agcagctggcttaaatataaaaactgtcatggcctctggagaaatgacaagaattcgaag gagcttccctgcactggccaccctacaccctacacccttacttcctcccctcaacttgcc tggatttatcgacccccactaccccactgccctgatagcttcctgctgttcctcaagaag ccctgggtctttccacctcaaggtctttgcaggtccctggagcccttgtccaccatggat gtccacggctccctcacctccttcaggtctgagctcggatgccactctcgacagccacct tctctgacccttcccctggcttaa