GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:24:28 Sequence gi568815597f:151440367_151681704 : 241338 bp : 44.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 720 562 159 1 0 118 115 85 0.994 14.16 1.02 Intr - 1871 1715 157 2 1 44 92 188 0.037 14.38 1.01 Init - 18682 18584 99 1 0 69 33 166 0.936 7.46 1.00 Prom - 25751 25712 40 -4.06 2.00 Prom + 25980 26019 40 -3.16 2.01 Init + 39658 39683 26 0 2 81 78 19 0.021 -0.64 2.02 Intr + 62505 62814 310 0 1 116 31 99 0.088 3.32 2.03 Intr + 70712 70835 124 1 1 -10 36 199 0.044 5.06 2.04 Intr + 78140 79026 887 0 2 119 71 802 0.125 72.64 2.05 Intr + 79800 79900 101 2 2 102 84 104 0.999 10.31 2.06 Intr + 80048 80117 70 2 1 116 89 23 0.713 4.38 2.07 Intr + 80230 80325 96 0 0 81 116 110 0.989 13.31 2.08 Intr + 83068 83195 128 0 2 103 64 134 0.993 11.98 2.09 Intr + 83860 83992 133 1 1 36 97 182 0.990 14.55 2.10 Intr + 84308 84538 231 1 0 97 1 278 0.944 17.97 2.11 Intr + 85276 85424 149 0 2 103 55 132 0.999 10.43 2.12 Intr + 86609 86741 133 2 1 63 63 197 0.979 15.35 2.13 Intr + 88984 89202 219 0 0 107 -12 223 0.814 12.70 2.14 Intr + 89543 89749 207 1 0 78 78 309 0.996 28.17 2.15 Intr + 90123 90380 258 2 0 98 59 383 0.972 34.06 2.16 Intr + 92036 92206 171 1 0 75 60 195 0.995 15.54 2.17 Intr + 93609 93770 162 2 0 57 78 200 0.980 16.07 2.18 Intr + 94676 94765 90 1 0 77 91 144 0.998 13.79 2.19 Intr + 95234 95359 126 1 0 73 105 11 0.502 2.18 2.20 Intr + 95414 95532 119 1 2 84 46 105 0.981 5.16 2.21 Intr + 95871 95979 109 0 1 76 94 209 0.999 20.59 2.22 Intr + 96364 96527 164 0 2 37 60 288 0.968 19.67 2.23 Intr + 99850 100060 211 1 1 50 95 111 0.306 6.92 2.24 Intr + 102212 102266 55 1 1 58 78 40 0.111 -1.45 2.25 Intr + 117072 117397 326 1 2 34 47 356 0.003 21.59 2.26 Intr + 122219 122320 102 1 0 102 121 68 0.976 11.87 2.27 Intr + 123538 123624 87 0 0 89 64 87 0.980 6.57 2.28 Intr + 124159 124248 90 0 0 115 90 35 0.989 6.59 2.29 Intr + 125797 125862 66 0 0 95 92 31 0.907 3.30 2.30 Intr + 129291 129404 114 2 0 93 78 103 0.999 10.44 2.31 Intr + 133904 134032 129 1 0 106 43 113 0.811 9.49 2.32 Intr + 134545 134639 95 0 2 55 78 119 0.954 6.36 2.33 Intr + 138355 138460 106 1 1 111 111 162 0.869 21.02 2.34 Intr + 139283 139366 84 1 0 68 85 141 0.958 11.82 2.35 Intr + 140576 140676 101 1 2 59 109 91 0.993 7.21 2.36 Term + 141278 141341 64 2 1 79 49 83 0.792 0.86 2.37 PlyA + 143190 143195 6 1.05 3.00 Prom + 155273 155312 40 -3.36 3.01 Init + 171836 172146 311 1 2 100 59 549 0.492 49.99 3.02 Intr + 198522 198753 232 0 1 110 89 171 0.566 17.18 3.03 Intr + 217869 218061 193 2 1 102 84 108 0.906 10.87 3.04 Intr + 220432 220496 65 2 2 65 92 74 0.999 3.94 3.05 Intr + 221800 221904 105 0 0 121 99 55 0.987 10.31 3.06 Intr + 225567 225645 79 2 1 63 111 50 0.972 3.92 3.07 Intr + 228106 228269 164 2 2 102 96 132 0.693 15.09 3.08 Term + 234666 234740 75 2 0 64 32 68 0.081 -3.36 3.09 PlyA + 237545 237550 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 1838 1715 124 2 1 64 92 201 0.909 18.43 S.002 Init + 78172 79026 855 0 0 107 71 799 0.863 74.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:151440367_151681704|GENSCAN_predicted_peptide_1|139_aa MGPVRAGVGAGGRRCAPPPTASGGAARGRGGRGAHILFECFSYSMADTDLFMECEEEELE PWQKISDVIEDSVVEDYNSVDKTTTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGA QNSDSTKKTLVTLIANNNX >gi568815597f:151440367_151681704|GENSCAN_predicted_CDS_1|417_bp atggggccggtacgcgcgggggtgggggcggggggccggcggtgcgcgcccccgcccacc gcgagtggcggcgcggcccgcggcaggggcggccgcggggcccatatcttatttgaatgt ttttcttatagcatggcggacaccgacctgttcatggaatgtgaggaggaggagttggag ccatggcagaaaatcagtgatgtcattgaggactctgtagttgaagattataattcagtg gataaaactaccacagtttctgtgagccagcagccagtctcggctccagtgcccatcgct gcccatgcttctgttgctgggcacctctctacatccaccaccgttagtagcagcggggca cagaacagcgacagtacaaagaagactcttgtcacactaattgccaacaacaatgnn >gi568815597f:151440367_151681704|GENSCAN_predicted_peptide_2|1880_aa MVVCAYSTRPLLEWSPESVQVVWGADTETKLGVQEAYWLATPVKDKEGGRNGQRKTLDRL PYPDKVKEGRSKIGQEKPQAVVQILPSGSLPSGKHQGKGYHIEKSTVGRNGQSPEDALAD GSRDLGRRSLATIRDAPKMASAPLAAPPRRSAARLLLFMEQAPNMAEPRGPVDHGVQIRF ITEPVSGAEMGTLRRGGRRPAKDARASTYGVAVRVQGIAGQPFVVLNSGEKGGDSFGVQI KGANDQGASGALSSDLELPENPYSQVKGFPAPSQSSTSDEEPGAYWNGKLLRSHSQASLA GPGPVDPSNRSNSMLELAPKVASPGSTIDTAPLSSVDSLINKFDSQLGGQARGRTGRRTR MLPPEQRKRSKSLDSRLPRDTFEERERQSTNHWTSSTKYDNHVGTSKQPAQSQNLSPLSG FSRSRQTQDWVLQSFEEPRRSAQDPTMLQFKSTPDLLRDQQEAAPPGSVDHMKATIYGIL REGSSESETSVRRKVSLVLEKMQPLVMVSSGSTKAVAGQGELTRKVEELQRKLDEEVKKR QKLEPSQVGLERQLEEKTEECSRLQELLERRKGEAQQSNKELQNMKRLLDQGEDLRHGLE TQVMELQNKLKHVQGPEPAKEVLLKDLLETRELLEEVLEGKQRVEEQLRLRERELTALKG ALKEEVASRDQEVEHVRQQYQRDTEQLRRSMQDATQACDKSRDHAVLEAERQKMSALVRG LQRELEETSEETGHWQSMFQKNKEDLRATKQELLQLRMEKEEMEEELGEKIEVLQRELEQ ARASAGDTRQVEVLKKELLRTQEELKELQAERQSQEVAGRHRDRELEKQLAVLRVEADRG RELEEQNLQLQKTLQQLRQDCEEASKARGAKMVAEAEATVLGQRRAAVETTLRETQEEND EFRRRILGLEQQLKETRGLVDGGEAVEARLRDKLQRLEAEKQQLEEALNASQEEEGSLAA AKRALEARLEEAQRGLARLGQEQQTLNRALEEEGKQREVLRRGKAELEEQKRLLDRTVDR LNKELEKIGEDSKQALQQLQAQLEDYKEKARREVADAQRQAKDWASEAEKTSGGLSRLQD EIQRLRQALQASQAERDTARLDKELLAQRLQGLEQEAENKKRSQDDRARQLKGLEEKVSR LETELDEEKNTVELLTDRVNRGRDQVDQLRTELMQERSARQDLECDKISLERQVMGEGRI LRDGPQENKDLKTRLASSEGFQKPSASLSQLESQNQLLQERLQAEEREKTVLQSTNRKLE RKVKELSIQIEDERQHVNDQKDQLSLRVKALKRQVDEAEEEIERLDGLRKKAQREVEEQH EVNEQLQARIKSLEKDSCSLRNRELGARPIFSPAPDGPIQGGGSARARPVGARQRGGQVA CVLRPRGKMNGTRNWCTLVDVHPEDQAAKKLLRENNGQGIQMVSIIATEMLMPKNRIAIY ELLFKEGVMVAKKDVHMPKHRELADKDVPNLHVMKAMQPLKSRGYVKEQFAWRHFHWYVT NEGIQYLRDYLHLPRRLCLPLYAAAVQRLEVLSLKAGRKTYAMVSSHSAGHSLASELVES HDGHEEIIKVYLKGRSGDKMIHEKNINQLKSEVQYIQEARNCLQKLREDISSKLDRNLGD SLHRQEIQVVLEKPNGFSQSPTALYSSPPEVDTCINEDVESLRKTVQDLLAKLQEAKRQH QSDCVAFEVTLSRYQREAEQSNVALQREEDRVEQKEAEVGELQRRLLGMETEHQALLAKV REGEVALEELRSNNADCQAEREKAATLEKEVAGLREKIHHLDDMLKSQQRKVRQMIEQLQ NSKAVIQSKDATIQELKEKIAYLEAENLEMHDRMEHLIEKQISHGNFSTQARAKTENPGS IRISKPPSPKPMPVIRVVET >gi568815597f:151440367_151681704|GENSCAN_predicted_CDS_2|5643_bp atggtggtgtgcgcctatagtaccagaccacttctggagtggtcacctgagtcagttcag gtggtctggggagcagatactgagacaaagttaggagtgcaagaggcttactggctggca acacctgtgaaagataaagaaggaggcaggaatgggcagagaaagactttggataggctt ccctatcctgataaagtaaaggaaggcagaagcaaaatcgggcaagaaaaacctcaagct gtggtgcagatccttccaagtggcagtctacccagtggaaagcaccagggcaaaggttat cacatagagaaatcaacagtgggcagaaacggccagagcccagaagatgctctggccgac ggctcccgggatcttggccggcggtcactcgcgaccatccgcgacgcccccaaaatggcc tccgcgcccctcgccgccccgccccgacgctccgcagcccgactcctcctatttatggag caggcacccaacatggctgagccccggggccccgtagaccatggagtccagattcgcttc atcacagagccagtgagtggtgcagagatgggcactctacgtcgaggtggacgacgccca gctaaggatgcaagagccagtacctacggggttgctgtgcgtgtgcagggaatcgctggg cagccctttgtggtgctcaacagtggggagaaaggcggtgactcctttggggtccaaatc aagggggccaatgaccaaggggcctcaggagctctgagctcagatttggaactccctgag aacccctactctcaggtcaagggatttcctgccccctcgcagagcagcacatctgatgag gagcctggggcctactggaatggaaagctactccgttcccactcccaggcctcactggca ggccctggcccagtggatcctagtaacagaagcaacagcatgctggagctagccccgaaa gtggcttccccaggtagcaccattgacactgctcccctgtcttcagtggactcactcatc aacaagtttgacagtcaacttggaggccaggcccggggtcggactggccgccgaacacgg atgctaccccctgaacagcgcaaacggagcaagagcctggacagccgcctcccacgggac acctttgaggaacgggagcgccagtccaccaaccactggacctctagcacaaaatatgac aaccatgtgggcacttcgaagcagccagcccagagccagaacctgagtcctctcagtggc tttagccgttctcgtcagactcaggactgggtccttcagagttttgaggagccgcggagg agtgcacaggaccccaccatgctgcagttcaaatcaactccagacctccttcgagaccag caggaggcagccccaccaggcagtgtggaccatatgaaggccaccatctatggcatcctg agggagggaagctcagaaagtgaaacctctgtgaggaggaaggttagtttggtgctggag aagatgcagcctctagtgatggtttcttctggttctactaaggccgtggcagggcagggt gagcttacccgaaaagtggaggagctacagcgaaagctggatgaagaggtgaagaagcgg cagaagctagagccatcccaagttgggctggagcggcagctggaggagaaaacagaagag tgcagccgactgcaggagctgctggagaggaggaagggggaggcccagcagagcaacaag gagctccagaacatgaagcgcctcttggaccagggtgaagatttacgacatgggctggag acccaggtgatggagctgcagaacaagctgaaacatgtccagggtcctgagcctgctaag gaggtgttactgaaggacctgttagagacccgggaacttctggaagaggtcttggagggg aaacagcgagtagaggagcagctgaggctgcgggagcgggagttgacagccctgaagggg gccctgaaagaggaggtagcctcccgtgaccaggaggtggaacatgtccggcagcagtac cagcgagacacagagcagctccgcaggagcatgcaagatgcaacccaggcatgtgacaag agcagggaccatgcagtgctggaggccgagaggcagaagatgtcagcccttgtgcgaggg ctgcagagggagctggaggagacttcagaggagacagggcattggcagagtatgttccag aagaacaaggaggatcttagagccaccaagcaggaactcctgcagctgcgaatggagaag gaggagatggaagaggagcttggagagaagatagaggtcttgcagagggaattagagcag gcccgagctagtgctggagatactcgccaggttgaggtgctcaagaaggagctgctccgg acacaggaggagcttaaggaactgcaggcagaacggcagagccaggaggtggctgggcga caccgggaccgggagttggagaagcagctggcggtcctgagggtcgaggctgatcgaggt cgggagctggaagaacagaacctccagctacaaaagaccctccagcaactgcgacaggac tgtgaagaggcttccaaggcaaggggagctaagatggtggccgaggcagaggcaacagtg ctggggcagcggcgggccgcagtggagacgacgcttcgggagacccaggaggaaaatgac gaattccgccggcgcatcctgggtttggagcagcagctgaaggagactcgaggtctggtg gatggtggggaagcggtggaggcacgactacgggacaagctgcagcggctggaggcagag aaacagcagctggaggaggccctgaatgcgtcccaggaagaggaggggagtctggcagca gccaagcgggcactggaggcacgcctagaggaggctcagcgggggctggcccgcctgggg caggagcagcagacactgaaccgggccctggaggaggaagggaagcagcgggaggtgctc cggcgaggcaaggctgagctggaggagcagaagcgtttgctggacaggactgtggaccga ctgaacaaggagttggagaagatcggggaggactctaagcaagccctgcagcagctccag gcccagctggaggattataaggaaaaggcccggcgggaggtggcagatgcccagcgccag gccaaggattgggccagtgaggctgagaagacctctggaggactgagccgacttcaggat gagatccagaggctgcggcaggccctgcaggcatcccaggctgagcgggacacagcccgg ctggacaaagagctactggcccagcgactgcaggggctggagcaagaggcagagaacaag aagcgttcccaggacgacagggcccggcagctgaagggtctcgaggaaaaagtctcacgg ctggaaacagagttagatgaggagaagaacaccgtggagctgctaacagatcgggtgaat cgtggccgggaccaggtggatcagctgaggacagagctcatgcaggaaaggtctgctcgg caggacctggagtgtgacaaaatctccttggagagacaggtgatgggggaggggaggatt cttagggatggaccccaggagaacaaggacctgaagacccggttggccagctcagaaggc ttccagaagcctagtgccagcctctctcagcttgagtcccagaatcagttgttgcaggag cggctacaggctgaagagagggagaagacagttctgcagtctaccaatcgaaaactggag cggaaagttaaagaactatccatccagattgaagacgagcggcagcatgtcaatgaccag aaagaccagctaagcctgagggtgaaggctttgaagcgtcaggtggatgaagcagaagag gaaattgagcgactggacggcctgaggaagaaggcccagcgtgaggtggaggagcagcat gaggtcaatgaacagctccaggcccggatcaagtctctggagaaggactcctgttctctg cgaaaccgcgaactgggggcgcggcctatcttcagccccgcccctgatgggcctatacaa gggggcggttccgcgcgcgcccgcccagttggagccagacagcggggtggacaagtggcg tgtgtgctgcgaccccgagggaagatgaacgggacgcggaactggtgtaccctggtggac gtgcacccagaggaccaggcggcgaaaaaattactaagagaaaataatggacaaggaatt cagatggtctccatcatagccaccgagatgttgatgcctaagaaccggattgccatttat gaactcctttttaaggagggagtcatggtggccaagaaggatgtccacatgcctaagcac cgggagctggcagacaaggatgtgcccaatcttcatgtcatgaaggccatgcagcctctc aagtcccgaggctacgtgaaggaacagtttgcctggagacatttccactggtacgttacc aatgagggcatccagtatctccgtgattaccttcatctgccccggagattgtgcctgcca ctctatgctgcagccgtccagagactggaagtcctcagcctaaaggcgggcaggaagacc tatgccatggtgtccagccactcagctggtcattctctggcttcagaactggtggagtcc catgatggacatgaggagatcattaaggtgtacttgaaggggaggtctggagacaagatg attcacgagaagaatattaaccagctgaagagtgaggtccagtacatccaggaggccagg aactgcctacagaagctccgggaggatataagtagcaagcttgacaggaacctaggagat tctctccatcgacaggagatacaggtggtgctagaaaagccaaatggctttagtcagagt cccacagccctgtacagcagcccacctgaggtggacacctgtataaatgaggatgttgag agcttgaggaagacggtgcaggacttgctggccaagcttcaggaggccaagcggcaacac cagtcagactgtgtggcttttgaggtcacactcagccggtaccagagggaagcagaacaa agtaatgtggcccttcagagagaggaggacagagtggagcagaaagaggcagaagtcgga gagctgcagaggcgcttgctagggatggagacggagcatcaggccttactggcgaaagtg agggaaggggaggtggccctagaggaacttcggagcaacaatgctgactgccaagcagaa cgagaaaaggctgctaccctggaaaaggaagtggccgggttgcgggagaagatccaccac ttggatgacatgctcaagagccagcagcggaaagtccggcaaatgatagagcagctccag aattcaaaagctgtgatccagtcaaaggacgccaccatccaggagctcaaggagaaaatc gcctatctggaggcagagaatttagagatgcatgaccggatggaacacctgatagaaaaa caaatcagtcatggcaacttcagcacccaggcccgggccaagacagagaacccgggcagt attaggatatccaagccgcctagcccgaagcccatgcctgtcatccgagtggtggaaacc tga >gi568815597f:151440367_151681704|GENSCAN_predicted_peptide_3|407_aa MADEDGEGIHPSAPHRNGGGGGGGGSGLHCAGNGGGGGGGPRVVRIVKSESGYGFNVRGQ VSEGGQLRSINGELYAPLQHVSAVLPGGAADRAGVRKGDRILEVNHVNVEGATHKQVVDL IRAGEKELILTVLSVPPHEADNLDPSDDSLGQSFYDYTEKQAVPISVPRYKHVEQNGEKF VVYNVYMAGRQLCSKRYREFAILHQNLKREFANFTFPRLPGKWPFSLSEQQLDARRRGLE EYLEKVCSIRVIGESDIMQEFLSESDENYNGVSDVELRVALPDGTTVTVRVKKNSTTDQV YQAIAAKVGMDSTTVNYFALFEVISHSFVRKLAPNEFPHKLYIQNYTSAVPGTCLTIRKW LFTTEEEILLNDNDLAVTYFFHQPHLPPLLDGHLVLQILRSFGNFVI >gi568815597f:151440367_151681704|GENSCAN_predicted_CDS_3|1224_bp atggcggacgaggacggggaagggattcatccctcagcccctcacaggaacggaggtggc ggcggcggcggggggtctgggctccactgcgccgggaacggcggcgggggaggcggcggc ccgcgggtcgtgcgcatcgtcaagtccgagtccggctacggcttcaacgtgcggggccaa gtgagcgagggcgggcaactgcggagcatcaacggggagctgtacgcgccgctgcagcat gtgagcgccgtgctgcccgggggggcggccgatcgggccggggtgcgcaagggggaccgc atcctggaggtgaaccacgtgaatgttgagggggcgacacacaagcaggtggtggacctg attcgagcaggcgagaaggaattgatcttgacagtgttatctgtacctcctcatgaggca gataacctagatcccagtgacgactcgttgggacaatcattttatgattacacagaaaag caagcagtgcccatatcggtccccagatacaaacatgtggagcagaatggtgagaagttt gtggtatataatgtttacatggcagggaggcagctgtgttctaagcggtaccgggagttt gctatcctacaccagaacctgaagagagagtttgccaactttacatttcctcgactccca gggaagtggccattttcattatcagaacaacaattagatgcccgacgtcggggattggaa gaatatctagaaaaagtgtgttcaatacgagtaattggtgagagtgacatcatgcaggaa ttcctatcagaatccgatgagaactacaatggtgtgtccgacgtagagctgagagtagca ttaccagatggaacaacggttacagtcagggttaaaaagaacagtactacagaccaagta tatcaggctatcgcagcaaaggttggcatggacagtacgacagtgaattactttgcctta tttgaagtgatcagtcactcctttgtacgtaaattggcacctaatgagtttcctcacaaa ctctacattcagaattatacatcagctgtgccaggcacctgcttgaccattcgaaagtgg ctttttacaacagaagaagaaattctcttaaatgacaatgaccttgctgttacctacttc tttcatcagccacatctaccacctctcctagatggccacctggtgctacagattcttcgg tcttttgggaatttcgtcatataa