GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:54:55 Sequence gi568815597f:151418520_151637343 : 218824 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 Intr - 5032 4878 155 0 2 81 94 113 0.619 10.92 1.09 Intr - 5767 5430 338 1 2 67 123 199 0.761 15.62 1.08 Intr - 6542 6436 107 0 2 96 75 33 0.653 2.73 1.07 Intr - 9495 9304 192 1 0 94 82 152 0.892 14.66 1.06 Intr - 9894 9604 291 1 0 46 82 182 0.589 10.51 1.05 Intr - 11192 11084 109 2 1 60 110 -1 0.973 -0.84 1.04 Intr - 12322 12147 176 2 2 100 109 85 0.916 11.46 1.03 Intr - 22567 22409 159 2 0 118 115 85 0.997 14.16 1.02 Intr - 23718 23562 157 0 1 44 92 188 0.037 14.38 1.01 Init - 40529 40431 99 2 0 69 33 166 0.936 7.46 1.00 Prom - 47598 47559 40 -4.06 2.00 Prom + 47827 47866 40 -3.16 2.01 Init + 61505 61530 26 1 2 81 78 19 0.021 -0.64 2.02 Intr + 84352 84661 310 1 1 116 31 99 0.088 3.32 2.03 Intr + 92559 92682 124 2 1 -10 36 199 0.044 5.06 2.04 Intr + 99987 100873 887 1 2 119 71 802 0.125 72.64 2.05 Intr + 101647 101747 101 0 2 102 84 104 0.999 10.31 2.06 Intr + 101895 101964 70 0 1 116 89 23 0.713 4.38 2.07 Intr + 102077 102172 96 1 0 81 116 110 0.989 13.31 2.08 Intr + 104915 105042 128 1 2 103 64 134 0.993 11.98 2.09 Intr + 105707 105839 133 2 1 36 97 182 0.990 14.55 2.10 Intr + 106155 106385 231 2 0 97 1 278 0.944 17.97 2.11 Intr + 107123 107271 149 1 2 103 55 132 0.999 10.43 2.12 Intr + 108456 108588 133 0 1 63 63 197 0.979 15.35 2.13 Intr + 110831 111049 219 1 0 107 -12 223 0.814 12.70 2.14 Intr + 111390 111596 207 2 0 78 78 309 0.996 28.17 2.15 Intr + 111970 112227 258 0 0 98 59 383 0.972 34.06 2.16 Intr + 113883 114053 171 2 0 75 60 195 0.995 15.54 2.17 Intr + 115456 115617 162 0 0 57 78 200 0.980 16.07 2.18 Intr + 116523 116612 90 2 0 77 91 144 0.998 13.79 2.19 Intr + 117081 117206 126 2 0 73 105 11 0.502 2.18 2.20 Intr + 117261 117379 119 2 2 84 46 105 0.981 5.16 2.21 Intr + 117718 117826 109 1 1 76 94 209 0.999 20.59 2.22 Intr + 118211 118374 164 1 2 37 60 288 0.968 19.67 2.23 Intr + 121697 121907 211 2 1 50 95 111 0.306 6.92 2.24 Intr + 124059 124113 55 2 1 58 78 40 0.111 -1.45 2.25 Intr + 138919 139244 326 2 2 34 47 356 0.003 21.59 2.26 Intr + 144066 144167 102 2 0 102 121 68 0.976 11.87 2.27 Intr + 145385 145471 87 1 0 89 64 87 0.980 6.57 2.28 Intr + 146006 146095 90 1 0 115 90 35 0.989 6.59 2.29 Intr + 147644 147709 66 1 0 95 92 31 0.907 3.30 2.30 Intr + 151138 151251 114 0 0 93 78 103 0.999 10.44 2.31 Intr + 155751 155879 129 2 0 106 43 113 0.811 9.49 2.32 Intr + 156392 156486 95 1 2 55 78 119 0.954 6.36 2.33 Intr + 160202 160307 106 2 1 111 111 162 0.869 21.02 2.34 Intr + 161130 161213 84 2 0 68 85 141 0.958 11.82 2.35 Intr + 162423 162523 101 2 2 59 109 91 0.993 7.21 2.36 Term + 163125 163188 64 0 1 79 49 83 0.792 0.86 2.37 PlyA + 165037 165042 6 1.05 3.00 Prom + 177120 177159 40 -3.36 3.01 Init + 193683 193993 311 2 2 100 59 549 0.432 49.99 3.02 Intr + 194390 194492 103 2 1 72 32 63 0.143 -0.72 3.03 Term + 209352 209369 18 2 0 133 39 22 0.166 0.12 3.04 PlyA + 209914 209919 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23685 23562 124 0 1 64 92 201 0.909 18.43 S.002 Init + 100019 100873 855 1 0 107 71 799 0.863 74.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:151418520_151637343|GENSCAN_predicted_peptide_1|595_aa MGPVRAGVGAGGRRCAPPPTASGGAARGRGGRGAHILFECFSYSMADTDLFMECEEEELE PWQKISDVIEDSVVEDYNSVDKTTTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGA QNSDSTKKTLVTLIANNNAGNPLVQQGGQPLILTQNPAPGLGTMVTQPVLRPVQVMQNAN HVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVGIVLNVQQGQTVRPITLVPAPGTQFV KPTVGVPQVFSQMTPVRPGSTMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTS TTPTATQPTSLGQLAVQSPGQSNQTTNPKLVSIASFVTVKRPGVTGENSNEVAKLVNTLN TIPSLGQSPGPVVVSNNSSAHGSQRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQFR VTEALRGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVASTPSSTPIPALS PPTKVPEPNENVGDAVQTKLIMLVDDFYYGRDGGKVAQLTNFPKVATSFRCPHCTKRLKN NIRFMNHMKHHVELDQQNGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTX >gi568815597f:151418520_151637343|GENSCAN_predicted_CDS_1|1785_bp atggggccggtacgcgcgggggtgggggcggggggccggcggtgcgcgcccccgcccacc gcgagtggcggcgcggcccgcggcaggggcggccgcggggcccatatcttatttgaatgt ttttcttatagcatggcggacaccgacctgttcatggaatgtgaggaggaggagttggag ccatggcagaaaatcagtgatgtcattgaggactctgtagttgaagattataattcagtg gataaaactaccacagtttctgtgagccagcagccagtctcggctccagtgcccatcgct gcccatgcttctgttgctgggcacctctctacatccaccaccgttagtagcagcggggca cagaacagcgacagtacaaagaagactcttgtcacactaattgccaacaacaatgctggc aatcctttggtccagcaaggtggacagccactcatcctgacccagaatccagccccaggt ctgggcacaatggttactcaaccagtattgaggcctgttcaggtcatgcagaatgccaat catgtgactagttcccctgtggcctcacaaccaatatttatcactacgcagggatttcct gtaaggaatgtccggcctgtacaaaatgcaatgaatcaggttgggattgtgctgaacgta cagcaaggccaaacggttagaccaattacactagttccagccccaggtacccagtttgtt aagccgacagttggagttccacaagtgttctcccagatgacccctgtgaggccaggctcc acaatgcctgtgaggcccaccaccaacaccttcaccaccgtcatcccggccactcttacc attcgaagcaccgtcccacagtcccagtcccagcagaccaagtccactcccagcacttct accactcccactgccacacagccaacctcactggggcaactagctgttcagtctccaggc cagtcaaaccagaccacgaatcccaagctagtgagcattgccagctttgtcactgtgaag cgacctggtgttacaggcgaaaatagcaatgaagtggccaaattggtgaatacccttaac accatcccttccctgggccagagtcctgggccagtggtggtgtccaacaacagctctgct catggctctcaaagaaccagcggacctgagtcttcaatgaaagtgacctcttccatccca gtatttgacctccaggatggtggacggaaaatatgtccacgatgtaatgctcaatttcgt gttactgaagctttgagaggtcacatgtgttactgttgcccagaaatggttgaataccag aagaaaggaaagtccctggattcagaacccagtgtcccatcagcagcaaagcccccatcc cctgagaaaacagctcctgttgcttccacaccctcttctacacctattcctgctctgtca ccgcctaccaaagtaccagaaccaaatgagaacgtgggcgatgccgtccagaccaaactc attatgcttgtagatgacttctactatggacgggatggtggcaaagtagcccagctcaca aatttccctaaggtcgccacatctttccgatgcccacattgtaccaaaaggctaaaaaac aatattcgattcatgaaccatatgaaacaccacgtagaactcgatcagcagaacggtgag gtagatggtcacactatctgccagcactgttaccgccagttttccactcccttccagctt cagtgccacttggaaaatgttcatagtccctatgaatctactann >gi568815597f:151418520_151637343|GENSCAN_predicted_peptide_2|1880_aa MVVCAYSTRPLLEWSPESVQVVWGADTETKLGVQEAYWLATPVKDKEGGRNGQRKTLDRL PYPDKVKEGRSKIGQEKPQAVVQILPSGSLPSGKHQGKGYHIEKSTVGRNGQSPEDALAD GSRDLGRRSLATIRDAPKMASAPLAAPPRRSAARLLLFMEQAPNMAEPRGPVDHGVQIRF ITEPVSGAEMGTLRRGGRRPAKDARASTYGVAVRVQGIAGQPFVVLNSGEKGGDSFGVQI KGANDQGASGALSSDLELPENPYSQVKGFPAPSQSSTSDEEPGAYWNGKLLRSHSQASLA GPGPVDPSNRSNSMLELAPKVASPGSTIDTAPLSSVDSLINKFDSQLGGQARGRTGRRTR MLPPEQRKRSKSLDSRLPRDTFEERERQSTNHWTSSTKYDNHVGTSKQPAQSQNLSPLSG FSRSRQTQDWVLQSFEEPRRSAQDPTMLQFKSTPDLLRDQQEAAPPGSVDHMKATIYGIL REGSSESETSVRRKVSLVLEKMQPLVMVSSGSTKAVAGQGELTRKVEELQRKLDEEVKKR QKLEPSQVGLERQLEEKTEECSRLQELLERRKGEAQQSNKELQNMKRLLDQGEDLRHGLE TQVMELQNKLKHVQGPEPAKEVLLKDLLETRELLEEVLEGKQRVEEQLRLRERELTALKG ALKEEVASRDQEVEHVRQQYQRDTEQLRRSMQDATQACDKSRDHAVLEAERQKMSALVRG LQRELEETSEETGHWQSMFQKNKEDLRATKQELLQLRMEKEEMEEELGEKIEVLQRELEQ ARASAGDTRQVEVLKKELLRTQEELKELQAERQSQEVAGRHRDRELEKQLAVLRVEADRG RELEEQNLQLQKTLQQLRQDCEEASKARGAKMVAEAEATVLGQRRAAVETTLRETQEEND EFRRRILGLEQQLKETRGLVDGGEAVEARLRDKLQRLEAEKQQLEEALNASQEEEGSLAA AKRALEARLEEAQRGLARLGQEQQTLNRALEEEGKQREVLRRGKAELEEQKRLLDRTVDR LNKELEKIGEDSKQALQQLQAQLEDYKEKARREVADAQRQAKDWASEAEKTSGGLSRLQD EIQRLRQALQASQAERDTARLDKELLAQRLQGLEQEAENKKRSQDDRARQLKGLEEKVSR LETELDEEKNTVELLTDRVNRGRDQVDQLRTELMQERSARQDLECDKISLERQVMGEGRI LRDGPQENKDLKTRLASSEGFQKPSASLSQLESQNQLLQERLQAEEREKTVLQSTNRKLE RKVKELSIQIEDERQHVNDQKDQLSLRVKALKRQVDEAEEEIERLDGLRKKAQREVEEQH EVNEQLQARIKSLEKDSCSLRNRELGARPIFSPAPDGPIQGGGSARARPVGARQRGGQVA CVLRPRGKMNGTRNWCTLVDVHPEDQAAKKLLRENNGQGIQMVSIIATEMLMPKNRIAIY ELLFKEGVMVAKKDVHMPKHRELADKDVPNLHVMKAMQPLKSRGYVKEQFAWRHFHWYVT NEGIQYLRDYLHLPRRLCLPLYAAAVQRLEVLSLKAGRKTYAMVSSHSAGHSLASELVES HDGHEEIIKVYLKGRSGDKMIHEKNINQLKSEVQYIQEARNCLQKLREDISSKLDRNLGD SLHRQEIQVVLEKPNGFSQSPTALYSSPPEVDTCINEDVESLRKTVQDLLAKLQEAKRQH QSDCVAFEVTLSRYQREAEQSNVALQREEDRVEQKEAEVGELQRRLLGMETEHQALLAKV REGEVALEELRSNNADCQAEREKAATLEKEVAGLREKIHHLDDMLKSQQRKVRQMIEQLQ NSKAVIQSKDATIQELKEKIAYLEAENLEMHDRMEHLIEKQISHGNFSTQARAKTENPGS IRISKPPSPKPMPVIRVVET >gi568815597f:151418520_151637343|GENSCAN_predicted_CDS_2|5643_bp atggtggtgtgcgcctatagtaccagaccacttctggagtggtcacctgagtcagttcag gtggtctggggagcagatactgagacaaagttaggagtgcaagaggcttactggctggca acacctgtgaaagataaagaaggaggcaggaatgggcagagaaagactttggataggctt ccctatcctgataaagtaaaggaaggcagaagcaaaatcgggcaagaaaaacctcaagct gtggtgcagatccttccaagtggcagtctacccagtggaaagcaccagggcaaaggttat cacatagagaaatcaacagtgggcagaaacggccagagcccagaagatgctctggccgac ggctcccgggatcttggccggcggtcactcgcgaccatccgcgacgcccccaaaatggcc tccgcgcccctcgccgccccgccccgacgctccgcagcccgactcctcctatttatggag caggcacccaacatggctgagccccggggccccgtagaccatggagtccagattcgcttc atcacagagccagtgagtggtgcagagatgggcactctacgtcgaggtggacgacgccca gctaaggatgcaagagccagtacctacggggttgctgtgcgtgtgcagggaatcgctggg cagccctttgtggtgctcaacagtggggagaaaggcggtgactcctttggggtccaaatc aagggggccaatgaccaaggggcctcaggagctctgagctcagatttggaactccctgag aacccctactctcaggtcaagggatttcctgccccctcgcagagcagcacatctgatgag gagcctggggcctactggaatggaaagctactccgttcccactcccaggcctcactggca ggccctggcccagtggatcctagtaacagaagcaacagcatgctggagctagccccgaaa gtggcttccccaggtagcaccattgacactgctcccctgtcttcagtggactcactcatc aacaagtttgacagtcaacttggaggccaggcccggggtcggactggccgccgaacacgg atgctaccccctgaacagcgcaaacggagcaagagcctggacagccgcctcccacgggac acctttgaggaacgggagcgccagtccaccaaccactggacctctagcacaaaatatgac aaccatgtgggcacttcgaagcagccagcccagagccagaacctgagtcctctcagtggc tttagccgttctcgtcagactcaggactgggtccttcagagttttgaggagccgcggagg agtgcacaggaccccaccatgctgcagttcaaatcaactccagacctccttcgagaccag caggaggcagccccaccaggcagtgtggaccatatgaaggccaccatctatggcatcctg agggagggaagctcagaaagtgaaacctctgtgaggaggaaggttagtttggtgctggag aagatgcagcctctagtgatggtttcttctggttctactaaggccgtggcagggcagggt gagcttacccgaaaagtggaggagctacagcgaaagctggatgaagaggtgaagaagcgg cagaagctagagccatcccaagttgggctggagcggcagctggaggagaaaacagaagag tgcagccgactgcaggagctgctggagaggaggaagggggaggcccagcagagcaacaag gagctccagaacatgaagcgcctcttggaccagggtgaagatttacgacatgggctggag acccaggtgatggagctgcagaacaagctgaaacatgtccagggtcctgagcctgctaag gaggtgttactgaaggacctgttagagacccgggaacttctggaagaggtcttggagggg aaacagcgagtagaggagcagctgaggctgcgggagcgggagttgacagccctgaagggg gccctgaaagaggaggtagcctcccgtgaccaggaggtggaacatgtccggcagcagtac cagcgagacacagagcagctccgcaggagcatgcaagatgcaacccaggcatgtgacaag agcagggaccatgcagtgctggaggccgagaggcagaagatgtcagcccttgtgcgaggg ctgcagagggagctggaggagacttcagaggagacagggcattggcagagtatgttccag aagaacaaggaggatcttagagccaccaagcaggaactcctgcagctgcgaatggagaag gaggagatggaagaggagcttggagagaagatagaggtcttgcagagggaattagagcag gcccgagctagtgctggagatactcgccaggttgaggtgctcaagaaggagctgctccgg acacaggaggagcttaaggaactgcaggcagaacggcagagccaggaggtggctgggcga caccgggaccgggagttggagaagcagctggcggtcctgagggtcgaggctgatcgaggt cgggagctggaagaacagaacctccagctacaaaagaccctccagcaactgcgacaggac tgtgaagaggcttccaaggcaaggggagctaagatggtggccgaggcagaggcaacagtg ctggggcagcggcgggccgcagtggagacgacgcttcgggagacccaggaggaaaatgac gaattccgccggcgcatcctgggtttggagcagcagctgaaggagactcgaggtctggtg gatggtggggaagcggtggaggcacgactacgggacaagctgcagcggctggaggcagag aaacagcagctggaggaggccctgaatgcgtcccaggaagaggaggggagtctggcagca gccaagcgggcactggaggcacgcctagaggaggctcagcgggggctggcccgcctgggg caggagcagcagacactgaaccgggccctggaggaggaagggaagcagcgggaggtgctc cggcgaggcaaggctgagctggaggagcagaagcgtttgctggacaggactgtggaccga ctgaacaaggagttggagaagatcggggaggactctaagcaagccctgcagcagctccag gcccagctggaggattataaggaaaaggcccggcgggaggtggcagatgcccagcgccag gccaaggattgggccagtgaggctgagaagacctctggaggactgagccgacttcaggat gagatccagaggctgcggcaggccctgcaggcatcccaggctgagcgggacacagcccgg ctggacaaagagctactggcccagcgactgcaggggctggagcaagaggcagagaacaag aagcgttcccaggacgacagggcccggcagctgaagggtctcgaggaaaaagtctcacgg ctggaaacagagttagatgaggagaagaacaccgtggagctgctaacagatcgggtgaat cgtggccgggaccaggtggatcagctgaggacagagctcatgcaggaaaggtctgctcgg caggacctggagtgtgacaaaatctccttggagagacaggtgatgggggaggggaggatt cttagggatggaccccaggagaacaaggacctgaagacccggttggccagctcagaaggc ttccagaagcctagtgccagcctctctcagcttgagtcccagaatcagttgttgcaggag cggctacaggctgaagagagggagaagacagttctgcagtctaccaatcgaaaactggag cggaaagttaaagaactatccatccagattgaagacgagcggcagcatgtcaatgaccag aaagaccagctaagcctgagggtgaaggctttgaagcgtcaggtggatgaagcagaagag gaaattgagcgactggacggcctgaggaagaaggcccagcgtgaggtggaggagcagcat gaggtcaatgaacagctccaggcccggatcaagtctctggagaaggactcctgttctctg cgaaaccgcgaactgggggcgcggcctatcttcagccccgcccctgatgggcctatacaa gggggcggttccgcgcgcgcccgcccagttggagccagacagcggggtggacaagtggcg tgtgtgctgcgaccccgagggaagatgaacgggacgcggaactggtgtaccctggtggac gtgcacccagaggaccaggcggcgaaaaaattactaagagaaaataatggacaaggaatt cagatggtctccatcatagccaccgagatgttgatgcctaagaaccggattgccatttat gaactcctttttaaggagggagtcatggtggccaagaaggatgtccacatgcctaagcac cgggagctggcagacaaggatgtgcccaatcttcatgtcatgaaggccatgcagcctctc aagtcccgaggctacgtgaaggaacagtttgcctggagacatttccactggtacgttacc aatgagggcatccagtatctccgtgattaccttcatctgccccggagattgtgcctgcca ctctatgctgcagccgtccagagactggaagtcctcagcctaaaggcgggcaggaagacc tatgccatggtgtccagccactcagctggtcattctctggcttcagaactggtggagtcc catgatggacatgaggagatcattaaggtgtacttgaaggggaggtctggagacaagatg attcacgagaagaatattaaccagctgaagagtgaggtccagtacatccaggaggccagg aactgcctacagaagctccgggaggatataagtagcaagcttgacaggaacctaggagat tctctccatcgacaggagatacaggtggtgctagaaaagccaaatggctttagtcagagt cccacagccctgtacagcagcccacctgaggtggacacctgtataaatgaggatgttgag agcttgaggaagacggtgcaggacttgctggccaagcttcaggaggccaagcggcaacac cagtcagactgtgtggcttttgaggtcacactcagccggtaccagagggaagcagaacaa agtaatgtggcccttcagagagaggaggacagagtggagcagaaagaggcagaagtcgga gagctgcagaggcgcttgctagggatggagacggagcatcaggccttactggcgaaagtg agggaaggggaggtggccctagaggaacttcggagcaacaatgctgactgccaagcagaa cgagaaaaggctgctaccctggaaaaggaagtggccgggttgcgggagaagatccaccac ttggatgacatgctcaagagccagcagcggaaagtccggcaaatgatagagcagctccag aattcaaaagctgtgatccagtcaaaggacgccaccatccaggagctcaaggagaaaatc gcctatctggaggcagagaatttagagatgcatgaccggatggaacacctgatagaaaaa caaatcagtcatggcaacttcagcacccaggcccgggccaagacagagaacccgggcagt attaggatatccaagccgcctagcccgaagcccatgcctgtcatccgagtggtggaaacc tga >gi568815597f:151418520_151637343|GENSCAN_predicted_peptide_3|143_aa MADEDGEGIHPSAPHRNGGGGGGGGSGLHCAGNGGGGGGGPRVVRIVKSESGYGFNVRGQ VSEGGQLRSINGELYAPLQHVSAVLPGGAADRAGVRKGDRILEVQYRVPPKFSSSASQFR SPPLPSAPVDCPFCPPDQAPGNH >gi568815597f:151418520_151637343|GENSCAN_predicted_CDS_3|432_bp atggcggacgaggacggggaagggattcatccctcagcccctcacaggaacggaggtggc ggcggcggcggggggtctgggctccactgcgccgggaacggcggcgggggaggcggcggc ccgcgggtcgtgcgcatcgtcaagtccgagtccggctacggcttcaacgtgcggggccaa gtgagcgagggcgggcaactgcggagcatcaacggggagctgtacgcgccgctgcagcat gtgagcgccgtgctgcccgggggggcggccgatcgggccggggtgcgcaagggggaccgc atcctggaggtccagtaccgtgtcccccccaagttctcatcttcagcatcgcagtttcgt tctccaccactcccctccgcccccgtggactgccccttctgccctccagaccaagcccct ggtaaccactga