GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:24:01 Sequence gi568815596r:159671854_159898198 : 226345 bp : 37.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 447 442 6 1.05 1.02 Term - 15541 15522 20 2 2 114 42 14 0.356 -3.10 1.01 Init - 40689 40461 229 0 1 97 107 188 0.929 18.67 1.00 Prom - 53678 53639 40 -2.65 2.00 Prom + 55023 55062 40 -5.95 2.01 Init + 59228 59317 90 1 0 82 64 67 0.344 2.24 2.02 Intr + 66061 66228 168 0 0 58 63 66 0.266 0.32 2.03 Intr + 66341 66505 165 1 0 11 101 146 0.689 7.44 2.04 Intr + 71208 71400 193 2 1 43 110 80 0.644 3.84 2.05 Intr + 75952 77050 1099 2 1 51 84 421 0.600 26.06 2.06 Intr + 80549 80718 170 2 2 56 99 120 0.963 8.57 2.07 Intr + 87373 87505 133 2 1 72 111 102 0.820 9.68 2.08 Term + 90486 90549 64 0 1 103 42 10 0.362 -5.72 2.09 PlyA + 90834 90839 6 1.05 3.06 PlyA - 91251 91246 6 1.05 3.05 Term - 100200 99998 203 1 2 119 38 88 0.911 3.47 3.04 Intr - 108325 108152 174 2 0 75 94 148 0.921 13.09 3.03 Intr - 109145 109029 117 0 0 59 115 64 0.193 5.72 3.02 Intr - 111616 111506 111 2 0 93 53 59 0.164 2.33 3.01 Init - 126345 126279 67 0 1 87 102 273 0.880 27.89 3.00 Prom - 128936 128897 40 -7.45 4.07 PlyA - 129040 129035 6 1.05 4.06 Term - 133369 133191 179 2 2 102 32 77 0.982 0.37 4.05 Intr - 135287 135120 168 0 0 69 106 81 0.988 7.00 4.04 Intr - 136718 136596 123 0 0 81 111 68 0.990 8.04 4.03 Intr - 138822 138673 150 1 0 45 101 77 0.937 3.91 4.02 Intr - 143720 143552 169 2 1 71 64 89 0.971 3.40 4.01 Init - 145072 144953 120 1 0 52 95 85 0.612 5.84 4.00 Prom - 147497 147458 40 -5.55 5.00 Prom + 148263 148302 40 -4.75 5.01 Init + 151130 151353 224 1 2 88 72 177 0.380 14.18 5.02 Intr + 151603 151850 248 1 2 -48 77 180 0.041 -0.52 5.03 Term + 154234 155102 869 2 2 73 37 219 0.121 7.14 5.04 PlyA + 156250 156255 6 1.05 6.19 PlyA - 156954 156949 6 1.05 6.18 Term - 159513 159296 218 1 2 17 49 101 0.760 -4.58 6.17 Intr - 159933 159817 117 1 0 90 100 62 0.922 7.12 6.16 Intr - 163792 163627 166 1 1 112 47 118 0.527 8.71 6.15 Intr - 169102 168876 227 2 2 54 95 173 0.590 11.38 6.14 Intr - 170521 170392 130 1 1 51 92 71 0.585 3.15 6.13 Intr - 178287 178127 161 1 2 66 101 63 0.467 4.19 6.12 Intr - 178614 178509 106 0 1 93 79 35 0.206 1.97 6.11 Intr - 182682 182543 140 1 2 125 -28 141 0.098 5.46 6.10 Intr - 183086 183051 36 0 0 106 109 46 0.741 5.82 6.09 Intr - 186623 186509 115 2 1 97 115 -10 0.685 1.70 6.08 Intr - 189036 188968 69 0 0 131 86 72 0.883 9.76 6.07 Intr - 203790 203591 200 1 2 49 98 101 0.931 5.35 6.06 Intr - 206640 206471 170 2 2 78 109 66 0.990 6.37 6.05 Intr - 209387 209126 262 0 1 56 116 116 0.881 6.82 6.04 Intr - 210462 210271 192 1 0 58 110 188 0.957 16.54 6.03 Intr - 213440 213300 141 0 0 57 70 83 0.839 2.80 6.02 Intr - 218524 218360 165 2 0 77 71 107 0.974 6.91 6.01 Intr - 222231 222061 171 1 0 81 84 109 0.973 8.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 182682 182498 185 1 2 125 42 144 0.814 10.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_1|82_aa MAPAPLPDPAQNQPPPLSAPPCSSSPHFADFLASRESGGAAGTESCGRAGGTPPRGGALT AGTVEELPPGSQRGATEVWGKD >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_1|249_bp atggcgcccgcccctctcccggatccggcgcagaaccagccaccaccgttaagtgcccct ccctgctcctcctccccccacttcgcggacttcctggcttctcgcgagagtggcggcgcg gcggggaccgagagctgcgggcgggctggaggtaccccaccgcggggtggtgctctgact gcgggaaccgttgaagagctgcccccggggagtcagcgaggcgccactgaggtctgggga aaggattga >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_2|693_aa MGFRYVAQAGLTLLGSTILPTFPPRVLGLQLEASVAGGAFAQVLLGPAGCVLPTRTGRLC STRAISSDPTPAKGEPDMDRQGVHGQAAASVGTEERSGTQKLGDTRNYRAPKRLSQPWFR ELLGLDSPKGQSSSVLVAGSMSTSASASASPFQSAWYSESEITQGARSRSQNQQRDHDSK RPKLSCTNCTTSAGRNVGNGLNTLSDVQDRVPSYSQGARPKENSMSTLQLNTSSTNHQLP SEHQTILSSRDSRNSLRSNFSSRESESSRSNTQPGFSYSSSRDEAPIISNSERVVSSQRP FQESSDNEGRRTTRRLLSRIASSMSSTFFSRRSSQDSLNTRSLNSENSYVSPRILTASQS RSNVPSASEVPDNRASEASQGFRFLRRRWGLSSLSHNHSSESDSENFNQESEGRNTGPWL SSSLRNRCTPLFSRRRREGRDESSRIPTSDTSSRSHIFRRESNEVVHLEAQNDPLGAAAN RPQASAASSSATTGGSTSDSAQGGRNTGISGILPGSLFRFAVPPALGSNLTDNVMITVDI IPSGWNSADGKSDKTKSAPSRDPERLQKIKESLLLEDSEEEEGDLCRICQMAAASSSNLL IEPCKCTGSLQYVHQDCMKKWLQAKINSGSSLEAVTTCELCKEKLELNLEDFDIHELHRA HANEQVSIFCLIWLTQQNSFELKNCFYQLQISV >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_2|2082_bp atggggtttcgctatgttgcccaggcaggtctcacactcctgggctcaaccattctgccc accttccctcccagagtgctgggattacagctagaagcctctgtggctggtggtgccttt gcccaggttttgctcgggcctgctgggtgtgttctgcctactcggactggcaggctttgt tcgactcgtgctatcagctcggatcccacgcctgccaagggtgagccagacatggatcgg cagggggtgcatgggcaagcagcagcttctgtgggcactgaggaacgcagtggcacccaa aagcttggagacacaaggaactacagagccccaaagaggttgtcacagccctggttcagg gagctcctaggtctggactccccaaagggtcagagctcttccgtccttgtcgccggcagc atgtctacatcagcatcagcatctgcgtcaccatttcaatctgcatggtatagtgaatct gagataactcagggagcacgctcaagatcgcagaaccagcaacgggatcatgattcaaaa agacctaaactttcctgtacaaactgtactacctcagctgggagaaatgttggaaatggt ttaaacacattatcagatgttcaagacagagttccttcatattcacaaggagcaagacca aaagaaaactcaatgagcactttacagttgaatacatcatccacaaaccaccaattgcct tctgaacatcagaccatactaagttctagggactccagaaattctttaagatcaaatttt tcttcaagagaatcagaatcttcccgaagcaatacgcagcctggattttcttacagttca agtagagatgaagccccaatcataagcaattcagaaagggttgtttcatctcaaagacca tttcaagaatcttctgacaatgaaggtaggcggacaacgaggagattgctgtcacgcata gcttctagcatgtcatctacttttttttcacgaagatctagtcaggattccttgaataca agatcattgaattctgaaaattcttacgtttctccaagaatcttgacagcttcacagtcc cgtagtaatgtaccatcagcttctgaagttcccgataatagggcatctgaagcttctcag ggatttcgatttcttaggcgaagatggggtttgtcatctcttagccacaatcatagctct gagtcagattcagaaaattttaaccaagaatctgaaggtagaaatacaggaccatggtta tcttcctcacttagaaatagatgcacacctttgttctctagaaggaggcgagagggaaga gatgaatcttcaaggatacctacctctgatacatcatctagatctcatatttttagaaga gaatcaaatgaagtggttcaccttgaagcacagaatgatcctcttggagctgctgccaac agaccacaagcatctgcagcatcaagcagtgccacaacaggtggctctacatcagattcg gctcaaggtggaagaaatacaggaatatcagggattcttcctggttccttattccggttt gcagtccctccagcacttgggagtaatttgaccgacaatgtcatgatcacagtagatatt attccttcaggttggaattcagctgatggtaaaagtgataaaactaaaagtgcgccttca agagatccagaaagattgcagaaaataaaagagagcctccttttagaggactcagaagaa gaagaaggtgacttatgtagaatttgtcaaatggcagctgcatcatcatctaatttgctg atagagccatgcaagtgcacaggaagtttgcagtatgtccaccaagactgtatgaaaaag tggttacaggccaaaattaactctggttcttcattagaagctgtaaccacctgtgaacta tgtaaagagaagttggagcttaacctggaggattttgatattcatgaactacatagagct catgcaaatgaacaagttagtatattttgcctaatttggttaacacaacagaacagtttt gaattgaagaactgtttttaccaactacaaatttctgtgtga >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_3|223_aa MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTANNHILISALVIASTVILTVLGAII WFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_3|672_bp atgctccgggccgcgctgcccgcgctcctgctgccgttgctgggcctcgccgctgctgcc gtcgcggactgtccttcatctacttggattcagttccaagacagttgttacatttttctc caagaagccatcaaagtagaaagcatagaggatgtcagaaatcagtgtactgaccatgga gcggacatgataagcatacataatgaagaagaaaatgcttttatactggatactttgaaa aagcaatggaaaggcccagatgatatcctactaggcatgttttatgacacagatgatgcg agtttcaagtggtttgataattcaaatatgacatttgataagtggacagaccaagatgat gatgaggatttagttgacacctgtgcttttctgcacatcaagacaggtgaatggaaaaaa ggaaattgtgaagtttcttctgtggaaggaacactatgcaaaacagctaataaccacatt ttaatatcagcattggtgattgctagcacggtaattttgacagttttgggagcaatcatt tggttcctgtacaaaaaacattctgattctcgtttcaccacagttttttcaaccgcaccc caatcaccttataatgaagactgtgttttggtagttggagaagaaaatgaatatcctgtt caatttgactaa >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_4|302_aa MCSQSGGHLASVHNQNGQLFLEDIVKRDGFPLWVGLSSHDGSESSFEWSDGSTFDYIPWK GQTSPGNCVLLDPKGTWKHEKCNSVKDGAICYKPTKSKKLSRLTYSSRCPAAKENGSRWI QYKGHCYKSDQALHSFSEAKKLCSKHDHSATIVSIKDEDENKFVSRLMRENNNITMRVWL GLSQHSVDQSWSWLDGSEVTFVKWENKSKSGVGRCSMLIASNETWKKVECEHGFGRVVCK VPLGPDYTAIAIIVATLSILVLMGGLIWFLFQRHRLHLAGFSSVRYAQGVNEDEIMLPSF HD >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_4|909_bp atgtgttctcaaagtggaggtcacttggcaagcgttcacaaccaaaatggccagctcttt ctggaagatattgtaaaacgtgatggatttccactatgggttgggctctcaagtcatgat ggaagtgaatcaagttttgaatggtctgatggtagtacatttgactatatcccatggaaa ggccaaacatctcctggaaattgtgttctcttggatccaaaaggaacttggaaacatgaa aaatgcaactctgttaaggatggtgctatttgttataaacctacaaaatctaaaaagctg tcccgtcttacatattcatcaagatgtccagcagcaaaagagaatgggtcacggtggatc cagtacaagggtcactgttacaagtctgatcaggcattgcacagtttttcagaggccaaa aaattgtgttcaaaacatgatcactctgcaactatcgtttccataaaagatgaagatgag aataaatttgtgagcagactgatgagggaaaataataacattaccatgagagtttggctt ggattatctcaacattctgttgaccagtcttggagttggttagatggatcagaagtgaca tttgtcaaatgggaaaataaaagtaagagtggtgttggaagatgtagcatgttgatagct tcaaatgaaacttggaaaaaagttgaatgtgaacatggttttggaagagttgtctgcaaa gtgcctctgggccctgattacacagcaatagctatcatagttgccacactaagtatctta gttctcatgggcggactgatttggttcctcttccaaaggcaccgtttgcacctggcgggt ttctcatcagttcgatatgcacaaggagtgaatgaagatgagattatgcttccttctttc catgactaa >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_5|446_aa MGRNQCKKAENSKNQNTSSPPKDHNSSPASEQNWMEKEFDELTEVGFRRWVITNCSELKE HVLTQCKEAKNLEKRPNLRLTGVPESDEENETKLENTLQDIIWENFTNLARQANTEIQKT QRTTQRYSSRGATPKHIIIKFTKVEMKEKMLRTAREKVLEVLARAIWQEKEMKGIQMGRG EVKLSLFADDMIVYLESPITSAQILLKLIRNLSKVSRYKINVQKSQAFLYTNNRQIESQI MNELPFTTVTKRIKYLGIQLTWDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPMKLPLTFFTELEKTTLNFIWNQKRARIAKTILDKKNKDGGIMLP DFKLYYEATVTKTSWYWYQNRDIDQWNRTEASEITPHIYNHLIFDKSDTNKHWVKYSLFN KWCWENWLAICRKLKLDPPSSHLIQK >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_5|1341_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaacacttcttctcct ccaaaggatcacaactcttcaccagctagcgaacaaaactggatggagaaagagtttgat gaattgacagaagtgggcttcagaaggtgggtaattacaaactgctccgagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgaaaaaagaccaaatctacgtttg actggtgtacctgaaagtgatgaggagaatgaaaccaagttggaaaacacacttcaggat attatctgggagaacttcaccaacctagcaagacaggccaacactgaaattcagaagaca cagagaacaacccaaagatactcctcgagaggagcaaccccaaaacacataatcatcaaa tttaccaaggttgaaatgaaggaaaaaatgttaagaacagccagagagaaagtattggaa gttctggccagggcaatctggcaagagaaagaaatgaagggcattcaaatgggaagaggg gaagtcaaattgtctctgtttgcagatgacatgattgtatatttagaaagccccatcacc tcagcccaaattctccttaagctgataaggaacctcagcaaagtctcaagatacaaaatc aatgtgcaaaaatcacaagcattcctgtacaccaataacagacaaatagagagccaaatc atgaatgaactcccattcacaactgttacaaagagaataaaatacctaggaatacaactt acctgggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggacacaaacaaatggaaaaacattccatgctcatggataggtagaatcaatatcgtg aaaatggccatactgcccaaagtaatttatagattcaatgctatccccatgaagctacca ctgactttcttcacagaactggaaaaaactactttaaacttcatatggaaccaaaaaaga gcccggatagccaagacaatcctggacaagaagaacaaagatggaggcatcatgctacct gacttcaaactatactatgaggctacagtaaccaaaacatcatggtactggtaccaaaac agagatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaac catctgatctttgacaaatctgacacaaacaagcactgggtaaaatattccctatttaat aaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggaccccccttcc tcacaccttatacaaaaataa >gi568815596r:159671854_159898198|GENSCAN_predicted_peptide_6|928_aa XIYTRDGNSYGRPCEFPFLIDGTWHHDCILDEDHSGPWCATTLNYEYDRKWGICLKPENG CEDNWEKNEQFGSCYQFNTQTALSWKEAYVSCQNQGADLLSINSAAELTYLKDRPSAPTI GGSSCARMDAESGLWQSFSCEAQLPYVCRKPLNNTVELTDVWTYSDTRCDAGWLPNNGFC YLLVNESNSWDKAHAKCKAFSSDLISIHSLADVEVVVTKLHNEDIKEEVWIGLKNINIPT LFQWSDGTEVTLTYWDENEPNVPYNKTPNCVSYLGEVRNLLWFLYILPVLPNDNIKTSLV GKFAYKLENKRFEQEYLNDLMKKYDKSLRKYFWTGLRDVDSCGEYNWATVGGRRRAVTFS NWNFLEPASPGGCVAMSTGKSVGKWEVKDCRSFKALSICKKMSGPLGPEEASPKPDDPCP EGWQSFPASLSCYKVSTIIMPNEFQQDYDIRDCAAVKVFHRPWRRGWHFYDDREFIYLRP FACDTKLEWVCQIPKGRTPKTPDWYNPDRAGIHGPPLIIEGSEYWFVADLHLNYEEAVLY CASNHSFLATITSFCFLKIKPVSLTFSQASDTCHSYGGTLPSVLSQIEQDFITSLLPDME ATLWIGLRWTAYEKINKWTDNRELTYSNFHPLLVSGRLRIPENFFEEESRYHCALILNLQ KSPFTGTWNFTSCSERHFVSLCQKYSEVKSRQTLQNASETVKYLNNLYKIIPKTLTWHSA KRECLKSNMQLVSITDPYQQAFLSVQALLHNSSLWIGLFSQDDELNFGWSDGKRLHFSRW AETNGQLEDCVVLDTDGFWKTVDCNDNQPGAICYYSGNPKSHILSIRDEKENNFVLEQLL YFNYMASWVMLGITYRSFTESMAGQPQETYNHVRGEWETSRFYYGGAGERVKEEVLHTFE QPELIRTHSLSREQQGGKSPPTRPLLQD >gi568815596r:159671854_159898198|GENSCAN_predicted_CDS_6|2787_bp nagatctataccagagatgggaactcttatgggagaccttgtgaatttccattcttaatt gatgggacctggcatcatgattgcattcttgatgaagatcatagtgggccatggtgtgcc accaccttaaattatgaatatgaccgaaagtggggcatctgcttaaagcctgaaaacggt tgtgaagataattgggaaaagaacgagcagtttggaagttgctaccaatttaatactcag acggctctttcttggaaagaagcttatgtttcatgtcagaatcaaggagctgatttactg agcatcaacagtgctgctgaattaacttaccttaaagacaggcccagtgcacctactata ggtggctccagctgtgcaagaatggatgctgagtctggtctgtggcagagcttttcctgt gaagctcaactgccctatgtctgcaggaaaccattaaataatacagtggagttaacagat gtctggacatactcagatacccgctgtgatgcaggctggctgccaaataatggattttgc tatctgctggtaaatgaaagtaattcctgggataaggcacatgcgaaatgcaaagccttc agtagtgacctaatcagcattcattctctagcagatgtggaggtggttgtcacaaaactc cataatgaggatatcaaagaagaagtgtggataggccttaagaacataaacataccaact ttatttcagtggtcagatggtactgaagttactctaacatattgggatgagaatgagcca aatgttccctacaataagacgcccaactgtgtttcctacttaggagaggtacgaaacctc ctgtggtttctttatattcttcctgttttacctaatgataatattaagacttctttggtt gggaagtttgcgtataagctggaaaataaaagatttgagcaagaatacctaaatgatttg atgaaaaagtatgataaatctctaagaaaatacttctggactggcctgagagatgtagat tcttgtggagagtataactgggcaactgttggtggaagaaggcgggctgtaaccttttcc aactggaattttcttgagccagcttccccgggcggctgcgtggctatgtctactggaaag tctgttggaaagtgggaggtgaaggactgcagaagcttcaaagcactttcaatttgcaag aaaatgagtggaccccttgggcctgaagaagcatcccctaagcctgatgacccctgtcct gaaggctggcagagtttccccgcaagtctttcttgttataaggtgtctactattatcatg ccaaatgagtttcagcaggattatgacatcagagactgtgctgctgtcaaggtatttcat aggccatggcgaagaggctggcatttctatgatgatagagaatttatttatttgaggcct tttgcttgtgatacaaaacttgaatgggtgtgccaaattccaaaaggccgtactccaaaa acaccagactggtacaatccagaccgtgctggaattcatggacctccacttataattgaa ggaagtgaatattggtttgttgctgatcttcacctaaactatgaagaagccgtcctgtac tgtgccagcaatcacagctttcttgcaactataacatctttttgttttctaaagatcaaa cccgtgtctctcacattttctcaagcaagcgatacctgtcactcctatggtggcaccctt ccttcagtgttgagccagattgaacaagactttattacatccttgcttccggatatggaa gctactttatggattggtttgcgctggactgcctatgaaaagataaacaaatggacagat aacagagagctgacgtacagtaactttcacccattattggttagtgggaggctgagaata ccagaaaatttttttgaggaagagtctcgctaccactgtgccctaatactcaacctccaa aaatcaccgtttactgggacgtggaattttacatcctgcagtgaacgccactttgtgtct ctctgtcagaaatattcagaagttaaaagcagacagacgttgcagaatgcttcagaaact gtaaagtatctaaataatctgtacaaaataatcccaaagactctgacttggcacagtgct aaaagggagtgtctgaaaagtaacatgcagctggtgagcatcacggacccttaccagcag gcattcctcagtgtgcaggcgctccttcacaactcttccttatggatcggactcttcagt caagatgatgaactcaactttggttggtcagatgggaaacgtcttcattttagtcgctgg gctgaaactaatgggcaactcgaagactgtgtagtattagacactgatggattctggaaa acagttgattgcaatgacaatcaaccaggtgctatttgctactattcaggaaatccaaaa tcacatattctgagtattcgagatgaaaaggagaataactttgttcttgagcaactgctg tacttcaattatatggcttcatgggtcatgttaggaataacttatagaagctttacagaa agcatggctgggcagcctcaggaaacttacaatcatgtcagaggcgaatgggaaacaagc aggttttactatggtggagcaggagagagagtgaaggaggaggtgctacacacttttgaa caaccagaactcattagaactcactcactatcacgagaacagcaagggggaaaatcacct cccaccaggcccctcctccaagactga