GENSCAN 1.0 Date run: 7-Nov-116 Time: 16:37:40 Sequence gi568815590r:100158559_100388174 : 229616 bp : 39.72% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3721 3862 142 2 1 72 115 100 0.487 9.59 1.02 Intr + 7256 7415 160 2 1 81 71 177 0.953 14.37 1.03 Intr + 19258 19383 126 0 0 85 102 49 0.879 5.96 1.04 Intr + 24817 24878 62 0 2 79 44 22 0.623 -6.49 1.05 Intr + 25398 25504 107 0 2 51 91 87 0.944 4.24 1.06 Intr + 26070 26175 106 1 1 32 99 139 0.953 7.75 1.07 Intr + 28562 28692 131 2 2 80 60 81 0.688 3.92 1.08 Intr + 32832 32938 107 1 2 46 115 130 0.672 10.51 1.09 Intr + 45198 45329 132 2 0 31 100 145 0.008 9.92 1.10 Intr + 47562 48131 570 2 0 62 67 400 0.575 27.33 1.11 Term + 48288 48554 267 2 0 -13 47 282 0.351 8.51 1.12 PlyA + 50725 50730 6 1.05 2.00 Prom + 51787 51826 40 -7.45 2.01 Init + 54495 54870 376 2 1 53 96 358 0.308 28.64 2.02 Intr + 55261 55360 100 2 1 45 97 61 0.830 0.95 2.03 Intr + 61721 61873 153 2 0 81 91 123 0.788 10.17 2.04 Intr + 66615 66781 167 0 2 46 75 169 0.984 10.08 2.05 Intr + 72598 72734 137 2 2 122 -42 99 0.743 -0.33 2.06 Intr + 74872 74979 108 0 0 6 100 180 0.702 10.56 2.07 Intr + 80682 80846 165 2 0 81 53 88 0.697 3.84 2.08 Intr + 81845 82213 369 1 0 46 115 267 0.975 19.58 2.09 Intr + 82333 82451 119 0 2 99 2 91 0.508 -0.06 2.10 Intr + 84905 84996 92 2 2 75 76 50 0.410 1.22 2.11 Term + 86037 86197 161 1 2 150 32 65 0.517 4.22 2.12 PlyA + 90047 90052 6 1.05 3.09 PlyA - 90968 90963 6 1.05 3.08 Term - 100688 99998 691 1 1 71 47 620 0.996 48.27 3.07 Intr - 101439 101296 144 2 0 54 80 168 0.996 11.08 3.06 Intr - 103197 102984 214 1 1 24 89 123 0.328 2.85 3.05 Intr - 105637 105476 162 2 0 107 97 158 0.918 17.63 3.04 Intr - 106227 106113 115 0 1 46 82 155 0.997 9.80 3.03 Intr - 110420 110227 194 0 2 81 93 100 0.816 8.09 3.02 Intr - 116603 116395 209 1 2 83 68 127 0.985 8.00 3.01 Init - 129523 128943 581 1 2 63 86 192 0.685 11.17 3.00 Prom - 143925 143886 40 -3.45 4.00 Prom + 148675 148714 40 -6.25 4.01 Init + 151645 151706 62 0 2 93 -1 160 0.341 6.37 4.02 Intr + 151981 152077 97 1 1 91 38 112 0.011 5.59 4.03 Intr + 157639 157726 88 0 1 67 84 71 0.367 3.22 4.04 Intr + 158226 158382 157 1 1 61 89 58 0.081 1.35 4.05 Term + 166053 166155 103 0 1 68 36 101 0.022 -0.33 4.06 PlyA + 168027 168032 6 1.05 5.06 PlyA - 168194 168189 6 1.05 5.05 Term - 175725 175572 154 2 1 70 49 138 0.246 4.51 5.04 Intr - 185228 185106 123 1 0 41 46 124 0.047 2.28 5.03 Intr - 191641 191608 34 2 1 68 85 41 0.042 -1.84 5.02 Intr - 196013 195753 261 0 0 76 89 87 0.151 4.14 5.01 Init - 196250 196211 40 2 1 58 86 46 0.156 1.80 5.00 Prom - 208101 208062 40 -3.35 6.00 Prom + 225445 225484 40 -3.35 6.01 Init + 226299 226448 150 2 0 79 89 48 0.759 4.09 6.02 Term + 227922 228062 141 2 0 111 39 218 0.982 16.15 6.03 PlyA + 228209 228214 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_1|636_aa XMTTKDYPSLWGFGTTKTFKIPIEHLDFKYIEKCSDVKHLEKILCVLRSGEEGYYPELTE FCEKHLQALAPESRALRKDKPAATAASFTAEEWEKIDGDIKSWVSEIKKEEDKMHFHETE TFPAMKDNLPPVRGSNSCLHVGKEKYSKRPTKKKTPRDYAEWDKFDVEKECLKIDEDYKE KTVIDKSHLSKIETRIDTAGLTEKEKDFLATREKEKGNEAFNSGDYEEAVMYYTRSISAL PTVVAYNNRAQAEIKLQNWNSAFQDCEKVLELEPGNVKALLRRATTYKHQNKLREATEDL SKVLDVEPDNDLAKKQILSLKSAKIALSESLISCRERAEIVEKQTQALIMRVADLQQKKT DGSWSITVDYCKLNQVVTPIAAAVPDMVSLLEQINTSPGTWYAATDLANVFFSIPVHKAH QKQSAVSWQGYQYTFTVLPQGYINSAALCHSLIRKDLDCFLLWQDITLVHYIDDVMLTGP SEQEVANTLDLLVRHLHARGWEINPTKIQGPSTSVKFPGFRWYGACRDIPSKVKQDIPSK VKEDIPSKGPEQEKALPQVQAAVQAALPLGPYDLADPMVLEVSVADRDAVWSLWQAPIDE SQWRPLGFWSKALPSSADNCFPLETALGLLLGFGGN >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_1|1911_bp nctatgaccaccaaagattatccatcattgtggggctttggaacaacaaaaacattcaaa attcccattgaacatctagatttcaaatacattgaaaaatgttcagatgttaaacatctg gaaaaaattctttgcgtgctcagatctggtgaggaaggatattatcctgaacttacagaa ttttgtgaaaagcatcttcaagccttggcccctgaaagcagagctttgaggaaagataaa ccagcagcaacagcagccagttttacagctgaagaatgggaaaaaattgatggtgatata aagagttgggtatcagaaattaaaaaagaagaagataaaatgcactttcatgaaactgag acatttccagcaatgaaagataatttgcctccagttcgtggttcaaacagctgtcttcat gtaggcaaggaaaaatattctaaaagaccaactaaaaagaaaactccaagggattacgcg gaatgggataaatttgacgtggagaaggaatgtttaaaaattgatgaagattacaaagaa aagacggtaatagacaagtcacacttgtctaaaattgagacaagaatagatacagcaggt ctaactgagaaagaaaaggattttcttgccactcgtgaaaaggagaaaggaaatgaagct ttcaactcaggagattatgaagaagcagtgatgtattataccaggagcatatcagcgctt cccactgtagttgcctataacaatcgagctcaagcagaaatcaaattacagaactggaat agtgcttttcaggattgtgaaaaggtcttggagttagaacctggaaacgtaaaggctctt ctgcgtcgtgctactacatataaacatcaaaacaagctccgggaagctacagaagatttg agtaaagtactagatgttgagcctgataatgatttggccaagaagcagatactgagcctc aaatctgccaagattgccctgagtgagagtcttatctcctgtagagaaagagctgaaatt gtggaaaaacagacacaagctctcatcatgcgagtggctgacctgcaacaaaagaagacc gatggatcttggagcataacagtggattattgtaagcttaaccaagtggtgactccaatt gcagctgctgtaccagatatggtttcattgcttgagcaaattaacacatctcctggtacc tggtatgcagccactgacttggcaaatgtctttttctccattcctgtccataaggcccac cagaagcaatctgccgtcagctggcaaggttaccaatatacctttactgtcctacctcag gggtatatcaactctgcggctttgtgtcatagtcttattcggaaagaccttgattgcttt ttgctttggcaagatatcacactggtacattatattgatgacgttatgctgactggacct agcgagcaagaagtagcaaacacactggacttattggtgagacatttgcatgcaagagga tgggaaataaatccaactaaaattcagggaccttctacctcagtaaaatttcctgggttc cggtggtatggggcctgtcgagatattccttctaaggtgaagcaagatattccttctaag gtgaaggaagatattccttctaagggtccagaacaggagaaggctctgccacaggtccaa gctgctgtgcaagctgctctgccacttgggccatatgacctagcagatccaatggtgctt gaggtgtcagtggcagatagggatgccgtttggagcctttggcaggcccccatagatgaa tcacagtggaggcctctaggattttggagcaaggccctgccatcttctgcagataactgc tttcctcttgagacagctcttggcctgttactgggctttggtggaaactga >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_2|648_aa MQTLTSRIHFLTEPAEPAGAARAAQPCVMGNIQKKLTGKAEGGKRPARGAPQRGQTPEAG ADKRSPRRASAAAAAGGGATGHPGGGQGAENPAGLKSQGNELFRSGQFAEAAGKYSAAIA LLEPAGSEIADDLSILYSNRAACYLKEGNCSGCIQDCNRALELHPFSMKPLLRRAMAYET LEQYGKAYVDYKTVLQIDCGLQLANDSVNRLSRILMELDGPNWREKLSPIPAVPASVPLQ AWHPAKEMISKQAGDSSSHRQQGITDEKTFKALKEEGNQCVNDKNYKDALSKYSECLKIN NKECAIYTNRQLCQFEEAKQDCDQALQLADGNVKAFYRRALAHKGLKNYQKSLIDLNKVI LLDPSIIEAKMELEEVTRLLNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGRPAGEVSMG CLASEKGGKSSRSPEDPEKLPIAKPNNAYEFGQIINALSTRKDKEACAHLLAITAPKDLP MFLSNKLEGDTFLLLIQSLKNNLIEKDPSLVYQHLLYLSKAERFKMMLTLISKGQKELIE QLFEDLSDTPNNHFTLEDIQALKRQCLTDTVTIHALNGTRCSLKLRLHLSHSSTKGLSSE NNTFSFMSAISPFSSLPSCQPINLRSYSSPKEKSQPFIFPYTIAFKTG >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_2|1947_bp atgcaaaccctcacttcccgcatccacttcctcacagagcccgcggagccggcgggagcc gcgcgcgccgcccagccgtgcgtcatgggcaacatccagaagaagctgactggcaaagcc gaaggcggcaagcggccggcaaggggcgcgccgcagcggggccagaccccggaggccggc gcggacaagcggagcccacggcgggcctctgcggcggcggcggcgggcggcggcgccacc gggcatccgggcggcgggcagggcgcggagaaccctgccggcctgaagagccagggcaac gagctgttccgaagcgggcagttcgccgaggcggccggcaagtactcggcggcaatcgcg ctcctggagccagcaggaagtgaaattgcagatgatctaagtatcttatattcaaataga gcagcatgttacctaaaagaaggaaactgcagtggctgcattcaagattgtaacagggct ctggaacttcatccattctctatgaaacctcttctgaggcgggcgatggcctatgaaact ctagagcagtatgggaaagcttatgtggattataaaacagtgttgcagatagactgtgga ctccagctagcaaatgacagtgttaacaggctatcaagaattttaatggagctggatgga ccaaattggcgggagaagctgtcacctattcctgctgtgcctgcttctgtgccactgcaa gcttggcatccggcaaaagagatgatctcaaaacaagcaggagactccagcagccatcgc cagcagggcatcacagatgaaaaaacatttaaagcccttaaggaagaaggaaatcaatgt gtaaatgacaaaaactataaagacgccctcagtaaatacagcgaatgcttaaagattaac aataaggaatgtgccatatatacaaacaggcaactgtgccagtttgaagaagcaaagcag gactgtgatcaggcacttcagctagctgatgggaacgtgaaagccttctatagacgagct ctggctcataaaggactcaagaattatcagaaaagcttaattgatctcaataaagttatc ctactagatccaagtattattgaggcaaagatggaactggaagaggtaactagactcctt aatcttaaggataagacagcaccattcaacaaagaaaaggagagaaggaaaattgagatt caagaggtgaatgaaggcaaggaggagcctggaagacctgcaggggaggtctccatggga tgccttgcttctgagaagggaggcaaaagcagcaggtcaccagaagaccctgagaaactt ccgatagccaagcctaataatgcctatgaatttggtcagattataaatgctctcagtacc aggaaggataaagaagcctgtgcacatcttttagccatcactgcaccaaaagatttgccg atgtttttaagtaacaaacttgaaggggatacattccttctcctcattcagtctctgaaa aataatcttattgaaaaagatccctcattggtgtatcagcatcttttatacctgagtaaa gcagaaaggtttaagatgatgttgacactaattagcaagggccaaaaggagctaattgaa cagctgtttgaggacctttcggacacaccaaacaaccattttactttagaagatatacag gccctaaaaaggcaatgtcttacagatactgtaactatacatgccctaaatggaactcgc tgttccctgaaactcaggcttcacttatcccatagttccacaaaaggactttcctctgag aataataccttctctttcatgtctgcaatatctcccttttcatcgctgccatcctgtcag cctataaacctacgcagctattcttcacctaaagaaaagtctcaacctttcatctttccc tataccattgctttcaaaactggataa >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_3|769_aa MSLHRQMGSDRDLQSSASSVSLPSVKKAPKKRRISIGSLFRRKKDNKRKSRELNGGVDGI ASIESIHSEMCTDKNSIFSTNTSSDNGLTSISKQIGDFIECPLCLLRHSKDRFPDIMTCH HRSCVDCLRQYLRIEISESRVNISCPECTERFNPHDIRLILSDDVLMEKYEEFMLRRWLV ADPDCRWCPAPDCGYAVIAFGCASCPKLTCGREGCGTEFCYHCKQIWHPNQTCDAARQER AQSLRLRTIRSSSISYSQESGAAAVIYIIIFSVSPSGCTFWGKKPWSRKKKILWQLGTLV GAPVGIALIAGIAIPAMIIGIPVYVGRKIHNRYEGKDVSKHKRNLAIAGGVTLSVIVSPV VAAVTVGIGVPIMLAYVYGVVPISLCRSGGCGVSAGNGKGVRIEFDDENDINVGGTNTAV DTTSVAEARHNPSIGEGSVGGLTGSLSASGSHMDRIGAIRDNLSETASTMALAGASITGS LSGSAMVNCFNRLEVQADVQKERYSLSGESGTVSLGTVSDNASTKAMAGSILNSYIPLDK EGNSMEVQVDIESKPSKFRHNSGSSSVDDGSATRSHAGGSSSGLPEGKSSATKWSKEATA GKKSKSGKLRKKGNMKINETREDMDAQLLEQQSTNSSEFEAPSLSDSMPSVADSHSSHFS EFSCSDLESMKTSCSHGSSDYHTRFATVNILPEVENDRLENSPHQCSISVVTQTASCSEV SQLNHIAEEHGNNGIKPNVDLYFGDALKETNNNHSHQTMELKVAIQTEI >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_3|2310_bp atgagtttacatcggcaaatgggttcagatcgagatcttcagtcctctgcttcatctgtg agcttgccttcagtcaaaaaggcacccaaaaaaagaagaatttcaataggctccctgttt cggaggaaaaaagataacaaacgtaaatcaagggagctaaatggcggggtggatggaatt gcaagtattgaaagtatacattctgaaatgtgtactgataagaactccattttctctaca aatacctcttctgacaatggattaacttccatcagcaaacaaattggagacttcatagag tgccctttgtgccttttgcggcattctaaagacagatttcctgatataatgacttgtcat cacagatcttgtgtggattgcttacgacaatatttaaggatagaaatctctgaaagcaga gttaatattagttgcccagaatgtactgaacggtttaatccccatgatattcgcttgata ttaagtgatgatgtcttgatggaaaaatacgaagaatttatgcttagacggtggcttgtt gcagatcctgattgtaggtggtgtccagctccagactgtggatatgctgtgatagcattt ggatgtgccagctgtccaaaattaacttgtgggcgagagggctgtggaacagagttttgc taccactgtaaacagatttggcaccccaaccagacctgtgatgctgctcgacaagagaga gcccagagcttacgtttgagaactatacgttcttcatccattagttatagtcaagagtct ggagcagcagcagttatttacattataattttttccgttagtccatcaggatgtactttt tgggggaagaaaccctggagccgaaagaagaaaatattgtggcaactgggaacactggtt ggtgctcctgtcggaatcgctttaatagctggcattgctattcctgcaatgattattggc attcctgtgtatgtgggccgcaagattcacaatcgctatgaaggcaaggatgtttcaaag cacaaacggaatttggccatagcaggtggtgtaacgttgtctgtaatcgtgtctccagta gtagctgcagtgactgtaggtatcggtgttcctattatgttagcttatgtctatggcgta gttccaatttctctttgtcgaagcggaggttgtggagtctcagcaggcaatggaaaagga gttaggattgaatttgatgatgaaaatgatataaatgttggtggaactaacacagctgta gacacaacatcagtagcagaagcaagacacaacccaagcataggggagggaagtgttggt gggctgactggcagtttgagtgcaagtggaagccacatggatcgaataggagccatccga gacaacctgagtgaaacggccagcaccatggcactagctggagccagtataacggggagt ctgtcaggaagtgccatggtaaactgttttaacaggttggaagtacaagcagatgtacag aaagaacggtacagtctaagtggagaatctggcacagtcagcttgggaacagttagtgat aatgccagcaccaaagcaatggcaggatccattctgaattcctacatcccattggacaaa gaaggcaacagtatggaggtgcaagtagatattgagtcaaagccatccaaattcaggcac aacagtggaagcagtagtgtggatgatggcagtgccacccgaagtcatgctggcggttca tccagtggcttgcctgaaggtaaatctagtgccaccaagtggtccaaagaagcaacagca gggaaaaaatcaaaaagtggtaaactgaggaaaaagggtaacatgaagataaatgagacg agagaggacatggatgcacagttgttagaacaacaaagcacgaactcaagtgaatttgag gctccatccctcagtgacagtatgccttctgtagcagattctcactctagtcatttttct gaatttagttgttctgacctagaaagcatgaaaacttcttgtagtcatggttccagtgat tatcacacccgctttgctactgttaacattcttcctgaggtagaaaatgaccgtctggaa aattccccacatcagtgtagcatttctgtggttacccaaactgcttcctgttcagaagtt tcacagttgaatcatattgctgaagaacatggtaacaatggaataaaacctaatgttgat ttatattttggcgatgcactaaaagaaacaaataacaaccactcacatcagacaatggaa ttaaaagttgcaattcagactgaaatttag >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_4|168_aa MASRPPGPLRAALDAAVPLRTVHLLDWQAEGEKCRDVAALGEEMSFPRGRQHPWACGLAG SGVKQQTFAVSVTAHKSSVDPKALGWSVGLAAVEQGVVLLGEAQAAQQLMEWVGGSGMAG CRSGALPRGKAAKGRLLTYICIDFSQTKNFQPEKPKYESTRLLQENSK >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_4|507_bp atggcgtcacgcccaccgggcccgctgcgggcggctctggacgcggctgttcccttgcgc acggttcacctgctagattggcaggccgagggagagaagtgccgggatgttgcagctctc ggcgaggaaatgtcctttcccagaggacggcagcacccttgggcttgtggtctcgctggc tcaggagtgaagcagcagaccttcgcggtgagtgttacagctcataaaagcagcgtggac ccaaaggcccttgggtggtctgtgggattggctgccgtggagcagggggtggtgctcctg ggggaggctcaggccgcacagcaactcatggagtgggtgggaggctcaggcatggcgggc tgcaggtccggagccctgccccgcgggaaggcagctaagggtcggcttctaacatacatt tgcattgatttttcacaaacgaagaatttccaacctgagaaacctaaatatgaatctacg cgtttgcttcaagaaaattcaaaataa >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_5|203_aa MVPWSQEDQEQPQASSSLKQKDNAVSTSQGCLKDQMSSTCKALRAVPVLYKYLLFTVTLF GRISLSMVILRALGAASQTVDIDEGNENRREKQAGCCQQAALSGLDDVHSHRQLSLVVLL VVLLNMISIRVMRPMQERLGRGQGSEFTETDERAFKIVEKSKLAHKKWWQNREALCCGYD EKNMLREFGKGQGESQDGEGGTS >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_5|612_bp atggttccctggagtcaagaggaccaagagcagccccaagcttcctcatccctaaaacag aaggataatgcagtatctacctcacagggctgtttaaaggatcaaatgagttctacgtgc aaagcccttagagcagtgcctgtgctttacaagtatttgctctttactgtaacactgttt ggaaggatatcactgtccatggtgatcttgagggcccttggagcagcttcccaaaccgtg gatattgatgaagggaatgagaatagaagagagaaacaggctgggtgctgccaacaggca gccctcagtggattggatgatgtccactcacatcggcaattgtccttggttgtattgctg gttgtattgctgaacatgatcagcatacgagtgatgaggccaatgcaggagaggcttggc agaggacaagggagtgaattcactgagactgacgagagagctttcaaaatagttgaaaag tcaaaattagcacacaagaaatggtggcagaaccgtgaagcattatgttgtggctatgat gaaaagaacatgctccgagagtttggcaaagggcaaggtgaaagtcaagatggagaaggt ggaacctcctga >gi568815590r:100158559_100388174|GENSCAN_predicted_peptide_6|96_aa MTSLSYITIKDLTRGHSIPAESESRAPCYGASPTKEGKGSLYGAKHLGHVGLLSPARNAM RADLGPATLAFICAETYTNSDEASQPEVGEPPPGQC >gi568815590r:100158559_100388174|GENSCAN_predicted_CDS_6|291_bp atgacatctctgagttatatcactattaaagacctgaccaggggtcatagtatcccagct gagtctgaaagcagggccccctgttatggggcctctcccactaaagagggaaaagggagc ctgtacggggcaaaacatttgggccacgtgggcctcctctcccctgcccgcaatgccatg cgagctgaccttggacctgcgacccttgccttcatctgtgccgagacctacacaaacagt gatgaagcatcgcagccggaggtgggagagcctccaccaggacagtgctag