GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:51:10 Sequence gi568815597f:28453640_28677633 : 223994 bp : 47.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5478 5619 142 2 1 78 100 70 0.701 7.50 1.02 Intr + 12046 12210 165 2 0 122 82 77 0.998 10.43 1.03 Intr + 12743 13129 387 0 0 78 111 120 0.641 8.06 1.04 Intr + 19915 20512 598 2 1 101 84 221 0.724 14.60 1.05 Intr + 22468 22652 185 1 2 -12 52 264 0.870 12.23 1.06 Intr + 26812 26965 154 2 1 50 82 96 0.248 4.33 1.07 Intr + 38049 38148 100 0 1 58 77 99 0.790 5.91 1.08 Intr + 42681 42768 88 2 1 70 77 14 0.379 -1.96 1.09 Term + 42895 42923 29 2 2 98 54 64 0.525 2.24 1.10 PlyA + 44022 44027 6 1.05 2.00 Prom + 50803 50842 40 -3.96 2.01 Init + 64517 64669 153 1 0 64 100 74 0.130 4.21 2.02 Intr + 76219 76300 82 0 1 50 110 96 0.240 7.21 2.03 Intr + 78164 78351 188 0 2 76 96 103 0.981 9.31 2.04 Intr + 78532 78711 180 0 0 31 90 232 0.999 17.76 2.05 Intr + 81411 81507 97 2 1 135 94 131 0.999 17.98 2.06 Intr + 81619 81741 123 2 0 102 123 137 0.999 19.16 2.07 Intr + 82232 82387 156 0 0 129 85 194 0.999 23.18 2.08 Intr + 82623 82742 120 1 0 104 97 56 0.996 8.57 2.09 Intr + 83108 83260 153 0 0 132 101 110 0.999 16.74 2.10 Term + 84193 84368 176 2 2 125 45 188 0.999 16.12 2.11 PlyA + 84563 84568 6 1.05 3.00 Prom + 99163 99202 40 -3.76 3.01 Init + 99472 99498 27 0 0 76 86 40 0.576 0.97 3.02 Intr + 100001 100098 98 1 2 101 106 173 0.979 19.21 3.03 Intr + 106994 107093 100 2 1 102 67 27 0.991 2.11 3.04 Intr + 107707 107759 53 0 2 91 96 67 0.993 5.51 3.05 Intr + 111064 111195 132 1 0 119 69 153 0.999 16.26 3.06 Intr + 113655 113774 120 0 0 70 75 120 0.994 8.51 3.07 Intr + 117537 117699 163 0 1 79 91 241 0.999 23.48 3.08 Intr + 118228 118261 34 0 1 99 116 59 0.999 7.60 3.09 Term + 123861 123997 137 1 2 128 42 196 0.999 17.18 3.10 PlyA + 124782 124787 6 -0.45 4.05 PlyA - 124908 124903 6 1.05 4.04 Term - 125041 124992 50 2 2 91 43 12 0.674 -5.53 4.03 Intr - 125948 125894 55 2 1 106 106 52 0.851 7.35 4.02 Intr - 126342 126273 70 2 1 107 98 29 0.823 4.98 4.01 Init - 127513 127455 59 1 2 58 94 37 0.563 2.08 4.00 Prom - 127593 127554 40 -7.26 5.00 Prom + 132101 132140 40 -0.36 5.01 Init + 138873 139105 233 2 2 111 80 400 0.985 39.23 5.02 Term + 140055 140478 424 0 1 134 41 508 0.954 45.47 5.03 PlyA + 141785 141790 6 1.05 6.07 PlyA - 142528 142523 6 1.05 6.06 Term - 149935 149900 36 1 0 120 41 39 0.652 -0.26 6.05 Intr - 151821 151733 89 1 2 104 94 71 0.909 8.99 6.04 Intr - 159722 159608 115 2 1 111 77 163 0.994 17.52 6.03 Intr - 164391 164314 78 0 0 48 94 123 0.994 8.65 6.02 Intr - 168526 168332 195 1 0 82 48 183 0.663 13.31 6.01 Init - 170378 170307 72 2 0 66 87 42 0.683 2.97 6.00 Prom - 171578 171539 40 -2.56 7.06 PlyA - 175138 175133 6 1.05 7.05 Term - 189331 189114 218 2 2 43 55 157 0.195 5.21 7.04 Intr - 200972 200363 610 2 1 97 55 130 0.002 2.59 7.03 Intr - 202575 202400 176 1 2 4 55 115 0.235 -0.54 7.02 Intr - 203115 202981 135 1 0 61 36 90 0.070 1.64 7.01 Intr - 215718 215342 377 2 2 76 -9 176 0.016 1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 201217 201056 162 1 0 72 52 146 0.911 7.34 S.002 Init - 203230 202981 250 1 1 45 36 143 0.821 2.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_1|615_aa MVLDSVEAGDTTPPTKRKSKFSGFGKIFKPWKWRKKKSSDKFKETSEGGEDPGKPSDAML KNGHTTPIGNARSSSPVQVEEEPVRLASLRKAIPEEDLKKRLGSTGSQPNSEAESVPENV PKPPLLPPKRPLSSSHEASEGQAKDATSSGGTARFIISTSITTAPAATTAATSLAKTVNL SVTPSPAPRTLPAAPASTNTTATPSLTHMVPAKQPPIPPPKPAHRNSNPVIAELSQAINS GTLLSKPSPPLPPKRGIPSTSVPTLESAAAITTKTPSDEREKSTCSMGSELLPMISPRSP SPPLPTHIPPEPPRTPPFPAKTFQVVPEIEFPPSLDLHQEIPQQEDQKKEVPKRILDQNF GEPHIPSRLPPLPLHIRIQQALTSPLPMTPILEGSHRAHSLLFENSDSFSEDSSTLGRTR SLPITIEMLKVPDDEEEEEQTCPSTFSEEMTPTSVIPKLPQCLREEEEKESDSDSEGPIQ YRDEEDEDESYQSALANKVKRKDTLAMKLNHRPSEPELNLNSWPCKSKEEWNEIRHQIGN TLIRKILRFNEYVEVTDAQDYDRRADKPWTKLTPADKCWDYRCEPLRPAEKSFKKEADEE YQMQQKATIVHDAKG >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_1|1848_bp atggtcctggacagtgtggaagcaggagacacaacacctcctaccaaaaggaagagcaag ttctcaggctttggcaagatcttcaagccctggaaatggaggaaaaaaaaaagtagtgat aaatttaaagagacttcagaaggtggtgaggatccaggaaagccaagcgatgccatgtta aagaatggccataccacccccatagggaatgccagatcatctagtccagtccaagtagag gaagagccagtaagattagcaagtcttaggaaagctattccagaagaggacctaaagaaa cgactaggctcaactggaagccagcctaattctgaagcagagtctgttcctgagaatgta cccaaaccacctttacttcctcccaaaagacccttgtcctcttctcatgaagcaagtgaa gggcaagcaaaggatgccacttcctctggcggcacggcaaggttcatcatctccacctcc atcaccacagcacccgctgccaccactgctgccacaagccttgcaaagactgttaatctc tctgtcaccccttccccagcacccaggactctgcctgctgctcctgccagcactaacact actgctaccccaagcctcactcatatggtccctgccaagcagccccctatccctccccct aaaccagctcacagaaatagcaaccctgtcattgctgaactgtcccaagcaataaacagt ggtacattgttatcaaaaccgtccccacccttaccacctaagagaggcattccatcaacc tcagtacccaccttggagtctgctgctgccatcaccacaaaaacaccaagtgatgaaaga gagaagagcacgtgttctatgggctcggaactactaccaatgatctcacctcgctctccg tcccccccactgcctactcatatacctccagagcctccacgcacccctccattccctgct aagacttttcaagttgtgccagaaattgagtttccaccatccttagatctacaccaggag attccccagcaggaagatcagaaaaaggaagtccccaagaggatactggaccagaacttt ggggagccccatataccctctaggctgcctccactcccactgcatattcgaatccagcag gccctcaccagcccacttcccatgactcctattctggagggttctcacagagctcattcg ttgctttttgaaaacagtgacagcttttctgaggacagcagtacgctgggtcggaccagg tctcttcccatcactattgaaatgctaaaagttccagacgatgaagaagaagaggagcaa acctgtccatccacattcagtgaagaaatgacacctacctcagtcattcctaaattacca cagtgtctacgggaggaagaagagaaggagagcgactctgattcagaaggtcccattcag taccgagatgaagaagatgaagatgaaagctatcagagtgctctcgccaacaaagtgaag aggaaagacacactggcaatgaagttgaaccacagacccagtgaaccagagttgaacctg aattcttggccttgtaaaagcaaggaggagtggaatgaaatacggcaccagattggaaac acactgatccggaagattctgaggtttaatgaatatgtagaggtaacagatgctcaagat tatgaccggcgagccgacaaaccttggaccaaactgacccctgctgacaagtgctgggat tacaggtgtgaaccattgcgcccagccgagaagagttttaagaaggaagcagatgaggag tatcagatgcaacagaaagctaccatcgtccatgatgccaaaggttga >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_2|475_aa MQILGWPPGAAGLFVFLSHPGRGQRPAERMLWGSSRPGGERRSSLWPTCTKDRKMSPKRI AKRRSPPADAIPKSKKVKVSHRSHSTEPGLVLTLGQGDVGQLGLGENVMERKKPALVSIP EDVVQAEAGGMHTVCLSKSGQVYSFGCNDEGALGRDTSVEGSEMVPGKVELQEKVVQVSA GDSHTAALTDDGRVFLWGSFRDNNGVIGLLEPMKKSMVPVQVQLDVPVVKVASGNDHLVM LTADGDLYTLGCGEQGQLGRVPELFANRGGRQGLERLLVPKCVMLKSRGSRGHVRFQDAF CGAYFTFAISHEGHVYGFGLSNYHQLGTPGTESCFIPQNLTSFKNSTKSWVGFSGGQHHT VCMDSEGKAYSLGRAEYGRLGLGEGAEEKSIPTLISRLPAVSSVACGASVGYAVTKDGRV FAWGMGTNYQLGTGQDEDAWSPVEMMGKQLENRVVLSVSSGGQHTVLLVKDKEQS >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_2|1428_bp atgcagatcctgggatggccgccaggggccgccgggctctttgttttcctttctcacccg ggtcggggccagaggcctgcagagcgcatgctctggggcagttcgcggcccggcggggag cgccggagttccttgtggccgacgtgcaccaaggacaggaagatgtcacccaagcgcata gctaaaagaaggtcccccccagcagatgccatccccaaaagcaagaaggtgaaggtctca cacaggtcccacagcacagaacccggcttggtgctgacactaggccagggcgacgtgggc cagctggggctgggtgagaatgtgatggagaggaagaagccggccctggtatccattccg gaggatgttgtgcaggctgaggctgggggcatgcacaccgtgtgtctaagcaaaagtggc caggtctattccttcggctgcaatgatgagggtgccctgggaagggacacatcagtggag ggctcggagatggtccctgggaaagtggagctgcaagagaaggtggtacaggtgtcagca ggagacagtcacacagcagccctcaccgatgatggccgtgtcttcctctggggctccttc cgggacaataacggtgtgattggactgttggagcccatgaagaagagcatggtgcctgtg caggtgcagctggatgtgcctgtggtaaaggtggcctcaggaaacgaccacttggtgatg ctgacagctgatggtgacctctacaccttgggctgcggggaacagggccagctaggccgt gtgcctgagttatttgccaaccgtggtggccggcaaggcctcgaacgactcctggtcccc aagtgtgtgatgctgaaatccaggggaagccggggccacgtgagattccaggatgccttt tgtggtgcctatttcacctttgccatctcccatgagggccacgtgtacggcttcggcctc tccaactaccatcagcttggaactccgggcacagaatcttgcttcataccccagaaccta acatccttcaagaattccaccaagtcctgggtgggcttctctggtggccagcaccataca gtctgcatggattcggaaggaaaagcatacagcctgggccgggctgagtatgggcggctg ggccttggagagggtgctgaggagaagagcatacccaccctcatctccaggctgcctgct gtctcctcggtggcttgtggggcctctgtggggtatgctgtgaccaaggatggtcgtgtt ttcgcctggggcatgggcaccaactaccagctgggcacagggcaggatgaggacgcctgg agccctgtggagatgatgggcaaacagctggagaaccgtgtggtcttatctgtgtccagc gggggccagcatacagtcttattagtcaaggacaaagaacagagctga >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_3|287_aa MAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIRNRLTGIPAGYCFVEFADLATAEK CLHKINGKPLPGATPAKRFKLNYATYGKQPDNSPEYSLFVGDLTPDVDDGMLYEFFVKVY PSCRGGKVVLDQTGVSKGYGFVKFTDELEQKRALTECQGAVGLGSKPVRLSVAIPKASRV KPVEYSQMYSYSYNQYYQQYQNYYAQWGYDQNTGSYSYSYPQYGYTQSTMQTYEEVGDDA LEDPMPQLDVTEANKEFMEQSEELYDALMDCHWQPLDTVSSEIPAMM >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_3|864_bp atggcggccagcctgtggatgggcgacctggaaccctacatggatgagaacttcatctcc agagcctttgccaccatgggggagaccgtaatgagcgtcaaaattatccgaaaccgcctc actgggatcccagctggctactgctttgtagaatttgcagatttggccacagctgagaag tgtttgcataaaattaatgggaaaccccttccaggagccacacctgcgaaacgttttaaa ctgaactatgccacttacgggaaacaaccagataacagccctgagtattccctctttgtg ggggacctgaccccggacgtggatgatggcatgctgtatgaattcttcgtcaaagtctac ccctcctgtcggggaggcaaggtggttttggaccagacaggcgtgtctaagggttatggt tttgtgaaattcacagatgaactggaacagaagcgagccctgacggagtgccagggagca gtgggactggggtctaagcctgtgcggctgagcgtggcaatccctaaagcgagccgtgta aagccagtggaatatagtcagatgtacagttatagctacaaccagtattatcagcagtac cagaactactatgctcagtggggctatgaccagaacacaggcagctacagctacagttac ccccagtatggctatacccagagcaccatgcagacatatgaagaagttggagatgatgca ttggaagaccccatgccacagctggatgtgactgaggccaacaaggagttcatggaacag agtgaggagctgtatgacgctctgatggactgtcactggcagcccctggacacagtgtct tcagagatccctgccatgatgtag >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_4|77_aa MTDLVYRDPGVLKQVVMMDRLPPSFLAVCDTEEVSLFLELCFKIHVTCKAVLICDYGPME LGQSLWEAEGKDPGHFR >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_4|234_bp atgactgacttagtctacagagatcccggcgtacttaagcaggtagtaatgatggacagg ttgcccccaagtttcctggcagtgtgtgatactgaggaggtgagcttgtttctggagctg tgctttaagattcatgttacatgtaaagctgtcctcatttgtgactatggacctatggag ttgggacaatctctatgggaagcagaaggcaaggaccccggtcattttaggtag >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_5|218_aa MEAEGCRYQFRVALLGDAAVGKTSLLRSYVAGAPGAPEPEPEPEPTVGAECYRRALQLRA GPRVKLQLWDTAGHERFRCITRSFYRNVVGVLLVFDVTNRKSFEHIQDWHQEVMATQGPD KVIFLLVGHKSDLQSTRCVSAQEAEELAASLGMAFVETSVKNNCNVDLAFDTLADAIQQA LQQGDIKLEEGWGGVRLIHKTQIPRSPSRKQHSGPCQC >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_5|657_bp atggaggccgagggctgccgctaccaatttcgggtcgcgctgctgggggacgcggcggtg ggcaagacgtcgctgctgcggagctacgtggcaggcgcgcctggcgccccggagccggag cccgagcccgagcccacggtgggcgccgagtgctaccgccgcgcgctgcagctgcgggcc gggccgcgggtcaagctgcaactctgggacaccgcgggccacgagcgcttcaggtgcatc accaggtccttttaccggaatgtggtgggtgtcctgctggtctttgatgtgacaaacagg aagtcctttgaacacatccaagactggcaccaggaggtcatggccactcagggcccggac aaggtcatcttcctgctggttggccacaagagtgacctgcagagcacccgctgtgtctca gcccaggaggccgaggagctagctgcctccctgggcatggccttcgtggagacctcggtt aaaaacaactgcaatgtggacctggcctttgacaccctcgctgatgctatccagcaggcc ctgcagcagggggacatcaagctagaagagggctgggggggtgtccggctcatccacaag acccaaatccccaggtcccccagcaggaagcagcactcaggcccatgccagtgttga >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_6|194_aa MAASHFTGLTAVADVIKDLDTQIALIGLGPHSSKKKQDLDKLYELKSKARQIMNQFGPSA LINLSNFSSIKPEPASTPPQGSMANSTAVVLTKKKLQDLVREVDPNEQLDEDVEEMLLQI ADDFIESVVTAACQLARHRKSSTLEVKDVQLHLERQWNMWIPGFGSEEIRPYKKACTTEA HKQRMALIRKTTKK >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_6|585_bp atggctgcctctcatttcaccgggctcacagctgttgctgatgtaattaaagatctagac actcagatagctttaattggccttggtcctcacagctccaaaaagaaacaggatctcgat aagctctatgagctgaagtccaaagctcggcagattatgaaccagtttggcccctcagcc ctaatcaacctctccaatttctcatccataaaaccggaaccagccagcacccctccacaa ggctccatggccaatagtactgcagtggtattgaccaagaagaaattacaggacttagta agagaagtggatcctaatgagcagttggatgaagatgtggaggagatgctgctgcagatt gctgatgattttatcgagagtgtggtgacagcagcctgtcagcttgcgcggcatcgcaag tctagcaccctggaggtgaaagatgtccagctgcatttagagcgccagtggaacatgtgg atcccaggatttggctctgaagaaatccgaccctacaaaaaagcttgcaccacagaagct cacaaacagagaatggcattgatccggaaaacaaccaagaaataa >gi568815597f:28453640_28677633|GENSCAN_predicted_peptide_7|505_aa XTQASNQLLIQGPGAPRTPPLGIPQAPRTPRWGPGRLALSVRSHSGPSWPLSRVLPGPSV RGRRGPPSGGLAGSRRPGLGPEPRRRGRLRWGRPGGSGGPSGSVPPPPRGAATSARSPAA AAARTGPTDLANAFFSIPVHKAHERQFAFSLQGRQYTFTILPQGYINSHKAVTRKTASFE WSPKQDKALQQVQAVVQAALPFGPYDPADPMVLEVSVADRVAVWCLWQAPKGQRFILAGI DTYSKYGFAYLACNASAKTIIRGLTECLIHHHGIPHSTASDHGTDFTAKQVQQWALAHGI HLSYHVPHYPEAAGLIEQRNGLLKSQLQRQLGDNSLQGWGKVLQKAVYALNQHPIYGTVS PITRIQGSRNQEVEMEVAPLTITLSDPLANFLLPVPMALCSADLEVLVLEGGMLPPGGTR IPLNWKLRLPPRPVEEQEREEGVRTQTYSGSVFEGPGCGRDGAVRIQALGKGLTERIRLG EDAAVGTRALRRDLGNGDGCEFSGI >gi568815597f:28453640_28677633|GENSCAN_predicted_CDS_7|1518_bp nngacccaggcctccaaccagctcctcatccaagggcccggtgctccccggaccccgcct ttggggataccccaggctccccggacccctcgatggggtcctgggcgcctggccctctca gtccggagtcactccggcccctcctggcccctgagccgagtacttccgggcccctcggtc cgggggcgccggggcccccccagtggtggactcgccggttcccgccgccctggtctgggg cccgagccccggcggcgcggccggctgcgctgggggcggcccggcggctcgggcggaccc agcggcagcgtccccccacccccccgcggcgccgcgacgtccgctcgctccccggcggcg gcggcggcacgcacagggcctactgatttggcaaatgcctttttctccattcctgtccac aaggcccacgagaggcaattcgccttcagcttgcaaggccggcaatatactttcactatc ttacctcaggggtatatcaactctcacaaagctgtgacccgaaagactgccagttttgag tggagtccaaaacaggacaaggctctgcaacaggtccaggctgttgtgcaagctgctctg ccatttggaccatatgacccagcagatccaatggtgcttgaggtgtcagtggcagatagg gttgctgtttggtgcctttggcaggcccccaaagggcagaggtttatcctcgctggaata gacacttactctaaatatgggtttgcctatcttgcatgcaatgcttctgccaagactatc atccgtggacttacagaatgccttatccaccatcacggtattccacacagcactgcctct gaccacggcactgactttacagctaaacaagtgcaacagtgggctcttgctcatggaatt cacttgtcttaccatgttccccattatcctgaagcagctggattgatagaacagcggaat ggccttttgaagtcacaattacaacgccaactaggtgacaattctttgcagggctgggga aaagttctccagaaggctgtgtatgctctgaatcagcatccaatatatggtactgtttct cccataaccaggattcaagggtccaggaatcaagaggtggaaatggaagtggctccactc accatcacccttagtgatccactggcaaacttcttgcttcctgttcccatggcattatgt tctgctgacctagaggtcttagttctagagggaggaatgctgccaccaggaggcacaagg attccattaaactggaagttaagattgcctcctaggccagttgaggaacaagagcgggag gagggagtcaggacccagacctacagtgggtcagtgttcgaaggaccaggttgtggacga gacggggctgtgaggattcaggctcttgggaagggactgactgagaggatcaggctcgga gaagacgcggctgtggggacccgggctctaagacgggacttagggaacggggacggatgc gagttctccggcatctga