GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:36:43 Sequence gi568815597r:28505372_28722081 : 216710 bp : 47.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12785 12937 153 1 0 64 100 74 0.132 4.21 1.02 Intr + 24487 24568 82 0 1 50 110 96 0.240 7.21 1.03 Intr + 26432 26619 188 0 2 76 96 103 0.981 9.31 1.04 Intr + 26800 26979 180 0 0 31 90 232 0.999 17.76 1.05 Intr + 29679 29775 97 2 1 135 94 131 0.999 17.98 1.06 Intr + 29887 30009 123 2 0 102 123 137 0.999 19.16 1.07 Intr + 30500 30655 156 0 0 129 85 194 0.999 23.18 1.08 Intr + 30891 31010 120 1 0 104 97 56 0.996 8.57 1.09 Intr + 31376 31528 153 0 0 132 101 110 0.999 16.74 1.10 Term + 32461 32636 176 2 2 125 45 188 0.999 16.12 1.11 PlyA + 32831 32836 6 1.05 2.00 Prom + 47431 47470 40 -3.76 2.01 Init + 47740 47766 27 0 0 76 86 40 0.576 0.97 2.02 Intr + 48269 48366 98 1 2 101 106 173 0.979 19.21 2.03 Intr + 55262 55361 100 2 1 102 67 27 0.991 2.11 2.04 Intr + 55975 56027 53 0 2 91 96 67 0.993 5.51 2.05 Intr + 59332 59463 132 1 0 119 69 153 0.999 16.26 2.06 Intr + 61923 62042 120 0 0 70 75 120 0.994 8.51 2.07 Intr + 65805 65967 163 0 1 79 91 241 0.999 23.48 2.08 Intr + 66496 66529 34 0 1 99 116 59 0.999 7.60 2.09 Term + 72129 72265 137 1 2 128 42 196 0.999 17.18 2.10 PlyA + 73050 73055 6 -0.45 3.05 PlyA - 73176 73171 6 1.05 3.04 Term - 73309 73260 50 2 2 91 43 12 0.674 -5.53 3.03 Intr - 74216 74162 55 2 1 106 106 52 0.851 7.35 3.02 Intr - 74610 74541 70 2 1 107 98 29 0.823 4.98 3.01 Init - 75781 75723 59 1 2 58 94 37 0.563 2.08 3.00 Prom - 75861 75822 40 -7.26 4.00 Prom + 80369 80408 40 -0.36 4.01 Init + 87141 87373 233 2 2 111 80 400 0.985 39.23 4.02 Term + 88323 88746 424 0 1 134 41 508 0.954 45.47 4.03 PlyA + 90053 90058 6 1.05 5.07 PlyA - 90796 90791 6 1.05 5.06 Term - 98203 98168 36 1 0 120 41 39 0.652 -0.26 5.05 Intr - 100089 100001 89 1 2 104 94 71 0.909 8.99 5.04 Intr - 107990 107876 115 2 1 111 77 163 0.994 17.52 5.03 Intr - 112659 112582 78 0 0 48 94 123 0.994 8.65 5.02 Intr - 116794 116600 195 1 0 82 48 183 0.663 13.31 5.01 Init - 118646 118575 72 2 0 66 87 42 0.683 2.97 5.00 Prom - 119846 119807 40 -2.56 6.05 PlyA - 123406 123401 6 1.05 6.04 Term - 137599 137382 218 2 2 43 55 157 0.195 5.21 6.03 Intr - 149240 148631 610 2 1 97 55 130 0.002 2.59 6.02 Intr - 150843 150668 176 1 2 4 55 115 0.235 -0.54 6.01 Init - 151498 151249 250 1 1 45 36 143 0.833 2.43 6.00 Prom - 151951 151912 40 -6.66 7.00 Prom + 159186 159225 40 -5.46 7.01 Init + 163413 163468 56 2 2 30 89 151 0.652 8.16 7.02 Intr + 184703 184815 113 2 2 31 87 124 0.952 6.62 7.03 Intr + 186214 186338 125 2 2 124 86 68 0.970 10.50 7.04 Intr + 187571 187674 104 1 2 83 115 100 0.998 11.17 7.05 Intr + 191556 191713 158 0 2 72 54 114 0.958 6.05 7.06 Intr + 197067 197198 132 1 0 102 66 133 0.987 13.12 7.07 Intr + 198821 198958 138 0 0 81 111 52 0.976 7.24 7.08 Intr + 205149 205271 123 1 0 61 91 69 0.927 5.06 7.09 Term + 208702 209402 701 2 2 97 36 637 0.884 53.10 7.10 PlyA + 212817 212822 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 149485 149324 162 1 0 72 52 146 0.911 7.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_1|475_aa MQILGWPPGAAGLFVFLSHPGRGQRPAERMLWGSSRPGGERRSSLWPTCTKDRKMSPKRI AKRRSPPADAIPKSKKVKVSHRSHSTEPGLVLTLGQGDVGQLGLGENVMERKKPALVSIP EDVVQAEAGGMHTVCLSKSGQVYSFGCNDEGALGRDTSVEGSEMVPGKVELQEKVVQVSA GDSHTAALTDDGRVFLWGSFRDNNGVIGLLEPMKKSMVPVQVQLDVPVVKVASGNDHLVM LTADGDLYTLGCGEQGQLGRVPELFANRGGRQGLERLLVPKCVMLKSRGSRGHVRFQDAF CGAYFTFAISHEGHVYGFGLSNYHQLGTPGTESCFIPQNLTSFKNSTKSWVGFSGGQHHT VCMDSEGKAYSLGRAEYGRLGLGEGAEEKSIPTLISRLPAVSSVACGASVGYAVTKDGRV FAWGMGTNYQLGTGQDEDAWSPVEMMGKQLENRVVLSVSSGGQHTVLLVKDKEQS >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_1|1428_bp atgcagatcctgggatggccgccaggggccgccgggctctttgttttcctttctcacccg ggtcggggccagaggcctgcagagcgcatgctctggggcagttcgcggcccggcggggag cgccggagttccttgtggccgacgtgcaccaaggacaggaagatgtcacccaagcgcata gctaaaagaaggtcccccccagcagatgccatccccaaaagcaagaaggtgaaggtctca cacaggtcccacagcacagaacccggcttggtgctgacactaggccagggcgacgtgggc cagctggggctgggtgagaatgtgatggagaggaagaagccggccctggtatccattccg gaggatgttgtgcaggctgaggctgggggcatgcacaccgtgtgtctaagcaaaagtggc caggtctattccttcggctgcaatgatgagggtgccctgggaagggacacatcagtggag ggctcggagatggtccctgggaaagtggagctgcaagagaaggtggtacaggtgtcagca ggagacagtcacacagcagccctcaccgatgatggccgtgtcttcctctggggctccttc cgggacaataacggtgtgattggactgttggagcccatgaagaagagcatggtgcctgtg caggtgcagctggatgtgcctgtggtaaaggtggcctcaggaaacgaccacttggtgatg ctgacagctgatggtgacctctacaccttgggctgcggggaacagggccagctaggccgt gtgcctgagttatttgccaaccgtggtggccggcaaggcctcgaacgactcctggtcccc aagtgtgtgatgctgaaatccaggggaagccggggccacgtgagattccaggatgccttt tgtggtgcctatttcacctttgccatctcccatgagggccacgtgtacggcttcggcctc tccaactaccatcagcttggaactccgggcacagaatcttgcttcataccccagaaccta acatccttcaagaattccaccaagtcctgggtgggcttctctggtggccagcaccataca gtctgcatggattcggaaggaaaagcatacagcctgggccgggctgagtatgggcggctg ggccttggagagggtgctgaggagaagagcatacccaccctcatctccaggctgcctgct gtctcctcggtggcttgtggggcctctgtggggtatgctgtgaccaaggatggtcgtgtt ttcgcctggggcatgggcaccaactaccagctgggcacagggcaggatgaggacgcctgg agccctgtggagatgatgggcaaacagctggagaaccgtgtggtcttatctgtgtccagc gggggccagcatacagtcttattagtcaaggacaaagaacagagctga >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_2|287_aa MAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIRNRLTGIPAGYCFVEFADLATAEK CLHKINGKPLPGATPAKRFKLNYATYGKQPDNSPEYSLFVGDLTPDVDDGMLYEFFVKVY PSCRGGKVVLDQTGVSKGYGFVKFTDELEQKRALTECQGAVGLGSKPVRLSVAIPKASRV KPVEYSQMYSYSYNQYYQQYQNYYAQWGYDQNTGSYSYSYPQYGYTQSTMQTYEEVGDDA LEDPMPQLDVTEANKEFMEQSEELYDALMDCHWQPLDTVSSEIPAMM >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_2|864_bp atggcggccagcctgtggatgggcgacctggaaccctacatggatgagaacttcatctcc agagcctttgccaccatgggggagaccgtaatgagcgtcaaaattatccgaaaccgcctc actgggatcccagctggctactgctttgtagaatttgcagatttggccacagctgagaag tgtttgcataaaattaatgggaaaccccttccaggagccacacctgcgaaacgttttaaa ctgaactatgccacttacgggaaacaaccagataacagccctgagtattccctctttgtg ggggacctgaccccggacgtggatgatggcatgctgtatgaattcttcgtcaaagtctac ccctcctgtcggggaggcaaggtggttttggaccagacaggcgtgtctaagggttatggt tttgtgaaattcacagatgaactggaacagaagcgagccctgacggagtgccagggagca gtgggactggggtctaagcctgtgcggctgagcgtggcaatccctaaagcgagccgtgta aagccagtggaatatagtcagatgtacagttatagctacaaccagtattatcagcagtac cagaactactatgctcagtggggctatgaccagaacacaggcagctacagctacagttac ccccagtatggctatacccagagcaccatgcagacatatgaagaagttggagatgatgca ttggaagaccccatgccacagctggatgtgactgaggccaacaaggagttcatggaacag agtgaggagctgtatgacgctctgatggactgtcactggcagcccctggacacagtgtct tcagagatccctgccatgatgtag >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_3|77_aa MTDLVYRDPGVLKQVVMMDRLPPSFLAVCDTEEVSLFLELCFKIHVTCKAVLICDYGPME LGQSLWEAEGKDPGHFR >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_3|234_bp atgactgacttagtctacagagatcccggcgtacttaagcaggtagtaatgatggacagg ttgcccccaagtttcctggcagtgtgtgatactgaggaggtgagcttgtttctggagctg tgctttaagattcatgttacatgtaaagctgtcctcatttgtgactatggacctatggag ttgggacaatctctatgggaagcagaaggcaaggaccccggtcattttaggtag >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_4|218_aa MEAEGCRYQFRVALLGDAAVGKTSLLRSYVAGAPGAPEPEPEPEPTVGAECYRRALQLRA GPRVKLQLWDTAGHERFRCITRSFYRNVVGVLLVFDVTNRKSFEHIQDWHQEVMATQGPD KVIFLLVGHKSDLQSTRCVSAQEAEELAASLGMAFVETSVKNNCNVDLAFDTLADAIQQA LQQGDIKLEEGWGGVRLIHKTQIPRSPSRKQHSGPCQC >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_4|657_bp atggaggccgagggctgccgctaccaatttcgggtcgcgctgctgggggacgcggcggtg ggcaagacgtcgctgctgcggagctacgtggcaggcgcgcctggcgccccggagccggag cccgagcccgagcccacggtgggcgccgagtgctaccgccgcgcgctgcagctgcgggcc gggccgcgggtcaagctgcaactctgggacaccgcgggccacgagcgcttcaggtgcatc accaggtccttttaccggaatgtggtgggtgtcctgctggtctttgatgtgacaaacagg aagtcctttgaacacatccaagactggcaccaggaggtcatggccactcagggcccggac aaggtcatcttcctgctggttggccacaagagtgacctgcagagcacccgctgtgtctca gcccaggaggccgaggagctagctgcctccctgggcatggccttcgtggagacctcggtt aaaaacaactgcaatgtggacctggcctttgacaccctcgctgatgctatccagcaggcc ctgcagcagggggacatcaagctagaagagggctgggggggtgtccggctcatccacaag acccaaatccccaggtcccccagcaggaagcagcactcaggcccatgccagtgttga >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_5|194_aa MAASHFTGLTAVADVIKDLDTQIALIGLGPHSSKKKQDLDKLYELKSKARQIMNQFGPSA LINLSNFSSIKPEPASTPPQGSMANSTAVVLTKKKLQDLVREVDPNEQLDEDVEEMLLQI ADDFIESVVTAACQLARHRKSSTLEVKDVQLHLERQWNMWIPGFGSEEIRPYKKACTTEA HKQRMALIRKTTKK >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_5|585_bp atggctgcctctcatttcaccgggctcacagctgttgctgatgtaattaaagatctagac actcagatagctttaattggccttggtcctcacagctccaaaaagaaacaggatctcgat aagctctatgagctgaagtccaaagctcggcagattatgaaccagtttggcccctcagcc ctaatcaacctctccaatttctcatccataaaaccggaaccagccagcacccctccacaa ggctccatggccaatagtactgcagtggtattgaccaagaagaaattacaggacttagta agagaagtggatcctaatgagcagttggatgaagatgtggaggagatgctgctgcagatt gctgatgattttatcgagagtgtggtgacagcagcctgtcagcttgcgcggcatcgcaag tctagcaccctggaggtgaaagatgtccagctgcatttagagcgccagtggaacatgtgg atcccaggatttggctctgaagaaatccgaccctacaaaaaagcttgcaccacagaagct cacaaacagagaatggcattgatccggaaaacaaccaagaaataa >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_6|417_aa MTVDYCKLNQVVTPVAAAEPDVALLLEQINTSPGTCYAATDLANAFFSIPVHKAHERQFA FSLQGRQYTFTILPQGYINSHKAVTRKTASFEWSPKQDKALQQVQAVVQAALPFGPYDPA DPMVLEVSVADRVAVWCLWQAPKGQRFILAGIDTYSKYGFAYLACNASAKTIIRGLTECL IHHHGIPHSTASDHGTDFTAKQVQQWALAHGIHLSYHVPHYPEAAGLIEQRNGLLKSQLQ RQLGDNSLQGWGKVLQKAVYALNQHPIYGTVSPITRIQGSRNQEVEMEVAPLTITLSDPL ANFLLPVPMALCSADLEVLVLEGGMLPPGGTRIPLNWKLRLPPRPVEEQEREEGVRTQTY SGSVFEGPGCGRDGAVRIQALGKGLTERIRLGEDAAVGTRALRRDLGNGDGCEFSGI >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_6|1254_bp atgacagtggattattgtaagcttaaccaagtggtgactccagttgcagctgctgaacca gatgtggctttattgcttgagcaaattaacacatctcctggtacctgttatgcagctact gatttggcaaatgcctttttctccattcctgtccacaaggcccacgagaggcaattcgcc ttcagcttgcaaggccggcaatatactttcactatcttacctcaggggtatatcaactct cacaaagctgtgacccgaaagactgccagttttgagtggagtccaaaacaggacaaggct ctgcaacaggtccaggctgttgtgcaagctgctctgccatttggaccatatgacccagca gatccaatggtgcttgaggtgtcagtggcagatagggttgctgtttggtgcctttggcag gcccccaaagggcagaggtttatcctcgctggaatagacacttactctaaatatgggttt gcctatcttgcatgcaatgcttctgccaagactatcatccgtggacttacagaatgcctt atccaccatcacggtattccacacagcactgcctctgaccacggcactgactttacagct aaacaagtgcaacagtgggctcttgctcatggaattcacttgtcttaccatgttccccat tatcctgaagcagctggattgatagaacagcggaatggccttttgaagtcacaattacaa cgccaactaggtgacaattctttgcagggctggggaaaagttctccagaaggctgtgtat gctctgaatcagcatccaatatatggtactgtttctcccataaccaggattcaagggtcc aggaatcaagaggtggaaatggaagtggctccactcaccatcacccttagtgatccactg gcaaacttcttgcttcctgttcccatggcattatgttctgctgacctagaggtcttagtt ctagagggaggaatgctgccaccaggaggcacaaggattccattaaactggaagttaaga ttgcctcctaggccagttgaggaacaagagcgggaggagggagtcaggacccagacctac agtgggtcagtgttcgaaggaccaggttgtggacgagacggggctgtgaggattcaggct cttgggaagggactgactgagaggatcaggctcggagaagacgcggctgtggggacccgg gctctaagacgggacttagggaacggggacggatgcgagttctccggcatctga >gi568815597r:28505372_28722081|GENSCAN_predicted_peptide_7|549_aa MLRRLPALAARRPPARRRRLFIDGHFYNRIYEAGSENNTAVVAVETHTIHKIEEGIDTGT IEANEDMEIAYPITCGESKAILLWKKFVCPGINVKCVKFNDQLISPKHFVHLAGKSTLKD WKRAIRLGGIMLRKMMDSGQIDFYQHDKVCSNTCRSTKFDLLISSARAPVPGQQTSVVQT PTSADGSITQIAISEESMEEAGLEWNSALTAAVTMATEEGVKKDSEEISEDTLMFWKGIA DVGLMEEVVCNIQKEIEELLRGVQQRLIQAPFQVTDAAVLNNVAHTFGLMDTVKKVLDNR RNQVEQGEEQFLYTLTDLERQLEEQKKQGQDHRLKSQTVQNVVLMPVSTPKPPKRPRLQR PASTTVLSPSPPVQQPQFTVISPITITPVGQSFSMGNIPVATLSQGSSPVTVHTLPSGPQ LFRYATVVSSAKSSSPDTVTIHPSSSLALLSSTAMQDGSTLGNMTTMVSPVELVAMESGL TSAIQAVESTSEDGQTIIEIDPAPDPEAEDTEGKAVILETELRTEEKVVAEMEEHQHQVH NVEIVVLED >gi568815597r:28505372_28722081|GENSCAN_predicted_CDS_7|1650_bp atgctccgtcgcctgcccgccctggccgctcgccgcccgcccgcccgacggagacgtttg tttatcgatggacacttttacaacaggatttatgaagctgggtcggagaacaacacggca gttgtagcagtagaaactcacacgatacacaaaattgaagaagggattgatacaggcact atagaagcaaatgaggatatggaaattgcttaccccataacttgtggggagagcaaagcc atcctcctctggaagaagtttgtatgtccaggaataaacgtgaagtgtgtcaagttcaat gatcagttgatcagccccaagcactttgttcatctggctggcaagtccactctgaaggac tggaagagagctattcgtctgggtgggatcatgctcaggaaaatgatggactccggacag attgatttttaccaacatgacaaagtttgctccaatacctgcagaagcaccaaatttgat cttctgatcagcagtgcaagagctccagtgccaggacagcagacaagtgtggtgcagaca cccacttcggctgatggtagcatcacgcagattgccatctcagaagagagcatggaagag gcagggctggaatggaactcagctctcaccgctgctgtcaccatggccacggaggagggt gtaaagaaagactcagaggaaatttcagaggacactttgatgttctggaaaggaatagct gatgtagggctgatggaagaggttgtctgcaatatacagaaggaaatagaggagctactc aggggagttcagcagcggctcatccaggctcccttccaagtcacagatgctgctgttctc aacaatgtagcacacacatttggcctaatggacacagtcaagaaggttttagacaacaga aggaaccaagtagagcagggagaagaacagtttctctatactctgacagacttggaacgc cagttggaggagcagaagaagcaaggccaggatcacaggctgaaatctcagacagttcaa aatgtggtactgatgcctgtgagcactcctaagcctccaaaaaggccccggctccagcgg ccagcctccaccactgtcttgagcccttctcctcctgtccagcagcctcagttcacagtc atctcacccatcaccatcaccccagtgggtcagtcattttccatgggcaatattccagtg gccaccctcagccagggctccagtcctgtgactgtccacacactgccttctggccctcag ctcttccgctatgccacagtggtctcctctgccaagagcagctcaccagacacagtgacc atccacccttcatctagcttggcgctgctgagctctactgccatgcaggatgggagtaca ctgggcaacatgaccaccatggttagccctgtggaattggtggccatggagtccggccta acctcggcaattcaggctgttgaaagcacctcagaggatgggcagaccatcattgagatt gatccagccccggacccagaagctgaagatactgagggcaaagcagtcatcttggagaca gagctgaggactgaggagaaagttgtggctgagatggaagaacaccagcatcaagttcac aatgtggagattgtggtcttagaggattaa