GENSCAN 1.0 Date run: 3-Nov-116 Time: 16:30:55 Sequence gi568815576f:31844734_32056789 : 212056 bp : 46.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 285 504 220 0 1 113 90 352 0.999 36.20 1.02 Intr + 2101 2234 134 0 2 96 100 42 0.876 5.64 1.03 Intr + 12712 12820 109 1 1 134 98 86 0.972 14.49 1.04 Intr + 16635 16700 66 2 0 92 92 56 0.924 5.50 1.05 Intr + 25857 26011 155 2 2 73 94 99 0.937 7.87 1.06 Intr + 28522 28599 78 1 0 80 99 62 0.940 5.17 1.07 Intr + 29540 29672 133 2 1 52 94 181 0.979 15.75 1.08 Intr + 31424 31532 109 1 1 56 50 175 0.556 10.36 1.09 Intr + 34792 34959 168 2 0 88 32 187 0.365 13.02 1.10 Intr + 48864 49018 155 1 2 58 81 228 0.512 18.89 1.11 Intr + 52749 52920 172 2 1 78 55 130 0.786 8.22 1.12 Intr + 57009 57069 61 1 1 104 72 62 0.666 3.99 1.13 Intr + 61251 61333 83 0 2 109 94 18 0.175 3.78 1.14 Intr + 61472 61760 289 0 1 101 58 570 0.295 51.60 1.15 Term + 72757 72832 76 1 1 134 46 47 0.567 2.41 1.16 PlyA + 73558 73563 6 1.05 2.03 PlyA - 77782 77777 6 1.05 2.02 Term - 81281 81157 125 0 2 79 42 56 0.266 -1.35 2.01 Init - 83481 83430 52 0 1 77 115 32 0.625 6.12 2.00 Prom - 86501 86462 40 -1.86 3.00 Prom + 89137 89176 40 -4.26 3.01 Init + 100001 100087 87 1 0 98 89 230 0.999 24.74 3.02 Term + 111406 112059 654 0 0 86 49 1025 0.995 92.40 3.03 PlyA + 112558 112563 6 1.05 4.04 PlyA - 112717 112712 6 1.05 4.03 Term - 118046 117927 120 2 0 -2 48 125 0.141 -2.03 4.02 Intr - 123569 123517 53 0 2 127 81 27 0.169 4.53 4.01 Init - 139287 139146 142 0 1 86 80 60 0.553 5.30 4.00 Prom - 159059 159020 40 -2.46 5.06 PlyA - 159868 159863 6 1.05 5.05 Term - 180547 180395 153 1 0 60 55 127 0.843 4.52 5.04 Intr - 186944 186889 56 0 2 91 69 57 0.840 2.80 5.03 Intr - 189952 189799 154 1 1 65 49 110 0.372 4.45 5.02 Intr - 191007 190832 176 1 2 39 20 103 0.556 -1.74 5.01 Init - 191148 191049 100 0 1 70 78 31 0.620 0.73 5.00 Prom - 192069 192030 40 -6.46 6.00 Prom + 192910 192949 40 -4.66 6.01 Init + 194756 195109 354 1 0 90 43 552 0.953 48.04 6.02 Intr + 195944 196127 184 1 1 94 80 114 0.564 10.56 6.03 Intr + 198505 198683 179 2 2 -51 59 327 0.210 15.54 6.04 Intr + 205210 205281 72 0 0 94 131 30 0.837 7.50 6.05 Term + 205678 205737 60 0 0 96 40 41 0.566 -2.10 6.06 PlyA + 209446 209451 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_1|669_aa XLIESLKFWRTRFLLLPACVTATKRITEGEAHCDIYGDRPRADEDEWQLLDGFVRFVEGL NRIRRRHRSDRMMRKGTAMKGLQMTGPISTHSLESTAPPVGKKGTSALSALLEMEASQKC LGEQQAAVHGGKSSAQSAESSSVAMTPTYMDSPRKDGAFFMEFVRSPRTASSAFYPQVSV DQTATPMLDGTSLGICTGQSMDRGNSQTFGNSQNIGEQGYSSTNSSDSSSQQLVASSLTS SSTLTEILEAMKHPSTGVQLLSEQKGLSPYCFISAEVVHWLVNHVEGIQTQAMAIDIMQK MLEEQLITHASGEAWRTFIYGFYFYKIVTDKEPDRVAMQQPATTWHTAGVDDFASFQRKW FEVAFVAEELVHSEIPAFLLPWLPSRPASYAKQRTVTLDVDVNNRTDRLEWCSCYYHGNF SLNAAFEIKLHWMAVTAAVLFEMVQGWHRKATSCGFLLVPVLEGPFALPSYLYGDPLRAQ LFIPLNISCLLKEGSEHLFDSFEPETYWDRMHLFQEAIAHRFGFVQDKYSASAFNFPAEN KPQYIHVTGTVFLQLPYSKRKFSGQQRRRRNSTSSTNQNMFCEERVGYNWAYNTMLTKTW RSSATGDEKFADRLLKDFTDFCINRDNRLVTFWTSCLEKMHASAPNQESGCHHWKPDWRT TEKLASSGV >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_1|2010_bp nncttaattgagtccctgaagttctggaggacccgcttcctgctgctgccagcctgtgtc accgccaccaagcgcatcacggagggggaggcccactgcgacatctatggggacaggccc cgtgcagacgaggacgagtggcaactcctggatggttttgtccgctttgtggagggcttg aatcgcattcgcaggcggcatcgctcggatcgcatgatgcggaaagggaccgccatgaaa ggcttgcagatgactgggcccatttccacgcattctctggagtcaactgcacccccagtg gggaagaagggaacctcagctctctctgccctgttggagatggaggccagtcagaagtgc ctgggagaacagcaggcagctgtgcatggtgggaagagctccgcccagtcagccgagagc agcagcgttgccatgactcccacctacatggacagcccacgaaaggacggggccttcttt atggagtttgtccgcagcccacgcacagcatcgtccgccttctaccctcaggtatctgtg gaccaaacagccactcctatgttggacggcaccagtttgggcatatgcacaggccaatcc atggacagaggcaacagccagacctttgggaactcccagaacataggagaacagggctac tcctccacaaactccagtgacagcagctctcagcagctggtggcaagctccttgacctca tcctctaccctgacagagatcctggaagccatgaagcacccctcgacaggagtccagctg ctctctgaacagaagggcctctcaccgtactgcttcatcagcgcggaggtggtacactgg ttggtgaaccacgtggaggggatccagacacaggcgatggccattgacatcatgcagaaa atgctggaagagcagctcatcacacatgcatctggcgaagcctggcggaccttcatctac ggcttctatttctacaagatagtaacggacaaagagcccgaccgagtggccatgcagcag cccgccaccacctggcacacagcaggagtggacgacttcgccagcttccagcgcaagtgg tttgaggtggcctttgtggcagaagagctcgtgcactctgagattcctgcctttctcctg ccctggctgcctagccggccagcctcctatgcaaagcagaggactgtgaccctggatgtt gacgtgaacaaccgcacagaccggctggagtggtgcagctgttattaccatggcaacttt tctctgaatgcagcctttgagatcaagctgcactggatggcggtgaccgcagcagtactc ttcgagatggtccaaggttggcatcggaaagccacctcctgtggcttcttgttagtccca gttttggaggggccttttgcactgcccagttacctgtatggcgacccccttcgtgcccag ctcttcatcccactcaacatcagctgcttgctcaaggagggcagcgagcacctgtttgat agctttgaacccgaaacgtactgggatcgaatgcacctcttccaggaagccattgcacac aggtttgggtttgtacaagataaatattctgcctctgcttttaacttccctgctgagaac aagcctcagtatatccacgttacaggaacagtgtttctgcagctgccctactccaagcgc aagttctcagggcagcagcggcggcggcggaactccaccagctccaccaaccagaacatg ttctgcgaggagcgggtcggctacaactgggcctacaacaccatgctcaccaaaacatgg cgctccagcgccacaggggatgaaaagtttgctgatcggctgctgaaggacttcacggac ttctgcatcaaccgtgacaaccggctggtcacgttctggacaagttgcctggagaagatg catgccagtgccccgaaccaggagagtggctgccaccactggaagccagattggaggaca acagagaaactggccagctctggggtctga >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_2|58_aa MHAQHSFKHTETVKPIAVQGSIMNFLDIYIGFPQASGFHTCSSHPPLPSSVTTAKIIS >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_2|177_bp atgcatgcccagcacagcttcaagcacacagaaacagtaaaaccgattgcagtccaaggt tccatcatgaatttcctggacatctacattggcttcccacaggcctccggctttcacact tgctcatcccatcccccgcttccatcttctgtaaccacagccaaaataatctcttaa >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_3|246_aa MGDREQLLQRARLAEQAERYDDMASAMKAVTELNEPLSNEDRNLLSVAYKNVVGARRSSW RVISSIEQKTMADGNEKKLEKVKAYREKIEKELETVCNDVLSLLDKFLIKNCNDFQYESK VFYLKMKGDYYRYLAEVASGEKKNSVVEASEAAYKEAFEISKEQMQPTHPIRLGLALNFS VFYYEIQNAPEQACLLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDQQDE EAGEGN >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_3|741_bp atgggggaccgggagcagctgctgcagcgggcgcggctggccgagcaggcggagcgctac gacgacatggcctccgctatgaaggcggtgacagagctgaatgaacctctctccaatgaa gatcgaaatctcctctctgtggcctacaagaatgtggttggtgccaggcgatcttcctgg agggtcattagcagcattgagcagaaaaccatggctgatggaaacgaaaagaaattggag aaagttaaagcttaccgggagaagattgagaaggagctggagacagtttgcaatgatgtc ctgtctctgcttgacaagttcctgatcaagaactgcaatgatttccagtatgagagcaag gtgttttacctgaaaatgaagggtgattactaccgctacttagcagaggtcgcttctggg gagaagaaaaacagtgtggtcgaagcttctgaagctgcctacaaggaagcctttgaaatc agcaaagagcagatgcaacccacgcatcccatccggctgggcctggccctcaacttctcc gtgttctactatgagatccagaatgcacctgagcaagcctgcctcttagccaaacaagcc ttcgatgatgccatagctgagctggacacactaaacgaggattcctataaggactccacg ctgatcatgcagttgctgcgagacaacctcaccctctggacgagcgaccagcaggatgaa gaagcaggagaaggcaactga >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_4|104_aa MNKFQINNKEFVRAKEKSSQRHNCYPFGNVMASRKGPRPKARSGMTPGVGLEMRATKYAV PGNQLANVKKVIEQRLPGKEERGLIKEDLEWQAFSPCLEDNIAS >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_4|315_bp atgaacaagttccagataaacaataaggaatttgtcagggcaaaagagaagagctcccaa agacacaactgttacccctttgggaacgttatggccagcaggaaaggcccaagacccaag gcaagatctgggatgacgccaggtgtaggcctggaaatgagagctacaaagtatgctgtc cctgggaaccagctggcgaatgtaaagaaggtgattgagcaaaggctacctggcaaggaa gaaagaggtcttatcaaggaagacctggagtggcaggctttcagcccctgtcttgaagac aacatcgcatcatga >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_5|212_aa MHVSFGLEPEPRLLELPHPLDLKSQTGFREEPWEPFTGNCLNTVLNLGQQEECVGWKSEA CVQEPVPLALGKSFPSLKLSPHLSNLAEFDQQFDTWTQLYESERHFNESLIITNAGKSQQ CNVGIIVCQTPWFSENSEPRMRAGCQRFRFHPEPESDTAGDVCEESVILGGCRNPKPDVC LTQVLKELALKDPHQQLPVLKKYKQFSHIRHV >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_5|639_bp atgcatgtcagctttggattggaaccagagcccaggttactggaactgcctcatcctcta gacctcaaatcccagactggcttcagagaggagccatgggagcccttcacagggaactgc ctcaacactgtcctcaatctggggcaacaggaagaatgtgttgggtggaagtcggaggcc tgtgttcaagagccggttcctctggccttaggcaagtcatttccctctctgaaactgtct ccgcacctgtcaaatttggcagaatttgaccagcagtttgacacttggacccagctgtac gaaagtgagaggcatttcaatgagtcattaattataacaaatgcaggcaaaagtcagcag tgtaatgtaggcatcatcgtctgccaaacaccttggttttctgaaaactcagagcccagg atgagagcaggctgtcaacgcttcaggttccatccagaacctgagtctgacacagctgga gatgtgtgtgaagagtctgttattcttggaggttgtagaaatccaaaacctgatgtatgc ctgactcaggtattaaaggaactggccctcaaggaccctcaccagcagctacctgtgctc aagaagtacaagcagttctcccatatcaggcatgtatga >gi568815576f:31844734_32056789|GENSCAN_predicted_peptide_6|282_aa MCRVCTKTVKKAARVIIEKYYTRLGNDFHTNKRVCKEIAIIPSKKLRNKIAGYVTHLMKW IQRGPVRGISIKLQEEERERRDNYVPEVSALDQEIIEVDPDTKEMLKLLDFGSLSNLQPH LSCGLSVWLAFFFLGCSANKGPFSDQILVLIGTNDGKRESNRILPVSSDPAAQASCMGAG ASGPGEREGRNAATMDSSTWSPKTTAVTRPVETHELIRNAADISIIVIYFVVVMAVGLWA MFSTNRGTVGGFFLAGRSMVWWPEEWAAEKLLEEGTTCIDLG >gi568815576f:31844734_32056789|GENSCAN_predicted_CDS_6|849_bp atgtgccgcgtttgcaccaaaaccgtgaagaaggcggcccgggtcatcatagaaaagtac tacacacgcctgggcaacgacttccacacgaacaagcgcgtgtgcaaggagatcgccatt atccccagcaagaagctccgcaacaagatagcaggctatgtcacgcatctgatgaaatgg attcagagaggcccagtaagaggtatctccatcaagctgcaggaggaggagagagaaagg agagacaattatgttcctgaggtctcagccttggatcaggagataattgaagtagatcct gacactaaggaaatgctgaagcttttggacttcggcagtctgtccaacctgcagcctcat ctttcctgcggcctgagtgtctggctggcttttttctttctcgggtgttctgctaacaag ggcccctttagtgaccagatcctggttttgattggcaccaatgacggcaagagagagtcc aacaggatcctaccagtgagcagtgacccagcagctcaggccagctgcatgggagcagga gctagcggccctggcgagagggaaggacgcaacgctgccaccatggacagtagcacctgg agccccaagaccaccgcggtcacccggcctgttgagacccacgagctcattcgcaatgca gccgatatctccatcatcgttatctacttcgtggtagtgatggccgtcggactgtgggct atgttttccaccaatcgtgggactgttggaggcttcttcctggcaggccgaagtatggtg tggtggccggaagagtgggcagcagagaagcttctggaagaaggaacaacttgtattgac ctgggttga