GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:06:01 Sequence gi568815593r:91271403_91483092 : 211690 bp : 40.02% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 5048 5087 40 -3.25 1.01 Init + 8765 8816 52 1 1 74 96 34 0.981 4.20 1.02 Term + 9111 9226 116 1 2 72 47 163 0.899 8.35 1.03 PlyA + 9653 9658 6 1.05 2.05 PlyA - 12401 12396 6 1.05 2.04 Term - 31759 31652 108 1 0 103 43 48 0.425 -0.67 2.03 Intr - 39630 39517 114 0 0 61 15 160 0.346 5.72 2.02 Intr - 41093 40938 156 2 0 69 70 66 0.328 2.19 2.01 Init - 45645 45568 78 0 0 46 73 83 0.401 3.71 2.00 Prom - 49569 49530 40 -3.85 3.04 PlyA - 49930 49925 6 1.05 3.03 Term - 64188 64055 134 1 2 70 35 83 0.775 -1.53 3.02 Intr - 65309 65124 186 0 0 81 79 92 0.761 6.34 3.01 Init - 74115 74064 52 0 1 52 116 26 0.431 3.17 3.00 Prom - 75124 75085 40 -5.05 4.02 PlyA - 75139 75134 6 1.05 4.01 Sngl - 76571 75996 576 2 0 100 41 266 0.977 19.02 4.00 Prom - 77367 77328 40 -3.75 5.00 Prom + 77494 77533 40 -6.95 5.01 Init + 79464 79551 88 2 1 45 97 45 0.286 1.96 5.02 Intr + 81921 82032 112 1 1 54 75 71 0.234 0.92 5.03 Term + 82998 83415 418 0 1 62 46 296 0.245 16.56 5.04 PlyA + 84042 84047 6 1.05 6.09 PlyA - 84505 84500 6 1.05 6.08 Term - 100054 99998 57 1 0 119 49 64 0.978 2.31 6.07 Intr - 102436 102282 155 2 2 75 107 119 0.998 11.37 6.06 Intr - 102874 102712 163 1 1 74 86 87 0.998 5.73 6.05 Intr - 103776 103520 257 1 2 97 80 186 0.998 14.94 6.04 Intr - 104211 104109 103 0 1 110 91 25 0.998 3.83 6.03 Intr - 105366 105219 148 2 1 71 99 126 0.999 11.32 6.02 Intr - 107373 107292 82 1 1 98 105 41 0.999 4.58 6.01 Init - 111690 111411 280 0 1 84 107 243 0.905 23.02 6.00 Prom - 117805 117766 40 -4.85 7.00 Prom + 132836 132875 40 -1.75 7.01 Init + 157392 157462 71 2 2 69 84 82 0.533 6.47 7.02 Term + 167538 167682 145 0 1 39 53 108 0.040 -1.20 7.03 PlyA + 169195 169200 6 1.05 8.04 PlyA - 169491 169486 6 1.05 8.03 Term - 175657 175101 557 2 2 102 49 182 0.908 9.20 8.02 Intr - 178854 178828 27 1 0 88 110 9 0.530 0.27 8.01 Init - 179109 179085 25 0 1 82 109 40 0.964 5.31 8.00 Prom - 189816 189777 40 -7.35 9.00 Prom + 192604 192643 40 -1.35 9.01 Sngl + 205349 205663 315 1 0 99 40 467 0.918 38.70 9.02 PlyA + 206594 206599 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 139138 139437 300 0 0 83 55 130 0.837 3.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_1|55_aa MPSSNLICVTTVVGLRQGKTKEAAVSAKSCELPATGVARAADGGNPLAASRMSQR >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_1|168_bp atgccaagttccaacctgatttgcgtcaccacagttgtgggtttgcgccaaggaaagact aaggaggcagcggtatctgcgaagtcctgcgagctcccggcgacaggagtagcgcgagca gctgacgggggaaatccactggcagcgagtcggatgagtcagcgctga >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_2|151_aa MCDPKSLFARDNTYVAKEIPVRKRFAGFSLSLTTIYSIASSIQYIQNRIISPAAELGPCL GQTENGLPADMPRSKPGSIHEKKEGEKEERRAEGKEEGREEEKKEAEKGREKKLRPMDKQ RQPEDKGLAQCHTFQSLNKTQWMPVFSSPHK >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_2|456_bp atgtgtgacccaaagtccttgttcgccagagacaatacttatgtagccaaggaaattcct gttagaaaacggtttgcaggtttttccttaagcctcacaacaatttacagcattgccagc agcattcagtacattcagaacaggataatttcacccgcagctgaactagggccatgcctg ggacagacagagaatggccttccagcagacatgccaagaagcaagccagggtcaatacat gaaaagaaagaaggagagaaggaagaaaggagggcggaaggaaaggaagaagggagagaa gaagagaagaaagaagcagagaaaggaagagaaaagaaattacggccaatggataaacag aggcaacccgaggataaaggccttgctcagtgtcacacatttcagtcactaaataagaca caatggatgccagtattctcatcccctcacaaataa >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_3|123_aa MLMDAGKYGYHLHYRWEENSGSRLPTGQEQLWENVYTDKFCFQDLSYFQIAQHILASFLL PESLHLCIMSSEHSIDLHHNLLSVIAFDTFTQTSNQWQGLSIYLQVHGESICDNLSPFKA ILL >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_3|372_bp atgctcatggatgcgggtaagtatggttatcatcttcactacagatgggaagaaaatagt ggctctaggctcccaacaggacaggaacagttatgggaaaatgtatacacggacaaattt tgttttcaggacctgtcatacttccagattgctcagcacatccttgcaagttttcttctc ccagagtcactccatctatgcattatgtcctcagaacacagcatagacctgcatcacaat ctcctctccgtaattgcattcgacacttttacacaaactagtaaccagtggcaaggacta tccatatacctgcaggttcatggggagagcatttgtgacaacctctctccctttaaggca attcttctttaa >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_4|191_aa MAERGQHRARAVASEGASPKPWQLPCGVEPASAQKSQIGVWEPPPRLQMMYGNARMTRQK FAAGAGPSWRTSARAVQKKNVGSEPTHRVPTRALPSGAVRRRPPSSRLQNGRSTNSLHCV PGKATGTHCQPVKAARREAIPCKVTGAKLSKTLGTYLLPQRDPDVRHGVKGDHFGALRFD CPAGFQTAWGL >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_4|576_bp atggctgaaaggggccaacatagagctcgggctgtggcttcagaaggtgcaagccccaag ccttggcagcttccatgtggtgttgagcctgcaagtgcacagaagtcacaaattggggtt tgggaacctccacctagattgcagatgatgtatggaaatgcccggatgaccaggcagaag tttgctgcaggggcagggccttcatggagaacctctgctagggcagtgcagaagaaaaat gtagggtcagagcccacacacagagtacccactagagcactgcctagtggagctgtgaga agaaggccaccatcctccagactccagaatggtaggtccaccaacagcttgcactgtgta cctggaaaagccacaggcactcattgccagcctgtgaaagcagccaggagggaggctatc ccctgcaaagtcacaggggcgaagctgtccaagaccttgggaacctacctcttgcctcag cgtgacccggatgtgagacatggagtcaaaggagatcattttggagctttaagatttgac tgccctgctggatttcagactgcatggggcctgtag >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_5|205_aa MLISGFTQTGLSKKLHLTKSSDDFPLHTEEGCFWLQVIKGQTQSGFRNEGNLLAHKAKKC KRQSGFSSGGWEVLDQGAGCFHRRPTCCHCAAAMSLVIPEKFQRILRVFKTNIDEWCKIT FAITAIKGVGQRYAHVVLRKADIDLTSRVGELTEDEVECVITMMQNPHQYKIPGWFLNRQ KDVKDGKYSQALANGLDNKLHEDLE >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_5|618_bp atgctaatctcaggcttcacccagaccggcttgagtaagaaactgcatttgacaaaatcc tcagatgattttcctttgcatactgaagaaggatgtttctggctgcaagtaattaaaggc cagactcaaagtggcttccgcaatgaaggaaatttattagctcacaaagctaaaaagtgc aagaggcagtcaggtttcagttctggaggctgggaagttctagatcaaggtgcaggctgt ttccacaggaggcctacgtgctgccactgtgctgctgccatgtctctagtgatccctgaa aagttccagcgtattttgagagtattcaaaaccaacattgatgagtggtgtaaaataacc tttgccatcactgccattaagggtgtgggtcaaagatatgctcatgtggtgttgaggaaa gcagacatcgacctcaccagcagggtgggagaactcactgaggatgaggtagaatgtgtg atcaccatgatgcagaatccacaccagtacaagatcccaggctggttcttgaacagacag aaggatgtaaaggatggaaaatatagccaggccctagccaatggtctggacaacaagctc catgaagacctggagtga >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_6|414_aa MVLGKVKSLTISFDCLNDSNVPVYSSGDTVSGRVNLEVTGEIRVKSLKIHARGHAKVRWT ESRNAGSNTAYTQNYTEEVEYFNHKDILIGHERDDDNSEEGFHTIHSGRHEYAFSFELPQ TPLATSFEGRHGSVRYWVKAELHRPWLLPVKLKKEFTVFEHIDINTPSLLSPQAGTKEKT LCCWFCTSGPISLSAKIERKGYTPGESIQIFAEIENCSSRMVVPKAAIYQTQAFYAKGKM KEVKQLVANLRGESLSSGKTETWNGKLLKIPPVSPSILDCSIIRVEYSLMVYVDIPGAMD LFLNLPLVIGTIPLHPFGSRTSSVSSQCSMNMNWLSLSLPERPEAPPSYAEVVTEEQRRN NLAPVSACDDFERALQGPLFAYIQEFRFLPPPLYSEIDPNPDQSADDRPSCPSR >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_6|1245_bp atggtgctgggaaaggtgaagagtttgacaataagctttgactgtcttaatgacagcaat gtccctgtgtattctagtggggataccgtctcaggaagggtaaatttagaagttactggg gaaatcagagtaaaatctcttaaaattcatgcaagaggacatgcgaaagtacgctggact gaatctagaaacgccggctccaatactgcctatacacagaattacactgaagaagtagag tatttcaaccataaagacatcttaattgggcacgaaagagatgatgataattccgaagaa ggcttccacactattcattcaggaaggcatgaatatgcattcagcttcgagcttccacag acaccactcgctacctcattcgaaggccgacatggcagtgtgcgctattgggtgaaagcc gaattgcacaggccttggctactaccagtaaaattaaagaaggaatttacagtctttgag catatagatatcaacactccttcattactgtcaccccaagcaggcacaaaagaaaagaca ctctgttgctggttctgtacctcaggcccaatatccttaagtgccaaaattgaaaggaag ggctataccccaggtgaatcaattcagatatttgctgagattgagaactgctcttcccga atggtggtgccaaaggcagccatttaccaaacacaggccttctatgccaaagggaaaatg aaggaagtaaaacagcttgtggctaacttgcgtggggaatccttatcatctggaaagaca gagacgtggaatggcaagttgctgaaaattccaccagtttctccctctatcctcgactgt agtataatccgcgtggaatattcactaatggtatatgtggatattcctggagctatggat ttatttcttaatttgccacttgtcatcggtaccattcctctacatccatttggtagcaga acctcaagtgtaagcagtcagtgtagcatgaatatgaactggctcagtttatcacttcct gaaagacctgaagcaccacccagctatgcagaagtggtaacagaggaacaaaggcggaac aatcttgcaccagtgagtgcttgtgatgactttgagagagcccttcaaggaccactgttt gcatatatccaggagtttcgattcttgcctccacctctttattcagagattgatccaaat cctgatcagtcagcagatgatagaccatcctgcccctctcgttga >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_7|71_aa MEEEDSTVSQTEAAAGFEDGKRSLFTSEDTCIKGRGPYEVVMVHEEEAWKSKQGFQLPAS CLSPGTYPTYR >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_7|216_bp atggaagaggaagacagcacggtgagccagacagaggctgctgctggctttgaagatgga aaaaggagtctatttacctcagaagacacctgtataaaaggaagaggtccctatgaagtg gtgatggtccatgaggaagaggcctggaaatctaagcagggcttccagctccctgcttcc tgtctaagtccaggtacttaccccacctataggtga >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_8|202_aa MAVKLQLAGYSCNPNLPGTSAQLAELIALTRAIELSKGKVANIYTDSQYAFLVLHAHAAI WKERHFLTTNGSPIKHHQEIKRLLSSVFLPREITEMHCRGHQKGTDEVAEGNRLASQVAR SAARKPQDINTLQTPLIWEGSIREIKPQYSTTEVEWATSRGCTFQPSGELQSEVGKLHLP ASSQRKVTWERIRRINVARDCF >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_8|609_bp atggcggtcaaactacagctagcaggatattcttgcaatccaaatttgccaggcacaagc gctcaattagctgagctaatagctcttacaagagcaattgaattaagcaaaggaaaggta gctaatatttacactgactcccagtatgctttcctagttctccatgctcatgcagccatt tggaaggaaaggcattttcttaccaccaatggatctcctataaaacatcaccaggaaatt aaaaggttactttcctcagttttccttccacgagaaatcacagaaatgcattgtagggga catcagaagggaacagatgaggtagccgaaggaaatagattagccagtcaggtagctagg tcagcggcaagaaagcctcaagacatcaacacacttcaaacccctctaatctgggaaggc tccataagagaaattaaacctcagtactccactacagaagtagaatgggccacttctcga gggtgtacatttcagccctcaggagagctacagtcagaggttggcaagctccacttgcca gcctctagccaacggaaagtcacttgggaaaggataagacgtatcaatgtggctagagat tgtttttag >gi568815593r:91271403_91483092|GENSCAN_predicted_peptide_9|104_aa MVLALTGNKADLASKRPLEFQEAQAYADDDSWLFMETSAKGAMNVNEIFMAIAEKLPKNK PQNAAGAPDRNQGVDLQENHPASPNRCCSNGAPLACPLPPSLPP >gi568815593r:91271403_91483092|GENSCAN_predicted_CDS_9|315_bp atggtccttgcgctcacgggtaacaaggcagacctggccagcaagagacccctggaattc caggaagcacaagcctatgcagacgatgacagttggctgttcatggagacatcagcaaag ggtgcaatgaatgtgaatgaaattttcatggcaatagctgagaaacttcccaagaacaag ccccagaatgcagctggtgctccagacagaaaccaaggtgtggacctccaggagaaccac ccggccagccccaaccggtgttgcagcaatggagccccccttgcctgtccactgccccca agtcttccaccttag