GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:08:14 Sequence gi568815596f:161079663_161335515 : 255853 bp : 37.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7756 7795 40 -1.75 1.01 Init + 10292 10344 53 1 2 60 94 6 0.170 -0.92 1.02 Intr + 10841 10944 104 2 2 113 109 13 0.293 4.80 1.03 Term + 34469 34614 146 0 2 97 51 126 0.949 6.89 1.04 PlyA + 36553 36558 6 1.05 2.03 PlyA - 36658 36653 6 1.05 2.02 Term - 46138 46047 92 2 2 13 55 144 0.032 0.50 2.01 Init - 64090 63982 109 1 1 48 81 101 0.652 5.83 2.00 Prom - 65343 65304 40 -6.15 3.00 Prom + 68479 68518 40 -3.85 3.01 Init + 81656 81750 95 1 2 72 54 119 0.038 6.80 3.02 Intr + 99952 100099 148 1 1 84 93 176 0.609 17.02 3.03 Intr + 123825 123933 109 2 1 31 95 140 0.102 7.94 3.04 Intr + 125013 125131 119 1 2 86 69 70 0.364 4.16 3.05 Intr + 140062 140117 56 0 2 89 95 -21 0.054 -4.44 3.06 Intr + 144969 145084 116 0 2 82 99 50 0.542 4.67 3.07 Intr + 151309 151889 581 2 2 128 93 363 0.975 32.29 3.08 Term + 155680 155856 177 0 0 90 47 146 0.979 7.40 3.09 PlyA + 156319 156324 6 1.05 4.07 PlyA - 156668 156663 6 1.05 4.06 Term - 157927 157808 120 1 0 73 43 100 0.339 1.49 4.05 Intr - 159587 159435 153 2 0 77 -12 170 0.464 5.25 4.04 Intr - 175018 174923 96 1 0 100 75 52 0.420 4.39 4.03 Intr - 201330 201169 162 0 0 -25 34 209 0.194 3.45 4.02 Intr - 201646 201552 95 2 2 14 59 151 0.754 3.56 4.01 Init - 202214 202076 139 2 1 69 105 223 0.993 22.45 4.00 Prom - 202346 202307 40 -16.85 5.00 Prom + 202521 202560 40 -14.71 5.01 Init + 202667 203038 372 1 0 104 33 293 0.843 22.41 5.02 Intr + 207351 207513 163 2 1 78 71 58 0.483 1.73 5.03 Intr + 208626 210083 1458 1 0 -1 72 481 0.015 26.33 5.04 Term + 210439 211061 623 2 2 11 49 319 0.454 13.89 5.05 PlyA + 211436 211441 6 1.05 6.00 Prom + 212192 212231 40 -3.65 6.01 Init + 212369 212496 128 1 2 83 60 46 0.105 0.98 6.02 Intr + 224840 224870 31 2 1 73 70 38 0.072 -2.29 6.03 Term + 228535 228753 219 0 0 64 40 160 0.109 4.86 6.04 PlyA + 229691 229696 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 19214 18918 297 2 0 62 49 167 0.826 5.69 S.002 Init + 33064 33139 76 0 1 68 81 32 0.860 1.80 S.003 Intr - 87820 87720 101 1 2 102 110 59 0.825 7.49 S.004 Term + 109274 109384 111 1 0 99 32 97 0.880 2.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_1|100_aa MPYAPLRKNMLITKEEMSWCVNIDKALRKEKKIKKKLYSFFTQQHIARKCWLERSTCQAT EELRLPSDSTACGCEPTLHISPYCQKLYQQSSELETAVTE >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_1|303_bp atgccatatgctcctctaagaaaaaatatgctaattaccaaggaagaaatgagctggtgt gtgaatatagataaagctttaaggaaagagaaaaaaataaaaaagaaactatacagcttt ttcacccaacaacacattgccaggaaatgctggctcgaaaggagcacgtgccaagcgacc gaagagttgaggctaccttctgactctacagcttgtggttgtgaaccaacattacacatc tccccatactgtcagaaactgtatcagcagtcctcagagctggaaacagcagtcacagaa tga >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_2|66_aa MNISAKILNKVLANRIQKHIKKLIHQDQVSFIPGTQVHVGQGWEADFRDTCEIHREAVNR NATATR >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_2|201_bp atgaacatcagtgcaaaaatcctcaataaagtactggcaaaccgaatccaaaagcacatc aaaaagcttatccaccaagatcaagtcagcttcatccctgggacgcaagtgcacgtgggc caaggctgggaagctgactttcgggacacctgtgagattcatcgggaagctgtcaatagg aatgccactgcaactcgatga >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_3|466_aa MVVKEVKKRREIEGKENDLPSYRGWNTHPNERPVIYSILYSDATGRRGMDKNIGEQLNKA YEAFRQACMDRDSAVKELQQKTENYEQRIREQQEQLSLQQTIIDKLKSQLLLVNSTQDNN YGCVPLLEDSETRKNNLTLDQPQDKVISGIAREKLPKLYPVLDFTITSFVSVYVDWGNIE KTFWDLKEEFHKICMLAKAQKDHLSKLNIPDTATETQCSVPIQCTDKTDKQEALFKPQAK DDINRGAPSITSVTPRGLCRDEEDTSFESLSKFNVKFPPMDNDSTFLHSTPERPGILSPA TSEAVCQEKFNMEFRDNPGNFVKTEETLFEIQGIDPIASAIQNLKTTDKTKPSNLVNTCI RTTLDRAACLPPGDHNALYVNSFPLLDPSDAPFPSLDSPGKAIRGPQQPIWKPFPNQDSD SVVLSGTDSELHIPRVCEFCQAVFPPSITSRGDFLRHLNSHFNGET >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_3|1401_bp atggtagtaaaggaagtgaagaagcgacgtgaaattgaaggaaaagaaaatgacctgcct tcttaccgcggttggaatacacacccaaacgagagacctgtcatttactccatcctttat agtgatgctacaggacgaagaggaatggataaaaacattggcgagcaactcaataaagcg tatgaagccttccggcaggcatgcatggatagagattctgcagtaaaagaattacagcaa aagactgagaactatgagcagagaatacgtgaacaacaggaacagctgtcacttcaacag actattattgacaagctaaaatctcagttacttcttgtgaattccactcaagataacaat tatggctgtgttcctctgcttgaagacagtgaaacaagaaagaataatttgactcttgat cagccacaagataaagtgatttcaggaatagcaagagaaaaactaccaaagctgtatcca gtgcttgacttcactataacatcttttgtctcagtatatgtagactggggtaatatagag aagactttctgggatctgaaagaagaatttcataaaatatgcatgctagcaaaagcacag aaagaccacttaagcaaacttaatataccagacactgcaactgaaacacagtgctctgtg cctatacagtgtacggataaaacagataaacaagaagcgctgtttaagcctcaggctaaa gatgatataaatagaggtgcaccatccatcacatctgtcacaccaagaggactgtgcaga gatgaggaagacacctcttttgaatcactttctaaattcaatgtcaagtttccacctatg gacaatgactcaactttcttacatagcactccagagagacccggcatccttagtcctgcc acgtctgaggcagtgtgccaagagaaatttaatatggagttcagagacaacccagggaac tttgttaaaacagaagaaactttatttgaaattcagggaattgaccccatagcttcagct atacaaaaccttaaaacaactgacaaaacaaagccctcaaatctcgtaaacacttgtatc aggacaactctggatagagctgcgtgtttgccacctggagaccataatgcattatatgta aatagcttcccacttctggacccatctgatgcaccttttccctcactcgattccccggga aaagcaatccgaggaccacagcagcccatttggaagccctttcctaatcaagacagtgac tcggtggtactaagtggcacagactcagaactgcatatacctcgagtatgtgaattctgt caagcagttttcccaccatccattacatccaggggggatttccttcggcatcttaattca cacttcaatggagagacttaa >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_4|254_aa MWYIIDERGDSEESSEEDTGEPNGFIDCPSRRGQGNGSLWDTGDDRVARRCELQVHAESA KRSGRCSALRRSEYRPARWFCGSHVVVLTPAASLQPSNPGDPQICVVTSPPPDPDAWQFK NYPCKRKDFEEELLATTILLDASMNLTTLGTSDEWNYAVFVLLCPFLNSPGLPVALAGRR GRFTRQRVGVRTQALRPLPQYVDPGARVLAPPPRGLFYLGAVLTRPRLADAKSQTLHDSV MRQRERPGGCPLHL >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_4|765_bp atgtggtatattattgatgaacgtggtgactctgaagaatcctcagaggaggacacggga gagcccaatggcttcattgattgcccatcacggagaggacagggaaatgggagcttgtgg gatactggtgatgacagagtcgcaagacgctgtgagctccaagtccacgcagagtccgcc aaacgctccggccgctgctccgctctgcgaagatctgagtacaggcctgccaggtggttc tgtggttcccacgttgtggttttgacaccagcagcatccttgcaaccctcaaaccccgga gatccacagatctgtgttgtaacaagccctccccctgaccctgatgcatggcagtttaag aattacccgtgtaagcgaaaagactttgaagaagagctcctggcaaccaccattctactt gatgcctctatgaatctgactactctaggtacctcagatgagtggaattatgcagtattt gtccttttgtgtccctttctgaattcgcctggactcccagtcgcgctcgctggccgacgc ggtcgattcaccagacagcgagtgggcgtgagaacgcaggctcttcggccgcttccccag tacgtggatcccggagcgcgggtcctcgccccaccgccccggggacttttttacctggga gctgttcttacccggccacgcttagcagatgccaaaagccagaccctgcacgactcagtc atgagacagagggagagacctggcgggtgcccactccacctttag >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_5|871_aa MMQLPGHFSTFLVRAPHPAAEKPLLHSILTHRPVHPLATVTLLPAEEAPVMADITAQSIL TPQLCSGTLLLAHVAARAHAGTTLQHVGVPAGHTTGDMASMTLLPGSDTADVNPISTKNT KISWPLLVIPRQTGSGVDLQQTPTGLQLRVLTVRRKTNKQKEHPHQNPICTLPSSKTKGS NSRITILTLNVNGPNAPIKRHRLANRIKTQDPSVCCIQETHLTGRDTHRLKIKGWRKIYQ ANGKQNKTKQQKKQGLQILVSDKRDFKPTKIKRDKEGHYIMVKGPIQQEEQTILNIHAPN TGEPRFIKKVLRDLQRDLDSHTIIMGDFNMPLSTLDRSTRHKVNKDIQELNSALHQVDLI DIYRTLHPKSTAYTFFSAPHCTYSKIDHRVGSKALLSKCKRTEIITNCLSDHSAIKLELR IKKLTQNRSTTWKLNNLLLNNYWVHKEMKAEIKMFFEINEDKDKTYQNLWDTFKAVCRGK FIALNAHKRRQERSKIDTLTLQLKELEKQEQTHSKASRRQEITKIRAELKETETQKTLQK PNESRSWFFEKINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIRDYYKHL YTNKLENLEEMDKFLDTYTLPRLNQEEVEPLNRLIIGSEIEAIINSLPTKKVQDQTDSQP NFTRDAEKAFDKIQQPFRLKPLNKLSIDGTYLKIIRAIYDKSTANIILNGQKLQAFPLKT GTRQGCPLSPLLFNIALEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMTVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQALLYTNNRQTESQIMSELPFTTASKRIKYLGIQLTR DVKDLFKENYKPLLNDIKEDTNKWKNIHAHG >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_5|2616_bp atgatgcagctacctggccatttctccacatttctggtgagggccccacacccagccgca gaaaagcccctcctgcattccatcctcacacacaggcctgtccatccacttgctactgtc acactcttgccagcagaagaggcccctgtaatggccgatatcaccgcccagtctatcctc accccacagctatgcagcgggaccctcctgctggcccacgtggctgccagagcccatgct ggcactacactccagcatgtcggcgtccctgcgggccacactaccggtgacatggctagc atgaccctccttcctggcagtgacactgctgatgtgaaccccatctctactaaaaataca aaaattagctggcctctgctggtgatacccaggcaaacagggtctggagtggacctccag caaactccaacaggcttgcagctgagggtcctgactgttagaaggaaaactaacaaacag aaagaacatccacaccaaaaccccatctgtacattaccatcatcaaagaccaaaggatca aattcacgtataacaatattaaccttaaatgtaaatggaccaaatgctccaattaaaaga cacagactggcaaatcggataaagactcaagacccatcagtgtgctgcattcaggaaacc catctcacgggcagagacacacataggctcaaaataaagggatggaggaagatctaccaa gcaaatggaaaacaaaacaaaacaaaacaacaaaaaaagcaggggttgcaaatcctagtc tctgataaaagagactttaaaccaacaaagatcaaaagagacaaagaaggccattacata atggtaaagggaccaattcaacaagaagagcaaactatcctaaatatacatgcacccaat acaggagaacccagattcataaagaaagtccttagagacctacaaagagacttagactcc cacacaataataatgggagactttaacatgccactgtcaacattagacagatcaacgaga cataaagttaacaaggatatccaggaattgaactcagctctgcaccaagtggacctaata gacatctacagaactctccaccccaaatcaacagcatatacattcttctcagcaccacat tgcacttattccaaaattgaccacagagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacgaactgtctctcagaccacagtgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgcttctgaat aactactgggtacataaagaaatgaaggcagaaataaagatgttctttgaaatcaatgag gacaaggacaaaacataccagaatctctgggacacatttaaagcagtgtgtagagggaaa tttatagcactaaatgcccacaagagaaggcaggaaagatctaaaattgacaccctaaca ttacagttaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaattactaagatcagagcagaactgaaggagacagagacacaaaaaacccttcaaaaa cccaatgaatccaggagctggttttttgaaaagatcaacaaaattgatagaccgctagca agactaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggg gatatcaccactgatcccacagaaatacaaactaccatcagagactactataaacacctc tacacaaataaactagaaaatctagaagaaatggataaattcctggacacatacaccctc ccaagattaaaccaggaagaagttgaacccctgaatagactaataataggctctgaaatt gaggcaataattaatagcctaccaaccaaaaaagtccaggaccagacggattcacagccg aattttaccagagatgcagaaaaggcctttgacaaaattcaacagcccttcaggctaaaa cctctcaataaactaagtattgatgggacgtatctcaaaataataagagctatttatgac aaatccacagccaatatcatactgaatgggcaaaaactgcaagcattccctctgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagcgttggaagttctg gccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgactgtatatttagaaaaccccatcgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattgctctacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaactgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtcaaggacctcttcaaagagaactacaaaccactgctcaatgatataaaagaggac acaaacaaatggaagaacattcatgctcacggatag >gi568815596f:161079663_161335515|GENSCAN_predicted_peptide_6|125_aa MDEGGSHHSEQTFARTENQTQHFLTHRWELNNENTWTQGPVVGKILWIVDIDKISAAAAE RRGRERKRRRRRGASDQGAKCRLTVLPETGDAGPPAPSVPRAPPGHPLDSPSSPIGRRGT EAEAN >gi568815596f:161079663_161335515|GENSCAN_predicted_CDS_6|378_bp atggatgaaggtggaagccatcattctgagcaaacttttgcaaggacagaaaaccaaaca cagcattttctcactcataggtgggaactgaacaatgagaacacttggacacaggggcct gttgtggggaaaatcctttggattgttgacatcgacaagatctcagcggcggccgcagag cgcagaggtagggagagaaaaaggaggcgtcggcggggcgcgagcgaccagggcgctaag tgccgcctcacagtcctgccagagacaggagacgccgggccgcccgcgccgtctgtccca agagctcctcctggacatccgcttgactctccgtcgtctccaattggccgtcggggaacg gaagccgaagcaaactag