GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:36:30 Sequence gi568815575f:74204267_74404764 : 200498 bp : 40.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8878 8958 81 0 0 64 100 73 0.602 7.12 1.02 Intr + 44682 44885 204 2 0 132 38 48 0.017 2.67 1.03 Intr + 45757 45935 179 0 2 56 54 107 0.009 1.90 1.04 Intr + 65653 65871 219 1 0 100 113 99 0.723 10.10 1.05 Intr + 70362 70462 101 0 2 -23 76 99 0.135 -3.57 1.06 Intr + 80427 80495 69 1 0 97 52 103 0.180 5.84 1.07 Intr + 83412 83483 72 1 0 50 92 72 0.274 2.26 1.08 Term + 87640 87908 269 2 2 52 47 266 0.837 13.57 1.09 PlyA + 89835 89840 6 1.05 2.00 Prom + 97122 97161 40 -8.25 2.01 Init + 100001 100497 497 1 2 97 53 391 0.738 31.12 2.02 Intr + 109952 110042 91 2 1 78 105 99 0.988 9.68 2.03 Intr + 116163 116298 136 2 1 98 87 52 0.983 5.32 2.04 Intr + 116668 116805 138 2 0 41 86 99 0.951 4.51 2.05 Term + 116953 117314 362 2 2 89 37 346 0.997 23.31 2.06 PlyA + 118225 118230 6 1.05 3.07 PlyA - 118433 118428 6 -5.80 3.06 Term - 118853 118578 276 2 0 58 49 168 0.421 4.38 3.05 Intr - 119029 118944 86 2 2 116 105 21 0.270 5.22 3.04 Intr - 141606 141506 101 2 2 124 68 -15 0.053 -1.07 3.03 Intr - 147728 147566 163 0 1 69 68 114 0.697 5.61 3.02 Intr - 164020 163905 116 0 2 84 113 124 0.924 13.67 3.01 Init - 164254 164202 53 1 2 86 2 49 0.518 -3.22 3.00 Prom - 165780 165741 40 -7.75 4.05 PlyA - 165954 165949 6 1.05 4.04 Term - 172158 171859 300 0 0 22 38 262 0.058 8.84 4.03 Intr - 175211 175095 117 2 0 111 34 92 0.561 5.84 4.02 Intr - 184652 184594 59 0 2 104 94 -15 0.018 -1.62 4.01 Init - 193055 192653 403 2 1 67 108 204 0.577 17.14 4.00 Prom - 193172 193133 40 -5.35 5.00 Prom + 193707 193746 40 -7.45 5.01 Sngl + 193999 194298 300 0 0 74 46 672 0.996 55.14 5.02 PlyA + 194450 194455 6 -3.64 6.03 PlyA - 194557 194552 6 -4.04 6.02 Term - 195079 194607 473 2 2 5 42 361 0.694 17.31 6.01 Init - 195284 195188 97 2 1 101 31 130 0.951 9.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_1|397_aa MTLTVATEVLQRNNFSPIHIKKVWIYEEGRLYSASSRKQPALEVKWRAGWGQLKPSIPSS FEIPDPWLCLKQKYCGSCSISSSFPETEQEKPPWKPFYISKPLSRRHSEASRLYLSGLAA IKQTPITNHPDRTEGGRGKHKQTLPTISGTINVTRWPANNLSLCELNRESVSSNHKCLTN SKKQNSPLRGMQFRKLATNLKRILFVLMMLCKPACDIMELLELLPRGRKTHDKEEIPLRD GNNMAREEYSWYSYEYSAIHAPSLKGENQAEAAQKTTYPKGREVTSKMLKNGSLQLTGLT QCNRIAKSRKKIKSFRPTLTLDSHTTTTVFCYSDGVTVFRAKYSDPRSESQRALRMRTAQ MWPVSKCSLFTHFLRTASCRNLLQSHHRLLIFVPTYR >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_1|1194_bp atgactctcacagtagcaacagaggtattacagcgcaacaacttctctccaatccacatt aaaaaggtatggatttatgaagaaggccgcctctactcagccagcagcaggaaacagcct gctttggaggtcaaatggagggctgggtggggccagctcaaaccttctatccccagctct tttgaaattccagatccctggctttgtcttaagcaaaagtactgtggaagctgcagcatt tcctcctcctttccagaaactgagcaagaaaagccaccctggaagcctttttacatttct aagcccttatctaggcgccacagtgaagccagcagactttacctatcaggcctcgctgct ataaagcaaaccccaattacaaaccatccggaccgcacagagggaggtcgtgggaagcat aaacaaactttacctacaatctccggtaccataaacgtcacaaggtggcctgcaaataac ctgtcattatgtgagctcaacagagaatctgtttctagcaatcacaagtgcctaaccaat agtaagaagcagaacagccccctaagaggaatgcagtttaggaagctggctacgaacctg aagagaatcctctttgttttgatgatgttgtgcaaaccagcttgtgatatcatggaactt ttagaacttttacccaggggaagaaagacccacgacaaggaagagattcctctaagggat ggtaataacatggcccgtgaagaatattcatggtattcgtatgaatattctgcaattcat gcaccaagccttaaaggagagaaccaagcagaagctgctcagaaaaccacctaccctaaa ggcagagaagtgactagcaaaatgctgaaaaatgggagcttacaactaactggattaacc cagtgtaaccggatagcaaaaagcaggaagaaaatcaagagctttcggccaacattaaca ttggattctcacactacaaccacagtgttttgctactctgatggggtgactgtattcaga gccaagtacagcgacccaagatcggaaagccagagagccctccgaatgcgtactgcccaa atgtggcctgtctccaagtgctccctttttacgcatttcctacgcaccgccagctgccgc aaccttttgcagagtcatcatcgcctcctcatttttgtacccacttaccgatag >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_2|407_aa MSSKDFFACGHSGHWARGCPRGGAGGRRGGGHGRGSQCGSTTLSYTCYCCGESGRNAKNC VLLGNICYNCGRSGHIAKDCKDPKRERRQHCYTCGRLGHLARDCDRQKEQKCYSCGKLGH IQKDCAQVKCYRCGEIGHVAINCSKARPGQLLPLRQIPTSSQGMSQRWRPCEPYSYKIDR LGVQLVEDILGEKDTKCHPPPPPPYIGEDQQGCTKGAALLRDLYVHCQGQWKLNHRHWFE EEGDQFLPKATFSKKPELTPPEMDTASDTEGKCWKGKIVVPLNDTEEEPPDPSTHTYKLL DVQRSKSQKTQAAGRGAHQRRNTWAAGRREEHTGGWTSRGTHGRLDVERNAPTGTGAASP DQQSNARSLAGTLRGEPRPLSGPTPGENPPTPSPSGFPHLLRATSTQ >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_2|1224_bp atgagcagtaaggatttcttcgcgtgtggacactctggccattgggctcggggatgccct agaggaggagctggagggcgaagaggtggaggccatggcagaggttctcaatgtggttcc accaccctatcttacacctgttactgctgtggtgagtccggtcgtaatgctaagaactgt gtccttctcggaaacatctgctacaactgtgggagaagcggccacatcgccaaagactgt aaggatcctaaacgagagagacgccaacactgttatacctgcggcagactaggacatctg gctcgtgactgtgatcgtcagaaagagcagaaatgctactcttgcggcaaacttgggcac attcagaaagactgcgcccaggtcaagtgttaccgatgcggcgagattggccacgtggcc atcaattgcagcaaggcgaggccaggtcaactgctaccgctgcggcaaatcccgacatct agccaaggaatgtcccagaggtggaggccttgtgaaccatacagctataagattgatcgg ctgggggtgcaactggtagaggacattctgggagagaaggatacaaagtgtcatccccca ccaccacccccatacataggagaagaccagcaaggatgtacaaaaggggcagccttgctc agggacctttatgttcattgtcaaggtcaatggaagctaaatcacagacactggtttgaa gaagagggagaccaatttctccctaaggctaccttctccaagaaaccagaactgacccct ccagaaatggacacagccagtgatacggaagggaagtgctggaaagggaagatcgtagtc cctttaaatgatacagaagaggaacccccagatcctagcacacacacatacaagctgctg gacgtccagagaagcaaatcgcagaagacacaggcggctggacgaggagcacatcagcgg aggaacacatgggcggctggacgtcgagaggaacacacgggcggctggacgtcgagagga acacacgggcggctggacgtcgagaggaacgcaccaacaggcaccggcgccgcaagccca gaccagcagagcaacgcgcggagtttggccgggacactcagaggagagcccaggccactg agcggcccgactccaggggaaaaccctcccactccatccccttctggcttccctcatctg ctgagagccacctccactcaataa >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_3|264_aa MDQKWPTPNEVEMTELSWSEIIVGTAATKLGSLNAIEAIGSWGSRGQMAELNHQRQDSSG WFSPLMLDDEQAQAEDYSRVDGLLQVLIGIEEEVARLCIFYEEKRKSESIRWLPAKQRAS HPSHYCGDTMLLFFFSSEHQSSSTGDCPWVHVPQSPPHPEETFIPKYLCPTPPGRVHCKY REVSGERSQILAREPAISNARVTVSLLSSAGARECPGNILGLSILRTGALASPLPPIGLL PSGDKKFSSVQQRYSRKSARLSCG >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_3|795_bp atggaccaaaagtggcctacaccaaatgaagttgaaatgacagaactgtcttggtcagaa attatagtgggaactgctgccaccaaactgggatccctgaatgcaatagaggcaatcgga tcctggggcagcaggggccaaatggcagaactaaatcatcaaagacaagatagtagtggg tggttttctccacttatgctagatgatgagcaggcgcaggctgaggattattccagagtt gatggtttacttcaggtcctcattgggattgaggaggaagtggctaggctctgtattttc tatgaagagaaaaggaaaagtgaaagtatcaggtggttaccagcaaagcagagggcaagt catcccagtcactactgtggtgacaccatgctgcttttttttttttctagtgagcaccag tcctcctctacaggtgactgcccttgggtgcatgtacctcagtctccccctcacccagaa gaaacttttattcctaaatatctgtgccccacacccccggggagggtgcactgcaagtac agggaagtctccggggagagatcacagatcctagccagggagccagccatcagcaatgcc agagtgacagtgtccctcttgtcctcagcaggggcaagggagtgtccaggcaacatcctg ggcctgagcatcctgaggacaggggctttggcctcacctctccctcccataggcctttta ccctctggggacaagaagttctcatctgtacagcaaagatactcgagaaaatcagccagg ctgtcctgtggttga >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_4|292_aa MGQLKFNTSEEHYADTYRSDLPNSDMLSAKLHCWRIKWKHRGKDIELPSTIYEALHLPDI KFFPNVYALLKVLCILPVMKVENEQYENGRKRLKAYLRIILTDQRSSILALLNINFDIKH HLDLVVDTYIKLYTKIRQGFFFNFNCETLVRLLESPPVPPTLNDLKNSFVVEKVQDGDKK TWVQVLAFPPTSQMMFAKYVVRKPLNKECKKPRTKAPKIQHLVNPCVLQHKQWRIALKKQ GTKKNKEETAEYAKLLAKRMKEAKEKRQEQIAKRHRLSSLRPSTSKPESSQK >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_4|879_bp atgggacaactcaaattcaatacatcagaggaacactatgctgacacgtatagaagtgac ttacccaattctgacatgctctcagccaagcttcattgttggagaatcaaatggaaacac agggggaaagatatagagcttccatccaccatctatgaagccctccacctgcctgacatc aagttttttcctaatgtgtatgcattgctgaaggtcctgtgtattcttcctgtgatgaag gttgagaatgagcagtatgaaaatggacgaaagcgccttaaagcatatttgaggatcatt ttgacagaccaaaggtcgagtatcttggctttgcttaacataaattttgatataaaacac cacctggatttagtggtggacacatatattaaactctatacaaaaatcagacagggattt ttcttcaattttaactgtgagactctggtaaggctccttgagagccctccagtaccccct accctaaatgatctgaaaaattcatttgtggtagaaaaagttcaggatggagataagaaa acctgggttcaagtattagcttttccacctactagtcaaatgatgtttgccaagtatgtt gtaagaaagcccttaaacaaagaatgtaagaaacctaggaccaaagcacccaagattcag catcttgttaatccatgtgtcctgcaacacaaacagtggcgtattgctctgaagaagcag ggtactaagaaaaataaggaagagactgcagaatatgctaaacttttggccaagagaatg aaggaggctaaagagaagcgccaagaacaaattgcaaagagacacagactttcttctctg cgaccttctacttctaagcctgaatccagtcagaaataa >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_5|99_aa MARTWEAELAVSPDGATALQPGRQSETLSQKKKKKEEEEEEEEEEEEEEEEEEEEEEEEE EEEEEEEEEEEEEEEEEEEEEEEGEEEEGEEEEDQLDTM >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_5|300_bp atggcgcgaacctgggaggcagagcttgcagtgagcccagatggcgccactgcactccag cctgggcgacagagcgagactctgtctcaaaaaaaaaaaaaaaaagaagaagaagaagaa gaagaagaagaagaggaggaggaggaggaggaggaggaggaggaggaggaggaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaaggagaagaagaagaaggagaagaggaagaagatcaactagacacaatgtaa >gi568815575f:74204267_74404764|GENSCAN_predicted_peptide_6|189_aa MTNFCAAPKCTWKSTQSDLAFSGSRGTRPDARKLVLIDNVIPTIFDLTSHLNNPHSRHRK RIKELSADKIRTLKQKKFDETSEQEPKHKETNNSNSQNPSEEESEGQDEDILPLTLEEKE NKEYLKSLFEILILMGKQNIPLDGHEADEIPEGLFTPDNFQALLECRINSGEEVLRKQFD TTAVNTLFC >gi568815575f:74204267_74404764|GENSCAN_predicted_CDS_6|570_bp atgacgaacttctgcgctgcccccaagtgcacgtggaagagcacgcagtccgacctggcc ttttcaggttcccgcggaacccggccagatgccagaaaactagtccttatagataatgta ataccaacaatatttgatcttaccagtcatttgaacaacccacatagtagacacagaaaa cgaataaaagaactgagtgcagacaaaatcaggacactgaaacagaaaaaattcgatgaa acttctgagcaggaaccaaaacataaagaaaccaacaatagcaattctcagaaccccagt gaagaagagagtgaagggcaagatgaagacattttacctctaacccttgaagaaaaggaa aacaaagaatacctaaaatctctatttgaaatcttgatcctgatgggaaagcaaaacata cctctggatggacatgaggctgatgaaatcccagaaggtctctttactccagataacttt caggcactgctggagtgccgaataaattctggtgaagaagttctgagaaagcagtttgac acaacggcagttaacacgttgttttgttaa