GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:53:16 Sequence gi568815592f:38948841_39186070 : 237230 bp : 44.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 612 730 119 2 2 67 94 70 0.957 4.76 1.02 Intr + 2478 2680 203 0 2 76 86 135 0.995 11.03 1.03 Intr + 22752 22825 74 1 2 96 91 50 0.059 5.23 1.04 Intr + 33506 33622 117 1 0 58 87 79 0.673 5.46 1.05 Intr + 35366 35467 102 1 0 44 89 101 0.518 6.17 1.06 Intr + 41172 41332 161 2 2 92 113 83 0.992 9.99 1.07 Intr + 51788 51845 58 2 1 113 101 7 0.031 3.39 1.08 Intr + 51962 52062 101 1 2 74 59 42 0.040 -1.19 1.09 Intr + 59974 60130 157 1 1 85 107 45 0.714 6.11 1.10 Intr + 63608 63797 190 1 1 102 95 25 0.907 3.76 1.11 Intr + 69849 69947 99 1 0 68 86 82 0.756 6.08 1.12 Intr + 77706 77827 122 1 2 89 49 108 0.892 7.21 1.13 Intr + 81265 81548 284 0 2 69 108 132 0.883 9.52 1.14 Intr + 84468 84582 115 0 1 60 99 66 0.668 5.35 1.15 Term + 87666 87737 72 2 0 70 44 55 0.099 -2.79 1.16 PlyA + 89721 89726 6 -0.45 2.06 PlyA - 90608 90603 6 1.05 2.05 Term - 91144 90828 317 2 2 -39 46 365 0.789 14.70 2.04 Intr - 92262 92112 151 0 1 103 110 18 0.793 5.24 2.03 Intr - 94422 94311 112 2 1 99 39 56 0.795 2.18 2.02 Intr - 95173 95024 150 0 0 61 82 53 0.411 1.28 2.01 Init - 97793 97753 41 2 2 91 85 19 0.558 1.56 2.00 Prom - 98401 98362 40 -6.46 3.00 Prom + 99171 99210 40 -8.56 3.01 Init + 100001 100259 259 1 1 112 94 224 0.525 20.70 3.02 Intr + 100710 100888 179 1 2 64 43 63 0.278 -0.96 3.03 Intr + 107557 107653 97 0 1 86 109 102 0.998 11.68 3.04 Intr + 108632 108739 108 0 0 78 77 149 0.854 13.06 3.05 Intr + 116871 116989 119 1 2 54 70 154 0.975 10.38 3.06 Intr + 117357 117463 107 2 2 88 110 209 0.993 22.11 3.07 Intr + 124022 124175 154 2 1 68 89 225 0.871 20.67 3.08 Intr + 124770 124929 160 2 1 119 78 241 0.993 25.76 3.09 Intr + 129482 129542 61 0 1 143 84 108 0.999 13.69 3.10 Intr + 130117 130186 70 1 1 100 84 98 0.990 9.78 3.11 Intr + 130272 130360 89 2 2 100 64 133 0.999 10.87 3.12 Intr + 130724 130862 139 2 1 92 62 221 0.999 20.37 3.13 Intr + 131858 131899 42 1 0 101 99 19 0.841 2.74 3.14 Term + 137066 137233 168 1 0 51 53 161 0.721 6.78 3.15 PlyA + 139208 139213 6 1.05 4.03 PlyA - 141014 141009 6 1.05 4.02 Term - 156936 156592 345 0 0 93 48 181 0.075 9.09 4.01 Init - 166372 166043 330 1 0 40 94 251 0.644 16.22 4.00 Prom - 171368 171329 40 -3.76 5.06 PlyA - 171526 171521 6 -0.45 5.05 Term - 172636 172527 110 2 2 50 46 79 0.280 -1.43 5.04 Intr - 178971 178855 117 1 0 82 68 62 0.464 4.04 5.03 Intr - 182460 182367 94 0 1 99 70 20 0.611 0.84 5.02 Intr - 184498 184260 239 2 2 48 65 104 0.321 1.43 5.01 Init - 187146 187089 58 0 1 42 111 58 0.838 4.97 5.00 Prom - 198698 198659 40 -3.46 6.06 PlyA - 200090 200085 6 1.05 6.05 Term - 212011 211922 90 1 0 83 43 73 0.163 0.02 6.04 Intr - 223617 223472 146 1 2 95 48 80 0.284 4.70 6.03 Intr - 223881 223730 152 2 2 73 57 61 0.423 1.31 6.02 Intr - 224635 224603 33 0 0 92 84 35 0.269 0.84 6.01 Init - 229917 229841 77 0 2 90 57 44 0.117 2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42001 42166 166 1 1 99 48 87 0.843 3.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_1|657_aa ISRNEKGWKSWFDKDAPEEEIIPDGYNDSLDTCHKLLLIRSWCPDRTVFQARKYIADSLE EKYTEPVILNLEKTWEESDTRTPLICFLSMGSDPTNQIDALAKKLKLECRTISMGQGQEV HARKLIQMSMQQERRKFGPLGWNIPYEFNSADFSASVQFIQNHLDECDIKKGVSWNTVRY MIGEVQYGGRVTDDFDKRLLNCFARVWFSEKMFEPSFCFYTGYKIPLCKTLDQYFEYIQS LPSLDNPEVFGLHPNADITWDRLVSGKQAQGSHRFYMMSMEKCLPQNQSLVPKRLGTAVL KDVQEFLRLEMRYQSNTASAVLETITNIQPKESGGGVGETREAIVYRLSEDMLSKLPPDY IPHENLRDALDNMYDARIPQLWKRVSWDSSTLGFWFTELLERNAQFSTWIFEGRPNVFWM TGFFNPQGHLDSCEWAPHVKQCGRGKPGQHGVMEGKEKESSFLTAMRQEVTRAHKGWALD TVTIHNEVLRQTKEEITSPPGEGVYIYGLYMDGAAWDRRNGKLMESTPKVLFTQLPVLHI FAINSTAPKDPKLYVCPIYKKPRRTDLTFITVVYLRTVLSPDHWILRGVALLCDIKYYPR EMKVYVHTKTYIPVLIEALLEEPTIGNNPNAHQQHLALTADKNRGFEHSSLDLHPTF >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_1|1974_bp atatctcgtaatgagaaggggtggaaaagctggtttgataaagatgctccagaggaggaa attatccctgatggatataatgattcactagatacctgccataaacttttacttatcagg tcttggtgcccagaccgtactgtttttcaagcaagaaagtatattgcagattctttggag gagaagtacacagaaccagttatcttaaatctggagaaaacttgggaagaaagtgatacc cggacacctctgatatgcttcctgtccatgggatctgaccccaccaatcaaattgatgca ttggccaagaaactgaaactggaatgtagaactatctcaatggggcaaggacaagaagta catgctcgaaagctgattcagatgtcaatgcagcaggagcgacgaaaatttggcccctta ggatggaatattccctacgaattcaattctgctgacttttcagccagtgttcagtttatt cagaatcaccttgatgaatgcgatattaagaaaggtgtatcatggaatacggttcggtac atgatcggagaagtacaatatggaggcagagtgacagatgactttgacaaacgtctactt aattgctttgccagagtctggttcagtgagaagatgtttgaaccgtcattctgcttttat actggatataaaatccccttatgcaaaaccttagaccagtattttgaatacatccagtca ctgccatccctagataaccctgaagtctttgggcttcaccctaatgctgatatcacatgg gaccgtctagtttcaggaaaacaagctcagggctcccacagattttacatgatgtctatg gaaaaatgtcttccacaaaaccagtccctggtgccaaaaaggttgggaaccgctgtctta aaggatgtgcaggagtttctcagattggagatgaggtatcagagtaacactgcttctgct gttcttgaaacaattaccaacattcaacccaaagagagtggaggtggtgtgggagagacc cgggaggctattgtttatagattatctgaagatatgctgagtaaactccctcctgattac attcctcatgagaatctgagagatgctctggacaacatgtatgatgctcgtatacctcag ctctggaaaagagtgtcttgggattcgtccacactgggcttctggttcactgaacttttg gaaagaaatgctcagttttctacgtggatatttgaagggaggcctaatgtgttttggatg actggtttctttaatccccaaggtcacctggactcttgtgaatgggctcctcatgtcaaa cagtgtggaagagggaaaccaggacagcatggtgtcatggaaggcaaggagaaggagagt agcttcctcacagcaatgaggcaagaagtgacccgtgcccacaaaggctgggcactggac actgtgaccatccacaatgaagttctgagacagaccaaggaggagatcacgtcaccccct ggggaaggtgtgtatatttatgggctctacatggatggagcagcctgggacagacggaat gggaagctcatggaatccacccccaaggtactcttcacgcagttacccgtgctccacatc tttgccattaactccacggcacccaaggaccccaagctgtatgtgtgtcctatttacaag aaacccaggcgaactgatttgaccttcatcactgtggtatatttacgaacagtgttgtcc ccggatcactggatcctgagaggagtggcccttttgtgtgacatcaagtattaccctaga gaaatgaaagtgtatgtccatacaaagacatacattccagtgctcatagaagctttattg gaagagccaacaattggaaataatccaaatgcccatcaacagcatctggctttgactgca gataagaaccgaggttttgaacactcaagtcttgatttacatccaaccttctaa >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_2|256_aa MDGQEWMDKYPIQRLALARQSEDLLLREGRDLSSGPGIHFLPGEPGCQQVLGAKGGITRG CLFSPLSSLLRVHQLCQVSSRQQVQLLMPRGSGAEHHFLAQTFMGRRRGQPSTRIFSEKN GPGARSGEAAAGFQGQGLAPALSKAASESPPDKEWCSITKSPCQVKDMKIKSLKVYLFSL SIKEFEVTDFLLGVPLKDKVLKLMFVQKQPKAGRRIRFKVIVTIRDCNDNVSLGVEWPKE LLIAVCRTNILSIILV >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_2|771_bp atggatggacaggagtggatggataaataccccatccagaggctggcccttgccagacag tctgaagatctgctgctgagagaaggcagagacctcagcagtgggccaggaatccacttt ctgccaggagaacctggctgccagcaagtcctgggggcaaaggggggtatcacgagagga tgcctctttagccctctatcttccctgctgcgggtgcaccagctgtgccaggtctcatcc aggcagcaagtacagctgctaatgcccaggggctctggtgcagaacatcacttcctggct cagaccttcatggggaggagaagaggccagccaagcacccgaattttctctgaaaagaat ggccctggagccaggagtggggaagctgctgctggctttcagggacaaggattagcacca gcactgagtaaagctgcctcagagtcccccccagacaaggagtggtgctccatcaccaag tcaccttgccaggtcaaggacatgaagatcaaatccctgaaggtctatctcttctctctg tccatcaaggagtttgaggtcactgacttcctcctgggggtgcccctcaaggacaaggtt ctgaagctcatgtttgtgcaaaagcagcccaaggctggccggcggatcaggttcaaagtg attgtcaccataagggactgcaatgataacgtcagtctgggtgttgagtggcccaaggag ttactcattgctgtctgcaggaccaacatcctctccatcatcctggtgtag >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_3|583_aa MAGAPGPLRLALLLLGMVGRAGPRPQVRSRDPDDTGGGGWLCGLQAQCPGGGPRDLNETP RPLAGASESLGGSRDWRLGAPGLEGPASVANRQKAQCPPGSSQAPPLTHSADLPFSSCTD SRPQEKAYWDTQEHLGSLVPPESRPQGATVSLWETVQKWREYRRQCQRSLTEDPPPATDL FCNRTFDEYACWPDGEPGSFVNVSCPWYLPWASSVPQGHVYRFCTAEGLWLQKDNSSLPW RDLSECEESKRGERSSPEEQLLFLYIIYTVGYALSFSALVIASAILLGFRHLHCTRNYIH LNLFASFILRALSVFIKDAALKWMYSTAAQQHQWDGLLSYQDSLSCRLVFLLMQYCVAAN YYWLLVEGVYLYTLLAFSVLSEQWIFRLYVSIGWGVPLLFVVPWGIVKYLYEDEGCWTRN SNMNYWLIIRLPILFAIGVNFLIFVRVICIVVSKLKANLMCKTDIKCRLAKSTLTLIPLL GTHEVIFAFVMDEHARGTLRFIKLFTELSFTSFQGLMVAILYCFVNNEVQLEFRKSWERW RLEHLHIQRDSSMKPLKCPTSSLSSGATAGSSMYTATCQASCS >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_3|1752_bp atggccggcgcccccggcccgctgcgccttgcgctgctgctgctcgggatggtgggcagg gccggcccccgcccccaggtgagatccagggaccccgacgacaccgggggaggcgggtgg ctctgcgggctgcaggcgcagtgtcctggtggagggccccgggacttgaacgaaactccg agaccgctggcgggggcatccgaaagcctcggggggagtagggactggagacttggtgcc cctgggctggaaggcccagcctctgtagcaaacaggcagaaggctcagtgcccccctgga agcagccaggcgcccccactcacccactctgctgacctcccattctcatcctgcactgac agcaggccccaagagaaggcatactgggacacgcaagaacacctgggctccttggtgcca ccagagagcaggccacagggtgccactgtgtccctctgggagacggtgcagaaatggcga gaataccgacgccagtgccagcgctccctgactgaggatccacctcctgccacagacttg ttctgcaaccggaccttcgatgaatacgcctgctggccagatggggagccaggctcgttc gtgaatgtcagctgcccctggtacctgccctgggccagcagtgtgccgcagggccacgtg taccggttctgcacagctgaaggcctctggctgcagaaggacaactccagcctgccctgg agggacttgtcggagtgcgaggagtccaagcgaggggaaagaagctccccggaggagcag ctcctgttcctctacatcatctacacggtgggctacgcactctccttctctgctctggtt atcgcctctgcgatcctcctcggcttcagacacctgcactgcaccaggaactacatccac ctgaacctgtttgcatccttcatcctgcgagcattgtccgtcttcatcaaggacgcagcc ctgaagtggatgtatagcacagccgcccagcagcaccagtgggatgggctcctctcctac caggactctctgagctgccgcctggtgtttctgctcatgcagtactgtgtggcggccaat tactactggctcttggtggagggcgtgtacctgtacacactgctggccttctcggtctta tctgagcaatggatcttcaggctctacgtgagcataggctggggtgttcccctgctgttt gttgtcccctggggcattgtcaagtacctctatgaggacgagggctgctggaccaggaac tccaacatgaactactggctcattatccggctgcccattctctttgccattggggtgaac ttcctcatctttgttcgggtcatctgcatcgtggtatccaaactgaaggccaatctcatg tgcaagacagacatcaaatgcagacttgccaagtccacgctgacactcatccccctgctg gggactcatgaggtcatctttgcctttgtgatggacgagcacgcccgggggaccctgcgc ttcatcaagctgtttacagagctctccttcacctccttccaggggctgatggtggccata ttatactgctttgtcaacaatgaggtccagctggaatttcggaagagctgggagcgctgg cggcttgagcacttgcacatccagagggacagcagcatgaagcccctcaagtgtcccacc agcagcctgagcagtggagccacggcgggcagcagcatgtacacagccacttgccaggcc tcctgcagctga >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_4|224_aa MLSRGFCHAGARRPARAERRQLLCGLRARAPLSANGREARAMEQRLAEFRAARKRAGLAA QPPAASQGAQTPGEKAEAAATLKAAPGWLKRFLVWKPRPASARAQPGLVQEAAQPQGSTS ETPWNTAIPLPSCWDQSFLTNITFLKVLLWLVLLGLFVELEFGLAYFVLSLFYWMYVGTR GPEEKKEGEKSAYSVFNPGCEAIQGTLTAEQLERELQLRPLAGR >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_4|675_bp atgctcagcaggggcttctgccacgcaggcgcacggcggccggcgcgggccgagcggagg caactgctgtgcggcctgcgggcgcgcgctcccttatcggccaacggacgcgaggcgcgc gccatggaacagcggttagctgagtttcgggcggcgcggaaacgggcgggtctggcggcc caaccccctgctgccagtcagggcgcacaaaccccaggagagaaggcggaagcagcagcg actctaaaggcagccccaggctggctaaagcggttcctggtatggaaacctaggcccgcg agtgcccgggcccagcccggcctagttcaggaagcggctcagccccagggcagcacatca gagacaccatggaacacagccattcctctgccgtcgtgctgggaccagtctttcctgacc aatatcaccttcttgaaggttcttctctggttggtcctgctgggactgtttgtggaactg gaatttggcctggcatattttgtcctgtccttgttctattggatgtacgtcgggacacga ggccctgaagagaagaaagagggagagaagagcgcctactctgtgttcaatccaggctgt gaagccatccagggcaccctgactgcagagcagttggagcgcgagttacagttgagaccc ctggcagggagatag >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_5|205_aa MTLGPQINQPEEHLANFKSGTTVNIYTDSKYVFHILHHLAVIWAETGFLTMQWSSIINAS LIKTRLKAALLPKKAGVIHCEGHQKASDPIAQGNAYADKKDATNQGPSEKNGSFPNNSIC SEKQGISLWKKHSTMEMSTIEPPTMESSTMGPSTMESLTTEPSMSLAALEIRVSLREVGE STCDFQEEEEKSGWLSNGHSFILEQ >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_5|618_bp atgaccttgggtcctcagatcaaccagcccgaggaacatctcgccaatttcaaatcaggg actacagtcaatatttatactgactctaaatatgtcttccatatcctgcaccaccttgct gttatatgggctgaaacaggtttcctcactatgcaatggtcttccatcattaatgcctct ttaataaaaactcgtctcaaggctgctttgcttccaaagaaagctggagttattcactgt gagggccatcaaaaggcatcagatcccatcgctcagggcaacgcttatgctgataagaaa gatgctacaaaccaagggccgagtgagaagaatggttcatttccgaacaattcaatctgc tccgagaaacaaggaattagcctgtggaagaaacactccaccatggaaatgtctaccata gaacccccaaccatggaatcctccaccatggggccctccaccatggaatccctcaccaca gagccttccatgtctttagcagctctggaaattcgagtgagtttaagggaagtaggggag agtacctgtgacttccaggaggaagaggagaaatcaggttggctgtccaatggacacagt tttattctggaacagtga >gi568815592f:38948841_39186070|GENSCAN_predicted_peptide_6|165_aa MKKPVGAKDPAVLLTLPPISNNIPLRPGSYGAAASRGTFSNNTSGFQSGLIHPHSQETFG NLWEGVGVPLASHGSSPGMLLNVLQCPDKTSMESRGPKRQPLPAKAGMSCPDSPQGPSPW NLLVLAVLAFSESDNLPSVFLHLRDTSLVTMGGTIVSCEGLEFSA >gi568815592f:38948841_39186070|GENSCAN_predicted_CDS_6|498_bp atgaaaaagcctgtaggtgcaaaagaccctgctgtgctcctcactttgcccccaatatcc aacaacatcccactgaggccaggcagctacggtgcagctgcttcccgggggaccttttcc aacaacaccagtggttttcagtcaggattgattcacccacactcccaggagacgtttggc aacttgtgggagggagtaggggtgccactggcatctcatgggtcgagcccagggatgctg ctcaacgtcctacaatgcccagataaaacaagcatggagagcaggggccccaaaaggcag ccattgccagcaaaggctgggatgagctgcccagacagcccgcagggcccaagcccctgg aatctgctggtgttggctgttttggccttctcagagtcagataacctgccctctgtcttc ctccatcttcgtgacacctccctggtcacaatgggaggaaccatcgtctcctgtgagggc ctggagttttctgcctag