GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:07:01 Sequence gi568815583r:83157445_83368234 : 210790 bp : 40.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 129 54 76 0 1 85 54 87 0.434 3.17 1.03 Intr - 597 459 139 2 1 77 95 119 0.401 11.05 1.02 Intr - 6631 6555 77 1 2 69 111 34 0.136 1.19 1.01 Init - 9286 9281 6 1 0 84 87 0 0.086 0.44 1.00 Prom - 19781 19742 40 -5.05 2.00 Prom + 26014 26053 40 -3.85 2.01 Init + 49189 49235 47 0 2 73 80 53 0.429 3.21 2.02 Intr + 49442 49665 224 2 2 54 11 209 0.448 6.65 2.03 Intr + 50812 50963 152 2 2 90 46 156 0.941 10.56 2.04 Term + 51901 51957 57 0 0 111 34 128 0.989 6.41 2.05 PlyA + 54319 54324 6 1.05 3.00 Prom + 58942 58981 40 -6.15 3.01 Sngl + 64868 65290 423 1 0 60 46 346 0.969 23.64 3.02 PlyA + 65989 65994 6 1.05 4.04 PlyA - 68458 68453 6 1.05 4.03 Term - 86339 86191 149 0 2 105 48 83 0.702 3.08 4.02 Intr - 86520 86424 97 0 1 45 80 57 0.571 -0.74 4.01 Init - 94496 94446 51 2 0 91 93 83 0.901 8.32 4.00 Prom - 97422 97383 40 -3.45 5.07 PlyA - 98479 98474 6 1.05 5.06 Term - 100682 99998 685 1 1 82 42 459 0.963 32.81 5.05 Intr - 107371 105507 1865 1 2 46 110 1261 0.988 110.47 5.04 Intr - 109627 109392 236 2 2 74 92 214 0.045 16.88 5.03 Intr - 118970 118775 196 2 1 89 12 106 0.017 1.17 5.02 Intr - 126976 126764 213 1 0 -23 47 193 0.042 2.09 5.01 Init - 127184 127086 99 2 0 42 99 250 0.757 19.81 5.00 Prom - 129373 129334 40 -8.45 6.06 PlyA - 129691 129686 6 1.05 6.05 Term - 131513 131301 213 2 0 77 38 111 0.321 1.25 6.04 Intr - 133388 133365 24 2 0 67 115 24 0.296 0.30 6.03 Intr - 135664 135517 148 0 1 47 37 155 0.098 5.62 6.02 Intr - 142288 142216 73 2 1 38 84 90 0.023 1.15 6.01 Init - 170213 169991 223 2 1 85 119 39 0.450 5.46 6.00 Prom - 171774 171735 40 -3.65 7.05 PlyA - 172835 172830 6 1.05 7.04 Term - 179288 179119 170 0 2 66 49 101 0.392 1.06 7.03 Intr - 199443 199362 82 0 1 79 82 71 0.008 3.89 7.02 Intr - 202134 201948 187 2 1 77 6 155 0.028 4.97 7.01 Init - 205178 205165 14 2 2 62 99 36 0.343 0.48 7.00 Prom - 208907 208868 40 -4.15 8.02 PlyA - 208993 208988 6 1.05 8.01 Term - 209678 209634 45 2 0 117 45 97 0.629 4.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 74981 75029 49 1 1 77 89 38 0.926 1.96 S.002 Term + 77911 78017 107 2 2 92 43 88 0.907 2.29 S.003 Init - 109601 109392 210 2 0 53 92 235 0.862 19.23 S.004 Term + 126202 126649 448 1 1 77 51 295 0.813 18.40 S.005 Term - 199408 199212 197 2 2 8 38 178 0.803 1.39 S.006 Intr - 202134 201986 149 2 2 77 84 133 0.834 10.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_1|100_aa MIIDELPEGAVKPPANKYPIFFFGTHETAFLGPKDLFPYKEYKDKFGKSNKRKGFNEGLW EIENNPGVKFTGYQAIQQQSSSETEGEGGNTADASSEEEX >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_1|300_bp atgattattgatgaactcccagagggcgctgtgaagcctccagcaaacaagtatcctatc ttcttttttggcacccatgaaactgcatttctaggtcccaaagacctttttccatataag gagtacaaagacaagtttggaaagtcaaacaaacggaaaggatttaacgaaggattgtgg gaaatagaaaataacccaggagtaaagtttactggctaccaggcaattcagcaacagagc tcttcagaaactgagggagaaggtggaaatactgcagatgcaagcagtgaggaagaagnn >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_2|159_aa MEVERHKKLAGPTSQRIYCPPALAGSRTAAHPAAVPAPTFARDPGYPPMTLPGAEASVAG PLPGGVASRSAPAFQEEGSVGPAGAGPAFETSHFRGCSLVSGGQRRSRPGGRRREGTLCW APSCRKEHRSGVSDLIGFLPQNLPYAAAMMSRIAHEAVE >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_2|480_bp atggaggtggaaaggcacaagaagcttgctggacccaccagtcagaggatttactgcccg ccggcactcgccggctcgcggacagccgctcaccccgctgccgtacctgccccgacgttc gcccgggacccagggtatccaccaatgaccctgcccggagccgaagcctccgtcgcaggc ccgctgcccggcggcgtggcttcccggtcggcaccggctttccaggaggaaggcagcgtc gggcctgcgggggccggacccgccttcgaaaccagccacttccgcggctgctctctggtg agtgggggacaaaggcgctccaggcctgggggacggcggcgcgaggggaccctctgctgg gcaccaagctgccggaaagagcaccgctccggagtcagcgaccttattggattcttgccc cagaacctaccgtatgcagctgccatgatgtcacgcattgctcatgaagctgtagaatga >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_3|140_aa MKLPEERSGSNICCSAVFAVLQPLLVTPRKTGSGVDLQQTPADLQLRVLSVRRKTNRKDI HTKTPSVRHHHQRPKVDKTTKMGRNQNRKGDNSKNQSASSPPKECSCLPVTEQSWTENDF EKLREEGFRQSNFSELKEDV >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_3|423_bp atgaagcttccagaggaacgatcaggcagcaacatttgctgttccgcagtattcgctgtt ctgcagcctctgctggtgacacccaggaaaacagggtctggagtggacctccagcaaact ccagcagacctgcagctgagggtcctgtctgttagaaggaaaactaacagaaaggacatc cacaccaaaaccccatctgtacgtcaccatcatcaaagaccaaaggtagataaaaccaca aagatggggagaaaccagaatagaaaaggtgacaattctaaaaatcagagtgcctcttct cctccaaaggaatgcagctgcttgccagtaacagaacaaagctggacagagaatgacttt gagaagttgagagaagaaggcttcagacaatcaaacttctccgagctaaaggaggatgtt tga >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_4|98_aa MPGLSRLTPALAPQSPLNQSRCPFLLHFPLSLFLSWVNGPKRGGLLEFLARTPLGSSKAS QVRTLLAADRDSLRSGCAVGCTLNSFPKDREKNLSHNA >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_4|297_bp atgcctggcctcagcaggctgaccccagcccttgctccacagtctccgctgaaccagtcc cgctgtcctttcctccttcactttccactctccctcttcctgagctgggtcaatggacca aaacgtggaggcttgcttgagtttctggctcggaccccactcggttcttccaaagctagt caagtgaggactctgctggccgctgacagggacagcctgaggtcaggatgtgcagtaggg tgtaccctgaatagcttcccgaaggacagggagaaaaacctatcgcacaatgcttga >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_5|1097_aa MRRRPPSRGGRGAARARETRRQPRHRSGRRMAEGQLATCAAARPRGAAGGTVVSGREAAA GGLGLWDPSLPDVPELANGLRAGSAGASEVPAGQPVEIGGGTRRIIPVEGLPSIRPSAKC WFGVLVVSLPLAGMIHCEATFLGAFTLDIFGSSVQSRTTRAQKTEELTCSLSKLRIPPMY PTSQVEIVQSNVVFDISSLMLYGTQAIPVRLKILLDRLFSVLKQDEVLQILHALDWTLQD YIRGYVLQDASGKVLDHWSIMTSEEEVATLQQFLRFGETKSIVELMAIQEKEEQSIIIPP STANVDIRAFIESCSHRSSSLPTPVDKGNPSSIHPFENLISNMTFMLPFQFFNPLPPALI GSLPEQYMLEQGHDQSQDPKQEVHGPFPDSSFLTSSSTPFQVEKDQCLNCPDAITKKEDS THLSDSSSYNIVTKFERTQLSPEAKVKPERNSLGTKKGRVFCTACEKTFYDKGTLKIHYN AVHLKIKHKCTIEGCNMVFSSLRSRNRHSANPNPRLHMPMNRNNRDKDLRNSLNLASSEN YKCPGFTVTSPDCRPPPSYPGSGEDSKGQPAFPNIGQNGVLFPNLKTVQPVLPFYRSPAT PAEVANTPGILPSLPLLSSSIPEQLISNEMPFDALPKKKSRKSSMPIKIEKEAVEIANEK RHNLSSDEDMPLQVVSEDEQEACSPQSHRVSEEQHVQSGGLGKPFPEGERPCHRESVIES SGAISQTPEQATHNSERETEQTPALIMVPREVEDGGHEHYFTPGMEPQVPFSDYMELQQR LLAGGLFSALSNRGMAFPCLEDSKELEHVGQHALARQIEENRFQCDICKKTFKNACSVKI HHKNMHVKEMHTCTVEGCNATFPSRRSRDRHSSNLNLHQKALSQEALESSEDHFRAAYLL KDVAKEAYQDVAFTQQASQTSVIFKGTSRMGSLVYPITQVHSASLESYNSGPLSEGTILD LSTTSSMKSESSSHSSWDSDGVSEEGTVLMEDSDGNCEGSSLVPGEDEYPICVLMEKADQ SLASLPSGLPITCHLCQKTYSNKGTFRAHYKTVHLRQLHKCKVPGCNTMFSSVRSRNRHS QNPNLHKSLASSPSHLQ >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_5|3294_bp atgcggcggcgcccgccgagccggggcggacgcggggcggcccgggcccgggagacgcgc cggcagccccggcaccgcagcggtcgcaggatggccgagggccagctggctacctgcgcg gcagcgcggccccgaggcgcggcgggagggaccgtggtctccggccgggaggcggcggcg ggggggttggggctctgggacccgtccctgcccgacgtcccggagctggcaaacggtctc cgcgccggatctgcgggcgcgtcggaggtgccggcgggccagcctgtggaaatcggcggc ggcactcggcggataataccagtggagggcttaccaagtatcaggcccagtgctaagtgt tggtttggagttttagttgtttcattgcctcttgcaggaatgattcattgtgaagcaact tttttaggggcttttaccttagatatttttggtagctctgtgcagagtcggacaaccaga gcccagaagaccgaagaacttacttgttctctaagtaagctaaggatcccccccatgtat ccaacaagccaggtggagattgtccagtccaatgtagtgtttgatattagcagcctcatg ctctatgggacccaggccatccccgttcgcctaaaaatcctactggaccggctcttcagt gtgttgaagcaagatgaggttctccagatcctccatgccttggactggacacttcaggat tatatccgtggatacgtactgcaggatgcatcaggaaaggtgttggatcactggagcatc atgaccagtgaggaagaagtggccaccttgcagcagttccttcgttttggagagaccaaa tctatagttgaactcatggcaattcaagagaaagaagagcaatccatcatcataccacct tccacagcaaatgtagatatcagggctttcatcgagagctgcagtcacaggagttctagc ctccccactcctgtggacaaaggaaaccccagcagtatacacccctttgagaacctcata agcaacatgactttcatgctgcctttccagttcttcaaccctctgcctcctgcactgata gggtcattgcccgaacaatatatgttggagcagggtcatgaccaaagtcaggaccccaaa caggaagtccatgggcccttccctgacagcagcttcttaacttccagttccacaccattt caggttgaaaaagatcagtgtttaaactgtccggatgctattactaaaaaagaagacagc acccatttaagtgactccagctcatacaacattgtcactaagtttgaaaggacacagtta tcccctgaggccaaagtgaagcctgagaggaatagccttggtacaaagaagggccgggtg ttctgcactgcatgtgagaagaccttctatgacaaaggcaccctcaaaatccactacaat gccgtccacttgaagatcaagcataagtgcaccatcgaagggtgtaacatggtgttcagc tccctaaggagccggaatcgccatagcgccaaccccaaccctcggctgcacatgccaatg aacagaaataaccgggacaaagacctcaggaacagcctgaacctggccagctctgagaac tacaagtgcccaggtttcacagtgacgtccccagactgtaggcctcctcccagctaccct ggttcaggagaggattccaaaggccaaccagccttcccaaacattgggcaaaatggtgtg ctttttcccaacctaaagacagtccagccagtccttcctttctaccgcagtccagccacg cctgccgaggtagcaaacacgcctgggatactcccttccctcccgctgttgtcctcttca atcccagaacagctcatttcaaacgaaatgccatttgatgcccttcccaagaagaaatcc aggaagtccagtatgcctatcaaaatagagaaagaagctgtggaaatagctaatgagaaa agacacaacctcagctcagatgaagacatgcccctacaggtggtcagtgaagatgagcag gaggcctgcagtcctcagtcacacagagtatctgaggagcagcatgtacagtcaggaggc ttagggaagcctttccctgaaggggagaggccctgccatcgtgaatcagtaattgagtcc agtggagccatcagccaaacccctgagcaggccacacacaattcagagagggagactgag cagacaccagcattgatcatggtgccaagggaggtcgaggatggtggccatgaacactac ttcacacctgggatggaaccccaagttcctttttctgactacatggaactgcagcagcgc ctgctggctgggggactcttcagtgctttgtccaacaggggaatggcttttccttgtctt gaagattctaaagaactggagcacgtgggtcagcatgcattagcaaggcagatagaagaa aatcgcttccagtgtgacatctgcaagaagacctttaaaaatgcttgtagtgtgaaaatt catcacaagaatatgcatgtcaaagaaatgcacacatgcacagtggagggctgtaatgct acctttccctcccgcaggagcagagacagacacagctcaaacctaaacctccaccaaaaa gcattgagccaggaagcattggagagtagtgaagatcatttccgtgcagcttaccttctg aaagatgtggctaaggaagcctatcaggatgtggcttttacacagcaagcctcccagaca tctgtcatcttcaaaggaacaagtcgaatgggcagtctggtttacccaataacgcaagtc cacagtgccagcctggagagctacaactctggccccttgagcgagggcaccatcctggat ttgagcactacctcgagcatgaagtcagagagtagcagccattcttcctgggactctgac ggggtgagtgaggaaggcactgtgcttatggaggacagtgatgggaactgtgaagggtcg agccttgtccctggggaagatgagtaccccatctgtgtcctgatggagaaggctgaccag agccttgctagcctgccttctgggttgcccataacctgtcatctctgccaaaagacatac agtaacaaagggacctttagggcccactacaaaactgtgcacctccggcagctccacaaa tgcaaagtaccaggctgcaacaccatgttttcgtctgttcgcagtcgaaacagacacagc cagaatcccaacctgcacaaaagcctggcctcatctccaagtcacctccagtaa >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_6|226_aa MGLSAWLHIGDFKYSHGHMIQRGWLSSWPMIQFLPYSHLPLGHCCSLASLPGIEEKKERR ANICLSIWLPISNSDPCNKKLIPLANSQPEPDAFLPTAIKMETSPPKAIVSYGIQLKEEG GDMLLFPTGPKFQSFKCVVNPTKDKDLQNNLPVKLKALCEKLRNRNNLLPPKPWATFGMA SCPLPQKGAALELKRGNCDTCARFVVGQPVPQQTAAWGGQFGPSAA >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_6|681_bp atgggattatcggcctggctccacattggtgacttcaagtacagccatggccacatgatt cagaggggatggctatccagttggcccatgatccagtttcttccttactctcacctgcca cttggccactgttgttcattagcatctctcccaggaatagaggagaaaaaggaaaggaga gccaatatttgcttgtccatctggctccccatcagcaactctgacccatgtaacaagaag ctgatacctctggcaaacagccagccagaacctgatgccttcctgccaacagccataaaa atggaaacatctcccccaaaggctattgtcagttatggaattcaacttaaggaggaaggt ggtgacatgctcctctttcccactggaccaaaatttcaaagcttcaaatgtgtggtaaat ccaaccaaagataaagatttgcagaataatcttccagtaaaactgaaggccttgtgtgag aaattgaggaacaggaacaatcttttgccaccaaaaccctgggccacatttgggatggcc tcatgtcctctccctcaaaagggagcagccctggagctgaagaggggaaattgtgacacc tgtgccaggtttgtggtaggccagcctgtcccccagcagacagctgcgtgggggggccag tttggcccaagtgctgcttag >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_7|150_aa MVLRSCEGARNVKFFAKRFKYWRLTNIPFLATPSGKEDAVMSKTVLGKRAQEMKCKSLLL GDAGSKQALPFIEDMIALRESQPEKRTVEEGRGEEPGKYLLNESMNEWKNTDNHFQRRGN DVFSFGRGRLKESVGYLSGEISMRPLYLQV >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_7|453_bp atggtgctgcggagttgtgagggagcaagaaatgtaaaattttttgcaaaacgtttcaaa tactggagactcactaacatcccatttctggcaacacctagcggaaaggaggacgcagta atgagcaaaaccgtcctggggaaaagagcacaggagatgaagtgtaagtcgttactcctc ggtgatgcagggtccaagcaagccttgcctttcatagaagacatgattgctttaagagag tcccaaccagagaagaggacggtggaggagggaagaggggaagagcctgggaaatattta ttaaatgaatcaatgaacgaatggaagaatacggacaatcattttcaaaggagaggaaat gatgtgttcagttttggacgtggtaggcttaaggaatctgtgggatatctgagtggagaa atatccatgagaccgttgtatctgcaggtctga >gi568815583r:83157445_83368234|GENSCAN_predicted_peptide_8|14_aa PTQREDDEDEGLPF >gi568815583r:83157445_83368234|GENSCAN_predicted_CDS_8|45_bp cctactcagcgtgaagatgacgaggatgaaggcttgcctttttga