GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:38:08 Sequence gi568815575f:46737101_46979766 : 242666 bp : 41.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6729 6867 139 0 1 104 42 126 0.363 6.05 1.02 PlyA + 8779 8784 6 1.05 2.03 PlyA - 10284 10279 6 1.05 2.02 Term - 12061 11955 107 2 2 80 49 73 0.574 0.19 2.01 Init - 21929 21362 568 2 1 109 20 486 0.295 37.17 2.00 Prom - 39901 39862 40 -5.75 3.00 Prom + 43540 43579 40 -8.25 3.01 Init + 43604 43733 130 1 1 39 119 131 0.628 11.66 3.02 Intr + 46994 47096 103 0 1 24 115 76 0.078 2.21 3.03 Intr + 83042 83162 121 2 1 -35 72 146 0.083 0.28 3.04 Term + 84038 84454 417 1 0 -24 43 246 0.080 3.19 3.05 PlyA + 84644 84649 6 1.05 4.00 Prom + 88917 88956 40 -3.45 4.01 Init + 100001 100102 102 1 0 103 90 173 0.121 19.29 4.02 Intr + 116376 117041 666 2 0 112 94 441 0.694 38.10 4.03 Intr + 122888 123002 115 1 1 76 77 82 0.924 5.00 4.04 Intr + 140405 140490 86 0 2 54 82 77 0.436 2.32 4.05 Term + 142586 142669 84 1 0 46 45 94 0.288 -2.43 4.06 PlyA + 142796 142801 6 1.05 5.00 Prom + 147622 147661 40 -4.15 5.01 Init + 153175 153360 186 0 0 76 108 73 0.376 7.21 5.02 Intr + 166668 166854 187 2 1 0 96 133 0.024 3.64 5.03 Term + 169952 170091 140 0 2 12 42 146 0.021 -0.46 5.04 PlyA + 171295 171300 6 1.05 6.05 PlyA - 172270 172265 6 1.05 6.04 Term - 176153 175929 225 2 0 97 42 244 0.756 16.50 6.03 Intr - 177703 177512 192 1 0 34 33 137 0.439 1.57 6.02 Intr - 177831 177768 64 2 1 56 98 65 0.211 2.10 6.01 Init - 209635 209577 59 1 2 62 62 96 0.019 5.23 6.00 Prom - 213649 213610 40 -2.15 7.04 PlyA - 213782 213777 6 1.05 7.03 Term - 224174 224061 114 2 0 51 41 89 0.071 -1.91 7.02 Intr - 229376 229270 107 0 2 88 97 55 0.130 5.41 7.01 Init - 241698 241677 22 0 1 105 88 3 0.417 2.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100395 99948 448 2 1 65 40 344 0.874 21.00 S.002 Init - 212436 212373 64 0 1 80 85 86 0.925 8.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_1|46_aa XIPSSATFIIGTNICTKSSKQPCMSHPSSKCNSVPDVKVTFRFSSK >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_1|141_bp nnaatcccaagcagtgccacattcatcataggtacgaacatctgcaccaagtcctcaaaa cagccctgtatgtcacacccaagttccaagtgcaactcagtgcctgatgtaaaagttaca ttcaggttcagcagcaaataa >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_2|224_aa MEPGDAARPGSGRATGAPPPRLLLLPLLLGWGLRVAAAASASSSGAAAEDSSAMEELATE KEAEESHRQDSVSLLTFILLLTLTILTIWLFKHRRVRFLHETGLAMIYGEHPCYTPRPPH PSAAALLLAPRGAPPSLANVFRFFYVSRCDGVGSQGQLSAFSGSLLPLKTIPPLFHVSAL GTQENERDRVLMPQLHTDSEVSEKETFVCENICLNCSRLFKEGG >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_2|675_bp atggagcctggtgacgcggcgcgccctggctcgggtcgggctaccggggcgccgccgccg cggctgctgctgctgccgctgctgctgggttgggggctgcgagtcgcggccgcggcctcg gcctcctcctctggggcggcggcggaggacagcagcgccatggaggagctcgctactgag aaggaggcggaggagagccaccggcaagacagcgtgagcctgctcaccttcatcctgctg ctcacgctcaccatcctcaccatctggctcttcaagcaccgccgggtgcgctttctgcac gagaccgggctggccatgatctatggtgagcacccctgttacaccccccgccctccacac ccctcggccgccgcgctcctcctggcgccgcgcggtgctcccccttcacttgccaacgtg ttccgtttcttttacgtttcccgctgcgacggtgttggttcccaaggtcaactgtctgca ttttctggctctctgctaccgttaaaaacaattccacccctattccacgtttctgcattg ggaacccaggaaaatgaaagggacagagttcttatgcctcagcttcacactgacagtgaa gtttctgagaaggaaacctttgtgtgtgaaaacatttgcctgaattgcagcagactgttt aaggaaggaggatga >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_3|256_aa MTGRLQAEEREKPVVAQSESKNLKTREADSAAPSRRLKGLRVPVSSVWYSSELSPHSPNT RTLSHSQTHEGHRPGEERVSPSPDSIHQELTEEPLGLKEASAVAWQYSLWACDGGGHGNQ VEILELKNAIGILKNASESFNSRIDQAEERISELEDRLYEETEEKRIKNNDAHLQDLENS LKRANLRAIGLKEEVEKEIGIENLPKGIITENIPSLEKYTNIQVQEGYRIPSRFNPKKTT PRHLIIKLPKIKDKGP >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_3|771_bp atgacaggccgtctgcaagctgaggaaagagagaagccggtagtggctcagtcggagtcc aaaaacctcaaaaccagggaagccgacagtgcagcccccagtcggaggctgaagggcctg agagtccccgtctccagtgtctggtattcctctgagctgtctccccacagccccaataca cgtacactctcacattctcaaacacatgaagggcacaggcctggagaagaaagagtgagt cccagtccagacagcattcaccaagagttgactgaagagcccttgggccttaaggaagca tcagcagtagcttggcagtactccctgtgggcctgtgatggtggtggccatgggaatcaa gtagaaattctggagctgaaaaatgcaattggcatattgaagaatgcatcagagtctttt aatagcagaattgatcaagcagaagaaagaattagtgagcttgaagacaggctatatgag gagacagaagaaaaaagaataaaaaacaatgatgcacacctacaagatctagaaaatagc ctcaaaagggcaaatctaagagctattggccttaaagaggaggtagagaaagagataggg atagagaatttacccaaagggataataacggagaacatcccaagcctagagaaatatacc aatatccaagtacaagaaggttatagaataccaagcagatttaacccaaagaagactacc ccaaggcatttaataatcaaactcccaaagatcaaggataaaggaccctaa >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_4|350_aa MGCFFSKRRKADKESRPENEEERPKQYSWDQREKVDPKDYMFSGLKDETVGRLPGTVAGQ QFLIQDCENCNIYIFDHSATVTIDDCTNCIIFLGPVKGSVFFRNCRDCKCTLACQQFRVR DCRKLEVFLCCATQPIIESSSNIKFGCFQWYYPELAFQFKDAGLSIFNNTWSNIHDFTPV SGELNWSLLPEDAVVQDYVPIPTTEELKAVRVSTEANRSIVPISRGQRQKSSDESCLVVL FAGDYTIANARKLIDEMVGKGFFLVQTKEVSMKAEDAQRVFREKAPDFLPLLNKGPVIAL EFNGDGAVEVCQLIVNEIFNGTKMFVSESKETASGDVDSFYNFADIQMGI >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_4|1053_bp atgggctgcttcttctccaagagacggaaggctgacaaggagtcgcggcccgagaacgag gaggagcggccaaagcagtacagctgggatcagcgcgagaaggttgatccaaaagactac atgttcagtggactgaaggatgaaacagtaggtcgcttacctgggacggtagcaggacaa cagtttctcattcaagactgtgagaactgtaacatctatatttttgatcactctgctaca gttaccattgatgactgtactaactgcataatttttctgggacccgtgaaaggcagcgtg tttttccggaattgcagagattgcaagtgcacattagcctgccaacaatttcgtgtgcga gattgtagaaagctggaagtctttttgtgttgtgccactcaacccatcattgagtcttcc tcaaatatcaaatttggatgttttcaatggtactatcctgaattagctttccagttcaaa gatgcagggctaagtatcttcaacaatacatggagtaacattcatgactttacacctgtg tcaggagaactcaactggagccttcttccagaagatgctgtggttcaggactatgttcct atacctactaccgaagagctcaaagctgttcgtgtttccacagaagccaatagaagcatt gttccaatatcccggggtcagagacagaagagcagcgatgaatcatgcttagtggtatta tttgctggtgattacactattgcaaatgccagaaaactaattgatgagatggttggtaaa ggctttttcctagttcagacaaaggaagtgtccatgaaagctgaggatgctcaaagggtt tttcgggaaaaagcacctgacttccttcctcttctgaacaaaggtcctgttattgccttg gagtttaatggggatggtgctgtagaagtatgtcaacttattgtaaacgagatattcaat gggaccaagatgtttgtatctgaaagcaaggagacggcatctggagatgtagacagcttc tacaactttgctgatatacagatgggaatatga >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_5|170_aa METTTYLFSLLKVPLSMTVVQPSAMESSQSRLTFLLGGGVGGEKEKRKSELPLPLRPALP QQTGLSGSPSGTREALSYHAGQQPRARGSPSHTRQPEEACSALLAASAYTQHGVQTLVPD PAAADPAGIERKIEKFYQEPYIYKFNDLDGMDQFLETTNYQNSTNMKFII >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_5|513_bp atggagacaaccacttacctctttagtcttcttaaagttcctctttctatgactgtggta cagcctagtgccatggaatcttcccaatccagattaacattccttctgggtgggggggtt ggaggagaaaaagagaagagaaaatctgagcttcctttacccttaagaccagcattacca cagcaaacaggattgagtgggagtcccagcggtaccagggaagcactttcctaccacgct gggcaacagccaagagctagggggagccccagtcacaccaggcagccggaggaggcgtgc tcagccctgctggcagcatcagcgtatacccagcacggagtccagacgcttgtccccgac ccagcagcagcagatcctgcaggcattgaaaggaaaatagagaaattctatcaagaaccc tacatatataaattcaacgacttagatggaatggaccaattccttgaaactacaaactac caaaactcgaccaatatgaaatttatcatctga >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_6|179_aa MMSLIMEFESHARKSALAIRYDASPAPSPVLLLGISPTGLKIHIALPRASNMTQRRGERE SVNSRARLAPLSIPELTGQKARQQYWVKSYSLANKSIKKSTTGEKALGSFRPPTAKQPGR ALAPWSIRPLAADRAGPPATGLAPSAPKPGKGQERRRGRQEGAPEERAATVMALNSCPV >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_6|540_bp atgatgtcactcatcatggaatttgaaagccatgcacgcaaatcggctcttgcaatccgc tatgatgccagccctgcaccatcacctgtcctgctcctgggaatctcccctacgggcctg aagattcacatcgcattgccgagggcaagcaacatgactcagagaagaggggagagagaa agtgtgaattctagagccagacttgcacccctgagtattccagaattgactggacagaaa gcaaggcaacaatattgggttaaaagctattctctagcaaacaaaagcataaagaagagc acaactggagaaaaggcactcggttctttccgccccccaacagctaaacagccagggcgc gccctcgctccctggagcatccgacccctggcggccgacagagcaggtcctcccgccacg ggtctggccccgtcagccccgaagccaggcaaagggcaggaacggaggagggggcgccag gagggggcaccggaggaaagagccgcaacagttatggcgctcaattcctgcccggtgtag >gi568815575f:46737101_46979766|GENSCAN_predicted_peptide_7|80_aa MATRGQSGDGGSLKCPTHMLMAPKTRSSGPELEIERKGERDRKKYSQGGTKSHMFTLTGP IKELNKDLMKELRLEFGKCG >gi568815575f:46737101_46979766|GENSCAN_predicted_CDS_7|243_bp atggccaccagggggcagtcaggtgatggtggttccctgaagtgccctacccacatgctt atggctcccaaaacaaggtcctctggtccagagctggagatagaaaggaaaggcgaaaga gacaggaagaagtacagccaaggaggaactaagtctcacatgttcactctcacagggccc atcaaagaactcaataaagatctgatgaaggaacttaggctggaatttggaaaatgtgga taa