GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:20:22 Sequence gi568815597f:144492943_144693434 : 200492 bp : 39.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 764 1095 332 1 2 57 48 149 0.166 4.64 1.02 Intr + 21124 21280 157 1 1 14 92 149 0.565 6.99 1.03 Intr + 26977 27141 165 0 0 61 3 147 0.073 2.74 1.04 Intr + 30920 31071 152 1 2 27 102 81 0.046 1.44 1.05 Intr + 38586 38782 197 0 2 13 110 136 0.017 6.44 1.06 Intr + 46856 47063 208 0 1 82 68 182 0.228 12.81 1.07 Intr + 52065 52247 183 0 0 79 64 80 0.289 2.68 1.08 Term + 53669 53840 172 2 1 46 41 198 0.377 7.22 1.09 PlyA + 55382 55387 6 1.05 2.00 Prom + 56781 56820 40 -7.35 2.01 Init + 56852 56899 48 1 0 79 111 16 0.066 4.00 2.02 Intr + 58209 58239 31 2 1 108 94 56 0.038 4.99 2.03 Intr + 61231 61429 199 2 1 24 12 148 0.008 -1.71 2.04 Intr + 67106 67136 31 2 1 100 94 43 0.286 3.31 2.05 Term + 67916 68221 306 1 0 65 45 225 0.319 10.03 2.06 PlyA + 71214 71219 6 1.05 3.05 PlyA - 71259 71254 6 1.05 3.04 Term - 75133 74671 463 0 1 9 43 250 0.553 6.14 3.03 Intr - 75388 75289 100 2 1 99 51 77 0.921 3.35 3.02 Intr - 83378 83145 234 0 0 43 86 121 0.040 4.14 3.01 Init - 97365 97287 79 0 1 86 25 54 0.060 0.17 3.00 Prom - 97451 97412 40 -3.25 4.00 Prom + 99925 99964 40 -3.95 4.01 Sngl + 100001 100495 495 1 0 86 39 606 0.998 51.30 4.02 PlyA + 100692 100697 6 1.05 5.00 Prom + 103039 103078 40 -4.85 5.01 Init + 139693 139849 157 0 1 29 91 99 0.321 4.42 5.02 Intr + 148695 148851 157 1 1 75 100 85 0.613 6.55 5.03 Intr + 149363 149759 397 2 1 47 65 428 0.788 30.06 5.04 Intr + 159305 159469 165 1 0 72 13 222 0.543 12.34 5.05 Term + 159502 159906 405 0 0 57 46 558 0.837 42.70 5.06 PlyA + 161138 161143 6 1.05 6.00 Prom + 161746 161785 40 -4.25 6.01 Init + 165993 166086 94 2 1 28 76 96 0.733 2.99 6.02 Intr + 168307 168377 71 2 2 91 44 36 0.490 -2.52 6.03 Intr + 171145 171261 117 0 0 77 60 97 0.907 5.54 6.04 Intr + 176794 176895 102 0 0 101 36 91 0.640 4.65 6.05 Intr + 177101 177192 92 1 2 -21 92 105 0.057 -2.13 6.06 Intr + 184410 184519 110 0 2 44 95 117 0.112 7.01 6.07 Intr + 186360 186572 213 1 0 34 75 176 0.806 8.66 6.08 Intr + 191922 191993 72 1 0 71 91 49 0.702 1.96 6.09 Intr + 196850 197036 187 0 1 67 89 196 0.482 15.43 6.10 Intr + 197183 197572 390 2 0 9 39 607 0.473 40.81 6.11 Intr + 197875 198213 339 1 0 65 -17 341 0.975 14.96 6.12 Term + 198246 198543 298 0 1 33 54 466 0.990 31.45 6.13 PlyA + 198798 198803 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 58337 58198 140 0 2 105 48 140 0.906 8.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_1|521_aa MKCLLIEWLKYRQMQLLSMKGILLAPCTVSSENSCFSAQLQHFSHHKEHKALASISTLKL SSPICVPSAACSGIEEGRAGLQSLLFFLGSVNGKGSCEGIQIDTLGSGVRSLQHCTWESV KELTVYVIGVPEEENRERGAEQLMRGKPTTYNQEYCRLLIGIYATAGCGQGSQQHPLASA ASHNASPQVAKRSVQVSGREAERSSEGRGCGREEEVGRKKARTQFPTTTNYSPVSPHVEV TGVSQHILSTMEKSPPRNTNFMAVFSPVTEVFYNILNSHFGEAVYRMVGSLLPVEAPTLK GAVRTEVSGEGPEPWGLSSGDGDGRRGKAEPEGSEPTRQYNCERPWKTGAMRGVNRVGLI GDPQVGRFWNQDDLAPHLQQKPPETPGPDPDLSRLGAQGKLNARGTRALAKPFPYKEQAC YVYNPMAPVPSTISTWFYELSTCSPQHRNPTNSHECYREKQGLASTTPANPQPPRALLAL THSWDSHVQPVVLDLLLQRGKSFVAKQQPLRCLIYIETPYR >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_1|1566_bp atgaaatgtttgttgattgaatggctgaagtatagacagatgcagcttctgtccatgaaa ggcatcctgcttgctccttgcactgtgtccagtgaaaactcttgtttctctgcgcagctg caacacttttcacatcacaaggagcacaaggcactggcttccatttccaccctgaagctc agcagccccatttgcgtcccatcagctgcctgctccggaatagaggagggaagggctggg cttcagagtttgcttttttttcttggttctgtgaatggaaaaggatcctgtgaagggatc cagattgataccttggggtctggtgtacgtagtctccagcactgcacttgggaaagcgtc aaggagttgactgtatatgtaattggagtgccagaagaagagaaccgagagagaggggca gaacagctcatgaggggaaaacccactacatacaatcaagaatactgcagactcctcatc ggaatctatgcaactgctgggtgtgggcagggatcccagcagcacccactggcgtcggct gcctcccacaatgccagcccccaggtagccaagaggagtgtccaggtgtcggggagagaa gctgagagaagcagcgagggccgtggttgtgggagagaggaggaagttgggaggaagaaa gcgcgaacgcaattccccaccaccacaaattacagtccagtttccccacatgtggaagta acaggagtcagtcagcacatactgagtacaatggagaaatcgcccccgagaaacaccaac ttcatggccgtattttctcctgttacggaagtcttttataatattcttaattcacatttt ggtgaagcagtatacagaatggtaggctcccttcttcctgtggaagcccctaccttgaaa ggggctgtcagaacagaagtttcaggggaggggccagagccctggggactttcctcaggt gatggtgatggaagaagaggcaaggcagaaccggagggctcagagcccacccggcagtac aactgtgaaaggccttggaaaactggagcgatgagaggggtgaatcgtgttggtctcatt ggagacccgcaggttgggagattctggaaccaggacgaccttgcccctcacctgcagcag aagcccccggaaacacccggccccgacccggacctgagccgcctgggggcccaagggaag ctgaacgcccggggcaccagggcactggcaaagccctttccatacaaagagcaagcgtgt tatgtctacaacccaatggcaccagttccaagtacaatttctacttggttctatgagctg agtacatgttccccccagcacagaaatcctacaaactcccatgaatgctatagggaaaag caggggctagccagcaccactccagccaatccccaaccaccccgagccctcctagcccta acacacagctgggactctcatgtccagccagtggtcctggacttgctcctacagcgcggg aaatccttcgtggcgaagcagcagcccctgcgctgcctcatctacatagaaacgccctat cggtga >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_2|204_aa MHGDLLHTKINAGAEQGGPQRISAKTGAVRTEVSGKGPEPWGLSSGDGDGRRGKAGQEGS GKKDYAGAGAGGEGAVGGERDKASSIFLNSENVEDRSAFRPRQPHTLHPLHARSLAPRSP TPPSPPSPDTQLGLSGPTSGPESAPTAREILRGEAAGGEAAAPALPHLHRSRPIRDVTDS AFPSPRLPFCRSAYQPAAGAGRGK >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_2|615_bp atgcatggtgatctcctccacacaaaaataaatgctggtgcagaacagggaggaccgcag cgcatttcggccaagacaggggctgtcagaacagaagtttcagggaaggggccagagccc tggggactttcctcaggtgatggtgatggaagaagaggcaaggcaggacaggaaggctca ggtaagaaagactatgcaggtgcaggggcaggaggagaaggagcagttggaggcgaaaga gacaaagcctcctcaatatttctcaattcagaaaacgtggaggatcgcagcgcatttcgg ccaagacagcctcacacgcttcaccctttacacgcacggtcacttgccccgcgcagcccc accccccccagccctcctagccctgacacacagctgggactctcaggtccgaccagcggt cctgaatccgctcccacggcacgggaaatccttcgtggcgaagcagcaggtggggaagca gcagcccctgcgctgcctcatctacatagaagtcgccctatccgtgatgtcaccgacagt gcctttcccagtccccgtctgcctttctgccgctcagcctaccaacccgctgccggagcc ggcagggggaagtga >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_3|291_aa MTSHEVKGPGDKMRSPKSKASKFSVSGFKNEEEFAPMYVSISHCFTIIFEIIPRNPTYKG CEGPIQGELQTTAQGNKRGHKQMEEHSMLMDRKNQYRENGHTAQGPDHRPAEPPRLHGAE GALEALPVAAPRGAKKSTGTHEPPSPRSLPWWSPRAEHRIRRWQRKIRSGPKSDWAGRAQ APSLGEGGAKNGKSHPGSHRAFSLPRAPRRFGPGSQPGRFRGFLKQVRGRAGEGHSAILL APESPNAQVSNVTSATYRSNLSGLPRPAQSSCRAGPEDARDGGSFVRAPFL >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_3|876_bp atgacaagtcatgaagtaaaaggaccaggtgataaaatgaggtctccaaaatccaaagca tcaaagttctcagttagtggatttaaaaatgaagaagagtttgctccaatgtatgtttca atatctcattgttttacaataatatttgaaataatacctaggaacccaacttacaaggga tgtgaaggacctattcaaggagaactacaaaccactgctcaaggaaataaaagaggacac aaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatggc catactgcccaaggacccgaccaccgcccagctgagcccccgcggctccacggcgcagaa ggggcactggaggccctgcccgttgccgccccgcggggtgccaagaagtccacggggacc cacgagccaccctcaccacgatccctgccctggtggagcccccgtgcggaacacaggatc cgaagatggcagcggaagatccgcagcggccccaaaagcgactgggcagggagggcacag gctccctcactgggtgaaggcggcgcaaagaacgggaagagccatcccgggagccaccgg gcgttcagcctccctagggcccccaggcggttcgggccggggtctcaaccggggcgtttc cgggggtttctgaagcaggtgaggggcagggcgggcgaaggccattcggctatccttctg gctccagaatctcccaacgcgcaggtgtccaacgtgaccagcgcgacttaccgctccaat ctctccggtcttccaaggcctgctcagtcgtcctgccgggcgggccctgaggatgcaagg gacggaggaagtttcgtgcgtgcgcccttcctatag >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_4|164_aa MVNSVVFFEITRDGKPLGRISIKLFADKIPKTAENFRALSTGEKGFRYKGSCFHRIIPGF MCQGGDFTRPNGTGDKSIYGEKFDDENLIRKHTGSGILSMANAGPNTNGSQFFICAAKTE WLDGKHVAFGKVKERVNIVEAMEHFGYRNSKTSKKITIADCGQF >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_4|495_bp atggtcaactccgtcgtcttttttgaaatcaccagggatggcaagcccttgggccgcatc tccatcaaactgtttgcagacaagattccaaagacagcagaaaactttcgtgctctgagc actggagagaaaggatttcgttataagggttcctgctttcacagaattattccagggttt atgtgtcagggtggtgacttcacacgccctaatggcaccggtgacaagtccatctatggg gagaaatttgatgatgagaacctcatccgaaagcatacaggttctggcatcttgtccatg gcaaatgctggacccaacacaaatggttcccagtttttcatctgcgctgccaagactgag tggttggatggcaagcatgtggcgtttggcaaggtgaaagaacgtgtgaatattgtggaa gccatggagcactttgggtacaggaatagcaagaccagcaagaagatcaccattgctgac tgtggacaattctaa >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_5|426_aa MLSTIKVFQIYQLRSYWSWTNTPEKKKNKQNKTIPKNLDKMAYRGHRQCIQAGLRFPHLQ DERAGPDGNSGGKGMSSRSPPPPPRDPGASIPVPTTHPLHKSCPRTPVELAAESAAPALR AVAAGTKILGGGGKTPRRRKSRCGRGTKSRDSGAQKAAAGKRPWRVKSRDGKKPRRAKSH GGGGAKSSGGGGGAKSRGSEGAESRGGKKPRRRVCKKLCRRWGKKPRRQRGKKPRRRSHG AFTGCSGSRAPRAVPAGPDQDRGCRGQSSAKGTGPGQAGQGYPKRIEAAGVNEWKPLMKS KLPPPSGRLCSHSWRTGSGAAPGGSAGAAEGFFAGTVQQAARDKTARQRPMARGSSEPES PAARRFSIPGSVQGHLDAVGKSRSGDIGSSLRVEAGDKRTQASPERQPHCGAHDAQGERH EAQEIG >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_5|1281_bp atgttaagcactattaaagtcttccaaatctaccagttacggagttattggtcttggact aacactcctgaaaagaaaaaaaacaaacaaaataagacaatacctaaaaacctggataaa atggcctaccgtgggcaccggcaatgcatccaagcaggccttcggtttcctcacctacaa gatgagagggctggaccagatggaaattcagggggtaaggggatgtcctcacgcagccca cccccacccccacgggaccctggagcctccatcccagttcccaccacgcacccgctccac aaatcctgcccaaggacgccagtagagctggcagccgagtctgccgctcccgccctcaga gccgtggcggcggggacaaaaatcctcggcggcgggggcaaaacgccgcggcggcgaaaa agtcgctgtggcagggggacaaaaagccgtgacagcggggcgcaaaaagccgcggcgggt aaaaggccgtggcgagtaaaaagccgcgatggcaaaaagccgcggcgggcaaaaagccac ggcggcggtggggcaaaaagcagcggcggtggcggaggggcaaaaagccgcggcagcgag ggggcagaaagccgcggcggcaaaaagccaaggcggcgagtgtgcaaaaagctgtgtcgg cggtggggcaaaaagccgcggcggcagaggggcaaaaagccgcggcggcggagccatggg gccttcacaggctgcagtgggtcccgagcccccagggctgtgcctgctggtcctgaccaa gatcgcggctgccgaggtcagtccagcgccaagggcacagggccagggcaggcggggcag ggctacccgaagcgcatagaggctgctggtgtcaacgagtggaagccgctgatgaagtca aagctgcctcctccttcaggaagactttgctcccatagctggcgaacaggaagcggagca gcgccaggaggatctgcgggcgctgctgagggcttctttgcagggacagtgcagcaggca gccagggacaagactgcacggcagcgccccatggccaggggaagctcagaaccggagtcg cccgctgcccggcgattctccatccctggatcggtacaggggcatttggacgctgtgggg aagtcgcggtctggggatattgggtccagccttcgggtagaagcaggtgataaacgcact caggccagcccggagcgtcagccacactgcggtgcccacgatgcccagggtgagcgccac gaggcgcaggaaattggctag >gi568815597f:144492943_144693434|GENSCAN_predicted_peptide_6|694_aa MFEGQQGSQNSWSIAAGRQVPQDELEKGTGKDISGGREIFKPRQLPGSAIWSIKVGHGSG FPGKRRPRGAGLSGRGGRGRSKLKSGIGAVVLPGVSTADISSNKDDEENSVLDMVVLFSS SDKFTLNQFVAVLAKEQKEDYLPVLSVVSVTIHTVSVLRWMHAVRQNLNTEEEVENVADI GFDCNMCRPYMPASNDPPKTYTQDGVCLTESGKTQLQSLTVTVPRRKLSKPKLKLKIINQ NSVAVLQTPPDIQSEHSRDGDMDDSRGKWACKGEFHAKSKDIMHLKLFLADTAAVAAMWR AADSREQPWLSAGTSRFLCPSRRLEGRTTATEPEALQEAREVPARSGTRRSSWSNGTAYP GQLALYQQLAQGNAVGGSAGAPPLGPVQVVTACLLTLLIIWTLLGNVLVSAAIVRSRHLR AKMTNVFIVSLPVSDLFVALLVMSWKAVAEVAGYWPFEAFCDVWVAFDIMCSTASILNLC VSSFYIPMAIMIVTYTRIYRIAQVQIRRISSLERAAEHVQSCRSSAGCAPDTSLRFSIKK ETEVLKTLSVIMGVFVCCWLPFFILNCMVPFCSGHPKGPPAGFPCVSETTFDVFICHYAF NADFRKVFAQLLGCSHVCSRTPVETVNISNELISYNQDTVFHKEIAAAYIHMMPNAVPPG DREVDNDEEEESPFDRMSQIYQTSPDGDPVAESV >gi568815597f:144492943_144693434|GENSCAN_predicted_CDS_6|2085_bp atgtttgaaggacagcaaggaagccagaatagctggagcatagcagcagggagacaagtg ccacaagatgagctggagaaaggcactgggaaagacatttcaggaggtcgggaaattttt aaacccaggcagcttcctggcagtgccatttggagcatcaaagtgggccatgggtctgga tttccaggaaagcggagacctcgaggtgcaggactgtcggggcgaggtggccgaggcagg tcaaagctgaaaagtggaatcggagctgttgtattgcctggggtgtctactgcagatatt tcatcaaataaggatgatgaagaaaactctgtgctcgatatggttgtgttgttttctagc agtgacaaattcactttgaatcagtttgtggcagttttggccaaggagcagaaggaagat tacttgcctgttctcagtgtggtcagtgttaccatccatactgtgtcagtattaagatgg atgcatgcagttcgtcagaacttaaatactgaggaagaagtggaaaatgtagcagacatt ggttttgattgtaacatgtgcagaccctatatgcctgcgtctaatgacccacccaagact tatacccaggatggtgtgtgtttgactgaatcagggaagactcagttacagagcctcaca gttacagttccaagaagaaaactgtcaaaaccaaaactgaaattgaagattataaatcag aatagcgtggccgtccttcagacccctccagacatccaatcagaacattcaagggatggt gatatggatgatagtcgaggtaagtgggcctgcaaaggtgaattccatgctaaatccaaa gacataatgcatttgaaactgtttttggcagacacagccgctgtcgctgccatgtggcgc gccgcagactcccgagaacagccctggctgtcagcgggcaccagccgcttcctgtgccca tcgcgtagactggaggggcgcaccacggccaccgagccagaggcgcttcaggaagcaaga gaagtccccgcgcgctccgggacccggcgcagctcatggagcaacggcaccgcgtacccg gggcagttagcgctgtaccagcagctggcgcaggggaatgccgtggggggctcggcgggg gcaccgccactggggcccgtgcaggtggtcaccgcctgcctgctgaccctactcatcatc tggaccttgctgggcaacgtgctggtgtccgcagccatcgtgcggagccgccacctgcgc gccaagatgaccaacgtcttcatcgtgtctctacctgtgtcagacctcttcgtggcgctg ctggtcatgtcctggaaggcagtcgccgaggtggccggttactggccctttgaagcgttc tgcgacgtctgggtggccttcgacatcatgtgctccaccgcctccatcctgaacctgtgc gtcagcagcttctacatccccatggccatcatgatcgtgacctacacgcgcatctaccgc atcgcccaggtgcagatccgcaggatttcctccctggagagggccgcagagcacgtgcag agctgccggagcagcgcaggctgcgcgcccgacaccagcctgcggttttccatcaagaag gagaccgaggttctcaagaccctgtcggtgatcatgggggtcttcgtgtgttgctggctg cccttcttcatccttaactgcatggtccctttctgcagtggacaccccaaaggccctccg gccggcttcccctgcgtcagtgagaccacattcgacgtcttcatctgtcactatgccttc aacgccgacttccggaaggtgtttgcccagctgctggggtgcagccacgtctgctcccgc acgccggtggagacggtgaacatcagcaatgagctcatctcctacaaccaagacacggtc ttccacaaggaaatcgcagctgcctacatccacatgatgcccaacgccgttccccccggg gaccgggaggtggacaacgatgaggaggaggagagtcctttcgatcgcatgtcccagatc tatcagacatccccagatggtgaccctgttgcagagtctgtctga