GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:30:17 Sequence gi568815583f:51946461_52164997 : 218537 bp : 44.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 929 832 98 0 2 79 94 61 0.907 5.53 1.10 Intr - 3534 3348 187 0 1 68 105 270 0.973 25.96 1.09 Intr - 5519 5384 136 1 1 90 95 -3 0.609 1.17 1.08 Intr - 6803 6669 135 1 0 81 93 43 0.919 3.78 1.07 Intr - 8115 8021 95 0 2 68 105 54 0.980 3.86 1.06 Intr - 12366 12282 85 2 1 17 76 114 0.639 2.92 1.05 Intr - 13584 13439 146 0 2 72 110 89 0.978 8.58 1.04 Intr - 14273 14179 95 0 2 52 81 76 0.969 2.98 1.03 Intr - 16033 15929 105 2 0 62 63 107 0.970 5.79 1.02 Intr - 20044 19289 756 2 0 51 90 996 0.984 87.54 1.01 Init - 25285 24937 349 1 1 92 57 255 0.701 20.15 1.00 Prom - 45428 45389 40 -0.76 2.03 PlyA - 45498 45493 6 1.05 2.02 Term - 72558 72349 210 0 0 39 44 103 0.610 -1.71 2.01 Init - 73033 72935 99 1 0 83 38 182 0.923 10.96 2.00 Prom - 98256 98217 40 0.04 3.00 Prom + 99066 99105 40 -5.46 3.01 Init + 100001 100555 555 1 0 68 95 316 0.957 25.27 3.02 Intr + 103533 103677 145 2 1 88 92 57 0.930 5.96 3.03 Intr + 112173 112337 165 1 0 87 67 80 0.975 5.73 3.04 Intr + 114839 115040 202 0 1 101 94 112 0.997 11.44 3.05 Term + 117442 118540 1099 1 1 64 41 508 0.999 35.26 3.06 PlyA + 118745 118750 6 1.05 4.00 Prom + 133551 133590 40 -2.06 4.01 Init + 141304 141311 8 0 2 103 91 0 0.860 2.30 4.02 Intr + 147797 148028 232 2 1 99 74 88 0.903 6.28 4.03 Term + 148784 148951 168 1 0 37 42 202 0.999 8.38 4.04 PlyA + 149039 149044 6 1.05 5.03 PlyA - 149980 149975 6 1.05 5.02 Term - 156263 156174 90 2 0 92 39 57 0.045 -1.08 5.01 Init - 166266 165778 489 0 0 85 80 833 0.610 77.40 5.00 Prom - 170033 169994 40 -4.36 6.13 PlyA - 171931 171926 6 1.05 6.12 Term - 173446 173366 81 1 0 93 48 -1 0.191 -5.91 6.11 Intr - 173910 173842 69 0 0 63 86 75 0.399 4.18 6.10 Intr - 178179 178013 167 1 2 61 52 107 0.870 4.08 6.09 Intr - 179584 179488 97 1 1 68 83 93 0.984 6.38 6.08 Intr - 181784 181736 49 1 1 63 92 55 0.545 2.08 6.07 Intr - 187009 186918 92 1 2 88 77 183 0.438 15.99 6.06 Intr - 188645 188532 114 2 0 114 93 -16 0.797 2.14 6.05 Intr - 189296 189153 144 2 0 86 55 204 0.974 17.38 6.04 Intr - 194812 194680 133 0 1 86 99 181 0.983 19.65 6.03 Intr - 201075 200999 77 0 2 65 106 8 0.047 -1.39 6.02 Intr - 203465 203424 42 2 0 141 100 42 0.970 9.14 6.01 Init - 207719 207480 240 2 0 58 91 211 0.795 16.38 6.00 Prom - 211104 211065 40 -0.66 7.00 Prom + 211851 211890 40 -5.46 7.01 Init + 211900 211909 10 0 1 99 79 9 0.893 0.67 7.02 Intr + 212833 212963 131 2 2 58 77 108 0.922 7.11 7.03 Term + 213026 213160 135 1 0 76 47 72 0.689 -0.08 7.04 PlyA + 214162 214167 6 1.05 8.02 PlyA - 215862 215857 6 1.05 8.01 Term - 217526 217053 474 2 0 97 36 121 0.209 2.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 198440 198370 71 2 2 98 45 101 0.949 7.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_1|729_aa MADMEDLFGSDADSEAERKGGFAGLRRMGAWQAFRCGSRTTVEAPAPRARGGVSSTPASA PPALPEVPSSSGGQDGESDSLRLEALGVGGADGSWELVERKGRGAWVEVQASLRSADSDS GSDSDSDQENAASGSNASGSESDQDERGDSGQPSNKELFGDDSEDEGASHHSGSDNHSER SDNRSEASERSDHEDNDPSDVDQHSGSEAPNDDEDEGHRSDGGSHHSEAEGSEKAHSDDE KWGREDKSDQSDDEKIQNSDDEERAQGSDEDKLQNSDDDEKMQNTDDEERPQLSDDERQQ LSEEEKANSDDERPVASDNDDEKQNSDDEEQPQLSDEEKMQNSDDERPQASDEEHRHSDD EEEQDHKSESARGSDSEDEVLRMKRKNAIASDSEADSDTEVPKDNSGTMDLFGGADDISS GSDGEDKPPTPGQPVDENGLPQDQQEEEPIPETRIEVEIPKVNTDLGNDLYFVKLPNFLS VEPRPFDPQYYEDEFEDEEMLDEEGRTRLKLKVENTIRWRIRRDEEGNEIKESNARIVKW SDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTGLQGQAVFKTKLTFRPHSTDSATHRK MTLSLADRCSKTQKIRILPMAGRDPECQRTEMIKKEEERLRASIRRESQQRRMREKQHQR GLSASYLEPDRYDEEEEGEESISLAAIKNRYKGGIREERARIYSSDSDEGSEEDKAQRLL KAKKLTSDE >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_1|2187_bp atggcggatatggaggatctcttcgggagcgacgccgacagcgaagctgagcgtaaaggt gggttcgctggtcttcggaggatgggtgcgtggcaggccttccgctgtgggagccggaca accgtggaggctccagcgccgcgggcaagaggtggcgtttcctccactcctgcctcagcg ccccctgcactcccggaggtgccatcctcctccggcgggcaggatggagagtcagacagt ctgcgcctcgaggctctgggtgtcggtggagcagacggttcctgggagcttgtggagcga aaggggcgcggggcatgggtggaagtgcaggcgagcctgcgcagcgcggattctgattct ggatctgactcagattctgatcaagagaatgctgcctctggcagtaatgcctctggaagt gaaagtgatcaggatgaaagaggtgattcaggacaaccaagtaataaggaactgtttgga gatgacagtgaggacgagggagcttcacatcatagtggtagtgataatcactctgaaaga tcagacaatagatcagaagcttctgagcgttctgaccatgaggacaatgacccctcagat gtagatcagcacagtggatcagaagcccctaatgatgatgaagacgaaggtcatagatcg gatggagggagccatcattcagaagcagaaggttctgaaaaagcacattcagatgatgaa aaatggggcagagaagataaaagtgaccagtcagatgatgaaaagatacaaaattctgat gatgaggagagggcacaaggatctgatgaagataagctgcagaattctgacgatgatgag aaaatgcagaacacagatgatgaggagaggcctcagctttccgatgatgagagacaacag ctatctgaggaggaaaaggctaattctgatgatgaacggccggtagcttctgataatgat gatgagaaacagaattctgatgatgaagaacaaccacagctgtctgatgaagagaaaatg caaaattctgatgatgaaaggccacaggcctcagatgaagaacacaggcattcagatgat gaagaggaacaggatcataaatcagaatctgcaagaggcagtgatagtgaagatgaagtt ttacgaatgaaacgcaagaatgcgattgcatctgattcagaagcggatagtgacactgag gtgccaaaagataatagtggaaccatggatttatttggaggtgcagatgatatctcttca gggagtgatggagaagacaaaccacctactccaggacagcctgttgatgaaaatggattg cctcaggatcaacaggaagaggagccaattcctgagaccagaatagaagtagaaataccc aaagtaaacactgatttaggaaacgacttatattttgttaaactgcccaactttctcagt gtagagcccagaccttttgatcctcagtattatgaagatgaatttgaagatgaagaaatg ctggatgaagaaggtagaaccaggttaaaattaaaggtagaaaatactataagatggagg atacgccgagatgaagaaggaaatgaaattaaagaaagcaatgctcggatagtcaagtgg tcagatggaagcatgtccctgcatttaggcaatgaagtgtttgatgtgtacaaagcccca ctgcagggcgaccacaatcatctttttataagacaaggtactggtctacagggacaagca gtctttaaaacgaaactcaccttcagacctcactctacggacagtgccacacatagaaag atgactctgtcacttgcagataggtgttcaaagacacagaagattagaatcttgccaatg gctggtcgtgatcctgaatgccaacgcacagaaatgattaagaaagaagaagaacgtttg agggcttccatacgtagggaatctcagcagcgccgaatgagagagaaacagcaccagcgg gggctgagcgccagttacctggaacctgatcgatacgatgaggaggaggaaggcgaggag tccatcagcttggctgccattaaaaaccgatataaagggggcattcgagaggaacgagcc agaatctattcatcagacagtgatgagggatcagaagaagataaagctcaaagattactc aaagcaaagaaacttaccagtgatgag >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_2|102_aa MEEGAGSAVAAAAAAAGPAAAPSRAAPPAHPAHGGRQLGPFALLPALALREQSDQTQCQR LLSTYYVQGTGQSERKRPQSPGRVDLKMDWLEGDSPSTRQTR >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_2|309_bp atggaggagggagccggctccgccgtcgccgccgccgccgccgccgccgggcccgccgcg gcgccctcccgagccgccccgccggcgcaccccgcacacggaggaagacagctggggccg tttgctctactacctgccctggcattaagggaacaaagtgaccaaacccagtgccagcgt ttactaagcacctactacgtgcaaggcactgggcaatccgagaggaagaggcctcagtct ccaggaagggtggaccttaagatggattggctggaaggcgacagcccgagcacaagacag acgcgttag >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_3|721_aa MAEKFESLMNIHGFDLGSRYMDLKPLGCGGNGLVFSAVDNDCDKRVAIKKIVLTDPQSVK HALREIKIIRRLDHDNIVKVFEILGPSGSQLTDDVGSLTELNSVYIVQEYMETDLANVLE QGPLLEEHARLFMYQLLRGLKYIHSANVLHRDLKPANLFINTEDLVLKIGDFGLARIMDP HYSHKGHLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAAGCIFAEMLTGKTLFAGAHELEQ MQLILESIPVVHEEDRQELLSVIPVYIRNDMTEPHKPLTQLLPGISREALDFLEQILTFS PMDRLTAEEALSHPYMSIYSFPMDEPISSHPFHIEDEVDDILLMDETHSHIYNWERYHDC QFSEHDWPVHNNFDIDEVQLDPRALSDVTDEEEVQVDPRKYLDGDREKYLEDPAFDTNYS TEPCWQYSDHHENKYCDLECSHTCNYKTRSSSYLDNLVWRESEVNHYYEPKLIIDLSNWK EQSKEKSDKKGKSKCERNGLVKAQIALEEASQQLAGKEREKNQGFDFDSFIAGTIQLSSQ HEPTDVVDKLNDLNSSVSQLELKSLISKSVSQEKQEKGMANLAQLEALYQSSWDSQFVSG GEDCFFINQFCEVRKDEQVEKENTYTSYLDKFFSRKEDTEMLETEPVEDGKLGERGHEEG FLNNSGEFLFNKQLESIGIPQFHSPVGSPLKSIQATLTPSAMKSSPQIPHQTYSSILKHL N >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_3|2166_bp atggcagagaaatttgaaagtctcatgaacattcatggttttgatctgggttctaggtat atggacttaaaaccattgggttgtggaggcaatggcttggttttttctgctgtagacaat gactgtgacaaaagagtagccatcaagaaaattgtccttactgatccccagagtgtcaaa catgctctacgtgaaatcaaaattattagaagacttgaccatgataacattgtgaaagtg tttgagattcttggtcccagtggaagccaattaacagacgatgtgggctctcttacggaa ctgaacagtgtttacattgttcaggagtacatggagacagacttggctaatgtgctggag cagggccctttactggaagagcatgccaggcttttcatgtatcagctgctacgggggctc aagtatattcactctgcaaatgtactgcacagagatctcaaaccagctaatcttttcatt aatacggaagacttggtgctgaagataggtgactttggtcttgcacggatcatggatcct cattattcccataagggtcatctttctgaaggattggttactaaatggtacagatctcca cgtcttttactttctcctaataattatactaaagccattgacatgtgggctgcaggctgc atctttgctgaaatgctgactggtaaaaccctttttgcaggtgcacatgaacttgaacag atgcagctgattttagaatctattcctgttgtacatgaggaagatcgtcaggagcttctc agcgtaattccagtttacattagaaatgacatgactgagccacacaaacctttaactcag ctgcttccaggaattagtcgagaagcactggatttcctggaacaaattttgacatttagc cccatggatcggttaacagcagaagaagcactctcccatccttacatgagcatatattct tttccaatggatgagccaatttcaagccatccttttcatattgaagatgaagttgatgat attttgcttatggatgaaactcacagtcacatttataactgggaaaggtatcatgattgt cagttttcagagcatgattggcctgtacataacaactttgatattgatgaagttcagctt gatccaagagctctgtccgatgtcactgatgaagaagaagtacaagttgatccccgaaaa tatttggatggagatcgggaaaagtatctggaggatcctgcttttgataccaattactct actgagccttgttggcaatactcagatcatcatgaaaacaaatattgtgatctggagtgt agccatacttgtaactacaaaacgaggtcatcatcatatttagataacttagtttggaga gagagtgaagttaaccattactatgaacccaagcttattatagatctttccaattggaaa gaacaaagcaaagaaaaatctgataagaaaggcaaatcaaaatgtgaaaggaatggattg gttaaagcccagatagcgctagaggaagcatcacagcaactggctggaaaagaaagggaa aagaatcagggatttgattttgattcctttattgcaggaactattcagcttagttcccag catgagcctactgatgttgttgataaattaaatgacttgaatagctcagtgtcccaacta gaattgaaaagtttgatatcaaagtcagtaagccaagaaaaacaggaaaaaggaatggca aatctggctcaattagaagccttgtaccagtcttcttgggacagccagtttgtgagtggt ggggaggactgttttttcataaatcagttttgtgaggtaaggaaggatgaacaagttgag aaggaaaacacttacactagttacttggacaagttctttagcaggaaagaagatactgaa atgctagaaactgagccagtagaggatgggaagcttggggagagaggacatgaggaagga tttctgaacaacagtggggagttcctctttaacaagcagctcgagtccataggcatccca cagtttcacagtccagttgggtcaccacttaagtcaatacaggccacattaacaccttct gctatgaaatcttcccctcaaattcctcatcaaacatacagcagcattctgaaacatctg aactaa >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_4|135_aa MPSSSQLLALSSFPWKNIPAVGAWNTPASATPLWPDVYGCVNVQQEASGFHPGAKVYGAS RLERTVWLTEVACLAAGSTYVGTFNNVATLHCLTAAIMGRMHAPGKGPSRLALPYRHSIP TWLKLTSDDVKEQIY >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_4|408_bp atgcccagctcatctcagctgcttgccctctcatcctttccttggaaaaacatcccagct gttggggcctggaatactccagcatcagccacacccctctggcctgatgtgtatggctgt gtcaatgtgcagcaggaggcaagtggcttccaccctggtgccaaggtctatggagccagc agactggagaggacagtctggctgacagaggtggcctgccttgcagctggaagcacttac gtgggaacatttaataacgtggccacacttcactgcctgactgccgccatcatgggtcgc atgcatgctcctgggaagggcccgtcccggttggctttgccctatcgccacagcatcccc acttggctgaagttgacatctgatgacgtgaaggagcagatttactaa >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_5|192_aa MVDQLRERTTMADPLRERTELLLADYLGYCAREPGTPEPAPSTPEAAVLRSAAARLRQIH RSFFSAYLGYPGNRFELVALMADSVLSDSPGPTWGRVVTLVTFAGTLLERGPLVTARWKK WGFQPRLKEQEGDVARDCQRLVALLSSRLMGQHRAWLQAQGGWGELIACMNADGSDARDG AFDDAGRRGEEI >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_5|579_bp atggttgaccagttgcgggagcgcaccaccatggccgacccgctgcgggagcgcaccgag ctgttgctggccgactacctggggtactgcgcccgggaacccggcacccccgagccggcg ccatccacgcccgaggccgccgtgctgcgctccgcggccgccaggttacggcagattcac cggtcctttttctccgcctacctcggctaccccgggaaccgcttcgagctggtggcgctg atggcggattccgtgctctccgacagccccggccccacctggggcagagtggtgacgctc gtgaccttcgcagggacgctgctggagagagggccgctggtgaccgcccggtggaagaag tggggcttccagccgcggctaaaggagcaggagggcgacgtcgcccgggactgccagcgc ctggtggccttgctgagctcgcggctcatggggcagcaccgcgcctggctgcaggctcag ggcggctggggagaattaatagcatgcatgaatgcagatggaagtgatgcaagagatgga gcatttgatgatgcaggcaggagaggagaggagatctag >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_6|434_aa MGPPPSAEHMSVLCQPKAQNLVLSKGTDYGLVIPVHQVAERVEALGQFVMKTRRTLKGHG NKVLCMDWCKDKRRIVSSSQDGKVIVWDSFTTNKEHAVTMPCTWVMACAYAPSGCAIACG GLDNKCSVYPLTFDKNENMAAKKKSVAMHTNYLSACSFTNSDMQILTASGDGTCALWDVE SGQLLQSFHGHGADVLCLDLAPSETGNTFVSGGARNRGFESSELADRATETQRSTRLGVR VPGSGLCYLTGCDKKAMVWDMRSGQCVQAFETHESDINSVRYYPSGDAFASGSDDATCRL YDLRADREVAIYSKESIIFGASSVDFSLSGRLLFAGYNDYTINVWDVLKGSRVSILFGHE NRVSTLRVSPDGTAFCSGSWDHTLRWVLSGFADEILDLDYVTACGDSGVISGLKQHRVVR AFVSRALIVARPAG >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_6|1305_bp atggggccgcctccctctgcggaacatatgtctgtgctgtgtcagcccaaagcccagaac ctggtccttagcaaggggactgactatggccttgtcattccagtgcaccaggtggcggag cgggtggaggccctggggcagtttgtcatgaagaccagaaggaccctcaaaggccacggg aacaaagtcctgtgcatggactggtgcaaagataagaggaggatcgtgagctcgtcacag gatgggaaggtgatcgtgtgggattccttcaccacaaacaaggagcacgcggtcaccatg ccctgcacgtgggtgatggcatgtgcttatgccccatcgggatgtgccattgcttgtggt ggtttggataataagtgttctgtgtaccccttgacgtttgacaaaaatgaaaacatggct gccaaaaagaagtctgttgctatgcacaccaactacctgtcggcctgcagcttcaccaac tctgacatgcagatcctgacagcgagcggcgatggcacatgtgccctgtgggacgtggag agcgggcagctgctgcagagcttccacggacatggggctgacgtcctctgcttggacctg gccccctcagaaactggaaacaccttcgtgtctgggggtgcaagaaacagaggctttgaa agctcagaattggcagacagggccacagagactcagagaagcactagactaggagtcagg gtgcctgggtcagggctttgctacctgactggatgtgacaagaaagccatggtgtgggac atgcgctccggccagtgcgtgcaggcctttgaaacacatgaatctgacatcaacagtgtc cggtactaccccagtggagatgcctttgcttcagggtcagatgacgctacgtgtcgcctc tatgacctgcgggcagatagggaggttgccatctattccaaagaaagcatcatatttgga gcatccagcgtggacttctccctcagtggtcgcctgctgtttgctggatacaatgattac actatcaacgtctgggatgttctcaaagggtcccgggtctccatcctgtttggacatgaa aaccgcgttagcactctacgagtttcccccgatgggactgctttctgctctggatcatgg gatcataccctcagatgggtgctgtcgggatttgctgatgagatacttgatttggactat gtgacagcatgtggagactcaggggttataagtggcttaaaacagcacagggttgtaagg gcctttgtgtcccgggctctcatagttgccaggccagctggctga >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_7|91_aa MAYTLEDSVDGDNTGGGKPGKRLLFFGSPLSSGHTICRTTLLCGPSKGEDACLCGLPAQL GPLKLEMLNILDASPDQGIRIKVMAGTQEEV >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_7|276_bp atggcgtacacgttggaggacagcgtcgatggagacaacactggaggagggaaaccaggg aagagactcctcttctttggcagccccctctccagcggacacaccatctgccgcaccacg cttctttgtggtccctccaagggagaggacgcttgcctctgtggcctccctgctcagttg ggtcctctgaagcttgagatgctgaatattctggatgccagtccagaccaaggaatcaga atcaaagtcatggctggaactcaagaggaagtgtag >gi568815583f:51946461_52164997|GENSCAN_predicted_peptide_8|157_aa GCCGLLGVHSRPLLPGSLLHLEVSPVKAAKQQRWHPAPSSGDSIPEGHQPDASWNAPGGG ICRPLLGGLTQSRGMGSGTRLKKQSGCPLVEQIRCAGGNLPHPDHPDSPEHPGPTESSSL AMICHSSCAALWGIPPSLDHPDSQELAGQDGLLELQR >gi568815583f:51946461_52164997|GENSCAN_predicted_CDS_8|474_bp ggctgctgcggtttgctgggggtccactccagacccttattgcctgggtccctcctgcac ctggaggtatcaccagtgaaggctgcaaaacagcaaagatggcatccagctccttcctct ggagactccatcccagaggggcaccaacctgatgccagctggaacgctcctggaggaggc atttgtagacccctgttgggaggtctcacccagtcaagaggaatgggatcagggacccgc ttaaagaagcagtctggctgccccctggtggagcagatacgctgtgctgggggaaacctc cctcatccagaccacccagactctccagagcatccaggtcccactgaaagtagcagtctg gccatgatctgccacagcagctgtgctgcgctgtggggaattcctcccagtttggaccac cctgactcccaggagctggcaggccaggatggcctacttgagctgcagagatag