GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:20:45 Sequence gi568815593f:175392104_175626731 : 234628 bp : 43.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6573 6902 330 2 0 88 44 245 0.933 16.12 1.02 PlyA + 6986 6991 6 1.05 2.00 Prom + 7986 8025 40 -4.96 2.01 Init + 8079 9116 1038 2 0 44 41 360 0.744 20.90 2.02 Term + 9406 10686 1281 0 0 -10 50 434 0.911 21.40 2.03 PlyA + 11683 11688 6 1.05 3.00 Prom + 18633 18672 40 -2.66 3.01 Init + 42066 42140 75 2 0 74 76 41 0.350 2.59 3.02 Term + 46350 46418 69 2 0 94 35 70 0.503 0.24 3.03 PlyA + 47980 47985 6 1.05 4.05 PlyA - 48709 48704 6 1.05 4.04 Term - 51000 49656 1345 2 1 59 43 1123 0.097 95.88 4.03 Intr - 52920 52614 307 1 1 110 64 128 0.087 8.11 4.02 Intr - 61126 61103 24 2 0 135 87 13 0.090 3.90 4.01 Init - 86788 86254 535 1 1 55 30 268 0.122 10.72 4.00 Prom - 89531 89492 40 -5.66 5.00 Prom + 89962 90001 40 -3.06 5.01 Init + 92349 92455 107 2 2 67 17 139 0.250 2.46 5.02 Intr + 93400 93499 100 1 1 123 56 35 0.238 3.91 5.03 Intr + 99992 100164 173 1 2 76 92 106 0.219 8.54 5.04 Intr + 116929 117099 171 1 0 66 91 92 0.553 6.36 5.05 Intr + 118006 118104 99 1 0 51 80 118 0.987 6.53 5.06 Intr + 119348 119423 76 2 1 107 73 45 0.756 4.42 5.07 Intr + 120008 120093 86 1 2 89 116 -43 0.755 -2.78 5.08 Intr + 121360 121487 128 1 2 99 28 182 0.648 13.62 5.09 Intr + 130740 130921 182 1 2 71 84 33 0.276 0.79 5.10 Term + 131138 131239 102 1 0 67 37 83 0.422 -0.62 5.11 PlyA + 131554 131559 6 1.05 6.09 PlyA - 131578 131573 6 1.05 6.08 Term - 134293 134199 95 2 2 52 48 77 0.541 -1.91 6.07 Intr - 139220 139104 117 0 0 57 81 120 0.035 8.64 6.06 Intr - 150241 150175 67 1 1 92 79 45 0.030 2.48 6.05 Intr - 171296 171156 141 2 0 70 84 60 0.116 4.35 6.04 Intr - 200002 199942 61 0 1 60 66 51 0.022 -1.16 6.03 Intr - 204556 204308 249 0 0 78 52 117 0.036 3.65 6.02 Intr - 205128 205005 124 1 1 70 55 106 0.833 5.14 6.01 Init - 214248 214161 88 0 1 73 57 53 0.170 1.50 6.00 Prom - 216765 216726 40 -5.76 7.00 Prom + 221225 221264 40 -4.96 7.01 Sngl + 228419 228808 390 1 0 66 33 522 0.416 38.72 7.02 PlyA + 229095 229100 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 50996 49656 1341 2 0 73 43 1131 0.849 103.24 S.002 Term - 178674 178647 28 2 1 118 40 44 0.803 0.15 S.003 Init - 180178 180120 59 1 2 93 85 90 0.894 7.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_1|109_aa MGKIQNRKTGNSKKQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELQEDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRS >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_1|330_bp atggggaaaatacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacaggaggacattcaa accaaagggaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_2|772_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSVLHQADLIDIYRTLHPKSTECTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRPELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQKYKEELHINGAKDKNHMIIS IDAEKAFDKIQQRFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTR QGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVK DLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMPFFT ELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQ WNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIGRKLKLDPFLTPYTK INSRWIKDLNIRPKTIKTLEENLGITIQDIGMGKDFVSKTPKAMATKDKIDK >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_2|2319_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagttctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagagtgtacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccatagtgcaatcaaactagaactcaggattaagaatctc acgcaaaaccactcaactacatggaaactgaacaacttgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaggcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagaccagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc aatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aagtacaaggaggaactgcatataaacggagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacgcttcatgctaaaaactctcaat aaattaggtattgatgggacatatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagggca attaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcctttcttcaca gaattggaaaaaactactttaaaattcatatggaaccaaaaaagagcccgtatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccacatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccataggtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagacttaaacattagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcatgggcaaggacttcgtgtctaaaaca ccaaaagcaatggcaacaaaagacaaaattgacaaatga >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_3|47_aa MEKPNLAERISPYKLSRVNATTKIKALNCGLALGERVAEMLLVQAWR >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_3|144_bp atggagaaacctaatttagcagaaaggatcagcccttataaactcagcagagtcaatgcc acaacaaaaattaaggccctcaactgtgggctggctctgggggaaagagtggctgaaatg ctgctggtgcaggcctggagataa >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_4|736_aa MLGLGMLEAWCCPAAGCQGSPGAGTPPPARGIPGPQRGFRPPSPGSPRSRPAGARPEPAQ ACAPPPGPLRVSPPTWLGLPPATHRFPRAAARAARRSRSVQPPPPPETLLPPRRPRASPE RVLARACERASGPGSYEAQGPALRVQARVQAPPRARDSASGLATAPRVQEELQAHPHGAD QIELSDWCFKRRSPEWRRFRFPVFRFWQSWTGKSESGRGSLAVAGTLVSEVGAREGTAGA REVAAAGRGRPGHTERAALGLGRSGGAGHRLESGVDRSRDAPATWAGKEKMRTLNTSAMD GTGLVVERDFSVRILTACFLSLLILSTLLGNTLVCAAVIRFRHLRSKVTNFFVISLAVSD LLVAVLVMPWKAVAEIAGFWPFGSFCNIWVAFDIMCSTASILNLCVISVDRYWAISSPFR YERKMTPKAAFILISVAWTLSVLISFIPVQLSWHKAKPTSPSDGNATSLAETIDNCDSSL SRTYAISSSVISFYIPVAIMIVTYTRIYRIAQKQIRRIAALERAAVHAKNCQTTTGNGKP VECSQPESSFKMSFKRETKVLKTLSVIMGVFVCCWLPFFILNCILPFCGSGETQPFCIDS NTFDVFVWFGWANSSLNPIIYAFNADFRKAFSTLLGCYRLCPATNNAIETVSINNNGAAM FSSHHEPRGSISKECNLVYLIPHAVGSSEDLKKEEAAGIARPLEKLSPALSVILDYDTDV SLEKIQPITQNGQHPT >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_4|2211_bp atgctgggcctcgggatgctggaagcgtggtgttgtccggccgcaggatgccagggatcc ccaggtgcagggaccccccccccggcccgaggcatccccggcccgcaaaggggcttccga cctccgagcccggggtctccccgcagccggcccgcgggtgcacggcccgagcctgcccaa gcctgcgcgcctccgccgggccccctgcgcgtttccccacccacctggctgggcctgccg cccgcgactcaccgcttcccccgggctgccgcccgcgctgctcgcaggtcccgctcagtg cagcctccgccgccaccggaaacgcttctcccgccacggcgtcctcgcgcgtccccggag cgcgtcctcgcgcgcgcctgtgagcgcgcgtccgggcccgggagctacgaggcccaggga cccgccctccgggtccaggcccgggtccaggccccgccccgcgcccgcgactccgcttct ggcttagcgaccgcgccccgggtccaagaggaactccaagcccatcctcacggtgctgat cagatagagctctctgattggtgtttcaagcggaggagtcctgagtggagaagattccgc ttccctgttttcaggttctggcagtcatggaccgggaagagcgagtcggggcgcgggtcc ctggcggtcgctggaacccttgtgagtgaggttggtgcccgcgagggcacagcgggcgct cgggaagtcgcagccgccggcagagggcgccccgggcacacggagcgcgcggcgctgggg ttggggcgctcgggaggtgcggggcaccggctggagtccggcgttgaccgcagccgggac gcgcccgccacctgggcggggaaggagaagatgaggactctgaacacctctgccatggac gggactgggctggtggtggagagggacttctctgttcgtatcctcactgcctgtttcctg tcgctgctcatcctgtccacgctcctggggaacacgctggtctgtgctgccgttatcagg ttccgacacctgcggtccaaggtgaccaacttctttgtcatctccttggctgtgtcagat ctcttggtggccgtcctggtcatgccctggaaggcagtggctgagattgctggcttctgg ccctttgggtccttctgtaacatctgggtggcctttgacatcatgtgctccactgcatcc atcctcaacctctgtgtgatcagcgtggacaggtattgggctatctccagccctttccgg tatgagagaaagatgacccccaaggcagccttcatcctgatcagtgtggcatggaccttg tctgtactcatctccttcatcccagtgcagctcagctggcacaaggcaaaacccacaagc ccctctgatggaaatgccacttccctggctgagaccatagacaactgtgactccagcctc agcaggacatatgccatctcatcctctgtaataagcttttacatccctgtggccatcatg attgtcacctacaccaggatctacaggattgctcagaaacaaatacggcgcattgcggcc ttggagagggcagcagtccacgccaagaattgccagaccaccacaggtaatggaaagcct gtcgaatgttctcaaccggaaagttcttttaagatgtccttcaaaagagaaactaaagtc ctgaagactctgtcggtgatcatgggtgtgtttgtgtgctgttggctacctttcttcatc ttgaactgcattttgcccttctgtgggtctggggagacgcagcccttctgcattgattcc aacacctttgacgtgtttgtgtggtttgggtgggctaattcatccttgaaccccatcatt tatgcctttaatgctgattttcggaaggcattttcaaccctcttaggatgctacagactt tgccctgcgacgaataatgccatagagacggtgagtatcaataacaatggggccgcgatg ttttccagccatcatgagccacgaggctccatctccaaggagtgcaatctggtttacctg atcccacatgctgtgggctcctctgaggacctgaaaaaggaggaggcagctggcatcgcc agacccttggagaagctgtccccagccctatcagtcatattggactatgacactgacgtc tctctggagaagatccaacccatcacacaaaacggtcagcacccaacctga >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_5|407_aa MPSAHLCLARGQQVHAGMLRTNQVARPALTPAGGKGEPCDDIGPTKYSRIMPASLKILNL IISAKSLHHSGTMSGELPPNINIKEPRWDQSTFIGRANHFFTVTDPRNILLTNEQLESAR KIVHDYRQGIVPPGLTENELWRAKYIYDSAFHPDTGEKMILIGRMSAQVPMNMTITGCMM TFYRTTPAVLFWQWINQSFNAVVNYTNRSGDAPLTVNELGTAYVSATTGAVATALGLNAL TKHVSPLIGRFVPFAAVAAANCINIPLMRQRELKVGIPVTDENGNRLGESANAAKQAITQ VVVSRILMAAPGMVFCLKSFLVHASHRKLPLNPAFDVRTGLTGVKSAFSTSGLQPALDLT PTHKAAAFCCLSFSKFKPHGQEPFITLEPTEEPPLKRAGPSELALGA >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_5|1224_bp atgccctcagcgcatctctgccttgcccggggccagcaggtgcacgccggaatgctgcgg accaaccaagtggcccggccagccctgacccctgcgggtggaaaaggggagccttgtgat gacattgggccaaccaaatattccaggatcatgcctgcatctctcaagatccttaactta atcatatcagcaaagtccctgcaccattccgggaccatgtctggagaactaccaccaaac attaacatcaaggaacctcgatgggatcaaagcactttcattggacgagccaatcatttc ttcactgtaactgaccccaggaacattctgttaaccaacgaacaactcgagagtgcgaga aaaatagtacatgattacaggcaaggaattgttcctcctggtcttacagaaaatgaattg tggagagcaaagtacatctatgattcagcttttcatcctgacactggtgagaagatgatt ttgataggaagaatgtcagcccaggttcccatgaacatgaccatcacaggttgtatgatg acgttttacaggactacgccggctgtgctgttctggcagtggattaaccagtccttcaat gccgtcgtcaattacaccaacagaagtggagacgcacccctcactgtcaatgagttggga acagcttacgtttctgcaacaactggtgccgtagcaacagctctaggactcaatgcattg accaagcatgtctcaccactgataggacgttttgttccctttgctgccgtagctgctgct aattgcattaatattccattaatgaggcaaagggaactcaaagttggcattcccgtcacg gatgagaatgggaaccgcttgggggagtcggcgaacgctgcgaaacaagccatcacgcaa gttgtcgtgtccaggattctcatggcagcccctggcatggttttttgccttaaaagcttt cttgtgcacgcctctcatcgcaaactgcccttgaatcctgcctttgatgtgagaacagga ctaactggcgttaaatcggcattttctacatcaggtctccagcctgctcttgacctgact ccaacacacaaagcagcggcattttgttgtctgtcattttctaaatttaaacctcatgga caggagcccttcatcaccttggagcctactgaagagccccccttgaagagggcaggacct tcagaacttgctctgggagcctga >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_6|313_aa MQPEATRFLTSSIASVDNIIIVIPEIGVRERKAHVGIPSVPGPGGAGTAEYFTDLTIEEP ESQVLSKTLRRGNGTLALLVTAFWWLSTTYTRGSQNKTPRPAAAAAPENLLEKQILGPTP DLLTQNLRDGAQISVFHKPPGGPDAGSTQDCSFKRQEGGRKEGHTSSFEEEDFQKSQAVT SVATYTSLPEHACFFSKIPESGGLWYGDLPVSQLPKTTLLSVFAFQGLLILGKPKQLFPC YLHSAGDMAMDRTEIQILAVMSLLLCAPKYVNDDPSHGFLESGEISYCRQECMVMTLSGP AGVGKTICGNATS >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_6|942_bp atgcagccagaggccacaagattcctaacctcctcaattgcttctgtagataacatcatt attgtaatacctgagattggtgttcgagaaagaaaggctcatgtgggaatccccagcgtc ccaggccccgggggagcaggcactgctgagtacttcaccgatttgacaattgaggaacct gagtcccaggtcttgtccaagacactgagaaggggaaatgggactctggctctcctggtc acagccttttggtggctttccacgacctacaccagggggtctcaaaataagacccccaga ccagctgcagcagcagcacctgagaacctgttagaaaagcaaattcttggccccacccca gacctactgacccagaacctcagagatggtgcccagatatctgtatttcacaagcccccg ggtggtcctgatgctggctccacacaggattgttcatttaagcgtcaggaaggaggaagg aaagaaggacatacttcctcctttgaagaagaagacttccagaaatcgcaggctgtgaca tctgtagccacctacacgtcccttccggagcatgcatgctttttctccaagatacctgag tccgggggattgtggtatggagatctacctgtctcacagctgcccaaaaccacacttctg tctgtctttgcctttcagggcctgttgatcttggggaagccaaagcagctcttcccgtgc tatctgcacagtgctggggatatggcaatggacagaacagaaatccaaatccttgccgtc atgagcttattgttatgtgcccccaagtacgtcaacgatgacccttcccatggtttcttg gaatctggtgaaatctcttattgccggcaagagtgtatggtgatgacactctcaggccct gctggtgttggaaaaaccatttgtggcaatgccacttcctaa >gi568815593f:175392104_175626731|GENSCAN_predicted_peptide_7|129_aa MMIMVMVMVVTVMVMVRMMMMMVMVVMVRMTMVMVRMMIAMVNDDNYYCGDYNYGDDGED DDGEDDDGKDDDGDGDNVGNGDDGDGDDKDNDGGTDYGDDDDAEDEDDDDDSDCYVLNVF VLAKIHVET >gi568815593f:175392104_175626731|GENSCAN_predicted_CDS_7|390_bp atgatgatcatggtgatggtgatggtggtgacagtgatggtgatggtgagaatgatgatg atgatggtgatggtggtgatggtgaggatgacaatggtgatggtgaggatgatgatagcg atggtgaatgatgataattattactgtggtgattataattatggtgatgatggtgaggat gatgatggtgaggatgatgatggtaaggatgatgatggtgatggtgataatgttggtaat ggtgatgatggtgatggtgatgataaggataatgatggtggtactgattatggtgatgat gatgatgctgaggatgaggatgatgatgatgatagtgattgctatgttttaaatgttttt gtccttgccaaaattcatgttgaaacttaa