GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:59:41 Sequence gi568815588f:123048379_123250120 : 201742 bp : 44.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2671 2808 138 0 0 74 83 29 0.422 1.66 1.02 Intr + 4683 4782 100 2 1 54 27 104 0.392 0.58 1.03 Term + 5317 5387 71 2 2 90 45 98 0.841 3.80 1.04 PlyA + 5476 5481 6 1.05 2.08 PlyA - 6174 6169 6 1.05 2.07 Term - 6793 6771 23 2 2 105 48 16 0.334 -2.23 2.06 Intr - 14647 14464 184 1 1 32 36 157 0.041 4.26 2.05 Intr - 25097 24864 234 2 0 -30 72 235 0.058 7.99 2.04 Intr - 37286 37155 132 2 0 -3 89 178 0.958 9.64 2.03 Intr - 40676 40524 153 2 0 53 119 52 0.803 5.07 2.02 Intr - 50598 50518 81 0 0 69 75 81 0.402 4.73 2.01 Init - 52481 52356 126 2 0 91 42 33 0.296 -0.81 2.00 Prom - 58688 58649 40 -1.56 3.03 PlyA - 58863 58858 6 1.05 3.02 Term - 63680 63532 149 0 2 54 38 117 0.539 1.36 3.01 Init - 69885 69705 181 0 1 83 0 238 0.349 11.85 3.00 Prom - 75076 75037 40 -4.26 4.00 Prom + 79195 79234 40 -5.86 4.01 Init + 86055 86637 583 2 1 54 64 310 0.839 20.55 4.02 Intr + 87572 88072 501 0 0 54 81 484 0.020 37.25 4.03 Term + 88680 89353 674 1 2 83 54 1200 0.986 110.02 4.04 PlyA + 92276 92281 6 1.05 5.00 Prom + 92354 92393 40 -5.46 5.01 Init + 99571 99683 113 0 2 44 58 141 0.121 4.43 5.02 Intr + 100000 100268 269 1 2 47 82 242 0.139 16.68 5.03 Term + 101192 101745 554 0 2 116 47 879 0.999 81.08 5.04 PlyA + 102275 102280 6 1.05 6.00 Prom + 103529 103568 40 -7.16 6.01 Init + 106369 106738 370 0 1 75 17 498 0.782 36.56 6.02 Intr + 109351 109502 152 2 2 106 102 29 0.970 5.98 6.03 Intr + 112029 112187 159 2 0 29 72 160 0.990 8.68 6.04 Intr + 113858 114035 178 1 1 101 103 103 0.999 12.59 6.05 Term + 114234 114454 221 1 2 70 48 174 0.540 8.80 6.06 PlyA + 115493 115498 6 1.05 7.03 PlyA - 115529 115524 6 1.05 7.02 Term - 123497 123154 344 0 2 32 49 308 0.603 15.97 7.01 Init - 155200 155083 118 1 1 54 64 103 0.723 4.89 7.00 Prom - 160475 160436 40 -1.26 8.00 Prom + 164280 164319 40 -5.86 8.01 Init + 170916 171022 107 2 2 81 37 131 0.154 5.49 8.02 Term + 177558 177690 133 0 1 56 53 110 0.188 1.86 8.03 PlyA + 178887 178892 6 1.05 9.03 PlyA - 182715 182710 6 1.05 9.02 Term - 192226 191990 237 1 0 67 48 279 0.906 17.97 9.01 Intr - 197190 197117 74 1 2 17 111 64 0.344 0.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 11924 11988 65 1 2 37 116 60 0.820 4.32 S.002 Intr - 78880 78752 129 1 0 64 103 83 0.861 8.29 S.003 Term + 86919 87076 158 1 2 112 44 45 0.891 0.60 S.004 Init + 87673 88072 400 0 1 104 81 557 0.967 51.43 S.005 Init + 100001 100268 268 1 1 83 82 233 0.855 19.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_1|102_aa GLQHQVAHVATQLEAARLLTYNAARLLEAGKPFIKEASMAKYYASEIAGQTTSKCIEWMG GVGYTKDYPVEKYFRDAKIGTIYEGASNIQLNTIAKHIDAEY >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_1|309_bp ggcctccaacaccaagtggctcacgtggccacccagctggaagctgcaagattactaaca tacaatgctgctaggcttttagaagctggaaagccattcataaaagaagcgtcaatggcc aaatactatgcatcagagattgcaggacaaacaacgagtaaatgtatcgagtggatgggg ggagtaggctacaccaaagattaccctgtggagaaatacttccgagatgcaaagattggt acgatatatgaaggagcttccaacatccagttgaacaccattgcaaagcatatcgatgca gaatactga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_2|310_aa MTCRRMSGEGSFSLEGLSQNYSQKALETYSQDMPQRSSLSRQSGNDLSILHRHSLAEPRE THEDVKDSKHQGFSRWALVIFQQQHFPRVAPQSVQQDDGLCPLIVPEGPRPVSWHLKGAL TGLGAVKKSEDDIIEGDLAVSEIKEEGGCEPDEDPKRMCCQEAQVRQANSLSPPQTVHGT SRGQGSFLNPLLFVESTCSLGLRHRCGLHGPTEDVPESLDVAVTGECIYSGVFGALPFMD SKITKHDNEDGGMSRADSVDDQEGGRADNMDTLDKGMIHVLIGTEQNCARFHHATQNDVQ FKTCMSLSAA >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_2|933_bp atgacctgcaggagaatgagtggagagggctctttctctctcgaggggctttcacagaac tactcccagaaggccctggagacctactctcaggacatgcctcagcggtcaagtctttcc agacagagtggcaatgatcttagcatccttcaccggcacagccttgccgagcccagggag acgcatgaagacgtgaaggactccaagcatcaaggcttctctcggtgggccctggtcata ttccaacagcagcatttccccagagtggctccccagagtgttcaacaggacgatggcctg tgtcccctcattgttccagaaggccccaggcccgtctcatggcatctgaagggggctctg actgggttaggagcagtgaagaagagtgaggatgatatcatagagggtgacttggcagtt agtgagatcaaggaggagggtggctgcgagccagatgaggaccccaagaggatgtgctgc caggaagcacaggtaagacaagcaaacagcctgagtcccccacaaacagtgcatggaacc agcagaggccagggctccttcctcaaccccttgctcttcgtggagtccacctgctccctg ggcctcagacacagatgtggcctgcatgggcccacggaggacgttccagagtccctcgat gtggccgtcacaggggagtgcatctactcaggagtgtttggggcattacccttcatggat tcaaagatcaccaagcatgacaatgaagatggcgggatgtcaagagcagacagtgttgat gaccaagaagggggtagagcagacaacatggatacgctggacaaagggatgattcacgtt cttattgggacggagcagaattgtgcaagatttcatcatgctactcagaacgacgtgcaa tttaaaacttgtatgtctttatcagcagcgtga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_3|109_aa MVSAVQLGACLPARAGGRLAAVLLPVQALGGGSGGEAVTDFECPDPSNTPFGACFRCSRI ARIVKLFDFSQPGTSGGVESASGKSAIGFLFREAKIHATQHNLSIFISI >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_3|330_bp atggtttcggccgtccagcttggggcctgcctgccagctcgggcgggtggccgacttgct gcagtgctgctcccagtccaggcactgggtggcggctctggtggtgaagcagtcactgac ttcgaatgtcccgatccatccaacactccgtttggggcgtgcttccggtgttcccgcata gctaggattgtcaagctctttgattttagccaaccaggtacaagtggaggtgttgaaagc gcctctggaaagtctgctataggatttctgttccgtgaagccaaaattcatgctacccag cacaaccttagtatcttcatctctatttaa >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_4|585_aa MSLKCLPHGLLVRTGELFLKPVEWNVLSLSRSAKVQCGSLRTSLPSRRAADAAACRGGGS YAAAPSVHLAPIKALVRQLQTPAGRRDFLGESAVVGVTDSELIREPWAPAARAEGCPSES SRRCREAVAPAAPPSDARVLAGSARIRTQAHPLLVPGEGSLFGFPGLKAKATRISESEDA VRGLTPGFPYYRLDVEFRLWSDFRPLACSPRRPPVPLPPSRGPGGEGTMPEPGPDAAGTA SAQPQPPPPPPPAPKESPFSIKNLLNGDHHRPPPKPQPPPRTLFAPASAAAAAAAAAAAA AKGALEGAAGFALSQVGDLAFPRFEIPAQRFALPAHYLERSPAWWYPYTLTPAGGHLPRP EASEKALLRDSSPASGTDRDSPEPLLKADPDHKELDSKSPDEIILEESDSEESKKEGEAA PGAAGASVGAAAATPGAEDWKKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYL SSSERAGLAASLHLTETQVKIWFQNRRNKWKRQLAAELEAANLSHAAAQRIVRVPILYHE NSAAEGAAAAAAGAPVPVSQPLLTFPHPVYYSHPVVSSVPLLRPV >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_4|1758_bp atgagtcttaaatgtctgcctcatgggcttctcgtgcgaaccggggagctgtttttaaag ccagtggagtggaatgtcctttcactaagccgttctgcaaaggtccagtgtggttctcta aggacttcgctgccgtcgagacgcgcagcggatgccgctgcttgccgcggcggaggaagc tacgccgctgcgccttcggttcaccttgcccccattaaagcgttagttagacagctccaa acgccagctggccggagagacttcctgggagaaagtgctgtcgtcggggtcacagactcg gagttgatccgcgagccttgggcaccggcggccagagcagaggggtgcccttccgagagc agccgtcggtgccgagaagctgttgccccagcagccccaccgagtgacgcgcgggttttg gccgggtcagcgcgcatacgcacacaggcacacccactcctggtcccgggggaaggctca ctcttcggttttccagggctaaaggccaaggccacgcggatttctgagtccgaagacgca gtgcgaggacttacacctggattcccttactatcggctggatgttgagttccgtttatgg tctgatttccggcctctcgcctgctcgccccgccgcccgcctgtcccgctccctccctcc cggggacccggaggagaggggaccatgccggaacccgggccggacgctgccggcaccgcc agcgcacagccccaaccgccgccgccccccccacccgctcccaaggagtccccgttctcc atcaagaacctgctcaacggagaccaccaccggccgccccctaagcctcagccgccccca cggacgctcttcgcgccagcctcggctgccgccgccgccgccgctgccgctgccgcggcg gccaagggggccctggagggcgccgcgggcttcgcgctctcgcaggtgggcgacctggct ttccctcgctttgagatcccggcgcagaggtttgccctgcccgcgcactacctggagcgc tccccagcctggtggtacccctacaccctgacccccgccggcggccacctcccgcgacct gaagcctcggagaaggccttgctgagagactcctcccccgcctccggcacagaccgcgac tctccggagccactgctcaaggccgaccccgatcacaaggagctggactccaagagcccg gacgagatcattctggaggagagcgactccgaggaaagcaaaaaggaaggcgaagcggcg ccaggcgcggccggggcgagcgtaggggcggcggcggccactccgggcgcagaagactgg aagaagggcgctgaaagtccagagaagaagccggcgtgccgcaagaagaagacgcgcaca gtcttctcgcgcagccaggtcttccagctcgagtccaccttcgacatgaagcgctatctg agcagctcggagcgagccggcctggccgcgtccctgcacctcaccgagacgcaggtcaag atctggttccagaaccgccgcaacaagtggaagcggcagctggcggcggagctggaggcg gccaacctgagccatgccgcggcgcagcgcatcgtgcgggtgcccatcctctaccacgag aactcggcggccgagggcgcggcggctgcagccgcgggggccccggtgccagtcagccag ccgctgctcaccttcccgcaccccgtctactactcgcacccggtggtctcttccgtgccg ctgctacggccggtctga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_5|311_aa MPARPSARSGGAPVLCAWPERFEIFRGRSAALRLAAFGMGSKEDAGKGCPAAGGVSSFTI QSILGGGPSEAPREPVGWPARKRSLSVSSEEEEPDDGWKAPACFCPDQHGPKEQGPKHHP PIPFPCLGTPKGSGGSGPGGLERTPFLSPSHSDFKEEKERLLPAGSPSPGSERPRDGGAE RQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERACLASSLQLTETQVKTWFQNRRN KWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRVPVPRSLAFPAPLYYPGSNLSAL PLYNLYNKLDY >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_5|936_bp atgcccgccaggccctcggcgcgctctgggggcgcaccggtgctgtgtgcctggccggag cgttttgaaattttccggggccgctcggcggcgctgcgattggccgcgttcgggatgggc agcaaagaagatgcgggcaaggggtgtccggcggccggtggcgtctccagcttcaccatc cagtccatcctgggcgggggcccctcggaggcaccgcgggagcccgtcggctggccagcc aggaagcgcagcctgtccgtgtcctcggaggaggaggagccggacgacggctggaaggcg cccgcctgcttctgcccagaccagcacggccctaaggagcagggccccaagcaccatccc cccatcccttttccttgcctgggtacccccaagggcagcggaggctcgggcccgggcggc ttggagcgcacgcctttcctctctccttcgcactcggactttaaagaagagaaagagagg ctcctgcccgcgggctcgccctcgccggggtccgagcggccgcgggacggcggcgctgag cggcaggccggcgcggccaagaagaagacgcgcaccgtcttttcgcgcagccaggtgtac cagctcgagtccaccttcgacatgaagcgctacctgagcagctcggagcgcgcctgcctc gcctccagcctgcagctcacggagacccaggtaaagacttggttccagaaccgccgcaac aagtggaagcggcagctctcggctgagctggaggcggccaacatggcgcacgcgtcggcg cagactctggtgagcatgccgctggtgttccgggacagttcgctgctgcgcgtgccggtg ccgcgctcgctcgcctttcccgcgccgctctactacccgggaagcaacctctcggcctta cctctctacaacctatacaacaagctcgactactga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_6|359_aa MGRVRTLAGECSAQAQAQSLLAVVLSAPPSGGTPSARLSVRSPSPRDPWGLWAPVLQMTG SNEFKLNQPPEDGISSVKFSPNTSQFLLVSSWDTSVRLYDVPANSMRLKYQHTGAVLDCA FYVENLVGTHDAPIRCVEYCPEVNVMVTGSWDQTVKLWDPRTPCNAGTFSQPEKVYTLSV SGDRLIVGTAGRRVLVWDLRNMGYVQQRRESSLKYQTRCIRAFPNKQGYVLSSIEGRVAV EYLDPSPEVQKKKYAFKCHRLKENNIEQIYPVNAISFHNIHNTFATGGSDGFVNIWDPFN KKRLCQFHRYPTSIASLAFSNDGTTLAIASSYMYEMDDTEHPEDGIFIRQVTDAETKPK >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_6|1080_bp atggggcgagtccggaccttggcgggcgagtgctcggcgcaggcgcaagcgcagagtctc ctcgcggtcgtcctctcggcccctccctctggggggacccccagtgccaggctgtcagtg cgcagccccagcccgcgggacccctggggactctgggcgcctgttctgcagatgaccggt tctaacgagttcaagctgaaccagccacccgaggatggcatctcctccgtgaagttcagc cccaacacctcccagttcctgcttgtctcctcctgggacacgtccgtgcgtctctacgat gtgccggccaactccatgcggctcaagtaccagcacaccggcgccgtcctggactgcgcc ttctacgtagaaaatcttgttgggacccatgatgcccctatcagatgtgttgaatactgt ccagaagtgaatgtgatggtcactggaagttgggatcagacagttaaactgtgggatccc agaactccttgtaatgctgggaccttctctcagcctgaaaaggtatataccctctcagtg tctggagaccggctgattgtgggaacagcaggccgcagagtgttggtgtgggacttacgg aacatgggttacgtgcagcagcgcagggagtccagcctgaaataccagactcgctgcata cgagcgtttccaaacaagcagggttatgtattaagctctattgaaggccgagtggcagtt gagtatttggacccaagccctgaggtacagaagaagaagtatgccttcaaatgtcacaga ctaaaagaaaataatattgagcagatttacccagtcaatgccatttcttttcacaatatc cacaatacatttgccacaggtggttctgatggctttgtaaatatttgggatccatttaac aaaaagcgactgtgccaattccatcggtaccccacgagcatcgcatcacttgccttcagt aatgatgggactacgcttgcaatagcgtcatcatatatgtatgaaatggatgacacagaa catcctgaagatggtatcttcattcgccaagtgacagatgcagaaacaaaacccaagtga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_7|153_aa MIPDSKTDSGPWMLGYLLPRALLICHSLGLQFVLSFRNGNDKRRNNGRAKKGHGHVQPIC CTNCAQCVPKDKAIKKFVIRNIVEAAAVRDISEAGIFDAYVLPKLYVKLHYCVSCAIHNK VVRNQSCEARKDRTPPPRFRPAGAAPPPLPKPI >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_7|462_bp atgatacctgattccaagaccgactctggtccctggatgctgggctacctccttcctagg gccctgctgatctgccactcactgggcttgcagtttgtgctgtccttccgtaatgggaat gacaaaagaaggaacaacggtcgtgccaaaaagggccatggccacgtgcagcctatttgc tgcactaactgtgcccaatgcgtgcccaaggacaaggccattaagaaattcgtcattcga aacatagtggaggccgcagcagtcagggacatttctgaagcgggcatcttcgatgcctat gtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcacaacaaa gtagtcaggaatcaatcttgtgaagcccgcaaggaccgaacacccccaccccgatttaga cctgcgggtgctgccccacctcccctaccaaagcccatataa >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_8|79_aa MMFFLFAMEVGWSCRFCSLILDHGSTLVLVLTSLQGFPKSSPSSSKFYKSPGQGQNATSR FAKTARVTFTPVHKFLISI >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_8|240_bp atgatgttctttctcttcgcaatggaagtaggatggagctgccgattctgctccctgatc ctggaccacggctccacccttgtcctggtgttgacatcccttcagggattccctaaatca tctccctcaagttcaaagttctacaaatctccagggcaggggcaaaatgccaccagtcgc tttgctaaaacagcaagagtcacctttactccagttcacaagttcctcatctccatctga >gi568815588f:123048379_123250120|GENSCAN_predicted_peptide_9|103_aa XRLTVELEKPFGQAELYKMSSFLMQVIHTARQIPGGPSKGPRAKVIPVEISGTWIHKIPL EILSLAFDDDDDDDDDDDDAETWGLPRLYLLGILGKGAHQNIF >gi568815588f:123048379_123250120|GENSCAN_predicted_CDS_9|312_bp ntgcgcctcaccgtggagctggagaagccctttgggcaggcagaactctacaaaatgagc tcttttctgatgcaggtgatacacacagccagacaaatcccaggtggcccatccaaagga ccaagagcaaaagttattccagttgagatcagtggaacttggattcacaaaatccccctg gagatcctctctctggcctttgatgatgatgatgatgatgatgatgatgatgatgacgca gaaacatgggggcttccaaggctgtacctcctaggcatcctggggaaaggagcccatcag aacatcttctag