GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:36:47 Sequence gi568815579f:43253715_43462317 : 208603 bp : 46.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 4742 4488 255 0 0 103 119 129 0.954 14.92 1.04 Intr - 5421 5143 279 1 0 72 65 247 0.994 18.25 1.03 Intr - 8424 8146 279 1 0 69 75 181 0.824 12.35 1.02 Intr - 14435 14070 366 0 0 100 74 197 0.881 14.62 1.01 Init - 15717 15654 64 0 1 98 94 36 0.628 4.88 1.00 Prom - 19339 19300 40 -4.86 2.13 PlyA - 20048 20043 6 1.05 2.12 Term - 25305 25214 92 1 2 100 43 92 0.560 3.78 2.11 Intr - 29976 29722 255 1 0 95 119 207 0.732 21.92 2.10 Intr - 35397 35355 43 0 1 65 74 21 0.295 -3.79 2.09 Intr - 36523 36396 128 2 2 46 92 105 0.698 7.10 2.08 Intr - 49639 49457 183 2 0 -5 56 155 0.072 2.76 2.07 Intr - 49822 49667 156 2 0 86 0 114 0.270 2.38 2.06 Intr - 50789 50755 35 1 2 78 94 38 0.560 1.27 2.05 Intr - 51510 51349 162 2 0 39 100 102 0.931 5.59 2.04 Intr - 51916 51835 82 2 1 102 89 8 0.316 1.00 2.03 Intr - 74589 74550 40 0 1 93 115 43 0.120 5.30 2.02 Intr - 79581 79369 213 0 0 96 70 50 0.086 2.91 2.01 Init - 88042 87908 135 1 0 45 109 73 0.405 5.24 2.00 Prom - 91780 91741 40 -7.36 3.00 Prom + 95426 95465 40 -3.96 3.01 Init + 100001 100052 52 1 1 69 109 20 0.834 3.31 3.02 Intr + 100139 100279 141 0 0 71 81 170 0.999 14.92 3.03 Intr + 100493 100678 186 0 0 100 51 209 0.999 18.06 3.04 Intr + 101947 102069 123 2 0 111 99 -23 0.742 1.66 3.05 Intr + 102278 102394 117 0 0 136 113 127 0.999 20.44 3.06 Intr + 106551 106691 141 1 0 79 84 69 0.973 5.92 3.07 Intr + 107429 107614 186 0 0 106 86 109 0.996 12.16 3.08 Intr + 107731 107865 135 2 0 109 105 -22 0.721 2.14 3.09 Intr + 108374 108514 141 0 0 135 59 48 0.584 6.92 3.10 Term + 111900 112027 128 1 2 87 46 65 0.803 0.64 3.11 PlyA + 113359 113364 6 1.05 4.12 PlyA - 114366 114361 6 -1.95 4.11 Term - 114792 114691 102 0 0 74 50 54 0.810 -1.52 4.10 Intr - 115285 115115 171 1 0 23 103 107 0.476 5.84 4.09 Intr - 115608 115499 110 1 2 87 56 56 0.850 2.30 4.08 Intr - 119260 119120 141 2 0 135 59 42 0.890 6.32 4.07 Intr - 119621 119487 135 0 0 109 105 -22 0.724 2.14 4.06 Intr - 119842 119738 105 2 0 14 86 106 0.725 3.19 4.05 Intr - 120801 120661 141 1 0 89 84 81 0.913 8.12 4.04 Intr - 125077 124961 117 2 0 136 113 127 0.999 20.44 4.03 Intr - 125409 125287 123 1 0 116 99 -23 0.797 2.16 4.02 Intr - 126984 126889 96 1 0 79 68 34 0.518 0.48 4.01 Init - 136288 136147 142 1 1 97 17 140 0.341 8.10 4.00 Prom - 144084 144045 40 -6.06 5.00 Prom + 151040 151079 40 -6.66 5.01 Init + 154438 154518 81 0 0 85 77 87 0.971 6.27 5.02 Term + 154589 154711 123 1 0 39 52 123 0.907 2.18 5.03 PlyA + 154785 154790 6 1.05 6.00 Prom + 159248 159287 40 -5.26 6.01 Init + 162206 162269 64 1 1 81 38 134 0.941 7.01 6.02 Intr + 162385 162528 144 2 0 75 67 72 0.942 4.05 6.03 Intr + 162659 162841 183 0 0 92 80 115 0.803 10.86 6.04 Intr + 164164 164292 129 2 0 120 82 -30 0.543 0.27 6.05 Term + 164454 164683 230 1 2 116 48 89 0.682 4.49 6.06 PlyA + 164864 164869 6 1.05 7.04 PlyA - 165493 165488 6 1.05 7.03 Term - 166013 165810 204 2 0 56 38 109 0.008 0.07 7.02 Intr - 171246 171109 138 0 0 70 113 19 0.065 3.26 7.01 Init - 195308 195228 81 2 0 75 59 38 0.131 0.57 7.00 Prom - 200151 200112 40 -2.16 8.02 PlyA - 201190 201185 6 1.05 8.01 Term - 208133 207637 497 0 2 136 49 297 0.989 25.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 67102 66986 117 1 0 53 107 75 0.824 6.10 S.002 Term + 142383 142501 119 1 2 125 43 36 0.902 1.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_1|415_aa MGPLPAPSCTQRITWKGLLLTASLLNFWNPPTTAEVTIEAQPPKVSEGKDVLLLVHNLPQ NLPGYFWYKGEMTDLYHYIISYIVDGKIIIYGPAYSGRETVYSNASLLIQNVTRKDAGTY TLHIIKRGDETREEIRHFTFTLYLETPKPYISSSNLNPREAMEAVRLICDPETLDASYLW WMNGQSLPVTHRLQLSKTNRTLYLFGVTKYIAGPYECEIRNPVSASRSDPVTLNLLPKLP IPYITINNLNPRENKDVLAFTCEPKSENYTYIWWLNGQSLPVSPGVKRPIENRILILPSV TRNETGPYQCEIRDRYGGLRSNPVILNVLYGPDLPRIYPSFTYYRSGENLDLSCFTESNP PAEYFWTINGKFQQSGQKLFIPQITRNHSGLYACSVHNSATGKEISKSMTVKVSX >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_1|1245_bp atggggcccctcccagccccttcctgcacacagcgcatcacctggaaggggctcctgctc acagcatcacttttaaacttctggaacccgcccaccactgccgaagtcacgattgaagcc cagccacccaaagtttctgaggggaaggatgttcttctacttgtccacaatttgccccag aatcttcctggctacttctggtacaaaggggaaatgacggacctctaccattacattata tcgtatatagttgatggtaaaataattatatatgggcctgcatacagtggaagagaaaca gtatattccaacgcatccctgctgatccagaatgtcacccggaaggatgcaggaacctac accttacacatcataaagcgaggtgatgagactagagaagaaattcgacatttcaccttc accttatacttggagactcccaagccctacatctccagcagcaacttaaaccccagggag gccatggaggctgtgcgcttaatctgtgatcctgagactctggacgcaagctacctatgg tggatgaatggtcagagcctccctgtgactcacaggttgcagctgtccaaaaccaacagg accctctatctatttggtgtcacaaagtatattgcaggaccctatgaatgtgaaatacgg aacccagtgagtgccagtcgcagtgacccagtcaccctgaatctcctcccgaagctgccc atcccctacatcaccatcaacaacttaaaccccagggagaataaggatgtcttagccttc acctgtgaacctaagagtgagaactacacctacatttggtggctaaacggtcagagcctc cccgtcagtcccggggtaaagcgacccattgaaaacaggatactcattctacccagtgtc acgagaaatgaaacaggaccctatcaatgtgaaatacgggaccgatatggtggcctccgc agtaacccagtcatcctaaatgtcctctatggtccagacctccccagaatttacccttca ttcacctattaccgttcaggagaaaacctcgacttgtcctgcttcacggaatctaaccca ccggcagagtatttttggacaattaatgggaagtttcagcaatcaggacaaaagctcttt atcccccaaattactagaaatcatagcgggctctatgcttgctctgttcataactcagcc actggcaaggaaatctccaaatccatgacagtcaaagtctctgnn >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_2|507_aa MTQFSMDKRQGRSRSVTITNTAVTSLEWVPFHAFAIPRSTLVSQQGATSHTDNTQEMDSP GQEPGFAKEVEVEPGFQPLPQMTLLVPAMCLLLHGACSAKGFCAAPHFLLASPMGKGQVP LNPFSFTLSEELDLPQSLKRNPKGCIARRAKPILAAERHKRLILHTSTKENTLLDNRVIE CLTMEAVAKFNIMKEREGPSHQPALSPPKARWHILWQKLMFTASLLTFWNPPTTARVTTE AMPFNATEEEEEEFFLLAHNLPLKVMGYNWHKGRRYATGIQRSSSGPAYSSQEVIFANAS LLTRKVTQDDTGFYTLQIIKTDFTIEEATGYFRVHPEKQAALQAAENFGDEQHISYNTPK GKEGDTESEEIAETPFQIRELLANNQLVKQEIHGPDEPTTYSSDTYYYAGSNLNLSCLMG SNPSAEYSWLLNGNNQQTGQELFIPQVTTENSGDYLCYVHNPVTNGKNFATKKITVPDTS VQGSSPGLSGRATVSILIEVLASVALI >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_2|1524_bp atgacccagttctctatggataaacgtcagggtcgttcccggtctgtgaccatcaccaac actgccgtgactagtctagaatgggtgccatttcatgcatttgcaattcccagaagtaca ctggttagtcagcagggggcgacatcacacacagataacactcaggaaatggattcccct ggacaggaacctggctttgctaaggaggtagaggtggagcctggtttccagcccttgccc caaatgacccttctggtccctgccatgtgcctgctccttcatggtgcctgctctgccaag ggtttttgtgcagctccccacttcctcctggcatcacccatggggaaggggcaagtacca ctcaaccccttctccttcacccttagcgaggagcttgatctgcctcaatccctcaagagg aaccccaagggatgtatagcaagaagggctaagcccatcctggctgcagagaggcataag agattgatcctgcacacctccacgaaggaaaataccctcttggacaaccgggtcatcgaa tgtctaaccatggaagctgtagcaaagttcaacataatgaaagagagggagggtccttct caccagccagcactcagcccacccaaagcaagatggcacatcctctggcagaagctcatg ttcacagcctcacttttaaccttctggaacccacccaccactgccagagtcactactgaa gccatgccattcaatgccacagaggaggaggaggaggagttttttctacttgcccacaat ctgcccctgaaagtgatgggctacaattggcacaaaggaagaagatatgcaacaggcatt caaagatctagctcaggacctgcatacagtagtcaggaggtaatatttgccaatgcatcc ctgctgacccggaaagtcacccaggatgacacaggattctataccctacaaatcataaag acagattttaccattgaagaagcaactggatatttccgtgtacaccctgaaaaacaggca gctctgcaagcagcagagaattttggagatgagcaacatatctcctataatacaccaaaa gggaaggaaggagatacagaaagtgaagaaatagcagaaacaccattccaaataagggag ctcttagctaacaatcagctggtaaagcaagaaatacatggcccagatgaacccacaact tattcttcagacacctattactatgcagggtcaaacctcaacctctcctgcctcatgggc tctaacccatcagcagagtattcttggctgctgaatgggaataaccagcaaacaggacaa gagctctttatcccccaagtcactacagagaatagtggggactatctgtgttatgtccat aacccagtcactaatggcaaaaacttcgcaaccaagaaaatcacagtccctgatacttca gtacaaggaagttctcctggcctttcaggtagagccacggtcagcatcttgattgaagta ctggccagtgtggctctgatctag >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_3|449_aa MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLM LIESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLW APQPPADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCM PQPVCNLLNGTQEIGPVGMTENCDMKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEV GQVCQETLLLLDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCN SASSSSVLLNSLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSG GGLSTKMSIQGCVAQPSSFLLNHTRQIGIFSAREKRDVQPPASQHEGEPLRFCPSEGFPF ITPQDHLKWALRQPPSTSAGRQLTVRTHP >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_3|1350_bp atgagcgcggtattactgctggccctcctggggttcatcctcccactgccaggagtgcag gcgctgctctgccagtttgggacagttcagcatgtgtggaaggtgtccgacctgccccgg caatggacccctaagaacaccagctgcgacagcggcttggggtgccaggacacgttgatg ctcattgagagcggaccccaagtgagcctggtgctctccaagggctgcacggaggccaag gaccaggagccccgcgtcactgagcaccggatgggccccggcctctccctgatctcctac accttcgtgtgccgccaggaggacttctgcaacaacctcgttaactccctcccgctttgg gccccacagcccccagcagacccaggatccttgaggtgcccagtctgcttgtctatggaa ggctgtctggaggggacaacagaagagatctgccccaaggggaccacacactgttatgat ggcctcctcaggctcaggggaggaggcatcttctccaatctgagagtccagggatgcatg ccccagccagtttgcaacctgctcaatgggacacaggaaattgggcccgtgggtatgact gagaactgcgatatgaaagattttctgacctgtcatcgggggaccaccattatgacacac ggaaacttggctcaagaacccactgattggaccacatcgaataccgagatgtgcgaggtg gggcaggtgtgtcaggagacgctgctgctcctagatgtaggactcacatcaaccctggtg gggacaaaaggctgcagcactgttggggctcaaaattcccagaagaccaccatccactca gcccctcctggggtgcttgtggcctcctatacccacttctgctcctcggacctgtgcaat agtgccagcagcagcagcgttctgctgaactccctccctcctcaagctgcccctgtccca ggagaccggcagtgtcctacctgtgtgcagccccttggaacctgttcaagtggctccccc cgaatgacctgccccaggggcgccactcattgttatgatgggtacattcatctctcagga ggtgggctgtccaccaaaatgagcattcagggctgcgtggcccaaccttccagcttcttg ttgaaccacaccagacaaatcgggatcttctctgcgcgtgagaagcgtgatgtgcagcct cctgcctctcagcatgagggagaacccctgaggttctgcccatctgaaggatttcccttc atcactcctcaggaccacctgaagtgggcactaaggcagccgccgtccacttctgctggg cgccagctcaccgtgaggacccatccataa >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_4|460_aa MAGGGLKKIISATEKKLARMLLAPVETGRPGETTVFIQGGMGGGANLDQDEEGISAIVAP NSKEKRRKKNRKPSTGSAADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRG GGIFSNLRVQGCMPQPVCNLLNGTQEIGPVGMTENCDMKDVLTCHRGTTLKKQENLSKEP TDWATSNTETCEVGQVCQEMLLLIDVAPPGVLVASYTHFCSSDLCNSASSSSVLLNSLPP QAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLTTRMSIQGCVA QPSSSLLNHTRQIGIFSVCEKGDEPPPASQHEGEPLRFCPSEGFPFITPQDHLKWALRQP PSTSAGRQLTLNVRFVFGQRKAISFGLFTPLASVNNAAMNKDMQIPAGAPALSSFQNTPE VELLHRVCILEVFPSGSTEGYFNKNGQHRWTAIRAMKSLA >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_4|1383_bp atggccggaggaggcctgaaaaagatcatctctgccacagagaagaagctggctcggatg ctgctggccccagtggagacagggagacccggggagacaactgtattcattcaaggagga atgggtggtggtgcaaacttagaccaagatgaggaaggaatttctgcgattgttgcacct aacagcaaagaaaagagaagaaaaaaaaacagaaaacccagtaccggcagtgctgcagac ccaggatccttgaggtgcccagtctgcttgtctatggaaggctgtctggaggggacaaca gaagagatctgccccaaggggaccacacactgttatgatggcctcctcaggctcagggga ggaggcatcttctccaatctgagagtccagggatgcatgccccagccagtttgcaacctg ctcaatgggacacaggaaattgggcccgtgggtatgactgagaactgcgatatgaaagat gttctgacctgtcatcgggggaccaccttgaagaagcaggaaaacttgagtaaagaaccc actgattgggccacatctaataccgagacgtgcgaggtggggcaggtgtgtcaggagatg ctgctgctcatagatgtagcccctcctggggtgcttgtggcctcctatacccacttctgc tcctcggacctgtgcaatagtgccagcagcagcagcgttctgctgaactccctccctcct caagctgcccctgtcccaggagaccggcagtgtcctacctgtgtgcagccccttggaacc tgttcaagtggctccccccgaatgacctgccccaggggcgccactcattgttatgatggg tacattcatctctcaggaggtgggctgaccaccagaatgagcattcagggctgtgtggcc caaccttccagctccttgttgaaccacaccagacaaattgggatcttctctgtgtgtgag aagggtgatgagccgcctcctgcctctcagcatgagggagaacccctgaggttctgccca tctgaaggatttcccttcatcactcctcaggaccacctgaagtgggcactaaggcagccg ccgtccacttctgctgggcgccagctcaccctgaacgtgagatttgtgttcggacagagg aaagccatatcatttggcttgtttacacctttggctagtgtgaataatgctgctatgaac aaagacatgcagatacctgctggggcccctgctctcagttcttttcagaatacaccagaa gtagaattgctgcatcgtgtgtgtatcttggaagtctttccaagtgggtcgacagaagga tacttcaataaaaatgggcaacaccgctggacagccatcagagccatgaagtccctggct tag >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_5|67_aa MPSGGARPALWAFRGLQGTCASGGGEAGLDRGSGSEQGTSEGGPVAPIRGVAVGQEVAVA AASPVAI >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_5|204_bp atgccctcaggaggcgcccgccccgccctctgggctttccgagggctgcaaggaacatgc gcttccgggggtggagaggcggggctagatcggggttcggggagcgaacaaggcacgtcg gagggtggcccggtggcccctattcgaggggtggctgtgggccaggaggtagcggtggcg gcggcctcgcctgtggccatctga >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_6|249_aa MGTPRIQHLLILLVLGASLLTSGLELYCQKGLSMTVEADPANMFNWTTEEVETCDKGALC QETILIIKAGTETAILATKGCIPEGEEAITIVQHSSPPGLIVTSYSNYCEDSFCNDKDSL SQFWEFSETTASTVSTTLHCPTCVALGTCFSAPSLPCPNGTTRCYQGKLEITGGGIESSV EVKGCTAMIGCRLMSGILAVGPMFVREACPHQLLTQPRKTENGATCLPIPVWGLQLLLPL LLPSFIHFS >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_6|750_bp atgggaacccctcgtatccagcatttgctgatcctcctggtcctaggagcctccctcctg acctcgggcctagagctgtattgtcaaaagggtctgtccatgactgtggaagcagatcca gccaatatgtttaactggaccacagaggaagtggagacttgtgacaaaggggcactttgc caggaaaccatactaataattaaagcagggactgagacagccattttggccacgaagggc tgcatcccggaaggggaggaggccataacaattgtccagcactcttcacctcccggcctg atcgtgacctcctacagtaactactgtgaggattccttctgtaatgacaaagacagcctg tctcagttttgggagttcagtgagaccacagcttccactgtgtcaacaaccctccattgt ccaacctgtgtggctttggggacctgtttcagtgctccttctcttccctgtcccaatggt acaactcgatgctatcaaggaaaacttgagatcactggaggtggcattgagtcgtctgtg gaggtcaaaggctgtacagccatgattggctgcaggctgatgtctggaatcttagcagta ggacccatgtttgtgagggaagcgtgcccacatcagctgctcactcaacctcgaaagact gaaaatggggccacctgtcttcccattcctgtttgggggttacagctactgctgccattg ctgctgccatcatttattcacttttcctaa >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_7|140_aa MNHNELSEPLAFCIGLPNGETLARNWRHCCEINEKIYGMNGEEDTVLLMSLEKGVLIQTP REDFWILCKKEFKPGLSKVNLKNWFRPCCGKGESAMPQYTRLPFGFWEKLTSMNINTDLK SDKHLQCILSEACDMVASFV >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_7|423_bp atgaaccacaatgagctttctgaacctctagctttctgcatcggtttgcccaatggggag acactggcaagaaactggaggcactgttgtgaaataaatgaaaagatatatggtatgaat ggggaagaagacacagtattattaatgtcgctggaaaaaggggtcttgatccagactcca agagaggatttttggatcttatgcaagaaagaattcaagccagggctttccaaagttaac ctgaaaaactggttcaggccatgctgtgggaagggagagtcggccatgcctcaatatacc cgcctcccttttggattttgggaaaagctgaccagcatgaacatcaacacagaccttaag tctgataaacatttacaatgtattctctctgaagcctgcgacatggtggcttcatttgtg taa >gi568815579f:43253715_43462317|GENSCAN_predicted_peptide_8|165_aa XNVTVSLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRL PPPEPTTVASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAA GHQDRSNSGQYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL >gi568815579f:43253715_43462317|GENSCAN_predicted_CDS_8|498_bp nctaatgtgactgtgtccttgcctgtccggggctgtgtccaggatgaattctgcactcgg gatggagtaacaggcccagggttcacgctcagtggctcctgttgccaggggtcccgctgt aactctgacctccgcaacaagacctacttctcccctcgaatcccaccccttgtccggctg ccccctccagagcccacgactgtggcctcaaccacatctgtcaccacttctacctcggcc ccagtgagacccacatccaccaccaaacccatgccagcgccaaccagtcagactccgaga cagggagtagaacacgaggcctcccgggatgaggagcccaggttgactggaggcgccgct ggccaccaggaccgcagcaattcagggcagtatcctgcaaaaggggggccccagcagccc cataataaaggctgtgtggctcccacagctggattggcagcccttctgttggccgtggct gctggtgtcctactgtga