GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:23:28 Sequence gi568815587f:113804934_114046134 : 241201 bp : 44.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.20 Intr - 1651 1556 96 1 0 85 65 101 0.602 7.71 1.19 Intr - 3203 3018 186 2 0 90 77 135 0.975 12.49 1.18 Intr - 3504 3365 140 1 2 58 78 147 0.999 10.88 1.17 Intr - 4321 4130 192 2 0 109 65 147 0.999 13.96 1.16 Intr - 7571 7343 229 2 1 78 98 151 0.539 12.54 1.15 Intr - 9022 8952 71 2 2 74 75 35 0.842 -0.30 1.14 Intr - 10449 10241 209 2 2 81 58 90 0.880 4.02 1.13 Intr - 12904 12725 180 0 0 53 90 75 0.489 3.18 1.12 Intr - 16373 16296 78 1 0 35 49 116 0.324 1.07 1.11 Intr - 18767 18672 96 1 0 53 113 115 0.978 9.62 1.10 Intr - 22427 22300 128 2 2 45 109 37 0.799 0.98 1.09 Intr - 24412 24264 149 2 2 78 94 144 0.936 13.95 1.08 Intr - 26135 25934 202 2 1 59 94 30 0.349 -0.44 1.07 Intr - 28624 28487 138 1 0 71 87 49 0.731 3.76 1.06 Intr - 29402 29316 87 2 0 69 98 42 0.714 3.47 1.05 Intr - 36224 36143 82 1 1 56 77 22 0.022 -2.46 1.04 Intr - 36835 36730 106 2 1 95 30 51 0.016 -0.73 1.03 Intr - 47700 47568 133 0 1 111 58 106 0.204 10.12 1.02 Intr - 69676 69584 93 1 0 44 94 38 0.234 0.16 1.01 Init - 70727 70512 216 2 0 66 60 217 0.363 13.39 1.00 Prom - 90602 90563 40 -4.46 2.00 Prom + 96638 96677 40 -6.46 2.01 Init + 100016 100052 37 1 1 53 109 94 0.489 6.28 2.02 Intr + 104362 104522 161 2 2 68 79 121 0.620 8.91 2.03 Intr + 126825 126934 110 2 2 75 81 98 0.359 6.88 2.04 Intr + 127356 127525 170 0 2 49 97 82 0.997 4.79 2.05 Intr + 128003 128160 158 0 2 77 51 159 0.895 10.83 2.06 Intr + 138049 138259 211 0 1 70 99 236 0.241 21.39 2.07 Intr + 139640 139822 183 0 0 98 89 160 0.997 16.86 2.08 Term + 140969 141204 236 0 2 70 49 194 0.999 10.18 2.09 PlyA + 141410 141415 6 1.05 3.06 PlyA - 141896 141891 6 1.05 3.05 Term - 142700 142557 144 2 0 40 33 70 0.296 -5.39 3.04 Intr - 143163 143117 47 1 2 79 80 33 0.379 -0.27 3.03 Intr - 144472 144360 113 0 2 27 105 124 0.538 8.02 3.02 Intr - 147683 147519 165 1 0 57 98 73 0.158 4.28 3.01 Init - 159700 159564 137 1 2 73 -5 156 0.094 4.41 3.00 Prom - 161922 161883 40 -4.16 4.00 Prom + 162597 162636 40 -3.56 4.01 Init + 170393 170459 67 1 1 56 121 186 0.988 17.93 4.02 Intr + 172838 172989 152 0 2 80 65 145 0.967 11.28 4.03 Intr + 174300 174344 45 2 0 105 99 101 0.997 11.51 4.04 Intr + 176270 176379 110 1 2 112 62 195 0.999 18.38 4.05 Intr + 178187 178356 170 2 2 84 81 282 0.963 26.69 4.06 Intr + 181025 181242 218 0 2 99 73 147 0.529 12.52 4.07 Intr + 181585 181795 211 0 1 91 87 359 0.656 34.59 4.08 Intr + 181892 182113 222 0 0 62 94 159 0.941 12.00 4.09 Term + 184532 184830 299 0 2 96 48 461 0.993 38.33 4.10 PlyA + 185357 185362 6 1.05 5.13 PlyA - 185449 185444 6 1.05 5.12 Term - 190125 190094 32 1 2 109 50 7 0.131 -2.98 5.11 Intr - 191137 190937 201 2 0 85 94 46 0.337 4.16 5.10 Intr - 195962 195764 199 2 1 93 47 104 0.259 5.72 5.09 Intr - 210127 210042 86 2 2 132 89 8 0.039 4.84 5.08 Intr - 210524 210439 86 1 2 67 16 51 0.014 -4.74 5.07 Intr - 216645 216544 102 2 0 15 119 121 0.839 7.19 5.06 Intr - 218187 218051 137 0 2 109 94 -21 0.834 -0.03 5.05 Intr - 219780 219713 68 1 2 100 90 58 0.886 5.82 5.04 Intr - 222623 222495 129 0 0 76 97 20 0.207 2.37 5.03 Intr - 224499 224359 141 1 0 51 107 29 0.124 1.42 5.02 Intr - 233256 233121 136 0 1 77 87 63 0.689 5.24 5.01 Init - 235256 235197 60 2 0 70 62 68 0.766 3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:113804934_114046134|GENSCAN_predicted_peptide_1|937_aa MPGAGHVGQASGARCGRGRDGAGWRAGAGPSPERLGRRRLRRETGDPRLGASAMTAELQQ DDAAGAADGHGSRKREICASCSNTSLEMRNYICRTKLADRYPQASNGDITQAVSLLTDER VKEPSQDTVATEPSEVEGSAANKEVLAKVIDLTHDNKDDLQAAIALSLLESPKIQADGRD LNREHRAVFYVLGNLEALIFSTVSLLPYSMSLFQLPEFRRLVLSYSLPQNVLENCRSHTE KRNIMFMQELQYLFALMMGSNRKFVDPSAALDLLKGAFRSSEEQQIIPGSLGVDENALFL LLNDAWILCSTYLNPNNPELSPFPKSSSPRNKSENPMVQLFYGTFLTEGVREGKPFCNNE TFGQYPLQVNGYRNLDECLEGAMVEGDVELLPSDHSVKYGQERWFTKLPPVLTFELSRFE FNQSLGQPEKIHNKLEFPQIIYMDRYMYRSKELIRNKRECIRKLKEEIKILQQKLESTVS YSGDVLHNEYILPPGHTVDCQTRYVKYGSGPARFPLPDMLKYVIEFASTKPASESCPPES DTHMTLPLSSVHCSVSDQTSKESTSTESSSQDVESTFSSPEDSLPKSKPLTSSRSSMEMP SQPAPRTVTDEEINFVKTCLQRWRSEIEQDIQDLKTCIASTTQTIEQMYCDPLLRQVPYR LHAVLVHEGQANAGHYWAYIYNQPRQSWLKYNDISVTESSWEEVERDSYGGLRNVSAYCL MYINDKLPYFNAEAAPTESDQMSEVEALSVELKHYIQEDNWRFEQEVEEWEEEQSCKIPQ MESSTNSSSQDYSTSQEPSVASSHGVRCLSSEHAVIVKEQTAQAIANTARAYEKSGVEAA LSELKEAEPKKPMPQETNLAEQSEQPPKANDAESTAQPNSEVSEVEIPSVGRILVRSDAD GYDEEVMLSPAMQGVILAIAKARQTFDRDGSEAGLIK >gi568815587f:113804934_114046134|GENSCAN_predicted_CDS_1|2811_bp atgccgggggcggggcatgtgggccaagcctccggcgcgcgctgcgggcgggggcgggac ggagcggggtggagggcgggggcggggcctagtcctgagaggctgggccggcggcggctg cggcgggagaccggtgacccgcggctgggcgcctcggccatgactgcggagctgcagcag gacgacgcggccggcgcggcagacggccacggctcgagaaaacgggaaatctgtgccagt tgttcaaatacgagtttggagatgagaaactatatttgtcgtacaaagctggcagacaga taccctcaggccagtaatggtgacattactcaggcagtcagccttctcactgatgagaga gttaaggagcccagtcaagacactgttgctacagaaccatctgaagtagaggggagtgct gccaacaaggaagtattagcaaaagttatagaccttactcatgataacaaagatgatctt caggctgccattgctttgagtctactggagtctcccaaaattcaagctgatggaagagat cttaacagggaacacagggctgtgttttatgtcctgggcaatttggaggctcttatcttt agcacagtgtctctcctgccctattcgatgtctctctttcaattgcctgaatttcgaaga cttgttctcagttatagtctgccacaaaatgtacttgaaaattgtcgaagtcatacagaa aagagaaatatcatgtttatgcaagagcttcagtatttgtttgctctaatgatgggatca aatagaaaatttgtagacccgtctgcagccctggatctattaaagggagcattccgatca tctgaggaacagcagatcatacctggctctttgggggttgatgaaaatgcactatttttg cttttgaatgatgcttggattttatgtagcacatatcttaatccaaacaatccagaattg tctccttttcctaaaagcagcagtcccaggaacaaatctgaaaatccaatggtgcagctg ttctatggtactttcctgactgaaggggttcgtgaaggaaaacccttttgtaacaatgag accttcggccagtatcctcttcaggtaaacggttatcgcaacttagacgagtgtttggaa ggggccatggtggagggtgatgttgagcttcttccctccgatcactcggtgaagtatgga caagagcgttggtttacaaagctacctccagtgttgacctttgaactctcaagatttgag tttaatcagtcccttgggcagccagagaaaattcacaataagctggaatttcctcagatt atttatatggacaggtacatgtacaggagcaaggagcttattcgaaataagagagagtgt attcgaaagttgaaggaggaaataaaaattctgcagcaaaaattggaaagcactgtcagc tacagcggagatgtgcttcataacgagtacatccttccccctggccacactgtggactgc cagaccaggtatgtgaaatatggctcaggcccagctcggttcccgctcccggacatgctg aaatatgttattgaatttgctagtacaaaacctgcctcagaaagctgtccacctgaaagt gacacacatatgacattaccactttcttcagtgcactgctcggtttctgaccagacatcc aaggaaagtacaagtacagaaagctcttctcaggatgttgaaagtaccttttcttctcct gaagattctttacccaagtctaaaccactgacatcttctcggtcttccatggaaatgcct tcacagccagctccacgaacagtcacagatgaggagataaattttgttaagacctgtctt cagagatggaggagtgagattgaacaagatatacaagatttaaagacttgtattgcaagt actactcagactattgaacagatgtactgcgatcctctccttcgtcaggtgccttatcgc ttgcatgcagttcttgttcatgaaggacaagcaaatgctggacactattgggcctatatc tataatcaaccccgacagagctggctcaagtacaatgacatctctgttactgaatcttcc tgggaagaagttgaaagagattcctatggaggcctgagaaatgttagtgcttactgtctg atgtacattaatgacaaactaccctacttcaatgcagaggcagccccaactgaatcagat caaatgtcagaagtggaagccctatctgtggaactcaagcattacattcaggaggataac tggcggtttgagcaggaagtagaggagtgggaagaagagcagtcttgcaaaatccctcaa atggagtcctccaccaactcctcatcacaggactactctacatcacaagagccttcagta gcctcttctcatggggttcgctgcttgtcgtctgagcatgctgtgattgtaaaggagcaa actgcccaggctattgcaaacacagcccgtgcctatgagaagagcggtgtagaagcggca ctgagtgagcttaaagaagctgaacccaagaagcccatgccccaggaaacaaaccttgca gagcagtcagaacagcccccaaaggctaatgatgcagagtctactgcccagcctaattct gaggtctctgaagtcgagattcccagtgtgggaaggattctggttagatctgatgcagat ggatatgatgaggaggtgatgctgagccctgccatgcaaggggtcatcctggccatagct aaagcccgtcagacctttgaccgagatgggtctgaagcagggctgattaag >gi568815587f:113804934_114046134|GENSCAN_predicted_peptide_2|421_aa MAPLWACILVAAGILATDTHHPQDSALYHLSKQLLQKYHKEVRPVYNWTKATTVYLDLFV HAILDVVWNDEFLSWNSSMFDEIREISLPLSAIWAPDIIINEFVDIERYPDLPYVYVNSS GTIENYKPIQVVSACSLETYAFPFDVQNCSLTFKSILHTVEDVDLAFLRSPEDIQHDKKA FLNDSEWELLSVSSTYSILQSSAGGFAQIQFNVVMRRHPLVYVVSLLIPSIFLMLVDLGS FYLPPNCRARIVFKTSVLVGYTVFRVNMSNQVPRSVGSTPLIGHFFTICMAFLVLSLAKS IVLVKFLHDEQRGGQEQPFLCLRGDTDADRPRVEPRAQRAVVTESSLYGEHLAQPGTLKE VWSQLQSISNYLQTQDQTDQQEAEWLVLLSRFDRLLFQSYLFMLGIYTITLCSLWALWGG V >gi568815587f:113804934_114046134|GENSCAN_predicted_CDS_2|1266_bp atggctcccctgtgggcctgcatcctggtggctgcaggaattctagccacagatacacat catccccaggattctgctctgtatcatctcagcaagcagctattacagaaatatcataaa gaagtgagacctgtttacaactggaccaaggccaccacagtctacctggacctgttcgtc catgctatattggatgtggtctggaatgatgaatttttatcctggaactccagcatgttt gatgagattagagagatctccctacctctaagtgccatctgggcccccgatatcatcatc aatgagtttgtggacattgaaagataccctgaccttccctatgtttatgtgaactcatct gggaccattgagaactataagcccatccaggtggtctctgcgtgcagtttagagacatat gcttttccatttgatgtccagaattgcagcctgaccttcaagagcattctgcatacagtg gaagacgtagacctggcctttctgaggagcccagaagacattcagcatgacaaaaaggcg tttttgaatgacagtgagtgggaacttctatctgtgtcctccacatacagcatcctgcag agcagcgctggaggatttgcacagattcagtttaatgtggtgatgcgcaggcaccccctg gtctatgtcgtgagtctgctgattcctagcatctttctcatgctggtggacctggggagc ttctacctgccacccaactgccgagccaggattgtgttcaagaccagtgtgctggtgggc tacaccgtcttcagggtcaacatgtccaaccaggtgccacggagtgtagggagcacccct ctgattgggcacttcttcaccatctgcatggccttcttggttctcagcttagctaagtcc atcgtgttggtcaaattcctccatgatgagcagcgtggtggacaggagcagcccttcttg tgccttcgaggggacaccgatgctgacaggcctagagtggaacccagggcccaacgtgct gtggtaacagagtcctcgctgtatggagagcacctggcccagccaggaaccctgaaggaa gtctggtcgcagcttcaatctatcagcaactacctccaaactcaggaccagacagaccaa caggaggcagagtggctggtcctcctgtcccgctttgaccgactgctcttccaaagctac cttttcatgctggggatctacaccatcactctgtgctccctctgggcactgtggggcggc gtgtga >gi568815587f:113804934_114046134|GENSCAN_predicted_peptide_3|201_aa MREQRQAFSPIRSGNGGLTGSGAQQTTCWIQKDGSQRQVCNGGKQHLALFKEDEQGDQEL AVHMQPGLPVTLRRDDLPARLTHTPGLQRPTQAVGEVQRVRRRKCTGDQFGGQASLLKGC LLEADVKNLGNVICEWNKAAVEFHGQLALSTNWEKRKSGHRHEHAQRKGHVKTKQEVSNL QAKDRAQKKATHPVCTLILGF >gi568815587f:113804934_114046134|GENSCAN_predicted_CDS_3|606_bp atgcgggaacaaaggcaagcctttagcccaatcaggagtggcaatgggggcctcactgga tcaggagcacagcagacaacctgctggatccagaaggatggaagtcagcggcaggtctgc aacggcggcaaacagcacctggcgctcttcaaagaggatgagcagggtgaccaggagctg gctgtccacatgcagcccgggttaccagtgacactgaggagggatgaccttcccgctaga cttacacacaccccaggccttcaaaggccaacacaggctgtgggagaggtacagagggta agacgaaggaagtgtaccggggaccagtttggcggccaggcttcactcttaaagggctgc ttgttagaggctgatgtaaagaacctgggcaatgtcatctgcgaatggaacaaagcagca gtggaatttcatggacaattggcgctgagcactaactgggagaagagaaaatctggacac agacatgagcatgcacagaggaaaggtcatgtgaaaacaaagcaagaagtcagcaacctg caagccaaggacagggctcagaagaaagccacccaccctgtttgcaccttgatcttaggc ttctag >gi568815587f:113804934_114046134|GENSCAN_predicted_peptide_4|497_aa MLLWVQQALLALLLPTLLAQGEARRSRNTTRPALLRLSDYLLTNYRKGVRPVRDWRKPTT VSIDVIVYAILNVDEKNQVLTTYIWYRQYWTDEFLQWNPEDFDNITKLSIPTDSIWVPDI LINEFVDVGKSPNIPYVYIRHQGEVQNYKPLQVVTACSLDIYNFPFDVQNCSLTFTSWLH TRSSRLWVLDSKAGLLYSLSVQDINISLWRLPEKVKSDRSVFMNQGEWELLGVLPYFREF SMESSNYYAEMKFYVVIRRRPLFYVVSLLLPSIFLMVMDIVGFYLPPNSGERVSFKITLL LGYSVFLIIVSDTLPATAIGTPLIGVYFVVCMALLVISLAETIFIVRLVHKQDLQQPVPA WLRHLVLERIAWLLCLREQSTSQRPPATSQATKTDDCSAMGNHCSHMGGPQDFEKSPRDR CSPPPPPREASLAVCGLLQELSSIRQFLEKRDEIREVARDWLRVGSVLDKLLFHIYLLAV LAYSITLVMLWSIWQYA >gi568815587f:113804934_114046134|GENSCAN_predicted_CDS_4|1494_bp atgctgctgtgggtccagcaggcgctgctcgccttgctcctccccacactcctggcacag ggagaagccaggaggagccgaaacaccaccaggcccgctctgctgaggctgtcggattac cttttgaccaactacaggaagggtgtgcgccccgtgagggactggaggaagccaaccacc gtatccattgacgtcattgtctatgccatcctcaacgtggatgagaagaatcaggtgctg accacctacatctggtaccggcagtactggactgatgagtttctccagtggaaccctgag gactttgacaacatcaccaagttgtccatccccacggacagcatctgggtcccggacatt ctcatcaatgagttcgtggatgtggggaagtctccaaatatcccgtacgtgtatattcgg catcaaggcgaagttcagaactacaagccccttcaggtggtgactgcctgtagcctcgac atctacaacttccccttcgatgtccagaactgctcgctgaccttcaccagttggctgcac accaggtccagcaggctctgggtactagattccaaagctggcttgctttattctctctca gtccaggacatcaacatctctttgtggcgcttgccagaaaaggtgaaatccgacaggagt gtcttcatgaaccagggagagtgggagttgctgggggtgctgccctactttcgggagttc agcatggaaagcagtaactactatgcagaaatgaagttctatgtggtcatccgccggcgg cccctcttctatgtggtcagcctgctactgcccagcatcttcctcatggtcatggacatc gtgggcttctacctgccccccaacagtggcgagagggtctctttcaagattacactcctc ctgggctactcggtcttcctgatcatcgtttctgacacgctgccggccactgccatcggc actcctctcattggtgtctactttgtggtgtgcatggctctgctggtgataagtttggcc gagaccatcttcattgtgcggctggtgcacaagcaagacctgcagcagcccgtgcctgct tggctgcgtcacctggttctggagagaatcgcctggctactttgcctgagggagcagtca acttcccagaggcccccagccacctcccaagccaccaagactgatgactgctcagccatg ggaaaccactgcagccacatgggaggaccccaggacttcgagaagagcccgagggacaga tgtagccctcccccaccacctcgggaggcctcgctggcggtgtgtgggctgctgcaggag ctgtcctccatccggcaattcctggaaaagcgggatgagatccgagaggtggcccgagac tggctgcgcgtgggctccgtgctggacaagctgctattccacatttacctgctagcggtg ctggcctacagcatcaccctggttatgctctggtccatctggcagtacgcttga >gi568815587f:113804934_114046134|GENSCAN_predicted_peptide_5|458_aa MGLHQNGIYQPLDLGLLNLQGQGKTQNHIHKGPKFGQQVKFQTAPQLSHSGVWTQNVCPA SAPGIGVESSTGVPDAKLDGELFPLEGPHPALTLNFPFSAPDGGRRNTYTLPGCCGTWNA NFSPGSARSAGEMMQEACSWTSWLVQVTCSSIRSQVIQQPLGFLDDQQQPLSGSVRGKEI WGSAAHPCACGYPPLSSSLSHKSLKIPQGHAHAPQSFSYQCISRSGVDNRHFSKFPSDAD AAAQGPDIEATDFSGHGWCQQPSVALPQVQQKLQAVQVVPRGCLLGVHLMCLSVPGIQMG ILRVPVPIHQPSSTQGCCCSPDQRTGDLGAPCESPCAPRPSSTDEITGSCNDEAQYCTDE SLIYVKDISAEGAINRKAKYEVLIQRVCDEPYLNFVVQEASRFIHTNNNKTIVEICQPFT LCQTLYLYQLLYSSQQSIRQCFYDVTFTGPLRSTSKPV >gi568815587f:113804934_114046134|GENSCAN_predicted_CDS_5|1377_bp atgggccttcaccagaatggaatctaccagcccctggatcttggtcttctcaacctccag ggccagggcaagacccagaaccacatccacaagggccccaagtttggtcaacaggtcaag tttcaaacagccccacaattatcccatagtggggtgtggacccagaatgtttgccctgca tcagctcctgggattggtgttgaaagctccacaggtgttcctgatgcaaagctggatgga gaactatttcccctggaaggtccacacccagccctcactctaaacttccccttctcagct ccagatggtggaaggagaaatacctacactctccctggatgctgtgggacttggaatgca aatttctctcccggctctgcgaggtcagcaggagagatgatgcaagaggcctgcagctgg acctcctggctggtacaagtcacctgctccagcatccgatctcaagttattcagcagccc ctggggtttcttgatgaccaacagcagcccttaagtggtagtgtcagaggaaaggagatt tgggggtctgcagcccacccatgtgcatgtggctacccacctctctcctcctctctttct cacaaaagtctaaagatcccccagggccacgcccatgccccacagtcattcagttaccag tgcatctccaggtcaggggtggataaccggcatttcagcaagttcccaagtgatgcagat gctgctgctcagggaccagacattgaagccactgacttcagtggtcatggctggtgtcag cagccaagcgtagctctgccccaggtgcaacagaagctgcaggcagtgcaggtggtgcca cgtggctgccttctgggagtgcacctgatgtgcctttctgtgcctggaatccagatgggg atacttagagtccctgtccccatacaccaaccatcttcgacccaaggttgttgctgtagc cctgatcagaggacaggggatctgggggctccatgtgaatcaccttgtgctccaaggcca agttccactgatgaaatcacagggagctgcaatgatgaagcccagtactgtacagatgaa agtcttatttatgtgaaagatatatctgcagagggagcaataaaccgcaaagcaaagtat gaagtattaattcaacgggtctgtgatgaaccatacctcaactttgttgtgcaggaagct tctagatttatacatacgaacaataataaaaccatagttgaaatttgtcaaccatttact ttgtgccagaccctttacctgtatcagctcctttattcttcacagcaatccataaggcag tgcttttatgatgtcacctttacaggaccattgaggagcacctcaaagccagtgtga