GENSCAN 1.0 Date run: 19-Jun-119 Time: 14:28:00 Sequence gi568815586f:54449873_54677325 : 227453 bp : 44.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9356 9395 40 -2.06 1.01 Init + 23516 23623 108 1 0 63 92 84 0.827 6.52 1.02 Intr + 23701 23758 58 0 1 54 59 54 0.120 -2.44 1.03 Intr + 47910 48019 110 1 2 63 121 70 0.156 7.80 1.04 Intr + 49483 49593 111 0 0 71 93 105 0.998 9.88 1.05 Intr + 50661 50753 93 2 0 71 88 93 0.990 7.76 1.06 Intr + 57981 58037 57 2 0 104 60 74 0.969 5.28 1.07 Intr + 58517 58659 143 1 2 77 92 137 0.999 12.15 1.08 Intr + 60023 60113 91 2 1 80 79 40 0.780 2.30 1.09 Intr + 61931 61979 49 1 1 131 89 31 0.881 5.75 1.10 Intr + 62077 62233 157 2 1 77 76 207 0.579 17.47 1.11 Intr + 66367 66423 57 1 0 96 103 39 0.516 4.20 1.12 Intr + 67024 67135 112 1 1 93 -53 95 0.558 -3.72 1.13 Intr + 67661 67770 110 1 2 76 23 138 0.686 5.18 1.14 Intr + 67934 68066 133 2 1 50 80 41 0.635 0.15 1.15 Intr + 68779 68860 82 0 1 65 88 42 0.952 1.11 1.16 Intr + 69042 69100 59 1 2 110 95 26 0.966 4.10 1.17 Intr + 69315 69460 146 2 2 69 84 90 0.999 5.78 1.18 Intr + 70822 70954 133 1 1 94 92 72 0.999 8.85 1.19 Intr + 71247 71366 120 2 0 92 74 111 0.994 10.79 1.20 Intr + 73522 73667 146 0 2 40 61 160 0.901 7.58 1.21 Intr + 73953 74084 132 0 0 91 69 68 0.967 4.96 1.22 Intr + 76656 76874 219 0 0 56 65 245 0.999 16.42 1.23 Intr + 78375 78505 131 0 2 60 93 131 0.996 11.14 1.24 Intr + 81388 81485 98 2 2 63 74 125 0.844 8.33 1.25 Intr + 81619 81712 94 0 1 69 115 93 0.999 9.64 1.26 Intr + 81871 81953 83 2 2 107 105 101 0.999 13.06 1.27 Intr + 82298 82378 81 1 0 116 86 31 0.979 5.53 1.28 Intr + 85232 85325 94 1 1 100 99 103 0.999 12.14 1.29 Intr + 86257 86373 117 2 0 100 115 50 0.998 9.34 1.30 Intr + 87072 87181 110 1 2 36 89 168 0.995 11.70 1.31 Intr + 89012 89101 90 1 0 110 111 50 0.993 9.69 1.32 Term + 92703 92813 111 2 0 111 42 138 0.928 10.06 1.33 PlyA + 95130 95135 6 1.05 2.00 Prom + 97025 97064 40 -5.36 2.01 Init + 99144 99171 28 2 1 86 75 27 0.304 1.04 2.02 Intr + 99813 99900 88 1 1 -26 75 98 0.350 -3.87 2.03 Intr + 99988 100113 126 1 0 96 84 156 0.669 15.79 2.04 Intr + 117102 117215 114 0 0 115 93 187 0.995 21.46 2.05 Intr + 119312 119494 183 2 0 75 89 358 0.999 33.50 2.06 Intr + 119674 119740 67 1 1 106 84 30 0.948 3.31 2.07 Intr + 120369 120485 117 2 0 86 53 102 0.960 7.16 2.08 Intr + 122729 122869 141 1 0 106 66 170 0.995 17.15 2.09 Intr + 123276 123376 101 2 2 118 61 208 0.999 20.01 2.10 Intr + 123483 123608 126 0 0 76 30 205 0.998 13.29 2.11 Intr + 123736 123837 102 1 0 86 23 122 0.652 4.79 2.12 Intr + 125226 125346 121 0 1 72 74 148 0.989 12.30 2.13 Intr + 125560 125628 69 0 0 79 82 27 0.599 0.58 2.14 Intr + 125679 125760 82 2 1 117 95 24 0.999 5.21 2.15 Intr + 126120 126228 109 1 1 70 94 190 0.981 17.14 2.16 Intr + 126699 126829 131 0 2 59 52 146 0.995 8.44 2.17 Term + 127353 127456 104 1 2 117 41 205 0.999 17.14 2.18 PlyA + 127678 127683 6 -3.64 3.18 PlyA - 127880 127875 6 -0.45 3.17 Term - 128089 127955 135 1 0 92 49 159 0.995 10.42 3.16 Intr - 131178 131072 107 1 2 102 103 34 0.978 6.23 3.15 Intr - 132259 132104 156 2 0 104 53 73 0.972 5.38 3.14 Intr - 132923 132860 64 2 1 99 47 34 0.976 -1.31 3.13 Intr - 133376 133339 38 0 2 129 105 50 0.994 8.78 3.12 Intr - 134448 134388 61 0 1 98 115 21 0.881 4.11 3.11 Intr - 138644 138533 112 1 1 98 80 187 0.589 19.28 3.10 Intr - 140741 140689 53 2 2 84 78 12 0.167 -2.49 3.09 Intr - 154409 154318 92 0 2 109 74 62 0.604 6.61 3.08 Intr - 159805 159761 45 2 0 103 95 47 0.596 5.28 3.07 Intr - 181967 181866 102 0 0 56 87 61 0.612 2.95 3.06 Intr - 182509 182279 231 2 0 66 85 145 0.602 9.74 3.05 Intr - 184938 184912 27 1 0 82 105 33 0.052 2.49 3.04 Intr - 195390 195301 90 1 0 87 113 17 0.152 4.07 3.03 Intr - 195835 195734 102 2 0 95 94 14 0.203 2.85 3.02 Intr - 197287 197249 39 2 0 123 84 27 0.332 4.00 3.01 Init - 198431 198374 58 2 1 74 76 136 0.757 10.47 3.00 Prom - 202466 202427 40 -6.36 4.00 Prom + 205631 205670 40 -5.26 4.01 Init + 209587 209667 81 0 0 79 100 63 0.394 7.63 4.02 Intr + 216351 216415 65 2 2 119 78 45 0.470 4.12 4.03 Term + 218608 218692 85 1 1 58 36 98 0.225 -1.27 4.04 PlyA + 218752 218757 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 47918 48019 102 1 0 86 121 50 0.835 8.34 S.002 Init - 184969 184912 58 1 1 37 105 83 0.815 5.18 S.003 Sngl - 193444 193076 369 1 0 58 43 190 0.907 7.61 S.004 Term - 194884 194774 111 1 0 85 42 116 0.822 5.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:54449873_54677325|GENSCAN_predicted_peptide_1|1144_aa MQRNRLVAKIAALRAHKAEGYQQKSEMRILEGEGIPSPSSHASPSNGHFRRLTLGVAIMS LTSAYQHKLAEKLTILNDRGQGVLIRMYNIKKTCSDPKSKPPFLLEKSMEPSLKYINKKF PNIDVRNSTQHLGPVHREKAEIIRFLTNYYQSFVDVMEFRDHVYELLNTIDACQCHFDIN LNFDFTRSYLDLIVTYTSVILLLSRIEDRRILIGMYNCAHEMLHGHGNQGAEQWRSAQLL SLISNPPAMINPANSDTMACEYLSVEVMERWIIIGFLLCHGCLNSNSQCQKLWKLCLQGS LYITLIREDVLQVHKVTEDLFSSLKGYGKRVADIKESKEHVIANSGQFHCQRRQFLRMAV KELETVLADEPGLLGPKARGRNALFAFMALSFIRDEVTWLVRHTENVTKTKTPEDYADSS IAELLFLLEGIRSLVRRHIKVIQQYHLQYLARFDALVLSDIIQNLSVCPEEESIIMSSFV SILSSLNLKQVDNGEKFEFSGLRLDWFRLQAYTSVAKAPLHLHENPDLAKVMNLIVFHSR MLDSVEKLLVETSDLSTFCFHLRIFEKMFAMTLEESAMLRYAIAFPLICAHFVHCTHEMC PEEYPHLKNHGLHHCNSFLEELAKQTSNCVLEICAEQRNLSEQLLPKHCATTISKAKNKK TRKQRQTPRKGEPERDKPGAESHRKNRSIVTNMDKLHLNLTELALTMNHVYSFSVFEHTI FPSEYLSSHLEARLNRAIVWLAGYNATTQEIVRPSELLAGVKAYIGFIQSLAQFLGADAS RVIRNALLQQTQPLDSCGEQTITTLYTNWYLESLLRQASSGTIILSPAMQAFVSLPREGE QNFSAEEFSDISEMRALAELLGPYGMKFLSENLMWHVTSQIVELKKLVVENMDILVQIRS NFSKPDLMASLLPQLTGAENVLKRMTIIGVILSFRAMAQEGLREVFSSHCPFLMGPIECL KEFVTPDTDIKVTLSIFELASAAGVGCDIDPALVAAIANLKADTSSPEEEYKVACLLLIF LAVSLPLLATDPSSFYSIEKDGYNNNIHCLTKAIIQVSAALFTLYNKNIETHLKEFLVVA SVSLLQLGQETDKLKTRNRESISLLMRLVVEESSFLTLDMLESCFPYVLLRNAYREVSRA FHLN >gi568815586f:54449873_54677325|GENSCAN_predicted_CDS_1|3435_bp atgcagcgaaacaggttggtagccaaaatagctgcattgagagcccacaaggctgaaggc taccagcagaaaagcgaaatgaggatattagaaggtgaaggtatcccgtcaccttcctcc cacgcaagccccagcaacggccacttccggcgcctaacgttgggagtggccatcatgtct ttgacatctgcttaccagcataaattagcagagaagctcactatcctgaatgatcgcggt cagggggttctcatccgtatgtataacatcaagaagacttgttcagaccccaaatctaag ccacctttcttactggaaaagtccatggaaccatctctcaagtatatcaacaagaaattt cccaacatagatgtccgaaacagcacgcaacatttaggaccagtacatcgtgaaaaagcc gagataattagattcctcaccaactactaccagtcatttgtggatgtcatggaatttcgg gatcatgtatatgaacttctcaacaccattgatgcctgccagtgccattttgatatcaat ctcaactttgatttcactcggagttacctggacttgattgtaacttacacctcagtcatt ttacttctgtcacggattgaagatcggcggatactcattggcatgtacaattgtgcccat gagatgctgcatgggcatggaaaccagggggctgagcagtggcgcagtgcccaacttcta agcctcatcagcaaccccccagccatgattaaccctgctaattcagatacaatggcctgt gagtatctgtctgtggaagtaatggagcgctggattatcattgggtttcttctttgtcat gggtgcctcaactccaatagccagtgccagaagctgtggaagctgtgtctgcagggctcc ctctacatcacccttatccgtgaggatgtgctgcaggtgcacaaagtcaccgaggacctg tttagcagtttgaaagggtatggcaagagagtggcagacataaaggagagcaaggaacat gtaattgcaaacagtggccagtttcattgtcaacggcggcaatttctgcggatggcagtg aaggagctggagactgtgttggctgatgaaccgggactactgggtcctaaggcaagagga agaaatgctctttttgctttcatggccctgtccttcattcgtgatgaggtcacctggctg gttcgccacacagagaatgtcaccaagacaaagacacctgaggactatgctgactcgagc attgcagagctacttttcttgttggaggggattaggtctctggtccgaagacacatcaaa gtgatacagcaataccaccttcagtacttggcaagatttgatgctcttgtgctcagtgac atcattcagaacttgtctgtgtgtccagaggaggagtccatcatcatgtcctcattcgtc agtatcctctcctctctgaatctcaaacaagttgataatggagaaaaatttgaattctca ggattgaggctggactggttccgcctacaggcatacactagcgtggctaaggcccctctg cacctgcatgagaaccctgacttagccaaggtgatgaacctcattgtcttccactcccga atgctggactccgtagaaaaattgctggtggaaacttctgatctgtctactttctgcttt catcttcgtatctttgagaagatgtttgccatgaccttggaggaatctgccatgttgcgt tatgccattgctttccccctgatttgtgctcactttgtccactgcactcatgagatgtgc ccagaggagtacccccacctcaagaaccatggtcttcaccactgcaactccttcctggaa gagttggccaagcagaccagcaattgcgtcctggagatctgtgctgagcagcgaaacctg agcgagcagcttctacctaagcactgtgccactacaatcagcaaagccaagaacaagaaa accaggaagcagaggcagactcccagaaaaggagagcccgagagggacaagccaggagct gagagtcaccggaagaaccgcagcattgtcaccaacatggacaagctacacctaaacttg acagaactggcactgacaatgaatcatgtatacagtttctccgtgtttgaacatactatc ttcccttctgagtacctcagcagccacctggaggccagactcaacagagccattgtgtgg ctggctggctacaatgccacgacccaggagatcgtacggccttctgagctgttggcagga gtcaaagcatacattggtttcatacagtcactggcccagtttttgggtgcagatgcttcc agagtcatccgcaacgccctcctgcagcagacacaaccactggattcctgtggggaacag acaatcaccacactctacacaaactggtacctggaaagtctgcttagacaggcaagcagt gggaccatcatcctctccccagccatgcaggccttcgtcagcctgcccagagaaggggag cagaacttcagtgcagaggagttctctgacatctctgagatgcgggccttggcagaactc ctgggcccctatggcatgaagttcctgagtgaaaacctgatgtggcatgtgacctctcag attgtggagctgaagaagctggtggtggaaaacatggacatacttgttcagatcagatcc aactttagcaagccggacttgatggcttccctgctgccccagctgacaggggctgaaaat gtgctaaagcgcatgaccatcattggggttatcctcagtttcagggccatggcccaagag ggacttcgggaggttttctcctcccactgcccatttcttatgggtcccattgagtgcttg aaggagtttgtcactccagacacagacatcaaggtgaccttgagtatctttgagctggca tctgctgcaggtgtgggctgtgacattgacccagccttggtggctgccattgctaatctg aaagctgatacttcatctcctgaggaggaatataaggtggcctgcctgctcttgatcttt ctggcagtttccctcccactccttgccactgacccttcttccttttatagcattgagaag gatggttacaacaacaatattcattgcttgaccaaagccatcatccaggtgtctgctgcc ctcttcacgctctacaacaagaacattgaaactcacctcaaggaatttctggtggtggcc tctgtcagcctcttgcagctgggccaggagactgacaagcttaaaaccagaaatcgagaa tccatttctctgctcatgcgcttggtggtggaggagtcatccttcctgaccctggacatg ctggagtcctgtttcccttatgtcctgcttcgaaatgcctatcgggaggtgtctcgggcc ttccacctaaactga >gi568815586f:54449873_54677325|GENSCAN_predicted_peptide_2|602_aa MEKATVLETAGEEEPQELQLCQLGPSLETPAWLVHASRRPWLSMELSPRSPPEMLEESDC PSPLELKSAPSKKMWIKLRSLLRYMVKQLENGEINIEELKKNLEYTASLLEAVYIDETRQ ILDTEDELQELRSDAVPSEVRDWLASTFTQQARAKGRRAEEKPKFRSIVHAVQAGIFVER MFRRTYTSVGPTYSTAVLNCLKNLDLWCFDVFSLNQAADDHALRTIVFELLTRHNLISRF KIPTVFLMSFLDALETGYGKYKNPYHNQIHAADVTQTVHCFLLRTGMVHCLSEIELLAII FAAAIHDYEHTGTTNSFHIQTKSECAIVYNDRSVLENHHISSVFRLMQDDEMNIFINLTK DEFVELRALVIEMVLATDMSCHFQQVKTMKTALQQLERIDKPKALSLLLHAADISHPTKQ WLVHSRWTKALMEEFFRQVNSAEISHNNPPPPPLATCTPDLGDKEAELGLPFSPLCDRTS TLVAQSQIGFIDFIVEPTFSVLTDVAEKSVQPLADEDSKSKNQPSFQWRQPSLDVEVGDP NPDVVSFRSTWVKRIQENKQKWKERAASGITNQMSIDELSPCEEEAPPSPAEDEHNQNGN LD >gi568815586f:54449873_54677325|GENSCAN_predicted_CDS_2|1809_bp atggagaaggccacagtcttggagacagcgggagaggaggagccgcaggagctgcagctc tgccagcttgggccgagcctagagacaccggcctggctggtccacgccagccgcagaccg tggctgagcatggagctgtccccccgcagtcctccggagatgctggaggagtcggattgc ccgtcacccctggagctgaagtcagcccccagcaagaagatgtggattaagcttcggtct ctgctgcgctacatggtgaagcagttggagaatggggagataaacattgaggagctgaag aaaaatctggagtacacagcttctctgctggaagccgtctacatagatgagacacggcaa atcttggacacggaggacgagctgcaggagctgcggtcagatgccgtgccttcggaggtg cgggactggctggcctccaccttcacccagcaggcccgggccaaaggccgccgagcagag gagaagcccaagttccgaagcattgtgcacgctgtgcaggctgggatcttcgtggaacgg atgttccggagaacatacacctctgtgggccccacttactctactgcggttctcaactgt ctcaagaacctggatctctggtgctttgatgtcttttccttgaaccaggcagcagatgac catgccctgaggaccattgtttttgagttgctgactcggcataacctcatcagccgcttc aagattcccactgtgtttttgatgagtttcctggatgccttggagacaggctatgggaag tacaagaatccttaccacaaccagatccacgcagccgatgttacccagacagtccattgc ttcttgctccgcacagggatggtgcactgcctgtcggagattgagctcctggccatcatc tttgctgcagctatccatgattatgagcacacgggcactaccaacagcttccacatccag accaagtcagaatgtgccatcgtgtacaatgatcgttcagtgctggagaatcaccacatc agctctgttttccgattgatgcaggatgatgagatgaacattttcatcaacctcaccaag gatgagtttgtagaactccgagccctggtcattgagatggtgttggccacagacatgtcc tgccatttccagcaagtgaagaccatgaagacagccttgcaacagctggagaggattgac aagcccaaggccctgtctctactgctccatgctgctgacatcagccacccaaccaagcag tggttggtccacagccgttggaccaaggccctcatggaggaattcttccgtcaggtcaac agtgcagaaatttcacacaacaaccccccacccccaccacttgccacctgtaccccagat ctgggtgacaaggaggcagagttgggcctgcccttttctccactctgtgaccgcacttcc actctagtggcacagtctcagatagggttcatcgacttcattgtggagcccacattctct gtgctgactgacgtggcagagaagagtgttcagcccctggcggatgaggactccaagtct aaaaaccagcccagctttcagtggcgccagccctctctggatgtggaagtgggagacccc aaccctgatgtggtcagctttcgttccacctgggtcaagcgcattcaggagaataagcag aaatggaaggaacgggcagcaagtggcatcaccaaccagatgtccattgacgagctgtcc ccctgtgaagaagaggcccccccatcccctgccgaagatgaacacaaccagaatgggaat ctggattag >gi568815586f:54449873_54677325|GENSCAN_predicted_peptide_3|503_aa MRFMTLLFLTALAGALVCAYDPEAASAPGSGNPCHEASAAQKENAGEDPGLARQAPKPRK QRSSLLEKGLDGAKKAVGGLGKLGKDAVEDLESVGKAVAGALVYAAKPNEEISGPAEPAS PPETTTTAQETSAAAVQGTAKVTSSRQELNPLSKSLSLCQINNLEKSLAAGPHHTSTHRD KPESIVEKSILLTEQALAKAGKGMHGGVPGGKQFIEKPEGEIHSETQPADAWVEGGRKKP STQVQGAERDVMEGQRSAAVPQEITIKPAATINRDLLCTSAGPPAAAPAMEQDNSPRKIQ FTVPLLEPHLDPEAAEQIRRRRPTPATLVLTSDQSSPEIDEDRIPNPHLKSTLAMSPRQR KKMTRITPTMKELQMMVEHHLGQQQQGEEPEGAAESTGTQESRPPGIPDTEVESRLGTSG TAKKTAECIPKTHERGSKEPSTKEPSTHIPPLDSKGANSVGCFFHDHAPLEQMGEPVLMD GADDIKHFGLNEDLGKEGETSVP >gi568815586f:54449873_54677325|GENSCAN_predicted_CDS_3|1512_bp atgaggttcatgactctcctcttcctgacagctctggcaggagccctggtctgtgcctat gatccagaggccgcctctgccccaggatcggggaacccttgccatgaagcatcagcagct caaaaggaaaatgcaggtgaagacccagggttagccagacaggcaccaaagccaaggaag cagagatccagccttctggaaaaaggcctagacggagcaaaaaaagctgtggggggactc ggaaaactaggaaaagatgcagtcgaagatctagaaagcgtgggtaaagctgtagcaggg gccctggtctatgctgctaagcctaatgaagagatctcaggtccagcagaaccagcttca cccccagagacaaccacaacagcccaggagacttcggcggcagcagttcaggggacagcc aaggtcacctcaagcaggcaggaactaaaccccctgagtaagtctctgtctctatgccag atcaacaacctagaaaagtctctggctgcaggcccacatcacacctccacgcacagagat aagcctgaatccatagtggagaaaagtatcttactaacagaacaagcccttgcaaaagca ggaaaaggaatgcacggaggcgtgccaggtggaaaacaattcatcgaaaagccagaaggt gaaattcactcggagactcagcctgcagatgcctgggtagaaggtggaagaaaaaaacct tctacacaggttcaaggtgcagagagagatgtcatggagggccagcgctcagccgcagtg cctcaggaaataacaataaaaccagcagctacaattaaccgtgaccttctatgcaccagc gcgggcccaccggccgccgccccagccatggagcaagacaacagcccccgaaagatccag ttcacggtcccgctgctggagccgcaccttgaccccgaggcggcggagcagattcggagg cgccgccccacccctgccaccctcgtgctgaccagtgaccagtcatccccagagatagat gaagaccggatccccaacccacatctcaagtccactttggcaatgtctccacggcaacgg aagaagatgacaaggatcacacccacaatgaaagagctccagatgatggttgaacatcac ctggggcaacagcagcaaggagaggaacctgagggggccgctgagagcacaggaacccag gagtcccgcccacctgggatcccagacacagaagtggagtcaaggctgggcacctctggg acagcaaaaaaaactgcagaatgcatccctaaaactcacgagagaggcagtaaggaaccc agcacaaaagaaccctcaacccatataccaccactggattccaagggagccaactcggtg ggttgtttcttccacgaccacgctcccttggagcagatgggggagccagtcctgatggat ggtgctgatgacatcaaacactttggactcaatgaagacctggggaaagagggagaaact tcagtgccctga >gi568815586f:54449873_54677325|GENSCAN_predicted_peptide_4|76_aa MDVGSMPPLPPVEVVLVLSIVVRPEAKSDWLKDVPWSEQMEEGVEQDLRVEKSNNNKELD QSSKDSEGKKEQEKNK >gi568815586f:54449873_54677325|GENSCAN_predicted_CDS_4|231_bp atggacgtgggcagcatgccacctcttcctcctgtagaggtggttctggtgctcagcatt gtggtgaggcctgaggcgaagagtgactggttgaaggatgtgccctggagtgagcaaatg gaggaaggagtcgagcaggacctgagagtagaaaagagcaacaacaacaaagaacttgat caatccagcaaagattcagaaggaaagaaagaacaagaaaagaataaataa