GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:44:01 Sequence gi568815596r:44221359_44446342 : 224984 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9055 9346 292 0 1 7 12 387 0.131 18.78 1.02 Intr + 26896 26991 96 2 0 6 97 120 0.231 3.76 1.03 Intr + 39399 39847 449 1 2 67 86 173 0.239 7.14 1.04 Term + 49071 49619 549 2 0 8 42 465 0.794 27.21 1.05 PlyA + 49773 49778 6 1.05 2.00 Prom + 52320 52359 40 -7.75 2.01 Init + 54178 54607 430 0 1 85 83 503 0.991 45.96 2.02 Intr + 59358 59537 180 1 0 85 115 174 0.993 18.72 2.03 Intr + 60029 60183 155 0 2 42 106 102 0.903 6.27 2.04 Intr + 64674 64845 172 2 1 84 36 84 0.012 1.39 2.05 Intr + 65131 65295 165 2 0 41 101 166 0.008 12.21 2.06 Intr + 71278 71486 209 2 2 10 95 76 0.008 -1.63 2.07 Intr + 78613 78732 120 0 0 106 60 103 0.464 9.07 2.08 Term + 79645 79809 165 0 0 63 36 174 0.369 6.63 2.09 PlyA + 80705 80710 6 1.05 3.00 Prom + 80964 81003 40 -6.25 3.01 Init + 81215 81279 65 1 2 85 68 62 0.827 4.57 3.02 Intr + 82785 82980 196 0 1 97 115 168 0.985 18.90 3.03 Intr + 84628 84765 138 0 0 64 75 97 0.977 5.74 3.04 Intr + 91228 91395 168 0 0 53 87 239 0.940 19.52 3.05 Intr + 92477 92593 117 1 0 49 95 47 0.536 1.24 3.06 Term + 98841 99281 441 2 0 34 48 207 0.843 5.57 3.07 PlyA + 99446 99451 6 1.05 4.14 PlyA - 99496 99491 6 1.05 4.13 Term - 100087 99998 90 1 0 90 47 137 0.918 6.54 4.12 Intr - 100542 100469 74 1 2 63 95 5 0.738 -3.09 4.11 Intr - 101496 101373 124 0 1 94 97 209 0.896 21.64 4.10 Intr - 102053 101904 150 2 0 80 72 158 0.996 12.84 4.09 Intr - 105570 105354 217 2 1 73 105 150 0.819 12.78 4.08 Intr - 107754 107579 176 0 2 70 109 94 0.996 7.52 4.07 Intr - 111298 111101 198 1 0 42 100 150 0.729 10.23 4.06 Intr - 117178 116993 186 1 0 73 95 43 0.745 2.46 4.05 Intr - 118005 117789 217 2 1 138 84 108 0.914 12.98 4.04 Intr - 121194 121059 136 1 1 101 74 208 0.999 19.41 4.03 Intr - 122593 122387 207 2 0 61 99 141 0.826 10.63 4.02 Intr - 123228 123162 67 0 1 75 81 39 0.990 -0.54 4.01 Init - 124984 124910 75 1 0 89 63 89 0.950 7.64 4.00 Prom - 126913 126874 40 -5.95 5.00 Prom + 136857 136896 40 -5.35 5.01 Init + 140365 140787 423 0 0 63 110 212 0.125 17.30 5.02 Intr + 151358 151530 173 1 2 66 86 110 0.030 6.42 5.03 Term + 206786 206909 124 2 1 92 54 151 0.784 8.98 5.04 PlyA + 208481 208486 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:44221359_44446342|GENSCAN_predicted_peptide_1|461_aa ASDEAEESGSQGKLVEALRQMRINHRGNYRQLLEEMLTSYRLAKVEGEESPAEPAATATS SNSDAGNPVTMQESHTESESGLAELDSSNEDAGTKMSAASQLQLSYYGRQEAQCESNSCV CHRQPFSRAVLEVLARAIRQEKEIKGIQLGKEEFKLSLFADDMIVYLENPIVSAQNLLKL ISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDLKDLF KENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKRDVTANCPGSWRFEQRIGQNT QQRKNEATKERKQLSRLSARRRPRCNFLRSSRIRVHPTPAASTMPPKFHPNEIKVVYFTC TGVEVGATSALAPKIGPAGLSPKEVGDDIAKVTGDWKGLRITVKLTIQNRQAQIEVVPSA SALIIKRNYQETERNRKTLNTGGISLFMRSSTLLDRCSTDP >gi568815596r:44221359_44446342|GENSCAN_predicted_CDS_1|1386_bp gcctccgatgaagcagaggaaagtggatcacagggaaaattggtggaagctctcaggcaa atgagaattaatcataggggaaactaccgacaacttctggaggagatgctgactagttac aggctagctaaagtagagggagaagaaagccctgctgaaccagctgccacagctacttct tcgaacagtgatgctggaaacccagtgacaatgcaggaaagccatactgaatcagaaagt ggtcttgctgaattagacagctctaatgaagatgcagggacaaagatgagtgctgcatcg cagctacaactcagttactatggaagacaggaagcacagtgtgagagcaactcctgtgtc tgccacaggcagccattctccagagcagtgttggaagttctggccagggcaattaggcag gagaaggaaataaagggtattcaattaggaaaagaggaattcaaattgtccctgtttgca gatgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatctgaaggacctcttc aaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagagg gacgtgactgcaaattgtccaggttcttggcgttttgaacaaagaattggacaaaacacg cagcaaagaaagaatgaagcaacaaaagaacgaaagcagctctctcggctttcggctcgg aggaggccaaggtgcaactttcttcggtcgtctcgaatccgggttcatccgacaccagcc gcctccaccatgccgccgaagttccaccccaatgagatcaaagtcgtatacttcacgtgc actggagttgaagtcggtgccacttctgctctggcccccaagattggccccgcgggtctg tctccaaaagaggttggtgatgacattgccaaggtaacgggtgactggaagggcctgagg attacagtgaaactgaccattcagaacagacaggcccagattgaagtggtaccttctgcc tccgccctgatcatcaaaaggaactaccaagagacagaaagaaacagaaaaacattaaac acagggggaatatcactttttatgagatcgtcaacattgctcgacagatgcagcactgat ccttag >gi568815596r:44221359_44446342|GENSCAN_predicted_peptide_2|531_aa MAEDKSKRDSIEMSMKGCQTNNGFVHNEDILEQTPDPGSSTDNLKHSTRGILGSQEPDFK GVQPYAGMPKEVLFQFSGQARYRIPREILFWLTVASVLVLIAATIAIIALSPKCLDWWQE GPMYQIYPRSFKDSNKDGNGDLKGIQDKLDYITALNIKTVWITSFYKSSLKDFRYGVEDF REVDPIFGTMEDFENLVAAIHDKGLKLIIDFIPNHTSDKHIWFQLSRTRTGKYTDYYIWH DCTHENGKTIPPNNWLSVYGNSSWHFDEVRNQCYFHQFMKEQPDLNFRNPDVQEEIKVSI DTHTDFSINGGLVSCTVPVTVSCAVPVTVSCAVPVTVSCAVPVTVSCAVSVTVSCAVSVT ELCTVCDEEVRYLRNRHELSEEAWVGVFPAEESVDRDTLEVLGVVQYHWDVIYRAGSDPM RLERSLGVRSGAPLKALEILRFWLTKGVDGFSLDAVKFLLEAKHLRDEIQVNKTQIPDTV TQYSELYHDFTTTQVGMHDIVRSFRQTMDQYSTEPGRYRLTTAYALISSQA >gi568815596r:44221359_44446342|GENSCAN_predicted_CDS_2|1596_bp atggctgaagataaaagcaagagagactccatcgagatgagtatgaagggatgccagaca aacaacgggtttgtccataatgaagacattctggagcagaccccggatccaggaagctca acagacaacctgaagcacagcaccaggggcatccttggctcccaggagcccgacttcaag ggcgtccagccctatgcggggatgcccaaggaggtgctgttccagttctctggccaggcc cgctaccgcatacctcgggagatcctcttctggctcacagtggcttctgtgctggtgctc atcgcggccaccatagccatcattgccctctctccaaagtgcctagactggtggcaggag gggcccatgtaccagatctacccaaggtctttcaaggacagtaacaaggatgggaacgga gatctgaaaggtattcaagataaactggactacatcacagctttaaatataaaaactgtt tggattacttcattttataaatcgtcccttaaagatttcagatatggtgttgaagatttc cgggaagttgatcccatttttggaacgatggaagattttgagaatctggttgcagccata catgataaaggtttaaaattaatcatcgatttcataccaaaccacacgagtgataaacat atttggtttcaattgagtcggacacggacaggaaaatatactgattattatatctggcat gactgtacccatgaaaatggcaaaaccattccacccaacaactggttaagtgtgtatgga aactccagttggcactttgacgaagtgcgaaaccaatgttattttcatcagtttatgaaa gagcaacctgatttaaatttccgcaatcctgatgttcaagaagaaataaaagtgagtata gatacccacacagacttctccattaatggaggtttagtgagctgtacggtgcctgtgacg gtgagctgtgcggtgcctgtgacggtgagctgtgcggtgcctgtgacggtgagctgtgcg gtgcctgtgacggtgagctgtgcggtgtctgtgacggtgagctgtgcggtgtctgtgact gagctgtgcactgtctgtgacgaggaggtgagatacctgaggaacaggcatgagctaagt gaagaggcatgggtgggagtgtttccagcagaggaaagcgtggatcgtgacacattagag gtgctgggagttgtccagtatcactgggatgtgatatatagagcagggagtgacccgatg agactagagaggtcactgggggtgaggtcaggagcaccactgaaggccctggaaatttta cggttctggctcacaaagggtgttgatggttttagtttggatgctgttaaattcctccta gaagcaaagcacctgagagatgagatccaagtaaataagacccaaatcccggacacggtc acacaatactcggagctgtaccatgacttcaccaccacgcaggtgggaatgcacgacatt gtccgcagcttccggcagaccatggaccaatacagcacggagcccggcagatacaggttg accacggcatatgctctcatttcttcccaggcttag >gi568815596r:44221359_44446342|GENSCAN_predicted_peptide_3|374_aa MHLSINMLPQIAKASIDRSHFRFMGTEAYAESIDRTVMYYGLPFIQEADFPFNNYLSMLD TVSGNSVYEVITSWMENMPEGKWPNWMTVHWLRAEAGSQHPCTVPTSHGGAQAQGLFVTG AVVIRMQFSDLPKIGGPDSSRLTSRLGNQYVNVMNMLLFTLPGTPITYYGEEIGMGNIVA ANLNESYDINTLRSKSPMQWDNSSNAGFSEASNTWLPTNSDYHTVNVDVQKTQPRSALKL YQDLSLLHANELLLNRGWFCHLRNDSHYVVYTRELDGIDRIFIVVLNFGESTLLNLHNMI SGLPAKMRIRLSTNSADKGSKVDTSGIFLDKGEGLIFEHNTKNLLHRQTAFRDRCFVSNR ACYSSVLNILYTSC >gi568815596r:44221359_44446342|GENSCAN_predicted_CDS_3|1125_bp atgcacttatccataaatatgctgcctcaaatagctaaagccagcattgatcggagtcat ttcaggttcatggggactgaagcctatgcagagagtattgacaggaccgtgatgtactat ggattgccatttatccaagaagctgattttcccttcaacaattacctcagcatgctagac actgtttctgggaacagcgtgtatgaggttatcacatcctggatggaaaacatgccagaa ggaaaatggcctaactggatgacagttcactggttaagggcagaggcaggttcccagcat ccttgcacggtgcccacctcacacggaggtgctcaggctcagggtttgtttgttactggt gctgttgttattcgtatgcagttcagcgacttaccaaagattggtggaccagacagttca cggctgacttcgcgtttggggaatcagtatgtcaacgtgatgaacatgcttcttttcaca ctccctggaactcctataacttactatggagaagaaattggaatgggaaatattgtagcc gcaaatctcaatgaaagctatgatattaatacccttcgctcaaagtcaccaatgcagtgg gacaatagttcaaatgctggtttttctgaagctagtaacacctggttacctaccaattca gattaccacactgtgaatgttgatgtccaaaagactcagcccagatcggctttgaagtta tatcaagatttaagtctacttcatgccaatgagctactcctcaacaggggctggttttgc catttgaggaatgacagccactatgttgtgtacacaagagagctggatggcatcgacaga atctttatcgtggttctgaattttggagaatcaacactgttaaatctacataatatgatt tcgggccttcccgctaaaatgagaataaggttaagtaccaattctgccgacaaaggcagt aaagttgatacaagtggcatttttctggacaagggagagggactcatctttgaacacaac acgaagaatctccttcatcgccaaacagctttcagagatagatgctttgtttccaatcga gcatgctattccagtgtactgaacatactgtatacctcgtgttag >gi568815596r:44221359_44446342|GENSCAN_predicted_peptide_4|638_aa MDAFEKVRTKLETQPQEEYEIINVEVKHGGFVYYQEGCCLVRSKDEEADNDNYEVLFNLE ELKLDQPFIDCIRVAPDEKYVAAKIRTEDSEASTCVIIKLSDQPVMEASFPNVSSFEWVK DEEDEDVLFYTFQRNLRCHDVYRATFGDNKRNERFYTEKDPSYFVFLYLTKDSRFLTINI MNKTTSEVWLIDGLSPWDPPVLIQKRIHGVLYYVEHRDDELYILTNVGEPTEFKLMRTAA DTPAIMNWDLFFTMKRNTKVIDLDMFKDHCVLFLKHSNLLYVNVIGLADDSVRSLKLPPW ACGFIMDTNSDPKNCPFQLCSPIRPPKYYTYKFAEGKLFEETGHEDPITKTSRVLRLEAK SKDGKLVPMTVFHKTDSEDLQKKPLLVHVYGAYGMDLKMNFRPERRVLVDDGWILAYCHV RGGGELGLQWHADGRLTKKLNGLADLEACIKTLHGQGFSQPSLTTLTAFSAGGVLAGALC NSNPELVRAVTLEAPFLDVLNTMMDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPYQNI KPQHYPSIHITAYENDERVPLKGIVSYTEKLKEAIAEHAKDTGEGYQTPNIILDIQPGGN HVIEDSHKKITAQIKFLYEELGLDSTSVFEDLKKYLKF >gi568815596r:44221359_44446342|GENSCAN_predicted_CDS_4|1917_bp atggatgcatttgaaaaagtgagaacaaaattagaaacacagccacaagaagaatatgaa atcatcaatgtggaagttaaacatggtggttttgtttattaccaagaaggttgttgcttg gttcgttccaaagatgaagaagcagacaatgataattatgaagttttattcaatttggag gaacttaagttagaccagcccttcattgattgtatcagagttgctccagatgaaaaatat gtggctgccaagataagaactgaagattctgaagcatctacctgtgtaattataaagctc agcgatcagcccgtaatggaagcttctttcccgaatgtgtccagttttgaatgggtaaag gacgaggaagatgaagatgttttattctacaccttccagaggaaccttcgctgtcatgac gtatatcgagccacttttggtgataacaaacgtaatgaacgcttttacacagaaaaagac ccaagctactttgttttcctttatcttacaaaagacagtcgtttcctcaccataaatatt atgaacaagactacttctgaagtgtggttgatagatggcctgagcccttgggacccacca gtacttatccagaagcgaatacatggggtcctttactatgttgaacacagagatgatgaa ttatacattctcactaatgttggagaacctacagaatttaagctaatgagaacagcggct gatacccctgcaattatgaattgggatttattttttacaatgaagagaaatacaaaagtg atagacttggacatgtttaaggatcactgtgttctatttctgaagcacagcaatctcctt tatgttaatgtgattggtctggctgatgattcagttcggtctctaaagctccctccttgg gcctgtggattcataatggatacaaattctgacccaaagaactgcccctttcaactttgc tctccaatacgtcccccaaaatattacacatacaagtttgcagaaggcaaactgtttgag gaaactgggcatgaagacccaatcacaaagactagtcgcgttttacgtctagaagccaaa agcaaggatggaaaattagtgccaatgactgttttccacaaaactgactctgaggacttg cagaagaaacctctcttggtacatgtatatggagcttatggaatggatttgaaaatgaat ttcaggcctgagaggcgggtcctggtggatgatggatggatattagcatactgccatgtt cgaggtggtggtgagttaggcctccagtggcacgctgatggccgcctaactaaaaaactc aatggccttgctgatttagaggcttgcattaagacgcttcatggccaaggcttttctcag ccaagtctaacaaccctgactgctttcagtgctggaggggtgcttgcaggagcattgtgt aattctaatccagagctggtgagagcggtgactttggaggcacctttcttggatgttctc aacaccatgatggacactacacttcctctgacattagaagaattagaagaatgggggaat ccttcatctgatgaaaaacacaagaactacataaaacgttactgtccctatcaaaatatt aaacctcagcattatccttcaattcacataacggcatatgaaaacgatgaacgggtacct ctgaaaggaattgtaagttatactgagaaactcaaggaagccatcgcggagcatgctaag gacacaggtgaaggctatcagacccctaatattattctagatattcagcctggaggcaat catgtaattgaggattctcacaaaaagattacagcccaaattaaattcctgtacgaggaa cttggacttgacagcaccagtgttttcgaggatcttaagaaatacctgaaattctga >gi568815596r:44221359_44446342|GENSCAN_predicted_peptide_5|239_aa MRDSVPVALPQLSHPLVRPHSRWLPARMVQWRDSSALVAKEMRVPEMRNSGRARMPGPVE GGCRASQVLTGKKLAGPGRGRAAAVAPPGVEGSSEMESRVADAGTGETARAAGGSPAVGC TTRGPVVSAPLGAARWKLLRQVLKQKHLDDCLRHVSVRRFESFNLFSVTEGKERETEEEV GAWVQYTSIFCPEYSISLRFLSSVQEESDHTDLKDGECGDFIEWWKWLSAGWRVGNGVE >gi568815596r:44221359_44446342|GENSCAN_predicted_CDS_5|720_bp atgagggacagcgttccagtagccctgccacagctctctcatcctctggtccgccctcac tcaagatggctaccagcaagaatggtgcaatggcgtgacagctcggccctggttgccaag gagatgcgagtaccggaaatgcggaattcggggagagcgcgcatgcctggtccggtggag ggagggtgccgggcgtcacaggtcctgacagggaagaagttggcaggtcctggcagggga cgagctgcggcggtggcacctccgggtgtggaaggctccagtgagatggagtcgcgagtc gcggacgctgggaccggcgagaccgcgcgagcagcgggcgggagtccggcagttggctgc accactcgggggcccgtagtctcggcgcccctgggagccgcccggtggaagctcctgcgg caggttctgaagcaaaaacacctggatgattgcctgcgacatgtatctgtaagaagattt gaatcatttaatctgttttcagtaacagaaggcaaagaaagggaaactgaagaggaggtt ggtgcatgggtccaatatacaagcatcttctgtcctgaatacagtatctccttaaggttc ttgtccagtgtccaggaagaatcagatcacacggacttgaaggatggtgaatgcggagat ttcattgagtggtggaagtggctctcagcaggatggagagttggaaatggggtggagtga