GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:35:38 Sequence gi568815578r:1272115_1492998 : 220884 bp : 45.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 9490 9703 214 1 1 90 47 106 0.358 3.40 1.02 PlyA + 12042 12047 6 1.05 2.04 PlyA - 12252 12247 6 1.05 2.03 Term - 12510 12470 41 1 2 114 47 27 0.431 -1.45 2.02 Intr - 14293 14149 145 1 1 111 59 140 0.851 13.26 2.01 Init - 15651 15640 12 0 0 64 81 8 0.141 -2.05 2.00 Prom - 16879 16840 40 -2.96 3.00 Prom + 17386 17425 40 -7.46 3.01 Init + 23465 23597 133 1 1 94 62 110 0.485 8.89 3.02 Intr + 24211 24307 97 2 1 110 109 23 0.897 5.67 3.03 Intr + 24932 25138 207 2 0 39 86 260 0.667 19.09 3.04 Intr + 28448 28597 150 2 0 103 105 278 0.998 30.28 3.05 Term + 32764 33940 1177 1 1 79 55 1955 0.993 182.57 3.06 PlyA + 35932 35937 6 -3.44 4.12 PlyA - 35967 35962 6 -3.84 4.11 Term - 36465 36165 301 2 1 114 42 122 0.904 4.89 4.10 Intr - 38777 38686 92 2 2 102 78 176 0.966 16.79 4.09 Intr - 40394 40223 172 1 1 75 102 322 0.999 32.25 4.08 Intr - 40648 40473 176 1 2 114 109 224 0.998 25.84 4.07 Intr - 41384 41226 159 2 0 110 91 167 0.951 19.38 4.06 Intr - 46304 46204 101 0 2 96 31 48 0.778 -0.27 4.05 Intr - 46847 46798 50 1 2 60 81 29 0.103 -2.28 4.04 Intr - 47545 47476 70 2 1 121 109 26 0.130 6.24 4.03 Intr - 48277 48249 29 0 2 72 111 4 0.020 -1.14 4.02 Intr - 57072 56971 102 2 0 35 115 86 0.205 5.29 4.01 Init - 91608 91526 83 0 2 65 70 83 0.300 4.64 4.00 Prom - 95473 95434 40 -4.86 5.08 PlyA - 96893 96888 6 1.05 5.07 Term - 100126 99998 129 1 0 114 48 134 0.959 10.18 5.06 Intr - 103489 103377 113 2 2 97 85 32 0.133 3.90 5.05 Intr - 120767 120720 48 0 0 102 117 60 0.002 8.95 5.04 Intr - 120987 120848 140 2 2 30 113 66 0.002 3.41 5.03 Intr - 121203 121012 192 2 0 89 41 75 0.001 1.51 5.02 Intr - 121612 121441 172 2 1 53 77 84 0.002 2.80 5.01 Init - 149946 149871 76 0 1 45 82 168 0.722 11.15 5.00 Prom - 161882 161843 40 -2.96 6.11 PlyA - 167048 167043 6 1.05 6.10 Term - 171797 171635 163 1 1 92 41 372 0.836 30.31 6.09 Intr - 173703 173552 152 0 2 82 82 73 0.946 5.06 6.08 Intr - 174010 173881 130 0 1 78 95 67 0.951 7.10 6.07 Intr - 180516 180379 138 2 0 122 110 202 0.998 25.38 6.06 Intr - 181026 180917 110 0 2 102 96 54 0.998 6.68 6.05 Intr - 182191 182099 93 1 0 69 76 68 0.935 3.86 6.04 Intr - 183018 182853 166 2 1 50 70 183 0.965 12.66 6.03 Intr - 186160 186086 75 0 0 103 105 89 0.996 10.73 6.02 Intr - 192312 192215 98 0 2 77 110 66 0.998 6.51 6.01 Init - 194710 194606 105 1 0 106 74 180 0.962 18.63 6.00 Prom - 198846 198807 40 -7.46 7.06 PlyA - 201785 201780 6 1.05 7.05 Term - 204222 204053 170 1 2 102 55 104 0.862 6.54 7.04 Intr - 205289 205224 66 0 0 83 99 62 0.972 5.68 7.03 Intr - 206493 206152 342 1 0 47 100 273 0.969 19.80 7.02 Intr - 207951 207586 366 1 0 97 86 229 0.956 18.72 7.01 Init - 219233 219161 73 2 1 80 111 139 0.882 14.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 109476 109576 101 2 2 58 114 35 0.864 2.83 S.002 Term + 113444 113552 109 2 1 107 48 74 0.899 3.28 S.003 Sngl + 120862 121224 363 0 0 78 38 266 0.969 14.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_1|71_aa XMWNAPPQAAPGYLENNYAPVLQSSAQGHQLLSSAEKPLSLLPNPSVTRASIMGLLAEEE VAHNRHLVNID >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_1|216_bp nnaatgtggaatgctcctccacaggcagctcctgggtacctggagaacaactatgcacct gtcctccagagctctgctcaagggcaccagcttctgagttctgcagagaagcccctctct cttctgcccaacccctcagtgacacgtgcttcgattatgggactattggcagaggaggaa gtggcacacaatagacacttggtaaatattgactga >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_2|65_aa MRKMLQLPCCPQQLIPVHLIPRGTSHAMLNSFTFNKIQGLNNAQCGSHLLEEGLCSKVTL SVNLL >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_2|198_bp atgaggaaaatgctgcagctgccctgctgtcctcagcagctgatccctgtccacctgatt ccccgtggtacttcccatgccatgctgaattctttcaccttcaataagatccagggcctc aacaatgcccagtgtggctcacacttgctggaagagggtctttgctccaaggtcaccctc tcagtgaatcttctctga >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_3|587_aa MPVCRMGAGGPGCMSCVMGWLPEHVHGDACGMSWNGRDMVPSRVAPGIPPIPPLTRTHSL MAMSLPGSRRTSAGSRSGGTLGRSGLAVFAQCPQLPASQNEHLPLLPASRRTSPPVSVRD AYGTSSLSSSSNSGSYKGSDSSPTPRRSMKYTLCSDNHGIKPPTPEQYLTPLQQKEVCIR HLKARLKDTQDRLQDRDTEIDDLKTQLSRMQEDWIEEECHRVEAQLALKEARKEIKQLKQ VIDTVKNNLIDKDKGLQKYFVDINIQNKKLETLLHSMEVAQNGMAKEDGTGESAGGSPAR SLTRSSTYTKLSDPAVCGDRQPGDPSSGSAEDGADSGFAAADDTLSRTDALEASSLLSSG VDCGTEETSLHSSFGLGPRFPASNTYEKLLCGMEAGVQASCMQERAIQTDFVQYQPDLDT ILEKVTQAQVCGTDPESGDRCPELDAHPSGPRDPNSAVVVTVGDELEAPEPITRGPTPQR PGANPNPGQSVSVVCPMEEEEEAAVAEKEPKSYWSRHYIVDLLAVVVPAVPTVAWLCRSQ RRQGQPIYNISSLLRGCCTVALHSIRRISCRSLSQPSPSPAGGGSQL >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_3|1764_bp atgcctgtgtgccgcatgggtgcgggaggacctggctgcatgtcctgcgtcatgggctgg ctgcctgagcacgttcatggagacgcgtgtggcatgtcttggaatggcagggacatggta ccttctcgtgttgcccccgggataccgcccatcccaccccttactcggacccacagcctc atggccatgtccctgccaggaagtagacggacctctgctggatcacgcagcgggggcact ttgggccgcagcggcctggcagtgttcgcccagtgtccgcagctgcccgccagccagaac gagcacctgcctcttcttcctgcctccaggcgcacctctccacctgtgagcgtgcgggat gcctacggcacctcttcgctcagcagcagcagcaattctggctcctacaagggcagtgac agcagtcccacgccaaggcgctccatgaaatacacgctgtgcagtgacaaccatggcatc aagcccccgaccccggagcagtacctgacccccctgcagcagaaggaggtgtgcatccgg cacctgaaagcccggctgaaggacacacaggaccggctccaggaccgggacacagagatt gatgacctgaagacgcagctgtcacgcatgcaggaggactggattgaggaggagtgccac cgcgtggaggcccagctggccctgaaggaggcccgaaaggagatcaagcagctcaagcag gtcatcgacactgtcaagaacaacctgattgacaaggacaaggggctgcagaagtacttc gtggacatcaacatccagaacaagaagctggagacgctgctgcacagcatggaggtggcc cagaatggcatggccaaggaggatggcactggggagtcagccggtgggtcccctgcccgc tccctcacccgcagctccacctacaccaagctgagtgacccggctgtctgtggtgaccgc cagccgggtgatccctccagcggctctgctgaggatggggcagacagtggctttgcagca gccgatgacacactgagccggacggacgcgctggaagccagcagcctgctgtcgtcgggg gtggactgtggcaccgaggagacctcgctgcacagctccttcggcctgggcccccgcttc cctgccagcaacacctatgagaagctgctgtgtggcatggaggctggtgtgcaggccagc tgcatgcaggagcgtgccatccagacagacttcgtgcagtaccagcctgaccttgacacc atcctggagaaagtgacccaggcccaggtctgtgggacagaccctgagtcaggggacagg tgcccagagctggatgcccacccttcagggcccagagaccccaactcagcagtggtggtg acagtgggtgatgagctagaggccccagagcccatcacccgtggacccaccccacagcgg cctggtgccaaccccaaccctggccagtcggtgagcgtggtgtgccccatggaagaggag gaggaggctgccgtggctgagaaggagcccaagagctactggagccgccactacatcgtg gatctgctggctgtggtggtgccggccgtgcccacggtggcctggctttgccgctcccag cggcgccagggccagcccatctacaacatcagctccctgctgcggggctgctgcactgtg gccttgcactccatccgcaggatcagctgccgctcgctgagccagccgagtcccagccca gcgggcggcggctcccagctctga >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_4|444_aa MGPSEAEFAAAAVLYSMQQNGCQLLLGRSGSQLRDRATFQGGKQIQQRQQLEKVVEQHLA CRGPKSGPSHSGPGQSLTQDASPASPGNSHFPTTSSGGWKPKIKVLANSVSVLYPNLAEL ENYMGLSLSSQEVQESLLQIPEGDSTAVSGPGPGQMVAPVTGYSLGVRRAEIKPGVREIH LCKDERGKTGLRLRKVDQGLFVQLVQANTPASLVGLRFGDQLLQIDGRDCAGWSSHKAHQ VVKKASGDKIVVVVRDRPFQRTVTMHKDSMGHVGFVIKKGKIVSLVKGSSAARNGLLTNH YVCEVDGQNVIGLKDKKIMEILATAGNVVTLTIIPSVIYEHMVKKAWHLSSSCRTSTAWS LTGSLHLCIDAGSEHTHMHSRAGPGPPATTVPGPVPGLSGELLPPLTRRKAHRRPQESPR KRGPSTLWARALVTHTGGLRPEQE >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_4|1335_bp atgggcccatcagaggctgagtttgcagcagcagctgtcctgtacagcatgcagcaaaac ggctgtcagctcttattaggaagatcaggaagccagctgagagacagggctacgtttcag ggagggaaacagattcagcagcggcagcagctggagaaggtcgtggagcagcaccttgcc tgcagaggacctaaaagtggaccaagccattcaggcccaggtcagagcctcacccaagat gccagccctgccagtccaggcaacagccatttccccaccaccagctctggaggctggaag cccaagatcaaggttctggccaattcagtttctgttttgtacccaaacttggcagaactg gaaaattatatgggtctttccctctccagccaagaagtccaggagagcctgcttcagatt ccagagggtgacagtacagcggtctcgggccccgggcccggccagatggtggcaccggta accgggtacagcctgggcgtgcggcgagctgagatcaagcccggggtgcgcgagatccac ctgtgcaaggacgagcgcggcaagaccgggctgaggctgcggaaggtcgaccaggggctc tttgtgcagttggtccaggccaacacccctgcatcccttgtggggctgcgctttggggac cagctcctgcagattgacgggcgtgactgtgctgggtggagctcgcacaaagcccatcag gtggtgaagaaggcatcaggcgataagattgtcgtggtggttcgggacaggccgttccag cggactgtcaccatgcacaaggacagcatgggccacgtcggcttcgtgatcaagaagggg aagattgtctctctggtcaaagggagttctgcggcccgcaacgggctcctcaccaaccac tacgtgtgtgaggtggacgggcagaatgttatcgggctgaaggacaaaaagatcatggag attctggccacggctgggaacgttgtcaccctgaccatcatccccagtgtgatctacgag cacatggtcaaaaaggcatggcacctgagcagcagctgccgcacctccaccgcctggagc ctcacaggcagcctgcacctttgcatagatgcgggatctgagcacacacacatgcactcc agagctgggccagggccaccggccaccacagtcccagggcctgtcccagggctgagtggg gagctgctccctccactgacaagaagaaaagcccatcgaaggccccaggagagcccccga aagagaggaccctcaacactgtgggccagggccctcgtgacccacaccggtggtctcagg ccagaacaggagtag >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_5|289_aa MAAMVLGTLYCISCLVLALGVKEQLGGQQAISAAVSGNIGSWSRRCVPPVLFPWPLDSDS VGHKKSDPIRIINFHMLNKNVCMEMLNRLEAQRKGRRTKQVTRQTQLYPEASGLPEVSGL LGFRSLFRRPGFPGRRAPCGARTQGWARRAEVLGRAVEPPPGRCWSTPPVAPPARSASAA AMGVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRG WEEGVAQMSVGQRAKLTISPDYAYGATGHPGIIPPHATLVFDVELLKLE >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_5|870_bp atggctgccatggtcctgggcacactgtactgcatctcctgcttggtgctggctctgggg gtgaaggaacagttagggggacagcaggccatttccgcagcagttagtggaaatattggt tcttggagcaggcgctgcgtgcctccagtgcttttcccctggcccctggattctgattcc gttgggcataagaagagtgacccaattcgtattatcaactttcacatgctcaataagaat gtatgcatggagatgcttaaccggctggaagcacagaggaagggcaggagaacgaagcaa gtcacccggcaaacacagctgtatccggaggcctccgggcttccggaggtctcggggctt ctgggcttccggtccctcttccggaggcctgggtttccgggacgtcgcgcgccgtgtggg gcgcgcacgcagggctgggcgcgacgcgccgaggtactaggcagagccgtggaaccgccg ccaggtcgctgttggtccacgccgcccgtcgcgccgcccgcccgctcagcgtccgccgcc gccatgggagtgcaggtggaaaccatctccccaggagacgggcgcaccttccccaagcgc ggccagacctgcgtggtgcactacaccgggatgcttgaagatggaaagaaatttgattcc tcccgggacagaaacaagccctttaagtttatgctaggcaagcaggaggtgatccgaggc tgggaagaaggggttgcccagatgagtgtgggtcagagagccaaactgactatatctcca gattatgcctatggtgccactgggcacccaggcatcatcccaccacatgccactctcgtc ttcgatgtggagcttctaaaactggaatga >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_6|409_aa MAAERQEALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGGDEDIVTISQATPSS VSRGTAPSDNRVTSFRDLIHDQDEDEEEEEGQRFYAGGSERSGQQIVGPPRKKSPNELVD DLFKGAKEHGAVAVERVTKSPGETSKPRPFAGGGYRLGAAPEEESAYVAGEKRQHSSQDV HVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQVNLDMEDHRD EDFVKPKGAFKAFTGEGQKLGSPCSVEDTKEEKGRVTCPHYSGARELAITYEIYDSGFEV NSAGPVLSTSSPAQQAENEAKASSSILIDESEPTTNIQIRLADGGRLVQKFNHSHRISDI RLFIVDARPAMAATSFILMTTFPNKELADESQTLKEANLLNAVIVQRLT >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_6|1230_bp atggcggcggagcgacaggaggcgctgagggagttcgtggcggtgacgggcgccgaggag gaccgggcccgcttctttctcgagtcggccggctgggacttgcagatcgcgctagcgagc ttttatgaggacggaggggatgaagacattgtgaccatttcgcaggcaacccccagttca gtgtccagaggcacagcccccagtgataatagagtgacatccttcagagacctcattcat gaccaagatgaagatgaggaggaagaggaaggccagaggttttatgctgggggctcagag agaagtggacagcagattgttggccctcccaggaagaaaagtcccaacgagctggtggat gatctctttaaaggtgccaaagagcatggagctgtagctgtggagcgagtgaccaagagc cctggagagaccagtaaaccgagaccatttgcaggaggtggctaccgccttggggcagca ccagaggaagagtctgcctatgtggcaggagaaaagaggcagcattccagccaagatgtt catgtagtattgaaactctggaagagtggattcagcctggataatggagaactcagaagc taccaagacccatccaatgcccagtttctggagtctatccgcagaggggaggtgccagca gagcttcggaggctagctcacggtggacaggtgaacttggatatggaggaccatcgggac gaggactttgtgaagcccaaaggagccttcaaagccttcactggcgagggtcagaaactg ggcagcccttgttcagttgaagacaccaaggaggaaaaaggccgagtaacttgcccacat tactcaggagcgagggagctagcaataacgtatgaaatatatgactcagggtttgaagtg aattctgctggacctgtgttgagtaccagctctccagcccaacaggcagaaaatgaagcc aaagccagctcttccatcttaatcgacgaatcagagcctaccacaaacatccaaattcgg cttgcagacggcgggaggctggtgcagaaatttaaccacagccacaggatcagcgacatc cgactcttcatcgtggatgcccggccagccatggctgccaccagctttatcctcatgact actttcccgaacaaagagctggctgatgagagccagaccctgaaggaagccaacctgctc aatgctgtcatcgtgcagcggttaacataa >gi568815578r:1272115_1492998|GENSCAN_predicted_peptide_7|338_aa MSAPTCLAHLPPCFLLLALVLVPSDASGQSSRNDWQVLQPEGPMLVAEGETLLLRCMVVG SCTDGMIKWVKVSTQDQQEIYNFKRGSFPGVMPMIQRTSEPLNCDYSIYIHNVTREHTGT YHCVRFDGLSEHSEMKSDEGTSVLVKGAGDPEPDLWIIQPQELVLGTTGDTVFLNCTVLG DGPPGPIRWFQGAGLSREAIYNFGGISHPKETAVQASNNDFSILLQNVSSEDAGTYYCVK FQRKPNRQYLSGQGTSLKVKAKSTSSKEAEFTSEPATEMSPTGLLVVFAPVVLGLKAITL AALLLALATSRRSPGQEDVKTTGPAGAMNTLAWSKGQE >gi568815578r:1272115_1492998|GENSCAN_predicted_CDS_7|1017_bp atgtcggcccccacctgcctggcccacttgcctccctgcttcctgctgctggcactggtc cttgtcccctcagatgcctctgggcagagcagcaggaatgactggcaggtgctacagccc gagggccccatgctggtggcagaaggtgagacacttctactgaggtgtatggtggtcggc tcctgcactgatggtatgataaaatgggtgaaggtgagcactcaggaccaacaggaaatt tataactttaaacgtggctccttccctggggtaatgcccatgatccaacggacatcagaa ccactgaattgtgattattccatctatatccacaatgtcaccagggagcacactggaacc taccactgtgtgaggtttgatggtttgagtgaacactcagaaatgaaatcggatgaaggc acctcagtgcttgtgaagggagctggggaccctgaaccagacctgtggatcatccagccc caggaattggtgttggggaccactggagacactgtctttctgaactgcacagtgcttgga gacggtccccctggacccatcaggtggttccagggagctggtctgagccgggaggccatt tacaactttggaggcatctcccaccccaaggagacagcggtgcaggcctccaacaatgac ttcagcattcttctgcaaaacgtctccagtgaggatgcaggcacctattactgtgtaaag tttcagaggaaacccaacaggcaatacctgtctggacagggcaccagcctgaaagtgaaa gcaaaatctacctcttccaaagaggcagaattcaccagtgaacctgcaactgagatgtct ccaacaggcctcctggttgtgttcgcacctgtggtcctggggctgaaggcaattaccttg gctgcactcctactggccctggctacctctcggaggagccctgggcaagaagatgtcaag accacaggcccagcaggagccatgaacaccttagcatggagcaagggtcaagagtga