GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:59:03 Sequence gi568815583f:68545160_68825971 : 280812 bp : 48.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 122 117 6 1.05 1.04 Term - 14978 14343 636 2 0 57 48 216 0.067 8.75 1.03 Intr - 21905 21761 145 1 1 77 49 91 0.100 4.38 1.02 Intr - 32884 32706 179 1 2 36 -13 169 0.499 0.32 1.01 Init - 34105 33422 684 1 0 78 78 326 0.676 23.70 1.00 Prom - 56688 56649 40 -5.66 2.05 PlyA - 57075 57070 6 1.05 2.04 Term - 63771 63611 161 1 2 113 54 72 0.644 4.40 2.03 Intr - 67594 67501 94 1 1 84 70 21 0.075 -0.56 2.02 Intr - 72338 72284 55 1 1 90 105 33 0.817 4.18 2.01 Init - 74177 74122 56 2 2 52 59 53 0.768 -0.44 2.00 Prom - 77754 77715 40 -2.26 3.00 Prom + 78947 78986 40 -6.16 3.01 Init + 94085 94168 84 1 0 75 53 107 0.706 4.72 3.02 Intr + 100001 100201 201 1 0 123 96 492 0.970 52.98 3.03 Intr + 107306 107471 166 1 1 88 12 44 0.002 -3.67 3.04 Intr + 118822 118971 150 2 0 47 52 77 0.001 0.13 3.05 Intr + 128977 129151 175 2 1 65 76 89 0.072 4.40 3.06 Intr + 138572 138643 72 2 0 118 -2 79 0.004 0.42 3.07 Intr + 141851 141951 101 2 2 71 89 34 0.018 1.55 3.08 Intr + 149219 149421 203 0 2 46 49 87 0.631 -0.40 3.09 Intr + 149981 150097 117 1 0 133 97 204 0.958 26.46 3.10 Intr + 151468 151492 25 0 1 87 75 8 0.239 -3.00 3.11 Intr + 154950 155050 101 1 2 67 78 77 0.393 4.43 3.12 Intr + 165573 165722 150 2 0 120 91 319 0.999 35.76 3.13 Intr + 166383 166547 165 2 0 128 44 284 0.997 28.16 3.14 Intr + 168766 168882 117 0 0 133 76 97 0.906 13.66 3.15 Intr + 169400 169504 105 1 0 50 62 132 0.976 7.21 3.16 Intr + 170056 170152 97 0 1 90 97 218 0.999 22.48 3.17 Intr + 173539 173651 113 2 2 60 110 259 0.533 25.40 3.18 Intr + 173985 174075 91 2 1 82 66 80 0.997 4.77 3.19 Intr + 174254 174393 140 0 2 60 94 134 0.652 11.38 3.20 Term + 180684 180815 132 2 0 117 36 356 0.975 31.39 3.21 PlyA + 180993 180998 6 1.05 4.03 PlyA - 182196 182191 6 1.05 4.02 Term - 209956 209821 136 0 1 136 36 112 0.726 8.29 4.01 Init - 223299 223238 62 0 2 78 73 34 0.067 1.62 4.00 Prom - 228382 228343 40 0.14 5.10 PlyA - 232604 232599 6 1.05 5.09 Term - 234983 234922 62 0 2 112 42 114 0.998 7.17 5.08 Intr - 235314 235224 91 0 1 106 29 157 0.581 11.17 5.07 Intr - 237894 237797 98 1 2 38 94 293 0.978 24.63 5.06 Intr - 239436 239238 199 0 1 91 64 428 0.999 39.62 5.05 Intr - 242376 242254 123 0 0 73 119 35 0.983 5.88 5.04 Intr - 242760 242611 150 0 0 105 94 138 0.927 16.46 5.03 Intr - 248483 248254 230 0 2 63 71 83 0.163 1.69 5.02 Intr - 274574 274404 171 0 0 64 52 95 0.283 3.41 5.01 Intr - 276099 275740 360 1 0 111 92 180 0.291 15.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 85230 85041 190 2 1 62 48 149 0.944 5.22 S.002 Init - 86796 86636 161 0 2 75 82 76 0.817 3.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:68545160_68825971|GENSCAN_predicted_peptide_1|547_aa MQKLRGGAGARARRAGAAAAARGGGGRGSGSSRLAPRAPPAARPSLGAALRAGSAQGPLV GARPPLVADVSPECAFIIDEPPSPQEGEAGRKGERERARAGGRGEEEGPGTRYKGGGGGR RRRKRSPPAAVPRASRSIAGRRAPLLPESGRAPGSAALTPSGLRTPRGWARLAPGTCRLG ERAAPRALGCPAGPGTLVKSQAGRPRRCAGKEGGRVRPLEARKVGPWAIQNRGEGGGEEL AEVEAEGFSKQDSLASPEVLASCSTVITGAIHLDSEGNVIYEHLLCGKSYYDTHFTDEKT EAQNPENREPQTQVGEWGSASLSHLLDTLPLRWQAQAAPSQCLSTARVQVPGHVIPLRDS SNRQSPPGSPCWAGRDLVRSALPSEALPASRPYTGLNPQQTSCTPNAISASAYQRTPTGT HAKSHTPCALRARAFAGDLLVLERRSSEEGCGPRPLVGTPWSCTWSGLGRRSLTDSPPGR TPESTAPTGKPETEAQTGKGLGKEKVWGPLTISTLVPSAPRLVPAALPAGRLGTVNFALP ATPSPPR >gi568815583f:68545160_68825971|GENSCAN_predicted_CDS_1|1644_bp atgcagaaactccgcggcggagcgggggcgcgggcgcggcgtgcgggggcggcggcggcg gcccggggcggcggcggaagggggtccggcagctcccggctcgctccccgcgccccgccg gcggctcggccgagcctgggggctgcgctccgggcgggcagcgcgcagggaccgctcgtc ggcgctcgcccgccgctcgtcgctgatgtcagccccgaatgtgcatttataatagatgag cctccttcgccgcaggaaggcgaggcaggaagaaagggagagagggagagggcccgggct ggggggcggggggaggaagaggggccgggaactcgctacaaaggaggaggcggagggagg aggaggaggaagcgcagcccgccagcggccgtgccccgagcctcccgctcgatcgccggc cgccgggcaccgctgctcccggagtccggacgggcgcccgggtccgcggccttaacccct tcgggcctgcggacgccacgcggctgggcgaggctggctcccggcacctgccggctggga gaacgggcggcgccccgggctctcgggtgtcccgctggcccggggacgctggtgaagtca caggctggacggccgaggcgctgcgcgggcaaagaaggcggtcgggttcgccccctggag gctcgcaaagtcggcccgtgggccatacagaaccgcggagagggtggaggggaggagtta gcagaggtggaagcagaggggttctccaagcaggacagcctggcaagcccggaagtgctg gcttcctgctccacggtaattaccggtgcaattcacctggacagcgaaggcaacgtcata tatgagcacctgctctgtggaaagtcttattatgatacccacttcacagatgagaaaact gaagcacagaacccagagaacagagaacctcagacccaagtgggagagtggggatcagcc agtctgagtcatttattggacacgctgcctcttcgctggcaagctcaggcagcccccagc caatgcctgagcactgcacgggtgcaagtgcccggccacgtcatcccattgcgggactct tcaaatcggcagtctccaccgggatctccctgctgggctggcagagacttggtcagatct gcattgccgtctgaggctctccccgccagtcgtccttacacaggtttaaatccccaacaa acttcttgcactcctaacgccatctcagcgtctgcttaccagaggaccccaaccggcaca catgccaagagccacacgccttgtgctctccgtgcgcgcgcgtttgcgggagacctgctg gtcctggagcgccgcagctccgaggaaggatgcgggccgcgccccctggtgggcacgccg tggagctgcacgtggagcggtctcgggcgccgcagcctcaccgacagtccccctgggaga accccagagagcactgcaccaaccggaaaaccggaaacggaggcccagacagggaaaggg ttagggaaagaaaaagtctgggggcctcttaccatcagcacgcttgttccgtctgctccc cgtctggtccctgcagctcttccagcgggccggcttggcacggtcaacttcgccctcccc gctaccccctcaccaccccgctga >gi568815583f:68545160_68825971|GENSCAN_predicted_peptide_2|121_aa MHTLQQDENLSDKIQAKGKWECGGPVWASDLLKVTQLRRRLRREDHEFLHVKSRAHGQLE SKLSSADQVPSGSFGKHLLNGWMEAEIANTVEGNDDAEDESAEFQLPSWSPITLQSQLWE A >gi568815583f:68545160_68825971|GENSCAN_predicted_CDS_2|366_bp atgcatacattgcagcaggatgaaaatctaagtgataaaattcaggctaaaggaaagtgg gaatgtggaggtccagtatgggcaagtgacctgctcaaggtcacacagctgaggaggaga ctgcggcgtgaggatcatgagtttcttcatgtgaagtcacgagctcatggtcagttagag agcaagttatcatcagcagaccaagtacccagtgggagttttggtaagcacttgttgaac ggatggatggaggctgaaatagcaaacacagtggaagggaatgatgatgctgaggatgag agtgcggaatttcaactcccctcctggagccccataactcttcaaagccagctgtgggaa gcctga >gi568815583f:68545160_68825971|GENSCAN_predicted_peptide_3|834_aa MRLSYKGALNPAWSGVGLCRGQKELDDLMSWRPQYRSSKFRNVYGKVANREHCFDGIPIT KNVHDNHFCAVNTRFLAIVTESAGGGSFLVIPLEQIQATGRTGLETELPATPTGTWQKEK KPGLDIKSLGSPYNPAILSPSCTPGLWGWDERPKFRKLIISSVDEDEEQLELSYYTAGSV QCYSHFGKLDDRSILHFDHGESKSLDVAESSAATALCWWNAGSTAFLLGHDAALWLGEAF QRRSNPLGSESSLLGASGRACSTSRQESVALVKTTVIMVLTACCSGIAISHLSHVDSIWI PVFCAGTLEAWHSVVAERGNRGSEKFINFPKATTFTAKNQQSCNSNTGLWDSEDYAFNHR ALCTGVQDVHCRRTLIHIVERMARTGRIEPNYPKVCGHQGNVLDIKWNPFIDNIIASCSE DTSTLPTFTSGAPEASATEALPPPIPAGMVNLLLTRHDIQTQAWEVRIWEIPEGGLKRNM TEALLELHGHSRRVGLVEWHPTTNNILFSAGYDYKVLIWNLDVGEPVKMIDCHTDVILCM SFNTDGSLLTTTCKDKKLRVIEPRSGRVLQEANCKNHRVNRVVFLGNMKRLLTTGVSRWN TRQIALWDQEDLSMPLIEEEIDGLSGLLFPFYDADTHMLYLAGKGDGNIRYYEISTEKPY LSYLMEFRSPAPQKGLGVMPKHGLDVSACEVFRFYKLVTLKGLIEPISMIVPRRSDSYQE DIYPMTPGTEPALTPDEWLGGINRDPVLMSLKEGYKKSSKMVFKAPIKEKKSVVVNGIDL LENVPPRTENELLRMFFRQQDEIRRLKEELAQKDIRIRQLQLELKNLRNSPKNC >gi568815583f:68545160_68825971|GENSCAN_predicted_CDS_3|2505_bp atgaggctcagctataagggagccctgaacccggcatggagtggggtggggctgtgcagg ggccagaaggagttggatgatctgatgtcctggcgtccgcaataccgtagctccaagttc cggaatgtctacgggaaggtggccaaccgggagcactgcttcgatgggatccccatcacc aagaatgtgcacgacaaccacttctgtgccgtcaacacccgcttcctggccatcgtcacc gagagcgcagggggcggctccttcctcgtcatccccctggagcagatccaggccacagga agaacagggctagaaacagagctacctgctaccccaactggcacctggcagaaagagaag aaacccggcctagatatcaagagtctgggttctccctataacccagccattctcagtccc agctgtacaccgggtctctggggctgggatgaaaggccaaaatttagaaagctgataata tcaagtgttgatgaggatgaggagcagctggaactctcatactacactgctgggagtgta cagtgttacagccactttggaaaactggatgacagaagtattctacattttgatcatggt gagtctaagagcttggatgtggctgagagtagcgcagccaccgccctctgctggtggaat gctggtagcacagctttcctgcttggccacgatgctgccctctggctaggcgaggctttc cagagaagaagcaaccctttggggtctgagagctccctgctgggggcttcgggacgagca tgttctacctctcggcaagagagtgtggccttggtcaagaccactgtcatcatggtgctc acagcctgttgttctggcattgccatctcacatctttctcatgtggactcaatatggatt cctgtgttctgtgctggtacactggaagcatggcactctgtggttgcagagaggggaaac cgaggctcagagaagttcattaactttcctaaggccacaactttcactgctaaaaatcag cagagctgtaattcaaacacaggcctgtgggactccgaagactatgctttcaaccacagg gcactgtgtaccggtgttcaagatgtacactgcagaaggacgctcattcatatcgtagag agaatggctcgtacaggcaggattgaacccaactaccccaaggtctgcggccaccagggc aatgtgctggatatcaaatggaaccccttcatcgacaacatcattgcctcgtgctcggag gacacgtcgactctaccaactttcaccagtggagcacctgaggcatcggccaccgaggcc ctccctccacccattccagctggaatggtcaacttgctgctcaccagacacgacatccag acccaggcttgggaggtgcggatctgggagatccccgagggcgggctgaagcggaacatg acggaggcgctcctggagctgcacgggcacagccggcgtgtggggctggtcgagtggcac cccaccaccaacaacatcctgttcagcgctggctacgactacaaggtcctcatctggaac ctggatgtgggtgagccggtgaagatgattgactgccacacggatgtgatcctctgcatg tccttcaacacggacggcagcctgctcaccaccacgtgcaaggacaagaagctgcgtgtg attgagccccgctctggccgtgttctgcaggaggccaactgcaaaaaccacagagtgaac cgggtggtgttcctggggaacatgaagcggctcctcacgacaggggtctccaggtggaac acaagacagattgccctctgggaccaggaggacctctccatgcccctgatcgaagaggaa attgatgggctctctggcctcctgttccccttctatgatgctgacacccacatgctctac ctggctggaaagggtgatggaaacatccggtactacgagatcagcactgagaagccctac ctgagttacctcatggagttccgctccccagccccgcagaaaggcctaggggtcatgccc aagcacgggctggatgtgtcagcctgcgaggtgttccgcttctacaagctggtgactctc aagggcctgatcgagcccatctccatgatcgtgccccggaggtcagattcctaccaggaa gacatttacccaatgacaccaggcacggagccagcactgaccccggatgaatggctggga ggcatcaaccgagatcccgtgctgatgtctttgaaagaaggctataagaagtcctcaaaa atggtatttaaggctcccatcaaagaaaagaagagtgttgtggtcaacggaatagattta ttagaaaatgtcccacccaggacagagaatgagctccttcgaatgttcttccggcagcag gatgagattcgacggttgaaagaggagctggcccagaaggacatccgcattcggcagctc cagctggaactgaaaaacttgcgcaacagccccaagaactgttag >gi568815583f:68545160_68825971|GENSCAN_predicted_peptide_4|65_aa MSYHSSTKAVKIKETDNTKSCPGILLQSSATPLLRKLPVQVLIKDVCLPCYKQERKHYFL DCVFT >gi568815583f:68545160_68825971|GENSCAN_predicted_CDS_4|198_bp atgagttatcactcatccactaaagcagtgaaaattaaggagactgacaacaccaaaagt tgccctggcatactgctgcaaagctccgcaacgcctttgctcaggaagctccccgtgcag gttctcatcaaggatgtctgcctcccctgctacaaacaggaacgaaaacactacttcctg gattgcgtctttacataa >gi568815583f:68545160_68825971|GENSCAN_predicted_peptide_5|494_aa XRTCGPAPGRTLRRRLAGKAGRLGPRGTVTFPSPAIPPTPLTQPQSSRRAPQPPRNLKGS VSGTTTKWPSLPGRLRSPPIGPFPPPPPLPLAVARRPSAAIGSGRCRSSMGPQIPPPRRS APSGFPKMPAFGPWCPPARTPGPSFSVLGLSAAIRRLLNKWARETLSGRELPGAGAREAF EECTLARPQSCYLAHWTGVVYWWLSPPSRPPHLLLARTPTLAYCFFRVSSANSVAGVPHV ERAGDLHAEHWVIQVKELVLDNSRSNEGKLEGLTDEFEELEFLSTINVGLTSIANLPKLN KLKKLELSDNRVSGGLEVLAEKCPNLTHLNLSGNKIKDLSTIEPLKKLENLKSLDLFNCE VTNLNDYRENVFKLLPQLTYLDGYDRDDKEAPDSDAEGYVEGLDDEEEDEDEEEYDEDAQ VVEDEEDEDEEEEGEEEDVSGEEEEDEEGYNDGEVDDEEDEEELGGMAAFLLSAEEERGQ KRKREPEDEGEDDD >gi568815583f:68545160_68825971|GENSCAN_predicted_CDS_5|1485_bp naacgcacctgcggccccgcccccggccgaacgctgaggcggcggctggctggcaaggcc gggcggctcggccctcgaggcacagtcaccttcccttcccccgccattccgcccaccccc ctcactcagccacagagcagccgccgcgctcctcagccgccccgaaatctaaaggggtcc gtctccggcaccactaccaaatggccgagcctccctggccggctgcgcagcccgcccatt ggtcccttcccccccccgccgccactgccattggctgttgcccggagaccctcggcggcg attggctcgggccgctgccgctcgtccatggggccgcagatcccgcctccacggcgatca gctccttccggcttccccaagatgccagcctttgggccctggtgccccccagctcgaacc cccggcccgtctttctcagtgctgggcttgtcggcagccattcggcgcctgcttaataaa tgggcacgggaaactttgtcaggacgcgagctgcccggagctggcgcccgtgaggcattt gaagaatgcaccctggcccggccccagtcctgctacctggcacactggactggagttgtt tactggtggctgtctcctcccagccggccaccacacctcttgctggcaaggaccccgacc ttggcttactgtttcttccgtgtctcctcagccaacagtgtcgcaggtgtgccccatgtg gagagggctggagacctgcatgcagagcactgggtcattcaggtgaaagaacttgtcctg gacaacagtcggtcgaatgaaggcaaactcgaaggcctcacagatgaatttgaagaactg gaattcttaagtacaatcaacgtaggcctcacctcaatcgcaaacttaccaaagttaaac aaacttaagaagcttgaactaagcgataacagagtctcagggggcctggaagtattggca gaaaagtgtccgaacctcacgcatctaaatttaagtggcaacaaaattaaagacctcagc acaatagagccactgaaaaagttagaaaacctcaagagcttagaccttttcaattgcgag gtaaccaacctgaacgactaccgagaaaatgtgttcaagctcctcccgcaactcacatat ctcgacggctatgaccgggacgacaaggaggcccctgactcggatgctgagggctacgtg gagggcctggatgatgaggaggaggatgaggatgaggaggagtatgatgaagatgctcag gtagtggaagacgaggaggacgaggatgaggaggaggaaggtgaagaggaggacgtgagt ggagaggaggaggaggatgaagaaggttataacgatggagaggtagatgacgaggaagat gaagaagagcttggtggtatggcagcctttttactctcagctgaagaagaaaggggtcag aagcgaaaacgagaacctgaagatgagggagaagatgatgactaa