GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:32:44 Sequence gi568815597r:93054830_93280242 : 225413 bp : 37.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12586 12753 168 1 0 111 77 39 0.010 3.24 1.02 Intr + 24331 24606 276 1 0 61 100 204 0.000 14.61 1.03 Intr + 55401 55540 140 0 2 55 32 162 0.181 6.49 1.04 Intr + 59859 59954 96 1 0 66 86 82 0.324 4.86 1.05 Intr + 60159 60259 101 1 2 24 71 56 0.686 -3.59 1.06 Intr + 60641 60789 149 1 2 80 89 119 0.990 9.31 1.07 Intr + 64504 64572 69 1 0 111 103 23 0.891 3.48 1.08 Intr + 74449 74619 171 1 0 33 93 171 0.991 10.24 1.09 Intr + 81841 82156 316 1 1 64 56 357 0.060 25.24 1.10 Intr + 85607 85654 48 1 0 79 116 28 0.060 2.66 1.11 Intr + 94039 94210 172 0 1 42 99 86 0.131 3.69 1.12 Term + 96237 96337 101 1 2 127 43 35 0.328 0.21 1.13 PlyA + 97708 97713 6 1.05 2.06 PlyA - 97925 97920 6 1.05 2.05 Term - 100059 99841 219 0 0 97 41 137 0.994 5.96 2.04 Intr - 101654 101471 184 1 1 92 65 128 0.991 9.77 2.03 Intr - 105397 105300 98 1 2 91 113 63 0.881 6.99 2.02 Intr - 116413 116289 125 2 2 41 89 47 0.078 -0.52 2.01 Init - 116595 116523 73 0 1 28 101 90 0.150 5.58 2.00 Prom - 120146 120107 40 -6.75 3.03 PlyA - 121040 121035 6 1.05 3.02 Term - 125151 125014 138 0 0 74 45 108 0.670 2.08 3.01 Init - 125413 125225 189 1 0 82 108 321 0.999 30.56 3.00 Prom - 127160 127121 40 -8.65 4.00 Prom + 128282 128321 40 -8.85 4.01 Init + 128533 128666 134 0 2 47 111 97 0.729 7.56 4.02 Intr + 129149 129317 169 2 1 89 25 111 0.958 3.93 4.03 Intr + 131516 131674 159 1 0 105 53 97 0.983 7.16 4.04 Intr + 137171 137277 107 1 2 72 84 75 0.991 3.59 4.05 Intr + 138787 138915 129 1 0 57 95 64 0.874 2.89 4.06 Intr + 147063 147219 157 0 1 80 97 138 0.583 12.99 4.07 Intr + 150681 150802 122 2 2 61 87 74 0.927 2.97 4.08 Intr + 152278 152569 292 1 1 99 100 241 0.980 22.61 4.09 Intr + 155973 156097 125 2 2 60 72 23 0.609 -3.64 4.10 Intr + 157272 157432 161 0 2 95 97 110 0.997 11.31 4.11 Intr + 159914 160137 224 0 2 28 74 176 0.483 7.12 4.12 Intr + 161807 161917 111 1 0 83 53 91 0.964 4.76 4.13 Intr + 162909 163040 132 2 0 86 89 89 0.997 8.72 4.14 Intr + 171504 171620 117 2 0 95 84 91 0.997 9.14 4.15 Intr + 177612 177764 153 2 0 35 76 132 0.737 6.05 4.16 Intr + 181419 181561 143 2 2 33 98 176 0.999 11.33 4.17 Intr + 184481 184644 164 2 2 70 107 132 0.999 12.00 4.18 Intr + 184854 185067 214 1 1 65 95 147 0.560 9.95 4.19 Intr + 191276 191375 100 2 1 37 15 112 0.314 -2.01 4.20 Intr + 192009 192125 117 2 0 40 99 87 0.845 4.74 4.21 Intr + 199642 199785 144 0 0 60 58 133 0.858 7.06 4.22 Intr + 201506 201709 204 1 0 78 23 204 0.981 11.37 4.23 Intr + 203919 204056 138 2 0 38 88 149 0.884 9.64 4.24 Intr + 209872 210072 201 0 0 19 106 210 0.202 14.46 4.25 Term + 210148 210186 39 0 0 39 42 18 0.201 -11.49 4.26 PlyA + 210637 210642 6 1.05 5.00 Prom + 210920 210959 40 -4.65 5.01 Init + 211393 212049 657 0 0 83 41 295 0.395 19.61 5.02 Intr + 215518 215985 468 0 0 64 91 303 0.836 20.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:93054830_93280242|GENSCAN_predicted_peptide_1|602_aa XLILCPFFALVCDQARPTSLVYITPAPLPSGFWLGQDTIKAPYKLTVWRSEGTRKESPSR GRRARCSSALAAQSLPPRGFGELPFGTGVAPRSQNAPAVRGKPKWRGAVLKWAVFEAGVR TLILPPTLVSKDLGLCVHMCRVPGGAGAQDSTGAGNSLVHKRSPLRRNQKTPTSLTKLSL QDGHKAKKPACKFEEGATGSGEMVCTICQEEYSEAPNEMVICDKCGQGYHQLCHTPHIDS SVIDSDEKWLCRQCVFATTTKRGGALKKGPNAKALQVMKQTLPYSVADLEWDAGHKTNVQ QCYCYCGGPGEFYTFICSVCSSGPEYLKRLPLQWFMSGKEIKKKKHLFGLRIRVPPVPPN VAFKAEKEPEGTSHEFKIKGRKASKPISDSRLSDSRKRTRTGRSWPAAIPHLRRRRGRLP RRALQTQNSEIVKDDEGKEDYQFDELNTEILNNLADQELQLNHLKNSITSYFGAAGRIAC GEKYRVLARRVTLDGKLISHVQENKNWTVKCKGIIRFVRATYNCLIFEHIWSAIAADPNT YIMQCWISFTIAHVITFIKAAVDELKPETDAGIQSPSPLCPVLCNIILALDSLIGVLSPD VH >gi568815597r:93054830_93280242|GENSCAN_predicted_CDS_1|1809_bp nngttgattctctgtcctttctttgccctggtctgtgaccaggcaagaccgacttctctg gtctatattaccccagctcccttgccctctggcttctggttgggtcaagacaccataaag gcaccttataagcttactgtatggagatcagagggcaccaggaaagaaagccccagtcgg ggacgcagggcgcggtgttcctccgcgctcgccgcgcagtccctgcccccccgcggcttt ggagagctgccattcggcaccggagtcgctccgcgctcccagaatgcaccggcagtccgc gggaaaccaaaatggcgaggggctgtattgaagtgggctgtgtttgaggccggtgtaaga acgctcattctacccccaacccttgtctccaaggacctcggtttgtgcgtgcatatgtgc cgggtacccggtggggcgggtgcccaagactctacaggggcaggtaattcactggtccac aagcggtctcctttacgtcgaaaccaaaagaccccaacatccttgaccaagctgtcttta caggatggacataaagccaaaaagccagcatgtaaatttgaagagggagccactggaagt ggggaaatggtctgtacaatatgtcaagaagagtattcagaagctcccaatgaaatggtt atatgtgacaagtgtggccaaggatatcatcagttgtgtcacacacctcatattgattcc agtgtgattgattcagatgaaaaatggctctgtcggcagtgtgtttttgcaacaacaaca aagaggggtggtgcacttaagaaaggaccaaatgccaaagcattgcaagtcatgaagcag acattaccctatagtgtggcagaccttgaatgggatgcaggtcataaaaccaatgtccag cagtgttactgctattgtggaggccctggagaattttatacgtttatatgctctgtctgc agttctggaccagaatacctcaaacgtctaccattacagtggtttatgtctgggaaagaa ataaagaagaagaagcatttgtttgggttgcgaattcgtgttcctcctgtgccaccaaat gtggctttcaaagcagagaaagaacctgaaggaacatctcatgaatttaaaattaaaggc agaaaggcatccaaacctatatctgattcaagattatctgactccagaaaaagaacgcgt acaggaagatcttggcctgctgcaataccacatttgcggagaagaagaggtcgtcttcca agaagagcactccagactcagaactcagaaattgtaaaagatgatgaaggcaaagaagat tatcagtttgatgaactcaacacagagattctgaataacttagcagatcaggagttacaa ctcaatcatctaaagaactccattaccagttattttggtgctgcaggtagaatagcatgt ggcgaaaaataccgagttttggcacgtcgggtgacacttgatggaaagttgatcagtcat gttcaggaaaataaaaactggacagtgaaatgtaagggcatcattcggtttgtcagggcc acatacaactgcctgatatttgaacacatttggtcagcaattgctgcagaccctaatacg tatataatgcagtgctggatatcattcactattgctcatgtgataacattcatcaaagct gcagtggatgaattaaaaccagaaacagatgcagggatccagagtccatctccattgtgc cctgtcttatgtaacatcatcctagctctagattctttaattggtgtcctttccccagac gtacattag >gi568815597r:93054830_93280242|GENSCAN_predicted_peptide_2|232_aa MFHCVRNWRVLGLTDFKNEAADPRGVKLQTFAVRVTALKVACLELFVPPGGFVVSLASGV KLQTFVVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISE KVIFFELILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQTLL RAFEARDRNIQESNFDRVNFWSMVNLVVMVVVSAIQVYMLKSLFEDKRKSRT >gi568815597r:93054830_93280242|GENSCAN_predicted_CDS_2|699_bp atgttccactgtgtccggaattggcgggttcttggtctcactgacttcaagaatgaagcc gcggaccctcgcggagtgaagctgcagacctttgcagtgagagttacagctcttaaggtg gcgtgtctggagttgttcgttcctcctggtgggtttgtggtctcgctggcttcaggagtg aagctgcagaccttcgtggttttagatggagcaggattagatattgatttccatcttgcc tctccagaaggcaaaaccttagtttttgaacaaagaaaatcagatggagttcacactgta gagactgaagttggtgattacatgttctgctttgacaatacattcagcaccatttctgag aaggtgattttctttgaattaatcctggataatatgggagaacaggcacaagaacaagaa gattggaagaaatatattactggcacagatatattggatatgaaactggaagacatcctg gaatccatcaacagcatcaagtccagactaagcaaaagtgggcacatacaaactctgctt agagcatttgaagctcgtgatcgaaacatacaagaaagcaactttgatagagtcaatttc tggtctatggttaatttagtggtcatggtggtggtgtcagccattcaagtttatatgctg aagagtctgtttgaagataagaggaaaagtagaacttaa >gi568815597r:93054830_93280242|GENSCAN_predicted_peptide_3|108_aa MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEI EYQKLLVDGPGGEEARAVQEASPLRAGARGEARLSLGPGTQNLGWTWN >gi568815597r:93054830_93280242|GENSCAN_predicted_CDS_3|327_bp atgggcgacaagatctggctgcccttccccgtgctccttctggccgctctgcctccggtg ctgctgcctggggcggccggcttcacaccttccctcgatagcgacttcacctttaccctt cccgccggccagaaggagtgcttctaccagcccatgcccctgaaggcctcgctggagatc gagtaccaaaaacttctggtggacggccctggtggtgaagaggcgagggccgttcaggaa gccagcccgctccgcgcaggcgcgcgaggcgaggcgaggctctcgctggggccggggacc cagaacttaggctggacctggaattga >gi568815597r:93054830_93280242|GENSCAN_predicted_peptide_4|1251_aa MESSSSDYYNKDNEEESLLANVASLRHELKITEWSLQSLGEELSSVSPSENSDYAPNPSR SEKLILDVQPSHPGLLNYSPYENVCKISGSSTDFQKKPRDKMFSSSAPVDQEIKSLREKL NKLRQQNACLVTQNHSLMTKFESIHFELTQSRAKVSMLESAQQQAASVPILEEQIINLEA EVSAQDKVLREAENKLEQSQKMVIEKEQSLQESKEECIKLKVDLLEQTKQGKRAERQRNE ALYNAEELSKAFQQYKKKVAEKLEKVKGSCANSVFCITVYIPTVKVQAEEEILERNLTNC EKENKRLQERCGLYKSELEILKEKLRQLKEENNNGKEKLRIMAVKNSEVMAQLTESRQSI LKLESELENKDEILRDKFSLMNENRELKVRVAAQNERLDLCQQEIESSRVELRSLEKIIS QLPLKRELFGFKSYLSKYQMSSFSNKEDRCIGCCEANKLVISELRIKLAIKEAEIQKLHA NLTANQLSQSLITCNDSQESSKLSSLETEPVKLGGHQVAESVKDQNQHTMNKQYEKERQR LVTGIEELRTKLIQIEAENSDLKVNMAHRTSQFQLIQEELLEKASNSSKLESEMTKKCSQ LLTLEKQLEEKIVAYSSIAAKNAELEQELMEKNEKIRSLETNINTEHEKICLAFEKAKKI HLEQHKEMEKQIERVRQLDSALEICKEELVLHLNQLEGNKEKFEKQLKKKSEEKELKIKN HSLQETSEQNVILQHTLQQQQQMLQQETIRNGELEDTQTKLEKQVSKLEQELQKQRESSA EKLRKMEEKCESAAHEADLKRQKVIELTGTARQVKIEMDQYKEELSKMEKEIMHLKRDGE NKAMHLSQLDMILDQTKTELEKKTNAVKELEKLQHSTETELTEALQKREVLETELQNAHG ELKSTLRQLQELRDVLQKAQLSLEEKYTTIKDLTAELRECKMEIEDKKQELLEMDQALKE RNWELKQRAAQVTHLDMTIREHRGEMEQKIIKLEGTLEKSELELKECNKQIESLNDKLQN AKEQLREKEFIMLQNEQEISQLKKEIERTQQRMKEMESVMKEQEQYIATQYKEAIDLGQE LRLTREQVQNSHTELAEARHQQVQAQREIERLSSELEDMKQLSKEKDAHGNHLAEELGAS KVREAHLEARMQAEIKKLSAEVESLKEAYHMEMISHQENHAKWKISADSQKSSVQQLNEQ LEKAKLELEEAQDTVSNLHQQVQDRNEVIEAANEALLTKFIEDYMTKYWPV >gi568815597r:93054830_93280242|GENSCAN_predicted_CDS_4|3756_bp atggaatctagttcatcagactactataataaagacaatgaagaggaaagtttgcttgca aatgttgcttccttaagacatgaactgaagataacagaatggagtttgcagagtttaggg gaagagttatccagtgttagtccaagtgaaaattctgattatgcccctaatccttcaagg tctgaaaagctaattttggatgttcagcctagccaccctggacttttgaattattcacct tatgaaaacgtctgtaaaatatctggtagcagcactgattttcaaaaaaagccaagagat aagatgttttcatcttctgcccctgtggatcaggagattaaaagccttcgagagaaacta aataaacttaggcaacagaatgcttgtttggtcacacagaatcattccttaatgactaaa tttgaatctattcactttgaattaacacagtcaagagcaaaagtttctatgcttgagtct gctcaacagcaggcagccagtgtcccaatcttagaagaacagattataaatttggaagca gaggtttcagctcaagataaagttttgagagaggcagaaaataagctggaacagagccag aaaatggtaattgaaaaggaacagagtttgcaggagtccaaagaggaatgtataaaatta aaggtggacttacttgaacaaaccaaacaaggaaaaagagctgaacgacaaaggaatgaa gcactatataatgccgaagagctgagtaaagctttccaacaatataaaaaaaaagtggct gaaaaactggaaaaggtaaaaggcagttgtgcaaattcagtgttttgtattactgtctat attccaacagtaaaggttcaagctgaagaagaaatattagagagaaatctaactaactgt gaaaaagaaaataaaaggctacaagaaaggtgtggtctatataaaagtgaacttgaaatt ctgaaagagaaattaaggcagttaaaagaagaaaataacaacggaaaagaaaaattaagg atcatggcagtgaaaaattcagaagtcatggcacaactaactgaatctagacaaagtatt ttgaagctagagagtgagttagagaacaaagacgaaatacttagagacaaattttcttta atgaatgaaaaccgagaattaaaggtccgtgttgcagcacagaatgagcgactagattta tgtcaacaagaaattgaaagttcaagggtagaactaagaagtttggaaaagattatatcc cagttgccattaaaaagagaattatttggctttaaatcatatctttctaaataccagatg agtagcttctcaaacaaggaagaccgttgcattggctgctgtgaggcaaataaattggtg atttcggaattgagaattaagcttgcaataaaagaggcagaaattcaaaagcttcatgca aacctgactgcaaatcagttatctcagagtcttattacttgtaatgacagccaagaaagt agcaaattaagtagtttagaaacagaacctgtaaagctaggtggtcatcaagtagcagaa agcgtaaaagatcaaaatcaacatactatgaacaagcaatatgaaaaagagaggcaaaga cttgttactggaatagaagaactacgtactaagctgatacaaatagaagctgaaaattct gatttgaaggttaacatggctcacagaactagtcagtttcagctgattcaagaggagctg ctagagaaagcttcaaactccagcaaactggaaagtgaaatgacaaagaaatgttctcaa cttttaactcttgagaaacagctggaagaaaagatagttgcttattcctctattgctgca aaaaatgcagaactagaacaggagcttatggaaaagaatgaaaagataaggagtctagaa accaatattaatacagagcatgagaaaatttgtttagcctttgaaaaagcaaagaaaatt cacttggaacagcataaagaaatggaaaagcagattgaaagagttaggcaactagattca gcattggaaatttgtaaggaagaacttgtcttgcatttgaatcaattggaaggaaataag gaaaagtttgaaaaacagttaaagaagaaatctgaagagaaagagctaaagataaaaaat cacagtcttcaagagacttctgagcaaaacgttattctacagcatactcttcagcaacag cagcaaatgttacaacaagagacaattagaaatggagagctagaagatactcaaactaaa cttgaaaaacaggtgtcaaaactggaacaagaacttcaaaaacaaagggaaagttcagct gaaaagttgagaaaaatggaggagaaatgtgaatcagctgcacatgaagcagatttgaaa aggcaaaaagtgattgagcttactggcactgccaggcaagtaaagattgagatggatcag tacaaagaagagctgtctaaaatggaaaaggaaataatgcacctaaaacgagatggagaa aataaagcaatgcacctctctcaattagatatgatcttagatcagacaaagacagagcta gaaaagaaaacaaatgctgtaaaggagttagaaaagttacagcacagtactgaaactgaa ctaacagaagccttgcaaaaacgggaagtacttgagactgaactacaaaatgctcatgga gaattaaaaagtactttaagacaactccaggaattgagagatgtactacagaaggctcaa ttatcattagaggaaaaatacactactataaaggatctcacagctgaacttagagaatgc aagatggagattgaagacaaaaagcaggagctccttgaaatggatcaggcacttaaagag agaaattgggaactaaagcaaagagcagctcaggttacacatttggatatgactattcgt gagcacagaggagaaatggaacaaaaaataattaaattagaaggtactctggagaaatca gaattggaacttaaagaatgtaacaaacagatagaaagtctgaatgacaaattacaaaat gctaaagaacagcttcgagaaaaagagtttataatgctacaaaatgaacaggagataagt caactgaaaaaagaaattgaaagaacacaacaaaggatgaaagaaatggagagtgttatg aaagagcaagaacagtacattgccactcagtacaaggaggccatagatttggggcaagaa ttgaggctgacccgggagcaggtgcagaactctcatacagaattggcagaggctcgtcat cagcaagtccaagcacagagagaaatagaaaggctctctagtgaactggaggatatgaag caactctctaaagagaaagatgctcatggaaaccatttagctgaagaactgggggcttct aaagtacgtgaagctcatttagaagcaagaatgcaagcagaaatcaagaaattgtcagca gaagtagaatctctcaaagaagcttatcatatggagatgatttcacatcaagagaaccat gcaaagtggaagatttctgctgactctcaaaagtcttctgttcagcaactaaacgaacag ttagagaaggcaaaattggaattagaagaagctcaggatactgtaagcaatttgcatcaa caagtccaagataggaatgaagtaattgaagctgcaaatgaagcattacttactaaattt atagaggactatatgaccaaatattggcctgtttga >gi568815597r:93054830_93280242|GENSCAN_predicted_peptide_5|375_aa MKMFFETNENKDTTYQNLWDTFKAVWRGKFIALNAHKRKQERSKIDILTSQLKELEKQEQ THSKASRRQEISKIRAELKEIETQKTLQKINESRSWFLEKINKIDRPLARLIKKKREKNQ IEAIKNDKEDITTDPIEIQTTIREYYKHLYTNKLENLEEMDKVLDTYTLPRLNQEEVESL SRPTTGSEIEAVINSLPTKKSPGPDRFIAEFYQRYKEELESELTRLQAKISGHEKAEDIK FLPAPFTSPTEIMPDVQDPKFAKCFHTSFSKCTKLRRSISASDLTFKIHGDEDLSEELLQ DLKKMQLEQPSTLEESHKNLTYTQPDSFKPLTYNLEADSSENNDFNTLSGMLRYINKEVR LLKKSSMQTGAGLNQ >gi568815597r:93054830_93280242|GENSCAN_predicted_CDS_5|1125_bp atgaagatgttctttgaaaccaatgagaacaaagacacaacataccagaatctctgggac acatttaaagcagtgtggagagggaaatttatagcactaaatgcccacaagagaaagcag gaaagatctaaaattgacatcctgacatcacaattaaaagagctagagaagcaagagcaa acacattcaaaagctagcagaaggcaagaaataagtaagatcagagcagaactgaaggag atagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctggtttttagaaaag atcaacaaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatcaa atagaggcaataaaaaatgataaagaggatatcaccactgatcccatagaaatacaaact accatcagagaatactataaacacctctacacaaataaactagaaaacctagaagaaatg gataaagtcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatccctg agtagaccaacaacaggttctgaaattgaggcagtaattaatagcctaccaaccaaaaaa agtccaggaccagacagattcatagctgaattctaccagaggtacaaagaggagctggaa tcagaattaaccagattacaggccaaaatttctggacatgaaaaggcagaagacatcaag tttctgccagccccatttacatctccaacagaaattatgcctgatgttcaagatccaaaa tttgctaaatgttttcacacatctttttccaagtgtacaaaattacgtcgctctattagt gccagtgatcttactttcaaaattcatggtgatgaagatctttctgaagaattactacag gacttaaagaaaatgcaattagaacagccttcaacattagaagaaagccataagaatctg acttacacccagccagactcatttaaacctctcacatataacctagaagctgatagttct gagaataatgactttaacacgcttagtgggatgctaagatacataaacaaagaagtaaga ctattaaaaaagtcttctatgcaaacaggtgctggtttaaatcag